Genbank accession
YP_009216963.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,67
Protein sequence
MVTKTIIPTDITTLKQDVSTAKTDISKLKEDVSNISSKNVFNEKVEVEKVVSPPSSGQTNNGPDLISTIKRNTGSIETQVKYGSRAKYGYPSHAVIGIKYSNSSGVVEWLVGSDGHFYFPGPDGYIHSGNMNIHWNQDSNSFIFENTSTIDVCHIDKSFAFQCAGTSVSNGRIHLWGNGGDRKDVIEFGTSEGYLFYAERNASGDRNLTLNNGAINCRVVNQSSDRDLKDNITPIKNATDKVRKINGYTYTFKSNGMPYAGVIAQEIMEVLPEAIGSTTVYPDDSSGIDGSEGQRFFTVDYSAVVALLVQTCKESDDRITKLDSEVQELKAIVKTLLTDSDTSSTDLP
Physico‐chemical
properties
protein length:348 AA
molecular weight: 38036,77160 Da
isoelectric point:5,15452
aromaticity:0,08046
hydropathy:-0,43161

Domains

Domains [InterPro]
DC_0421
STR
4–346
G3DSA:1.20.5.190
STR
7–50
Coil
Unmapped
11–31
YP_009216963.1
1 348
Architecture
STR
STR 4-346 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009216963.1
1 348
Domain Start End Length (AA) Confidence
N-terminal 1 20 20 0,8723
Central domain 21 219 200 0,2169
C-terminal 220 348 128 0,9961
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-20
Central
21-219
C-terminal
220-348

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterobacteria phage JenP2
[NCBI]
1610838 Uroviricota > Caudoviricetes > Queuovirinae > Nonagvirus JenP2 >
Host Escherichia coli K-12
[NCBI]
83333 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009216963.1 [NCBI]
Genbank nucleotide accession
NC_028997.1 [NCBI]
CDS location
range 22191 -> 23237
strand +
CDS
ATGGTGACTAAGACTATTATACCAACAGATATAACTACACTTAAACAAGATGTGAGTACAGCTAAGACTGACATTTCGAAACTCAAAGAGGATGTGTCTAACATTAGTTCGAAGAACGTATTTAATGAAAAGGTTGAAGTAGAGAAGGTTGTTTCTCCTCCGAGTAGTGGACAGACTAATAATGGGCCTGACTTAATTTCTACAATAAAAAGAAATACGGGCAGTATTGAGACTCAAGTTAAATACGGTTCTAGAGCAAAATATGGCTATCCATCACATGCAGTAATTGGCATCAAGTATTCTAATTCTTCCGGAGTTGTGGAGTGGCTTGTTGGTAGTGATGGTCACTTTTATTTCCCTGGCCCCGATGGGTATATTCATTCTGGTAACATGAATATACACTGGAATCAGGATTCCAATTCTTTTATATTTGAAAATACAAGTACAATAGATGTATGTCACATCGACAAATCGTTTGCATTTCAATGTGCTGGAACATCTGTGAGTAATGGTAGAATTCATCTATGGGGCAATGGTGGTGATCGAAAGGATGTAATAGAATTCGGTACAAGTGAAGGTTATTTATTTTATGCAGAACGTAATGCAAGCGGAGACAGAAACTTAACATTAAATAACGGAGCTATCAACTGCCGTGTCGTCAACCAATCATCGGATAGAGATTTAAAAGATAACATAACACCTATCAAAAATGCTACAGATAAAGTCAGAAAAATAAACGGGTACACGTACACTTTTAAATCAAACGGAATGCCTTATGCTGGTGTGATAGCACAAGAAATAATGGAGGTTCTACCTGAAGCGATTGGCAGTACCACGGTATATCCAGATGATAGTTCCGGTATAGATGGTAGTGAAGGTCAGCGTTTCTTTACAGTAGATTACTCTGCTGTAGTTGCACTTCTAGTTCAAACATGCAAAGAATCAGATGACAGAATAACAAAACTAGATTCCGAAGTCCAGGAACTCAAAGCTATTGTAAAAACTTTGTTAACAGATAGCGACACATCTTCAACTGACTTACCGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
4b17bb31a7cbe116efeb1d7fa0e1c3c2a32c8147e7f04528e3870839e4a52bec
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7328
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50