Genbank accession
YP_012026716.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
Protein sequence
MSFFAGKLNNKSILSLRRGSGGDTNQHINPDSQTIFHSDMPHVIITETHSTGLRLDQGAGDYYWSEMPSRITQLHNNDPNRVVLTEIEFSDGSRHMLSGMSMGVGAKAYGIINPQVMSQGGLKTQITASADLSLDVGYFNTGTSGTIPQKLRDGTGCQHIFGAFSGRRGFASSAMYLGGAALYKSAWDGSGYVVPDSGTLTIPSDYVRHPGARNFGFNAIYVRGRSCNRVLYGMEGPNYTTAGAVHGASSSGALNFMSNPSNPSAPKYSVGFARADPTNYAYWESMGDPNDAANGPIGIYSEHLGIYPSKITWYVTNLVYNGSGYSIDAGLFTGNDIKLSPTEFIIKGINVNNTSWKFINFIEKNFNVGNRADFRDVGCNLSMGAPSTGISGIATFGLPTTESNNAPSIKGGNVGGLNTNVVSIYNFLPSASWYVSSNPPKIGSNYGDVWSENLLPLRLLGGSGSTVLSGNIVFQGNGSVHVGTVGLDLNSSRNGAIVCTMEFLDDTWLSAGGIGCFNPTEMLSKGAEYGDSRFRMGGNTIDKKLHQILSLPAGEYVPFFTIKGTVVNACKLQAAGYTPTPYWVSGLPGSVGQTGYYTLTYYMRNDGNNNISIWLESSMSNIIGMKACLPNIKLIIQRLT
Physico‐chemical
properties
protein length:640 AA
molecular weight: 68524,12270 Da
isoelectric point:7,61412
aromaticity:0,10312
hydropathy:-0,18172

Domains

Domains [InterPro]
IPR059609
RBD
1–640
YP_012026716.1
1 640
Architecture
RBD
RBD 1-640
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_012026716.1
1 640
Domain Start End Length (AA) Confidence
N-terminal 1 155 155 0,0619
Central domain 156 354 200 0,2773
C-terminal 355 640 285 0,9445
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-155
Central
156-354
C-terminal
355-640

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC100
[NCBI]
2894397 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_012026716.1 [NCBI]
Genbank nucleotide accession
NC_105587.1 [NCBI]
CDS location
range 105528 -> 107450
strand -
CDS
ATGAGTTTTTTCGCTGGTAAGCTAAATAACAAATCTATACTCTCACTGCGAAGGGGTAGTGGTGGAGATACTAACCAACATATCAACCCTGATTCCCAAACCATCTTTCATTCAGATATGCCGCACGTAATTATAACGGAAACTCATTCTACTGGATTAAGATTGGATCAGGGCGCTGGTGACTATTACTGGTCTGAAATGCCTAGTAGAATTACACAGCTACATAATAATGACCCTAATAGGGTTGTATTAACAGAAATTGAGTTTTCTGATGGCTCTAGACATATGTTATCTGGCATGTCTATGGGGGTAGGTGCAAAAGCCTATGGTATTATAAATCCTCAGGTTATGTCTCAGGGTGGGTTAAAAACACAAATAACAGCTAGTGCCGATCTTTCACTTGATGTAGGCTATTTTAATACAGGTACTAGTGGCACTATACCACAAAAGTTAAGAGATGGTACTGGATGTCAGCATATATTTGGAGCCTTTAGTGGACGTAGAGGGTTTGCATCTAGTGCAATGTACTTAGGGGGTGCCGCTCTTTATAAGTCTGCCTGGGATGGTTCAGGATATGTAGTGCCGGATTCAGGAACTCTAACTATCCCTAGCGACTACGTTAGGCATCCCGGAGCTAGAAACTTTGGATTTAATGCCATATATGTTCGTGGAAGGTCTTGTAATAGAGTTCTTTACGGTATGGAGGGGCCTAACTATACTACTGCTGGAGCCGTGCACGGTGCTTCTAGTTCTGGGGCTCTTAACTTTATGTCTAATCCTAGCAATCCCTCGGCCCCTAAGTACTCTGTAGGTTTCGCACGTGCTGATCCCACTAACTATGCGTATTGGGAAAGTATGGGTGATCCGAATGATGCTGCTAATGGGCCCATAGGTATATATAGTGAGCACCTCGGTATTTATCCATCTAAAATTACCTGGTATGTTACTAACTTAGTGTATAATGGTTCAGGTTATAGTATTGATGCTGGATTATTCACAGGTAATGATATAAAATTAAGCCCCACTGAATTTATTATTAAAGGGATTAACGTTAATAATACATCTTGGAAGTTTATCAACTTTATAGAGAAAAACTTCAATGTCGGTAACAGGGCTGATTTTCGTGATGTTGGGTGTAACTTGAGTATGGGTGCCCCCTCTACTGGGATCTCTGGCATAGCAACATTTGGGCTGCCCACTACGGAAAGCAATAACGCACCAAGTATTAAAGGCGGTAACGTTGGTGGTTTAAATACTAATGTAGTTAGTATTTATAACTTTTTACCTTCAGCATCTTGGTATGTTTCTAGTAATCCTCCAAAAATAGGCAGCAATTATGGAGATGTTTGGAGTGAGAATCTTTTACCTTTAAGACTTCTAGGTGGTAGCGGAAGTACTGTACTATCTGGTAACATAGTATTTCAGGGTAATGGTTCCGTGCATGTTGGCACCGTGGGACTAGATCTTAATAGTAGTAGAAATGGCGCAATTGTATGTACTATGGAGTTCCTGGATGACACATGGTTGTCTGCAGGAGGTATTGGCTGCTTTAATCCTACGGAAATGCTATCTAAAGGCGCTGAATATGGCGATAGTAGGTTTAGAATGGGTGGCAATACTATCGATAAAAAACTTCACCAAATATTGTCCCTACCTGCAGGAGAATATGTACCATTTTTTACTATTAAAGGTACTGTAGTAAATGCTTGTAAATTACAGGCTGCTGGGTATACTCCAACTCCGTACTGGGTATCTGGGTTGCCAGGATCCGTAGGTCAAACGGGTTACTATACTCTAACATACTATATGAGAAATGATGGAAATAATAATATCTCTATTTGGTTAGAATCCTCTATGAGCAATATAATTGGCATGAAAGCATGCTTACCAAATATTAAACTTATAATTCAACGCCTTACCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
431ff945980233d2b7759459eb8a6114bf622a1631d91a1160a081bb8dd41f12
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,2606
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequences of 17 Escherichia coli bacteriophages isolated from wastewater, pond water, cow manure and bird feces Vitt,A.R., Ahern,S.J., Gambino,M., Holst Sorensen,M.C. and Brondsted,L. 2022-10-20 GenBank