Genbank accession
UGO52881.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,82
TF
Evidence RBPdetect2
Probability 0,89
Protein sequence
MATRLRGTLVDGLNKPIVNATVALLAKGNSLLVLSGSEAIFKTSATGTYDITVQTGYYKVIIGPQGSEPYKAGEIAIYADSKEGTLNNYLTSWAPEELTPEVIAQVKDLVSQAETAKNASATSATKSEQERVKAETAAQNAVNTANGMKASIGLTTAPRHVADITVNPSSFLGFIRILRSSTVGYPDIASSENVLAGFVSAMDGTPGFIGVFVGDMTGTIYSYRWIKDVGVTWWKQVNYSTVNRYTQLNAETNFTNQLGNAKLLIDNNKVWGAYDNENKRFIPLAIAQGGTGALTDADARTNLRLGSNDTPQFRNLNLVTVADSAQAPSGIVSGYLNNSSGVQRCRYRIYSEIRGDNKAWLTLHLQSDTATNKYAGLSVDGNFQINGNFIGNALQLSDAPNSRINLQLDRFYQSANETVIYTPSRQAYMTISNNKTWGAYDAEAQNFIPLPINRGGTGALTISDAKTNLQIPSVGGGDWLTYNAPPGVEAGKYYPVIIDMAYSSLYASGAFIDIKTRSAAGDDPMNCCTFNGFIRCGGWSDRKDGGYGYFNNYAKNEIAMKCILSSSKDAERYVAIYVEGRGFPLQLRVPAFCEVTVPTSNFTYKNTTYAWGTANPATDSVAINTLFDFSLNRVGFYQATTEGNYYIGNGERIVLSNGMSVGEELSLTTPKVSFSGTIAAGNGVIADGTSVSNATFYSRYRVGDVIYDSEFRASENAGQIIVRDPTGGTAHQFFNFNLNGTFSAPAGLLSSTGVDWNGQINTVNKFYGIAGQVNTPENNVVYGGIHVGFSGNYAFQICGRKGKSFFRTFEAGVEGQWQQLVTKGNYGVGLIGTFKPEDGSSGFYTDSDGANTWSPANGGGFQSSYTQQRIFQFWMTSSSQGFIRFNDSGNAQASKTDKPWTTLQAAGTSDINFKRVHGVMDTDVALDNISKLEFVYFNYLSDGPEREIRRGVIAQQAQEVDPEYVHSAETSGKMTLDSNPLLLDALAAIQSLKKKDQDNKDRISKLETEVEELKTLVATLVNKEQPLP
Physico‐chemical
properties
protein length:1028 AA
molecular weight: 111412,70080 Da
isoelectric point:5,48726
aromaticity:0,10506
hydropathy:-0,29358

Domains

Domains [InterPro]
IPR013609
ATT
1–130
DC_1669
ATT
6–160
IPR030392
CHP
909–1010
UGO52881.1
1 1028
Architecture
ATT
STR
ATT 1-160 | STR 223-1027 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UGO52881.1
1 1028
Domain Start End Length (AA) Confidence
N-terminal 1 462 462 0,6232
Central domain 463 688 227 0,6857
C-terminal 689 1028 339 0,9619
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-462
Central
463-688
C-terminal
689-1028

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS_OddieOddie
[NCBI]
2902674 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UGO52881.1 [NCBI]
Genbank nucleotide accession
OL539454.1 [NCBI]
CDS location
range 18626 -> 21712
strand +
CDS
ATGGCAACTCGTTTACGTGGTACTTTAGTTGATGGTTTAAATAAACCAATTGTCAACGCTACAGTTGCACTTTTAGCGAAAGGGAATAGCTTATTGGTATTGTCCGGTAGTGAAGCTATCTTTAAAACAAGCGCAACCGGAACATATGATATTACAGTCCAAACAGGCTATTACAAGGTTATTATCGGTCCGCAAGGTAGCGAGCCTTATAAAGCTGGTGAAATTGCTATTTACGCGGACAGTAAAGAAGGTACGCTCAATAACTATTTAACTTCCTGGGCACCGGAAGAGCTAACTCCGGAAGTCATTGCACAGGTTAAAGATTTAGTTTCACAAGCTGAGACAGCTAAGAATGCTTCTGCTACATCTGCAACTAAATCCGAGCAAGAACGCGTTAAAGCAGAGACTGCGGCTCAGAATGCTGTAAACACTGCTAACGGTATGAAAGCTTCTATTGGTTTGACTACTGCACCAAGACATGTTGCAGATATTACAGTCAACCCGTCATCATTCTTGGGTTTTATACGGATTTTACGGTCTAGTACAGTAGGTTATCCGGATATTGCGTCCTCAGAAAACGTATTAGCCGGTTTTGTTTCCGCTATGGATGGAACCCCTGGATTTATAGGTGTTTTTGTTGGAGACATGACCGGGACTATATATAGCTATAGATGGATCAAGGACGTTGGCGTTACATGGTGGAAGCAGGTTAACTATTCTACTGTCAACAGATATACCCAGCTAAACGCAGAAACTAATTTTACAAACCAATTAGGTAATGCAAAACTCCTCATAGATAATAATAAAGTTTGGGGGGCATATGACAATGAAAATAAACGCTTCATACCTCTAGCAATAGCTCAAGGTGGTACAGGGGCGCTCACAGACGCGGATGCTCGTACAAACTTAAGACTTGGTTCTAACGATACACCTCAATTCAGAAACCTTAATCTTGTTACAGTAGCAGACTCCGCACAAGCGCCTTCCGGCATTGTGAGCGGTTATTTAAATAATAGCTCTGGAGTTCAAAGATGTCGCTACCGCATATATTCTGAAATAAGAGGCGACAATAAAGCGTGGTTAACCTTACACCTTCAATCAGACACAGCAACAAATAAATATGCTGGTCTAAGCGTTGATGGTAATTTTCAAATCAATGGAAATTTTATTGGTAATGCTTTGCAGTTGTCAGACGCTCCAAATTCCAGGATTAATCTACAGTTAGATCGTTTTTATCAAAGTGCTAACGAAACGGTTATTTATACTCCGTCTCGTCAAGCATATATGACAATTTCTAATAACAAGACTTGGGGCGCTTATGACGCGGAAGCTCAAAACTTCATTCCTTTACCTATTAATAGGGGAGGGACTGGTGCCCTAACAATATCCGACGCTAAGACTAATCTTCAAATTCCGTCTGTAGGTGGGGGTGATTGGTTAACTTATAACGCTCCTCCTGGCGTGGAAGCTGGAAAGTATTATCCAGTTATCATCGATATGGCATACAGTTCTTTATATGCTTCTGGTGCTTTTATAGATATAAAGACACGTTCTGCCGCTGGTGACGACCCAATGAACTGTTGCACTTTTAACGGATTTATTAGATGTGGTGGTTGGAGCGACCGAAAAGACGGCGGTTATGGTTACTTCAACAACTATGCAAAAAATGAAATTGCTATGAAATGTATTCTTTCTTCATCTAAAGATGCGGAAAGATACGTTGCAATATATGTTGAAGGTCGTGGTTTCCCTCTTCAATTGCGTGTCCCTGCATTCTGTGAAGTAACAGTACCAACATCAAACTTTACTTATAAAAATACGACCTATGCGTGGGGTACAGCTAACCCTGCAACGGATTCTGTAGCTATAAACACCTTATTTGATTTTTCATTAAACCGCGTTGGTTTTTACCAGGCCACGACAGAAGGTAATTATTATATCGGAAATGGGGAACGTATTGTTTTATCTAACGGAATGTCTGTGGGCGAAGAGTTAAGTCTAACCACACCTAAAGTGTCTTTCAGTGGAACCATTGCTGCTGGTAACGGTGTCATTGCAGATGGAACATCTGTATCCAATGCTACTTTTTATAGTCGTTATCGTGTTGGTGACGTGATATATGACAGCGAGTTTCGCGCAAGTGAAAATGCTGGTCAAATTATAGTCCGCGATCCAACGGGTGGGACTGCGCATCAATTCTTTAACTTCAATCTCAACGGGACATTCAGCGCTCCCGCTGGTTTGCTATCTTCTACTGGAGTAGATTGGAACGGTCAGATTAATACCGTCAATAAATTCTATGGGATTGCTGGACAAGTTAACACACCAGAAAACAATGTGGTTTATGGTGGTATACATGTTGGATTCAGCGGCAACTATGCTTTCCAGATATGCGGAAGGAAAGGTAAATCCTTCTTCCGCACATTTGAAGCAGGAGTAGAAGGACAATGGCAACAATTAGTAACTAAAGGGAACTACGGAGTTGGTCTAATTGGAACTTTTAAACCGGAAGATGGTTCAAGTGGTTTTTACACTGATTCCGATGGGGCAAATACTTGGTCGCCCGCTAACGGAGGGGGTTTTCAATCTTCGTATACTCAGCAGCGAATATTCCAATTTTGGATGACATCAAGCTCGCAGGGGTTTATTCGTTTTAACGATAGCGGTAACGCTCAAGCTTCTAAGACGGATAAACCTTGGACAACACTTCAAGCAGCTGGCACATCGGATATTAATTTCAAGCGTGTTCACGGAGTGATGGATACAGATGTTGCATTGGATAACATAAGCAAGCTTGAATTTGTATACTTTAACTATCTGTCCGATGGTCCAGAGCGTGAAATCCGTAGGGGTGTCATTGCTCAACAGGCCCAGGAAGTTGATCCTGAATATGTGCATAGTGCTGAAACATCCGGGAAAATGACTTTGGATTCTAACCCATTGTTATTGGATGCATTAGCTGCAATACAATCCCTCAAGAAAAAGGATCAAGATAATAAAGACCGTATTAGTAAACTTGAAACTGAAGTTGAGGAACTTAAGACGTTAGTTGCTACGCTTGTTAATAAAGAGCAACCATTACCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a6d6b31955aad46db8022d1d19e74d5978c11b1dcdb35ce4684144e7139656aa
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6281
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50