Genbank accession
VUD37820.1 [GenBank]
Protein name
Phage tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,84
Protein sequence
MAVQISGVLKDGAGKPIQNCTIQLKAKRNSTTVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSKPGTLNDFLGAATEDDVRPEALYRFEKMVEEVARNAEAASQSAAAAKKSETAAASSRNAAKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARASEEASADSEEASRRNAESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAADRAEAAAEVTAEPYANIVPPLPDVWIPFNDSLDMIAGFSPGYKKIAIGDDVVQVASDKQVNFSRASTATYINKSGELKTAEINEPRFECDGLLIEGQRTNYMLNSESPASWGKSSNMDVPETGTDSFGFTYGKFVCNDSLVGQTSAINMASIAATKSVDVSGDNKYVTTSCRFKTERQVRLRIRFDKYDGSATTFLGDAYIDTQTLEINMTGGAAGRITARVRKDKTTGWIFAEATIQAIDGELKIGSQIQYSPEQGGATVSGDYIYLATPQVENGPCVSSFIISGGSATTRASDLVSIPTRNNLYKLPFTFLLEIHKNWDIAPNAAPRVWDIAAANTGQSAIAAINRGSGKLYMSLSNPSGSYVNSAATDVFTEKTTFGCIAKADGHFHVVTNGKAVNEVYCEYNGVTADKNIRFGGQTNTGERHLFGHIRNFRIWHKELNDRQLKEVV
Physico‐chemical
properties
protein length:686 AA
molecular weight: 73236,32490 Da
isoelectric point:5,39086
aromaticity:0,07289
hydropathy:-0,39373

Domains

Domains [InterPro]
IPR013609
ATT
1–131
IPR008969
ATT
5–82
G3DSA:2.60.40.1120
STR
5–99
VUD37820.1
1 686
Architecture
ATT
STR
ATT 1-131 | STR 215-686
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
VUD37820.1
1 686
Domain Start End Length (AA) Confidence
N-terminal 1 211 211 0,7572
Central domain 212 410 200 0,1598
C-terminal 411 686 275 0,7138
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-211
Central
212-410
C-terminal
411-686

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia virus mEp460_4F5
[NCBI]
2686058 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
VUD37820.1 [NCBI]
Genbank nucleotide accession
LR595868.1 [NCBI]
CDS location
range 18673 -> 20733
strand +
CDS
ATGGCAGTACAGATTTCAGGCGTGCTGAAAGACGGTGCAGGAAAACCGATACAGAACTGCACCATTCAGCTCAAAGCAAAACGTAACAGCACCACGGTTGTGGTGAACACGGTGGCCTCAGAAAATCCGGATGAAGCCGGACGTTACAGCATGGATGTCGAGTATGGCCAGTACAGCGTTATCCTGCTGGTTGAAGGTTTTCCGCCTTCACATGCCGGGACCATCACCGTGTATGAGGATTCTAAGCCGGGGACGCTGAATGATTTTCTGGGCGCTGCAACAGAAGATGATGTTCGTCCGGAGGCACTGTATCGTTTTGAAAAGATGGTGGAAGAGGTGGCACGCAACGCTGAAGCCGCCTCTCAGAGCGCAGCGGCAGCAAAGAAATCAGAAACAGCAGCGGCATCGTCCAGGAACGCGGCGAAAACATCAGAGACGAATGCAGGTAACAGCGCGAAAGCGGCAGCTTCTTCAAAAACAGCCGCACAAAACGCAGCAACAGCGGCAGAACGTTCAGAGACAAATGCCCGTGCGTCAGAAGAAGCCTCCGCAGACAGTGAAGAGGCTTCCCGCCGTAATGCAGAGTCAGCCGCTGAAAATGCCGGAGTCGCCACCACAAAAGCGCGGGAGGCTGCAGCAGACGCAACAAAGGCCGGGCAGAAAAAGGATGAGGCTCTGTCGGCAGCGACACGAGCTGAAAAGGCGGCAGACCGCGCAGAAGCCGCAGCGGAAGTGACTGCAGAGCCCTATGCGAATATAGTGCCGCCGCTGCCTGATGTGTGGATACCGTTTAACGATTCACTGGATATGATTGCGGGTTTTTCTCCGGGCTATAAAAAAATAGCTATTGGTGACGATGTGGTTCAGGTCGCCAGTGATAAACAGGTTAATTTCAGTCGCGCATCAACGGCAACATATATCAACAAATCTGGCGAACTGAAAACGGCGGAAATTAATGAGCCGCGATTTGAGTGTGATGGCCTGCTTATTGAGGGACAAAGAACGAACTACATGCTCAATTCGGAAAGTCCAGCCAGCTGGGGGAAGTCATCAAACATGGATGTGCCCGAAACCGGGACGGATAGTTTTGGTTTTACTTATGGAAAGTTTGTCTGCAACGATTCTCTGGTTGGGCAAACTTCGGCTATTAATATGGCATCAATTGCTGCAACAAAGTCAGTTGATGTCTCAGGCGATAACAAGTACGTGACAACCTCATGCCGTTTTAAAACAGAACGACAGGTAAGGTTACGTATACGGTTTGATAAGTATGATGGTAGTGCAACAACTTTTCTTGGCGATGCGTACATTGATACGCAAACGCTTGAAATTAATATGACAGGTGGTGCTGCCGGCAGAATTACGGCACGAGTCAGGAAGGATAAGACCACGGGCTGGATTTTTGCAGAGGCAACGATTCAGGCAATTGATGGTGAGTTAAAAATAGGCTCTCAGATACAGTATTCTCCTGAGCAGGGTGGGGCAACAGTATCTGGTGACTATATTTATCTTGCCACCCCACAAGTAGAGAATGGGCCGTGTGTATCATCATTTATTATTTCAGGAGGCAGCGCAACGACAAGAGCCAGTGATTTGGTTAGTATCCCCACCAGAAATAATCTTTATAAGTTACCATTTACTTTTTTACTTGAGATTCATAAAAACTGGGATATTGCACCAAACGCCGCACCCCGCGTGTGGGATATAGCAGCAGCCAATACCGGGCAATCAGCAATTGCAGCAATCAACAGAGGTAGTGGTAAGTTATATATGAGTCTGTCAAACCCTTCAGGCTCGTATGTTAATAGCGCAGCGACAGATGTATTTACAGAGAAAACCACATTTGGATGTATTGCAAAAGCTGATGGTCACTTTCATGTGGTGACAAATGGTAAAGCGGTTAATGAAGTTTATTGTGAATATAATGGCGTGACCGCTGATAAAAATATCCGATTTGGAGGGCAGACGAATACTGGAGAACGACATCTGTTTGGCCATATTCGCAATTTCCGCATATGGCATAAAGAATTAAATGACAGGCAATTAAAAGAGGTCGTATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
6f2d4359948879e8d3dbe59eebff4e009dd163dcf9d34ce580060adfad2b9832
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7903
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50