Genbank accession
YP_007878095.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,83
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MPVVIPYIAIALSVASAIYAASLDLSPTADPTDTGTLINKSGTRATRDVVYGRCLTGATSLWSNVNDSSTNELIQVFSTGIAVSEIHQLYIDNVEVLVEKSYRETTDTSSLILSGEADLVNGFEKQCTIQLRTGFPIGAPMNGDLPIGTPMQLAIDNSDGEWTDKMRGDYTSMVAIKSKRIIDDEAIRIMSESYAVNLEVSGVPVFDPRVGSNPAVKSYSRNPALCALDYITNSYYGMGIGFDKVDTDSFVTAANHCDVNKFYIDGQLNQGESFASNLENICSTAQLHLFIENGKLVCRVETVAMSSWSFNEDNILKGTLRVTEQTSASYANVISVNYKNSELDDKEDVFTVPENIYPVQSDPSYPTKPQVDGYIATTELNMPMTRYIGSNANDANSPMKKFANIELLRQDFQKEIEFDIDREEYNVSVFDVIEISDSGIGWVNKQFRITGFATRISETDMNIVTLKCKEYSSTIYNGNMDGTPPSTKPTPPIEVTPPTNLTFTLQDLIISGSATLSWDRTWFESNVQYDIDYKRSSSSTWTRVGRSSTNEWKFPNLYPDTYDFRVATWSNLYGSSNFTQLTGVIISQLGAFPSVTGLECDTTTQDFKWTWDDMLNSSVVLPSDPRPDAPTNPIVRDYFSHYQVDILDGSTLIDSYQATTNLYDYTYTTNINNGILRTIRANVYIVAKDGTKSQIGSGSDLTATNPQQSAPTGIVTSTELSNTIITWDQPSDFSDYRATRFYQSTTKGFSPSQSNFLKEQVGTLFSHIWADKNVHYIRFAHVDVFGNDSASYSHEIAVTPSTIDALLPIDPDFAEIRDPQGAVGKEQVLKAADGEYVSGIGIYADNPSKSTKILMAAEQILMGVGGRPFYKSSTAYRVGDKVLYKRTSTITALYECKVANTGNAPTNTTYWNVLQSNADQTVFAVDESGKVLIRNAVISDLTSDNIKSRSIVADDIATNTITANEIAGNTITGSKIKSTTTIIAGSGSTSAGMNGDDSTSSSYKNYRFWSGAALPASAPFRVSRTGKLTATGVDISGELNATSGVMDNVTVNKTCTVLGTIQANQIVGDIVSGITKTNTAASVNVTQYEDETWSRTSSFGSVTIANARGWDRTLVVTLHIEINENSGEAYGAGRMRLFVSGSSGGSDYYSRQYNGDEKETTRSRVYTLGADVTMVIPIAKNTTPKFTFYAEGARKQGRSYNFDVSVSAPQSNNLWCAYLISNGGDLS
Physico‐chemical
properties
protein length:1227 AA
molecular weight: 134138,43960 Da
isoelectric point:4,73596
aromaticity:0,09372
hydropathy:-0,29267

Domains

Domains [InterPro]
DC_0187
STR
1–633
IPR013783
STR
496–581
IPR036116
STR
497–581
IPR003961
STR
497–580
IPR003961
STR
497–590
YP_007878095.1
1 1227
Architecture
STR
ATT
STR
RBD
STR 1-633 | ATT 699-806 | STR 807-1013 | RBD 1020-1226 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_007878095.1
1 1227
Domain Start End Length (AA) Confidence
N-terminal 1 83 83 0,8872
Central domain 84 282 200 0,1655
C-terminal 283 1227 944 0,3067
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-83
Central
84-282
C-terminal
283-1227

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage henriette 12B8
[NCBI]
573174 No lineage information
Host Vibrio splendidus
[NCBI]
29497 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_007878095.1 [NCBI]
Genbank nucleotide accession
NC_021073.1 [NCBI]
CDS location
range 75640 -> 79323
strand -
CDS
ATGCCAGTAGTAATTCCTTATATTGCTATAGCTTTATCGGTGGCTAGTGCAATATACGCAGCATCATTAGATCTGTCTCCTACAGCAGACCCAACCGATACGGGTACGCTAATAAACAAATCTGGTACACGAGCAACGAGAGATGTAGTTTATGGTAGATGTTTGACCGGAGCAACCTCATTGTGGTCCAATGTGAACGATTCGAGTACCAACGAGCTTATACAAGTATTCTCTACTGGTATAGCAGTATCCGAAATCCATCAATTGTATATAGATAACGTAGAAGTTCTAGTAGAAAAATCATATAGGGAAACTACAGACACATCTTCTCTGATACTATCAGGTGAAGCAGACCTAGTTAATGGTTTCGAAAAACAATGTACAATACAGCTACGCACAGGATTCCCGATTGGTGCACCAATGAACGGAGACTTACCAATCGGTACACCTATGCAGTTGGCTATAGATAATTCTGATGGTGAGTGGACCGACAAAATGCGAGGCGATTATACATCAATGGTGGCGATTAAATCGAAACGTATTATCGACGACGAAGCAATCAGGATAATGAGCGAGAGTTATGCGGTTAACCTAGAAGTATCTGGTGTTCCTGTATTTGACCCAAGGGTTGGTAGCAATCCAGCAGTCAAGTCTTATAGTCGCAACCCAGCATTATGTGCATTAGATTATATCACAAACTCATACTACGGTATGGGCATCGGTTTTGATAAAGTTGATACAGACAGTTTTGTTACGGCAGCAAATCATTGTGATGTAAATAAATTCTATATTGATGGTCAATTAAATCAGGGCGAATCTTTCGCATCAAACTTAGAAAATATATGCAGTACAGCACAATTACATTTATTTATAGAGAATGGAAAATTGGTATGTAGAGTTGAAACTGTTGCTATGTCTAGTTGGTCGTTTAATGAAGACAATATACTGAAGGGTACTTTACGAGTAACCGAACAAACATCAGCATCATACGCTAACGTGATATCAGTAAACTACAAAAACTCTGAATTGGATGATAAGGAAGATGTGTTCACCGTACCAGAAAATATATACCCAGTACAAAGTGATCCTTCATACCCAACAAAACCACAAGTTGATGGGTATATAGCGACGACCGAACTAAACATGCCGATGACTCGATATATTGGGTCTAACGCAAATGACGCTAATTCGCCAATGAAGAAATTTGCTAACATCGAGTTGCTCAGACAAGATTTTCAGAAGGAAATCGAATTTGATATAGATAGAGAGGAATATAATGTATCTGTATTTGATGTGATAGAAATATCAGATTCTGGTATTGGGTGGGTCAATAAACAATTTCGTATAACTGGATTTGCCACAAGAATATCAGAAACTGATATGAATATAGTCACTTTAAAATGCAAAGAATACAGCAGCACAATATATAACGGAAATATGGATGGTACACCACCATCAACAAAACCAACACCACCAATCGAGGTAACGCCACCAACTAACCTAACATTCACATTACAAGACTTGATAATATCTGGTAGCGCAACGTTATCTTGGGATAGAACTTGGTTTGAGTCTAACGTACAATATGACATTGATTATAAACGTTCCAGTTCGTCGACATGGACTAGGGTTGGTAGATCTTCAACTAATGAGTGGAAATTTCCGAATCTATACCCAGACACTTATGACTTTCGGGTAGCAACTTGGTCGAACCTATACGGGTCATCAAACTTCACTCAACTGACTGGTGTAATAATCAGTCAATTAGGTGCTTTCCCATCAGTTACTGGCCTCGAATGTGACACAACAACTCAAGACTTCAAATGGACTTGGGATGATATGTTAAATTCGTCGGTTGTTTTGCCATCAGATCCTAGACCTGATGCGCCAACTAATCCAATAGTGAGGGATTACTTTAGTCATTATCAAGTTGATATATTGGATGGTTCTACGTTAATTGACTCATACCAAGCAACTACCAACCTATACGACTACACATACACAACAAACATAAATAATGGTATACTGCGCACAATTAGAGCTAATGTTTATATAGTGGCTAAGGATGGAACTAAGAGCCAAATAGGATCAGGCTCAGACCTAACCGCAACAAACCCACAACAATCAGCACCTACTGGTATCGTTACATCAACCGAACTAAGCAATACAATAATCACTTGGGATCAACCTAGCGATTTTTCTGACTATAGAGCAACTAGGTTTTATCAAAGTACGACTAAGGGTTTTTCACCAAGCCAGTCAAACTTCCTTAAGGAGCAAGTTGGCACATTATTCAGCCATATATGGGCAGATAAAAATGTGCATTATATTAGATTTGCTCACGTTGATGTATTCGGTAATGATTCCGCGTCATATTCGCATGAAATAGCAGTGACTCCATCCACTATAGATGCTCTACTTCCCATAGATCCAGATTTTGCTGAAATCCGTGATCCGCAGGGTGCTGTCGGAAAGGAGCAGGTACTTAAGGCTGCTGATGGTGAATATGTTTCTGGTATAGGGATTTATGCAGACAATCCATCAAAAAGTACCAAAATATTGATGGCGGCAGAACAAATCCTTATGGGCGTTGGTGGTAGACCGTTTTACAAATCGAGTACAGCTTATAGAGTTGGTGATAAGGTATTATACAAAAGAACATCAACGATTACTGCGTTGTATGAATGTAAGGTAGCAAACACAGGCAACGCCCCAACAAACACAACTTACTGGAATGTATTACAATCAAACGCAGATCAGACCGTATTTGCTGTAGATGAGTCTGGTAAAGTTCTGATACGCAATGCAGTAATATCAGATCTAACATCAGATAACATTAAGTCCAGATCTATTGTTGCTGATGATATAGCAACCAATACAATAACGGCAAATGAAATAGCGGGTAATACCATAACTGGTTCAAAAATAAAATCAACTACTACAATAATTGCTGGTAGTGGTTCAACTAGTGCAGGGATGAATGGTGATGACAGTACCAGCTCATCTTATAAAAATTATCGATTTTGGTCCGGTGCTGCGTTACCCGCTAGTGCTCCTTTTAGAGTTAGTAGAACAGGTAAACTTACTGCCACAGGAGTTGATATTAGTGGTGAACTTAATGCCACTAGTGGTGTAATGGATAACGTAACAGTAAACAAAACGTGTACAGTACTGGGAACGATACAAGCAAACCAAATAGTTGGTGATATTGTGTCAGGTATAACCAAAACCAACACAGCGGCATCAGTTAACGTAACTCAGTATGAGGATGAAACATGGAGTAGAACATCATCGTTTGGTAGTGTCACTATAGCAAATGCACGAGGTTGGGATAGGACTTTGGTAGTAACTCTGCATATAGAGATAAACGAAAATTCTGGTGAAGCATATGGTGCAGGACGAATGCGTTTATTCGTTTCTGGGTCTTCTGGTGGTTCTGATTATTATTCTCGACAATATAATGGCGACGAAAAGGAAACTACTAGGTCTAGGGTGTATACATTGGGTGCAGACGTAACCATGGTTATCCCTATAGCAAAAAACACGACACCAAAATTCACATTCTATGCTGAAGGTGCAAGAAAACAGGGCAGATCGTATAATTTCGATGTGTCCGTTAGTGCCCCACAATCCAATAATTTGTGGTGTGCGTACTTGATATCTAATGGTGGTGACTTGAGTTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
d5c79e2b7c659335c79e4800bfafd29f9093761b7d151452d418401659132050
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6917
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
The Genome Sequence of Vibrio phage henriette 12B8 Henn,M.R., Polz,M., Kauffman,A.K.M., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Koehrsen,M., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Borenstein,D., Chen,Z., Engels,R., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hepburn,T., Howarth,C., Jen,D., Larson,L., Lewis,B., Mehta,T., Park,D., Pearson,M., Richards,J., Rizzolo,K., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., Walk,T., White,J., Haas,B., Nusbaum,C. and Birren,B. 2011-09-23 GenBank