Genbank accession
YP_007673829.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MALTRLKNVFTSKTGRCLYVNSDDFDASDSFDNRGNSPNRPFKSIQRALIEAARFSYKSGQFNDTFESFSIVLYPGDYVIDNRPARDYSQNTQDGRAFIPANLSELSASSDVDLVDENGNVNPNNVLYKFNSVDGGVIVPRGTSIVGMDLRKTKLRPLYVPDPTAATASSAIFRVTGGCYFWQFSFFDGITSGVYNNAANTASTLPPTFSHHKLTCFEYADGKNNITSVKQNDALPGATPSDFSVSDLQLYYQKVAKAWEDIPDSTSVISADELQARVEENRIVGPNTAGPKTIGSVVTDFVSTNVFTTTAEVTTSDAHGFSVGTPVLVEGITGTDAARFNGSFFISAIPTPTTFRYIIRNPANGAPSGNPTAGGSTVKVEVDNVDSSSPYIFNISLRSTWGMQGMHADGSKATGFKSMVVAQFTGVSLQKDDNAFIKWDGSAYIAGSHVDGDSIYKADYRNFHVKASNDSVIQAVSVFAVGFADHFVAESGGDQSITNSNSNFGSCALRAKGFKAAPFTQDKAGTITHVIPPQKLARTYALISGTTFTTTYDNVTVTATNNSHGIVAGDYVRFETSDNIESYLVTTVNPTSGDLTLNRGYRNLHGATSGSGKAAYKGTISEIPVGYVALDVQKIQDNASQGNAAWTINQSGISVGDSRTNGGNAYLATAVGGSGSTAGAGSGPTHTSGVAVDNEVTWAYIGAVNTRLYLYGYTSIATKPPYKLQGFSIGARKQDKIYVSLIDGSTQTTFAALISPDGTASPVDSAYTNITQQQFTPGDTNHPLQYDTYHQNWYLRVTPATSGDSGVNGVTGYGGIHYHLGNETFYANSLFTGSSYTQRIADNRSSRDRTYRMRYTVDNSASLSREPINGYVFQVRNSVTNYNNVYYIYDIQVAQELKQSVQDGIYYLTVLKGSISPTNGNLTQFSFAQNINNLYPTLDKDNPTEDPNAATSIASNITVGLVETTDGSGVEDLSLSITKEAVNTYIEEGNNSYTNSGGAGNPAQTNYITLEARDGDASEVDKTLRMVQVNNTGGTATELRRPSILRSGNHTFEYVGFGPGNYSTGLPSVQNRVLTEAETLLAQSQKEDGGIAFYSGLNSNGDLFIGNTRISAVTGEEASLDTPSLSIVGETANLRPVFDEIIVRDKITVENTQLTSVFKGSVEVNEDVIVTKGLESADITIKGEASNNQATKKFDVTVGTPSTSNAANTGDISFLGNIGNGTNLGYYWTGAAWAKFGLTDTGNLEITGGSASGSTWTDGAGDLQLKNGLGLDIQSGGALNVANGNSTLGGNLSVSGTLTVTSTSEFNNTVDVDANFAVRSGTTDKFTVASSSGNVSTDGTLTVAGQTDLNGHVNLGDGTGDNITISGRVDSDIDPDTSATYDLGSSSLKWRNAQFSGTVTAPTLAGNVDIGSGTSTFNNVTVNGTLSAGNLTGNADTATDLAINATQQLVIQTANNATSTLSSGTNNYILTSNGSGAAPSWQQNFNGNADTATQVYVTETTTNSNYPIVFTDGSTTSNSANRGLQKDNSTLYFNPSTNILTCTSIQATTFGTSSQNAYGARTVSNGNPSGGSNGDIHYKI
Physico‐chemical
properties
protein length:1580 AA
molecular weight: 167095,37970 Da
isoelectric point:4,81718
aromaticity:0,08861
hydropathy:-0,32810

Domains

Domains [InterPro]
DC_0066
STR
1–1578
YP_007673829.1
1 1580
Architecture
STR
STR 1-1578 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_007673829.1
1 1580
Domain Start End Length (AA) Confidence
N-terminal 1 80 80 0,8897
Central domain 81 313 234 0,9400
C-terminal 314 1580 1266 0,5389
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-80
Central
81-313
C-terminal
314-1580

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage MED4-213
[NCBI]
889956 Uroviricota > Caudoviricetes > Eurybiavirus >
Host Prochlorococcus
[NCBI]
1218 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_007673829.1 [NCBI]
Genbank nucleotide accession
NC_020845.1 [NCBI]
CDS location
range 86924 -> 91666
strand -
CDS
ATGGCACTTACTAGACTAAAAAACGTCTTTACATCAAAAACAGGACGTTGCTTATACGTCAACTCTGATGACTTTGATGCATCAGATAGTTTTGACAATAGAGGTAACTCTCCTAACCGTCCTTTTAAGAGTATACAAAGGGCGTTAATTGAAGCAGCGAGATTTTCATATAAAAGTGGACAGTTTAACGATACGTTTGAGTCATTTAGTATTGTATTATATCCTGGTGACTATGTTATTGACAACAGACCAGCTAGAGATTATAGTCAAAATACTCAAGATGGACGAGCATTTATTCCCGCCAATCTTTCCGAATTAAGTGCATCTAGTGATGTTGATCTAGTAGATGAAAATGGTAATGTAAATCCAAACAACGTATTATATAAATTTAACTCAGTAGATGGTGGTGTAATAGTTCCTAGAGGTACATCTATCGTAGGTATGGACTTACGTAAGACTAAATTACGTCCTCTATATGTTCCTGATCCTACAGCTGCTACTGCTAGTTCTGCAATATTCAGAGTTACTGGTGGATGTTATTTCTGGCAATTCTCATTCTTTGATGGTATAACATCTGGTGTATACAACAATGCTGCGAACACAGCTTCAACTTTACCCCCAACATTTTCTCATCATAAACTTACATGTTTTGAGTATGCCGATGGTAAGAATAATATAACTTCTGTAAAACAAAACGATGCTCTTCCTGGAGCAACCCCAAGTGATTTTTCAGTTAGTGACTTACAACTATACTATCAAAAGGTAGCAAAAGCATGGGAAGACATACCTGATAGCACAAGTGTTATATCTGCTGACGAATTACAGGCAAGAGTAGAAGAGAATAGAATCGTAGGTCCTAACACAGCAGGTCCTAAAACAATAGGTAGTGTTGTTACTGACTTTGTTAGCACAAACGTATTCACAACCACAGCAGAGGTCACAACATCTGATGCTCACGGGTTCTCCGTTGGAACTCCCGTATTAGTTGAGGGTATTACAGGAACTGATGCTGCAAGATTTAATGGATCGTTCTTTATTAGTGCAATACCAACACCAACAACATTTAGATATATTATTAGAAACCCTGCTAATGGTGCACCATCTGGTAACCCAACTGCGGGTGGATCAACGGTGAAAGTAGAAGTTGATAACGTTGATAGTTCATCACCATACATCTTTAACATATCTCTACGTTCTACATGGGGTATGCAGGGTATGCATGCTGATGGTAGTAAAGCAACTGGATTCAAATCTATGGTTGTTGCACAGTTTACTGGTGTATCACTACAGAAAGATGACAATGCATTTATTAAGTGGGACGGATCTGCATATATTGCAGGATCACACGTTGATGGAGACAGTATATACAAAGCAGACTATAGAAACTTCCATGTAAAGGCATCTAATGACTCAGTTATACAGGCTGTCTCAGTGTTTGCTGTTGGATTTGCTGATCACTTTGTTGCTGAGTCTGGTGGTGACCAATCTATCACCAACTCTAATAGTAACTTTGGTTCATGTGCATTAAGAGCAAAAGGATTCAAAGCAGCACCATTTACACAGGATAAAGCTGGAACTATAACACACGTCATACCTCCACAAAAACTTGCAAGAACATATGCACTTATAAGTGGAACTACATTTACTACTACGTATGATAATGTTACTGTAACAGCAACAAACAATTCTCATGGTATTGTAGCGGGTGACTATGTAAGATTTGAAACCTCTGATAACATAGAATCATATCTAGTTACCACAGTTAATCCTACAAGTGGAGACTTAACCTTAAACAGAGGATATAGAAACTTACATGGTGCAACATCTGGTTCTGGTAAAGCAGCATATAAGGGAACTATCAGTGAAATACCTGTTGGTTATGTTGCACTTGACGTTCAGAAAATACAAGACAACGCATCACAAGGTAATGCTGCATGGACAATTAACCAATCTGGTATATCAGTCGGTGACTCTAGAACAAATGGTGGTAATGCATATCTAGCAACTGCGGTAGGTGGATCTGGATCCACAGCAGGAGCAGGATCAGGACCTACTCATACATCTGGTGTTGCTGTTGATAATGAGGTCACATGGGCATATATTGGTGCAGTCAACACAAGATTATATCTTTATGGTTACACATCTATTGCAACCAAACCTCCATATAAATTACAAGGTTTCAGTATTGGTGCACGTAAACAAGACAAAATATATGTGTCATTGATTGATGGGTCTACACAAACTACATTTGCAGCTCTAATTTCTCCTGATGGAACTGCGTCACCTGTTGACTCAGCATATACAAATATAACACAACAACAATTTACACCTGGCGACACTAACCACCCACTACAGTATGATACTTATCATCAAAACTGGTATTTAAGAGTAACTCCTGCCACATCTGGAGATAGTGGAGTTAATGGAGTCACAGGATATGGGGGTATTCATTATCATCTAGGTAATGAGACATTCTATGCTAACTCATTATTCACTGGATCATCATATACTCAACGTATTGCTGACAATAGATCATCTAGAGATAGAACATATAGAATGCGTTACACTGTAGATAACTCTGCAAGTTTGTCAAGAGAACCTATTAACGGTTACGTATTCCAAGTAAGAAACAGTGTTACAAATTACAATAATGTTTACTACATTTATGATATACAAGTAGCACAAGAACTTAAACAGTCAGTGCAAGATGGTATTTACTACTTGACAGTATTGAAAGGTAGTATATCACCGACAAATGGTAACTTAACTCAGTTCTCATTTGCACAGAATATTAATAACTTATATCCTACCTTAGACAAAGATAACCCAACTGAAGATCCTAACGCTGCAACATCTATTGCAAGTAATATCACTGTTGGTTTAGTTGAGACTACTGACGGATCAGGTGTAGAGGATCTATCTTTATCAATTACAAAAGAAGCAGTAAACACATATATTGAGGAAGGAAACAACTCATATACAAACTCTGGTGGAGCAGGAAACCCTGCACAGACAAATTATATTACTCTTGAAGCAAGAGATGGTGATGCATCTGAAGTTGATAAAACCTTACGTATGGTACAGGTCAACAACACAGGTGGTACAGCAACTGAACTTAGACGACCTAGTATCCTAAGATCTGGTAACCACACATTTGAATACGTTGGTTTCGGACCAGGTAACTATTCAACTGGTCTACCTTCAGTTCAGAACAGAGTTCTTACTGAAGCTGAGACACTACTAGCACAGTCACAGAAAGAAGACGGTGGTATCGCATTCTACTCTGGTCTTAACAGTAATGGTGACTTATTCATTGGTAATACTAGAATCTCTGCTGTTACTGGTGAGGAAGCATCACTTGATACACCATCACTATCAATTGTTGGTGAGACTGCAAACTTACGTCCTGTATTTGATGAGATCATCGTTAGAGATAAGATTACAGTTGAAAATACACAGTTAACCAGTGTATTCAAGGGTAGTGTTGAAGTCAATGAGGATGTAATAGTAACTAAAGGTTTAGAATCTGCTGATATTACAATCAAAGGAGAAGCATCTAATAACCAAGCAACTAAAAAGTTTGACGTTACAGTAGGAACACCAAGCACTTCTAACGCTGCAAACACAGGAGATATATCATTCTTAGGAAATATTGGTAATGGAACTAATCTTGGTTACTACTGGACAGGTGCAGCATGGGCAAAGTTTGGACTAACTGACACAGGTAACTTAGAAATTACAGGTGGTAGTGCATCTGGTTCCACATGGACTGATGGTGCAGGAGACTTACAACTTAAGAATGGATTAGGACTAGACATACAATCTGGTGGTGCACTTAATGTTGCCAATGGTAATTCTACACTTGGTGGTAATTTAAGCGTCAGTGGAACTCTAACTGTTACAAGCACATCTGAATTTAATAATACAGTTGATGTTGATGCAAACTTTGCAGTCAGATCTGGAACAACTGATAAGTTTACAGTCGCATCAAGTTCTGGTAATGTATCTACAGATGGAACATTAACAGTTGCAGGACAGACTGATTTAAACGGACATGTTAATCTTGGTGATGGAACAGGTGATAACATAACAATCAGCGGTAGAGTAGATTCAGATATAGATCCAGATACATCTGCAACATATGATTTAGGTTCTAGTTCATTAAAGTGGAGAAATGCTCAGTTCTCAGGTACAGTCACTGCACCTACACTAGCTGGTAACGTAGATATAGGATCTGGAACATCTACATTTAATAATGTAACAGTAAATGGAACATTATCTGCAGGAAACTTAACTGGTAACGCTGATACAGCAACTGATCTTGCTATCAATGCAACACAGCAACTTGTTATTCAAACAGCTAATAATGCAACATCTACATTATCATCTGGAACTAATAACTATATCCTAACATCTAACGGATCAGGAGCAGCACCATCATGGCAGCAAAACTTTAATGGTAATGCTGATACAGCGACACAAGTATATGTTACTGAAACAACTACTAACAGTAACTATCCTATTGTCTTCACTGATGGCAGCACTACATCAAACTCTGCTAATAGGGGACTACAGAAAGATAATTCTACCTTGTATTTCAATCCTAGCACTAATATACTTACTTGCACCAGTATACAAGCAACAACATTTGGAACATCATCACAAAACGCATATGGTGCAAGAACCGTATCCAATGGTAATCCTAGTGGTGGAAGTAATGGAGATATCCACTATAAAATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
321d5fd1925c85f6100dccd76522bcb527c82cbc8e40948040653731e6951133
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7888
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
The Genome Sequence of Cyanophage MED4-213 Henn,M.R., Sullivan,M.S., Osburne,M.S., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Yu,Q., Coleman,M.L., Huang,K.H., Weigele,P.R., DeFrancesco,A.S., Kern,S.E., Thompson,L.R., Fu,R., Hombeck,B., Chisholm,S.W., Haas,B., Nusbaum,C. and Birren,B. 2011-09-23 GenBank