UniProt accession
H6WFV8 [UniProt]
Protein name
Virion structural protein and packaging
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,77
Protein sequence
MPHIGNHVDAIFLPESVNTSTTLKINGGDLNVNNGQFFVDQSTTNVGIGTVTPSTLLDIYGDGVNTTQISLRQWNTGNDGPDIRFFCSGGTIASPVEPNTGDILGKVNAFVNTGSSYDQFGGFGWTYRTQGFGLNINRGSSFAIETKSIYETTNSAKIYISEEGNVGIGTASPFRTFQINGSYNSSTSEYGAPNQWFINSLSSASAGTNLGSIVFSRSTGSTGASAKIQATATGTANETDLYFYNRTSGGADNVNNYLNTTPTLKLYHDKTAEFASDVNLPDDAKLLLGTDDDFQIYYDGGTAHIDNNQGQIRMRAASAFLFYYEGNSGVEDYAKFLQNGAVELYHNNVKAFETTSTGINVPLGTITGPATLNIDPAVVGDNTGTVVIKGDLQVNGTTTTINSTTLTVDDKNIVLASGAANSAAADGAGITIDGANASLTWVDSNESFKFSTRLTIGSATSASANLRLSKDVGGENTTTYYSFLNNGLVQPDVTGTAYYNFVQVRTDGNNGTGYTISKLEGYSASVGSGTIHADTTITNLIGFEVKNTWTAGTNNYGFRGEIATGTNRWNVYMDGTAPNYFAGNVGIGTDNPDKKLVVLSDDSEVVIDDTNGSPVLRLRNNGTTGGTVELTSSNSLRFRAGGTTERLRITADGDLLVNHDSSDGSGKLQVFTNSQDGIDILGFSSGATAGGRLTFYRSKSTGVGNFSEVADGDSLGRIDWRGYNDDGTTNNLGATIEALVSGAVNSTTDMPSDLVFKTSPDGGASPTERLRITSAGRVGIGTDNPGERLDVRGKIRIEDDSPSPGLLIRDTDASGDIEIYQYNSGDLQILNNATSRNIIFQTHNGTSVGEKLRITSDGRVGIGTDSPATKLEIAGTGSPAIRIKDLDGTSQFGQIVSNNGLLIIESRNENSDGQIVFRGRDNTDTNEYGRFDEVGRFGVGVQNLNSNLHVKGGSESTDNLLLTLQSNGVANDGSLSTGLRLINSTSNTSVHGADIRAIRTGTNTADLTFSLYNGSTPQEERLRITSDGNVQLKTANAQLEWQAASGGDNPFIRSIGTDQEALEFNTGGSSRVRIGSGGSVGIGTDSPEALLDVNGGNLVVQNSSGNSITLRTHVGNGNDSHFNFQKSRGGLGTIADVQTGDHIGTISFSGYFGGAYNTESTITVEVDLGNPLNPTYSDRMFYDSNEHRFRTNGSTRIAISAAGSVGINTTAPSARLHVRGTQNAGGILVEDSSTSTQAPAIEVIGKRQDGNVHHSFSGKLLLAKNRTDAKIQGSDSILGTVAFGGNHSTGSTGNILYAAAIHGVAENSFDANNDMPTGITFRTGSTGRSGDTNNVHITNERLRITSDGNVGIGTNAPSTKLDVFGAIKSSPVAYGNNQDGTYLIAGSTSWSGATSNWGLMAFSTKLNQMPGEPQE
Physico‐chemical
properties
protein length:1415 AA
molecular weight: 148754,20750 Da
isoelectric point:4,83117
aromaticity:0,07279
hydropathy:-0,37562

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
H6WFV8
1 1415
Domain Start End Length (AA) Confidence
N-terminal 1 498 498 0,0147
Central domain 499 697 200 0,3334
C-terminal 698 1415 717 0,8752
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-498
Central
499-697
C-terminal
698-1415

Taxonomy

  Name Taxonomy ID Lineage
Phage Cyanophage S-TIM5
[NCBI]
1137745 Uroviricota > Caudoviricetes > Aurunvirus >
Host Synechococcus sp. WH 8102
[NCBI]
84588 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEZ65683.1 [NCBI]
Genbank nucleotide accession
JQ245707 [NCBI]
CDS location
range 66179 -> 70426
strand +
CDS
ATGCCACATATCGGTAATCACGTAGACGCAATATTTTTACCAGAGTCAGTTAATACTTCGACTACTCTGAAGATTAATGGCGGGGACCTTAACGTAAACAACGGTCAATTTTTTGTTGATCAGAGCACAACCAATGTTGGCATAGGCACTGTAACACCATCGACCCTCCTAGATATATACGGAGATGGTGTAAATACAACTCAAATAAGTTTGAGACAGTGGAATACTGGAAATGATGGACCAGATATTAGGTTCTTTTGTTCTGGTGGAACTATTGCGTCTCCTGTAGAACCCAATACCGGAGACATACTTGGTAAGGTAAATGCTTTTGTAAATACCGGGAGTTCTTATGATCAGTTCGGTGGATTTGGGTGGACGTACCGTACCCAAGGTTTTGGGCTTAATATTAATCGTGGGTCTAGTTTTGCAATTGAAACTAAATCTATTTACGAGACTACCAATTCCGCAAAAATATATATTTCTGAAGAAGGCAACGTTGGAATTGGAACCGCCTCTCCATTTAGAACATTCCAAATTAACGGGTCTTATAACTCTAGCACATCTGAGTATGGAGCACCGAATCAGTGGTTTATAAACTCCTTAAGTTCTGCGTCTGCCGGAACTAATCTAGGCTCAATTGTATTTTCCAGAAGTACGGGGAGTACGGGTGCTTCTGCTAAGATCCAGGCTACAGCAACGGGCACTGCAAATGAGACAGATTTGTACTTTTACAATAGAACCTCTGGCGGAGCTGATAATGTAAATAATTATTTAAATACAACTCCGACTCTAAAGCTCTACCACGACAAAACTGCAGAGTTTGCAAGTGATGTTAACCTTCCAGATGACGCTAAGCTTCTCCTAGGAACTGACGACGATTTTCAGATCTATTATGATGGCGGCACTGCTCATATAGATAATAACCAGGGCCAAATTAGGATGAGAGCAGCTTCTGCTTTCTTATTCTATTATGAAGGTAATAGCGGCGTAGAAGATTATGCTAAATTTTTGCAAAATGGAGCAGTAGAGCTTTATCATAATAATGTAAAGGCATTTGAAACTACCTCTACTGGCATTAATGTGCCCTTAGGTACAATTACTGGTCCTGCCACGCTTAACATTGACCCAGCTGTTGTTGGAGATAATACCGGAACTGTTGTTATTAAAGGAGACCTTCAGGTCAATGGCACGACCACAACCATCAACTCCACAACGCTTACTGTTGACGATAAAAATATTGTATTAGCTTCTGGTGCAGCAAACAGTGCAGCAGCCGATGGCGCCGGGATTACTATCGATGGAGCTAATGCGAGTTTAACATGGGTTGATAGTAATGAATCTTTTAAATTCTCCACAAGATTAACAATAGGTAGCGCTACCTCTGCATCCGCCAACTTAAGATTATCAAAAGACGTCGGTGGAGAGAACACTACTACGTATTATAGTTTCTTAAACAACGGTCTCGTTCAACCGGATGTAACAGGGACAGCATATTATAACTTTGTTCAAGTAAGAACCGATGGCAATAACGGTACAGGATATACAATTTCTAAACTCGAAGGGTACTCGGCCTCTGTCGGAAGCGGTACTATTCATGCGGATACCACGATTACAAATCTTATAGGATTTGAGGTAAAAAATACCTGGACTGCAGGAACAAATAACTACGGATTTAGAGGGGAGATAGCTACTGGTACAAATAGATGGAACGTCTATATGGACGGTACTGCTCCGAATTATTTTGCAGGTAATGTTGGGATTGGAACTGATAATCCCGATAAAAAATTAGTTGTATTGAGCGATGATAGTGAAGTTGTTATTGATGATACTAATGGAAGTCCTGTATTAAGATTGAGAAATAATGGTACGACAGGTGGAACAGTTGAATTAACTTCATCTAATAGTTTGCGTTTTAGAGCAGGAGGAACTACAGAAAGACTTCGCATAACTGCCGATGGTGATTTATTAGTAAATCATGATTCCTCTGACGGCAGTGGTAAATTACAAGTCTTTACTAATAGTCAAGATGGCATTGATATCCTTGGATTTTCAAGTGGTGCTACTGCTGGTGGAAGATTAACATTCTATAGAAGTAAAAGTACTGGAGTTGGTAACTTCTCAGAAGTAGCAGATGGTGACTCCTTAGGTAGAATAGATTGGAGAGGATATAATGATGATGGAACTACTAATAATCTAGGAGCAACAATTGAAGCACTTGTTTCTGGTGCTGTAAATTCAACAACTGATATGCCATCAGATCTTGTATTTAAGACAAGTCCCGATGGTGGTGCGTCACCAACAGAAAGACTTCGTATAACTTCTGCTGGTCGGGTTGGTATCGGAACCGATAATCCAGGAGAAAGATTGGATGTTCGTGGAAAAATACGTATCGAAGACGATAGTCCTTCACCAGGATTGCTTATAAGAGATACTGATGCTAGTGGAGATATTGAAATTTATCAGTATAATTCTGGTGATTTGCAAATTCTAAATAATGCCACTAGCAGAAACATCATATTTCAAACTCATAATGGAACATCAGTTGGAGAAAAACTTCGTATAACTTCTGATGGTAGGGTTGGTATTGGTACGGACAGCCCAGCAACAAAACTTGAAATTGCTGGAACTGGATCTCCTGCAATTAGAATAAAAGATTTAGATGGAACAAGTCAATTTGGGCAAATTGTTTCAAATAATGGTCTTCTTATAATTGAAAGTAGAAATGAAAATTCTGATGGTCAAATTGTTTTCCGTGGTAGAGATAATACTGACACCAATGAGTATGGAAGATTTGATGAAGTTGGTAGATTTGGTGTCGGTGTTCAAAACCTTAATAGTAATTTACATGTAAAAGGTGGAAGTGAATCAACTGATAATCTTCTTTTAACTTTACAGTCAAATGGAGTTGCAAATGATGGTTCTTTATCCACAGGACTCAGATTAATCAACTCTACATCTAATACTTCAGTTCATGGTGCAGATATTAGAGCGATTCGTACTGGTACTAATACTGCTGATCTTACTTTTTCTCTTTATAATGGTAGTACTCCCCAAGAAGAAAGACTTCGTATAACCTCTGATGGTAATGTTCAATTAAAAACAGCAAATGCTCAATTAGAATGGCAAGCAGCAAGTGGCGGTGATAATCCTTTCATTAGATCTATTGGTACTGATCAAGAAGCATTAGAATTTAATACCGGTGGATCTTCAAGAGTTCGTATAGGCTCTGGTGGCAGCGTTGGTATTGGGACGGATAGTCCAGAAGCTCTTTTAGATGTAAACGGAGGTAACCTTGTAGTTCAAAACAGCTCCGGCAATAGCATTACTCTTAGAACACACGTGGGCAACGGAAACGATAGCCATTTTAATTTTCAAAAATCAAGAGGCGGATTGGGAACAATTGCCGATGTTCAAACCGGCGATCATATTGGCACAATTTCTTTCTCAGGGTATTTTGGCGGAGCTTATAATACAGAATCTACCATCACAGTCGAGGTTGATTTAGGTAATCCTCTGAATCCTACGTATAGTGACAGGATGTTTTATGATTCTAACGAGCACCGTTTCCGAACTAACGGCTCCACTAGAATTGCAATTTCAGCAGCTGGCTCCGTAGGCATTAACACTACCGCCCCATCCGCCCGACTCCACGTGCGTGGTACACAAAATGCTGGTGGTATTCTGGTAGAGGATAGCAGCACGAGCACCCAAGCACCAGCAATTGAAGTCATTGGAAAGAGACAAGATGGTAATGTTCATCATAGTTTTTCTGGAAAATTACTCTTAGCAAAAAATCGCACGGATGCTAAGATTCAGGGATCCGATAGCATACTAGGCACTGTTGCATTTGGAGGTAACCACTCTACTGGCAGCACGGGCAACATCTTATATGCTGCGGCTATTCACGGTGTTGCTGAGAATTCTTTTGATGCAAATAACGACATGCCTACAGGCATAACATTCCGCACTGGTTCCACTGGTCGAAGTGGAGATACAAATAATGTACACATCACAAATGAAAGACTTCGTATAACTTCTGATGGTAACGTCGGTATTGGAACAAATGCTCCAAGTACAAAACTTGATGTTTTTGGAGCTATCAAAAGTAGCCCAGTTGCCTATGGAAACAATCAAGACGGAACTTACCTTATTGCAGGATCAACGTCGTGGTCAGGAGCCACTAGTAACTGGGGGCTTATGGCTTTCAGCACAAAATTAAATCAAATGCCGGGGGAACCGCAAGAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
e8f79cb24fdf645736723e67e322c0564d76cd19b034139f269e806e2d01fbd1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5217
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50