UniProt accession
A0A385EEY6 [UniProt]
Protein name
Tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,75
Protein sequence
MPLSKLQNFIKNTEGKILYVNPNDIGATDSIENQGNSLSQPFKTLQRALLESARFSYVRGNDNDLFDRTTILLFPGEHFIDNRPGYRIKDDGGVAKAISPSGTESLAAGTLTLSLDSIFDLNVEDNMLYKFNDHRGGCILPRGTSIVGLDLRKTKIRPMYVPNPTDSEAGKSALIRLTGTCYFRDFTFFDGALDQEVYTDPQDFSQINKSKPTFSHHKLVCFGFADGVNEVDGTGLTDLDMYYSKLSNAFNEASGRNIDQKFPAQSQGFSKSRIEWEIVGAFASDPITIKEIISGDGISPSTLITVTTNIPHGLTVGTPVKIKGVTPADYNISTFVTGVPSETVFTYLLPDVDATLVATGNVSGSTATIETDTVTGASPYVFNVSLRSVYGMNGMLADGATASGFKSMVVAQFTAVSLQKDDRAFVKYNPVSRTYDGISISKVTGAQLASESSSTNSNTVYHLDSRAVYRKGWDTHHISMINDAIIQIVSVFAIGFNKHFDCQSGGDASITNSNSNFGQIALVSEGFKAEAFTKDDQGYVTSIVAPQTISEEEVNIDLFTLDVGLTTSVGITSHLYLFGFKDQDNPPAIISQGYRVGAREGDKIYVDINNTAYTANILITDNLVSTASTIATGQGSKERITPIVGLDAEGRFSCQVNHNLVTGEKVRIYSEDGDLPENLEDDTVYYAIVDGINLALFQVASTYSDALRGEAITIYGGTGLRVESRVSDKSAGEIGCPVLFDPNQNNWFLHVAENNDIYNAFLSQGGIIDIGAQTSETYLKRRSDNRSLGDKIYKLRYFVPKESTLGRDPVNGFILQDSGSTAARDDQDFTISSIDITDYDYNRNSRYISTCSVSGSTITVRVDKPHELCRDDVIKIIDVESTTNASGAALSGYNGTFTVTSIVDDKTFTYSTTDVNGNGRTPGDFTSDMNTRIALMARYERVDNKTNLSVYRSEVIQNHIPGISDGIYHFSVLAANNSIVEEFDNLSFLPKIERFYPQLDRDNVDSNPRASKSFAKRNPIGDVAVDDPENSITRETLDHFARTIGVGRTVISFERDDAAGIATVTLDRPHNLSGIVTYTSLAGSSGFTEGDYYNVKLLDNVESTWNGATARVTVGSGGGVENVQIISCGSAYSDGDVLKLDGFTGANITLAETGISTFVGNSIQITGIGSTATGVYRVTGTPTKNQVSFALTSGDPNPVNNQYLVNCGRSVGINTVISFSGISTIVTNNAHGLVAGSSFRVVDNNNNSIGDLTVLERIGINTVTVQTDVELQTGPTFILPTAYAAKGGEISGDTESIGSRVYSFYGGEDAVLSNDLGSGESDNKIILSLANSGISTNSRFSIGSFIEINGEIMRIADSQFSGSGNNELEVIRGYLGSQTKTHQQGALVRKILVRGMELRRPSILRGSGHTFEYLGYGPGNYSTGLPQVQTITLTDKEEFLTQSQKRSGGVVVYTAMNNDGDFFIGNTIINPSTGEQTTFDAPIPTVRGEDPSVLSVIFDEVTIRQRLVVEGGPAKTLLSQFDGPLRVNNVVNITGNTKIDANLEVTGRFRSSGSADIEGSLNVAGVGTFAGKIEGDAGADLGDIRIGVGNSTARIESLNSKNLTLKSATTNVMVEDNLNVDGTITGDFIQVDNIPPIGSIMLWTGSDASLPGSWLPCNGATRDQADYPALYNVLTTNGTDFRWGANPSGTTFVVPDLRGHFVIGSGTGGGASYPRASSGGSKDSVTIVHTHTTNASTVSDHTHNVSNQGNHNHPISNQADHAHNTNTSPNHSHPVGGAGQHRHNTNTAPNHGHPVGGGGAHGHNISNNGAHRHGYTRLNTSSTEYGNRNNNAVRTGQINANTGFSGAHRHNTNTAPNHGHPVGGGGAHGHNISNLGNHGHPVGGGGAHAHNIGGSGQHGHTTTGAGQHGHNTANAGGHGHTISASPAGESGTNRNLPPYIAVRYIIRAL
Physico‐chemical
properties
protein length:1949 AA
molecular weight: 208006,98100 Da
isoelectric point:5,38785
aromaticity:0,07491
hydropathy:-0,30036

Domains

Domains [InterPro]
DC_0066
STR
1–1949
IPR023366
STR
295–374
SSF88874
STR
1630–1811
IPR011083
ATT
1638–1702
A0A385EEY6
1 1949
Architecture
STR
ATT
STR
STR 1-1627 | ATT 1628-1702 | STR 1703-1949
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A385EEY6
1 1949
Domain Start End Length (AA) Confidence
N-terminal 1 119 119 0,8038
Central domain 120 318 200 0,7036
C-terminal 319 1949 1630 0,3004
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-119
Central
120-318
C-terminal
319-1949

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-T4
[NCBI]
2268578 Uroviricota > Caudoviricetes > Pantevenvirales > Tamkungvirus > Tamkungvirus ST4
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AXQ70431.1 [NCBI]
Genbank nucleotide accession
MH412654 [NCBI]
CDS location
range 17571 -> 23420
strand -
CDS
ATGCCTCTTAGTAAGCTTCAGAACTTTATTAAGAATACCGAAGGTAAGATTTTATACGTAAATCCTAACGACATCGGTGCGACTGATAGTATTGAAAACCAAGGTAATTCACTATCACAACCATTCAAAACTCTACAGAGAGCCCTTCTAGAATCTGCTAGATTCTCTTATGTAAGAGGTAATGATAATGACCTGTTTGACAGAACAACAATTCTTCTGTTCCCAGGTGAGCACTTTATTGATAACAGACCTGGATATAGAATCAAGGATGATGGTGGTGTAGCAAAAGCCATCTCTCCTTCAGGCACAGAATCTCTTGCTGCTGGAACTCTTACATTATCCCTAGATTCAATCTTTGACTTAAATGTCGAAGACAACATGCTCTACAAGTTTAATGATCATAGGGGTGGTTGCATTCTTCCTAGAGGTACATCAATCGTTGGTCTAGACCTAAGAAAGACCAAGATTCGTCCAATGTATGTGCCCAATCCCACAGACAGTGAAGCGGGCAAATCAGCACTAATTCGATTAACTGGTACTTGTTACTTCAGAGACTTTACATTTTTTGATGGGGCTCTAGATCAAGAAGTTTATACTGACCCACAAGATTTCTCCCAAATCAATAAATCAAAACCAACATTCTCTCACCATAAACTTGTTTGTTTTGGTTTCGCTGATGGTGTTAATGAGGTTGATGGCACTGGTCTAACTGACCTTGACATGTATTACAGCAAACTATCAAATGCTTTTAACGAAGCATCTGGTAGAAATATTGACCAGAAATTCCCAGCACAATCACAAGGATTCTCTAAGAGTAGAATTGAATGGGAAATTGTTGGTGCATTTGCATCAGACCCAATCACAATCAAAGAAATTATATCTGGTGATGGTATCAGCCCATCTACTCTAATCACAGTAACTACTAACATCCCTCATGGGCTTACTGTGGGTACTCCCGTCAAGATTAAGGGTGTTACTCCTGCCGATTATAATATCTCAACATTCGTAACAGGTGTACCTTCAGAAACAGTATTCACATATCTATTACCAGATGTAGATGCTACTCTAGTGGCGACAGGAAACGTTTCTGGGTCCACTGCAACTATTGAGACCGATACAGTCACAGGTGCATCTCCATACGTCTTTAACGTGTCTCTGAGATCCGTATATGGCATGAATGGTATGCTTGCGGATGGTGCTACCGCATCTGGATTCAAGTCCATGGTTGTGGCACAGTTTACGGCTGTATCACTACAGAAGGATGACCGTGCATTCGTTAAGTATAACCCAGTATCCAGAACATATGATGGTATTTCTATCTCGAAAGTTACTGGTGCTCAACTAGCATCAGAATCTAGTTCTACTAATAGTAACACTGTTTACCACCTAGACAGTAGAGCTGTTTATAGAAAGGGTTGGGATACTCACCATATTTCAATGATTAATGATGCGATCATTCAGATCGTATCCGTGTTCGCTATCGGATTCAATAAGCACTTTGATTGTCAGTCTGGTGGTGACGCATCTATCACAAACTCAAACTCCAACTTTGGTCAGATTGCTCTTGTTTCTGAAGGATTCAAGGCTGAAGCATTCACAAAGGATGATCAGGGATATGTAACTTCTATTGTTGCACCTCAAACTATTTCTGAGGAAGAAGTAAACATTGACCTATTCACATTAGATGTTGGCCTCACAACTTCTGTTGGTATTACAAGTCACCTATATCTCTTTGGTTTCAAAGATCAGGATAACCCACCAGCAATTATTTCACAGGGTTATCGTGTTGGTGCAAGAGAAGGCGATAAAATTTATGTTGACATCAACAACACTGCATATACGGCAAACATTCTAATCACGGATAATCTTGTCTCAACTGCATCCACAATAGCTACTGGTCAGGGTTCAAAGGAAAGAATCACACCAATTGTGGGTCTAGATGCAGAAGGACGTTTCTCATGTCAAGTAAATCATAATCTGGTTACTGGTGAAAAGGTTCGTATCTATAGTGAAGATGGAGACCTTCCAGAAAATCTAGAAGATGATACTGTTTATTATGCAATTGTTGATGGAATCAACTTAGCCCTATTCCAAGTTGCATCAACATATAGTGATGCTCTTAGAGGAGAGGCAATTACCATTTATGGTGGAACTGGACTTAGAGTAGAGAGTAGGGTATCCGATAAATCTGCTGGTGAAATTGGATGTCCAGTATTATTCGATCCAAATCAAAACAACTGGTTCCTCCATGTTGCTGAGAATAATGATATTTACAATGCGTTCTTAAGTCAAGGTGGTATTATTGATATTGGTGCTCAAACATCAGAGACATATTTGAAGAGACGTTCTGACAATAGATCTCTTGGTGATAAAATTTATAAATTAAGATATTTTGTTCCAAAAGAATCAACACTTGGTAGAGATCCTGTAAATGGATTCATTCTTCAGGATAGTGGAAGCACAGCAGCCAGAGATGATCAAGATTTCACTATTTCATCAATCGATATTACTGATTATGATTATAATCGTAATTCAAGATATATTTCAACCTGTTCAGTAAGTGGATCAACCATTACAGTTAGAGTTGATAAACCACATGAACTGTGCCGTGATGATGTCATCAAAATCATTGATGTTGAAAGTACGACAAATGCTTCTGGTGCAGCCTTAAGTGGATACAACGGAACTTTCACCGTAACTTCTATTGTTGATGACAAGACATTTACATATTCCACAACAGATGTAAATGGAAATGGAAGAACTCCTGGTGATTTCACATCAGATATGAACACCAGAATTGCTCTGATGGCAAGATATGAGAGAGTTGATAATAAAACAAACCTCTCAGTGTATCGTTCCGAAGTAATTCAGAATCACATTCCTGGAATTAGTGATGGTATCTATCACTTCAGTGTTCTTGCTGCAAACAATTCTATTGTGGAGGAATTTGATAATCTATCATTCCTACCTAAGATTGAAAGATTCTATCCACAGTTAGATAGAGACAATGTTGATTCTAACCCAAGAGCTTCAAAGTCATTTGCAAAGAGAAATCCAATTGGTGATGTTGCAGTTGATGACCCAGAAAACAGCATCACTAGAGAAACTCTTGATCACTTTGCAAGAACCATTGGTGTAGGAAGAACTGTCATTTCCTTTGAGAGAGATGACGCTGCTGGTATTGCAACAGTAACTCTTGATAGACCACACAATTTATCTGGTATTGTAACTTATACATCTCTGGCTGGATCAAGTGGATTCACCGAAGGTGATTATTATAATGTCAAACTACTTGATAATGTAGAATCCACATGGAATGGTGCAACCGCAAGAGTCACAGTAGGCTCTGGTGGTGGTGTAGAAAATGTTCAGATTATTTCTTGTGGATCTGCATATTCTGATGGAGATGTATTAAAACTTGATGGATTCACTGGTGCAAACATTACTCTTGCAGAGACTGGTATTTCAACATTTGTTGGAAATTCGATTCAAATCACTGGTATTGGTAGCACTGCAACTGGCGTATATCGTGTAACTGGAACTCCCACTAAGAATCAAGTTTCATTCGCACTAACATCAGGAGATCCCAATCCTGTTAATAATCAATATCTAGTCAACTGTGGAAGATCTGTTGGTATCAACACTGTTATTAGCTTCAGTGGAATCTCAACCATTGTAACAAATAATGCACATGGATTGGTTGCTGGATCCAGTTTCAGAGTTGTTGATAACAACAACAATTCTATTGGCGATCTCACAGTATTGGAAAGAATTGGTATCAATACCGTAACCGTTCAGACAGATGTAGAATTACAAACAGGACCTACATTCATTCTACCAACTGCATATGCAGCTAAGGGTGGTGAAATTAGTGGAGACACTGAATCTATTGGTTCTAGAGTTTATAGTTTCTATGGTGGAGAGGATGCGGTTCTAAGTAATGATCTTGGATCTGGTGAATCTGATAATAAGATTATTCTATCACTTGCAAACTCTGGTATTTCAACCAATTCAAGATTCTCAATTGGATCCTTTATTGAGATTAACGGTGAAATTATGAGAATTGCAGATTCTCAGTTCTCTGGATCTGGAAACAATGAATTGGAAGTCATTAGAGGCTATCTTGGATCACAAACCAAGACTCATCAGCAAGGAGCATTGGTTCGTAAGATCCTTGTCAGGGGAATGGAACTTCGTAGACCATCAATTCTAAGAGGATCTGGTCATACATTTGAATACCTAGGTTATGGCCCTGGTAACTACTCAACTGGTCTACCTCAGGTACAGACAATCACTCTAACTGATAAGGAAGAGTTCCTAACTCAGTCCCAGAAGAGATCTGGTGGTGTGGTTGTATATACCGCTATGAATAATGATGGTGACTTCTTCATTGGAAACACAATCATTAACCCATCTACTGGAGAGCAGACAACTTTTGATGCACCAATTCCAACTGTTCGTGGTGAGGATCCTTCAGTTCTATCTGTTATCTTCGATGAAGTTACAATTCGTCAGAGACTAGTTGTTGAAGGTGGCCCTGCTAAGACATTACTATCTCAGTTTGATGGTCCACTAAGAGTCAACAATGTTGTCAATATCACTGGCAATACCAAGATCGATGCAAACCTTGAAGTAACTGGTAGATTTAGATCGAGTGGAAGTGCCGATATTGAAGGATCACTAAATGTTGCTGGTGTTGGTACATTTGCTGGTAAGATTGAGGGTGATGCTGGTGCTGATCTCGGTGATATAAGAATTGGTGTTGGTAATTCTACTGCTAGAATTGAGTCCCTCAATAGCAAGAACCTAACACTAAAGAGTGCCACAACCAATGTGATGGTTGAGGATAACTTGAATGTTGATGGAACAATCACTGGTGACTTTATTCAGGTGGATAATATTCCACCAATCGGTTCAATAATGCTGTGGACAGGATCTGATGCTTCACTTCCTGGATCATGGCTTCCTTGCAATGGTGCAACACGCGATCAGGCAGATTATCCAGCATTATATAACGTACTAACAACTAATGGAACCGATTTTAGATGGGGTGCAAATCCAAGTGGGACTACATTTGTTGTTCCAGATTTAAGAGGTCATTTTGTCATTGGATCTGGTACTGGTGGTGGAGCAAGTTACCCTCGAGCTTCAAGTGGTGGATCAAAGGATTCTGTTACCATTGTTCATACACATACGACTAATGCGTCAACAGTATCTGATCACACACATAATGTAAGCAACCAGGGAAATCACAATCACCCAATCAGTAATCAAGCTGATCATGCTCACAATACAAATACATCACCAAACCACTCTCACCCTGTAGGTGGTGCTGGTCAACACAGACACAATACCAATACAGCTCCAAACCATGGCCACCCTGTAGGTGGTGGTGGTGCTCATGGTCACAATATCAGTAACAATGGTGCTCACAGACATGGTTATACACGTCTGAACACATCTAGTACCGAATATGGTAATAGAAACAATAATGCTGTAAGAACTGGTCAGATTAACGCAAATACTGGTTTTTCTGGAGCACACAGACACAATACCAATACAGCTCCAAACCATGGCCACCCTGTTGGCGGCGGTGGTGCTCATGGTCACAATATCAGTAACCTTGGAAACCATGGACACCCTGTAGGTGGTGGTGGTGCTCATGCACACAATATTGGTGGATCAGGACAGCACGGTCACACAACAACTGGAGCTGGTCAACATGGACACAATACAGCTAATGCTGGTGGTCATGGTCACACTATTTCAGCATCTCCTGCTGGAGAAAGTGGAACCAATAGAAATCTACCTCCATATATTGCCGTAAGATACATCATCAGAGCACTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a5f02dd07ae0b43c4817f0eb9496ff0da876406e9cca583c5d38bf1c79b18d23
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,8033
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50