Genbank accession
YP_009103150.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,86
TF
Evidence RBPdetect2
Probability 0,88
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MATPEALYTGNGSTDIYSYAFDALELSHIKASLDGVDTTAFTIPVAGQVQFNTAPTNGAAIRIYRRTPDDSIDATFTPGSALRSQDLNSNFEQLLYVTQEDRVTASNSGSLAQQAITDAAAATGVANNAFATSVNAVASANTATTNANNAITAANSAQSVANSALSAANAADSKATTALDAVAGSLQYVPVATVGDIPGSPANNYAVEVADSTGIESFTPLAGLPAGFVGDSGLRVRIQYTSAGSTWNYLSYLPNSPDQRYLRNTAIDTTTSIGLSTGGTERLSIDASGRVGIGTKPVSRALEVSSPLQIVSAFKSTQTASRIAFVDSTSTDDARSGIGSVGDGTAIYAGATERFRITADGKLGLGTSSPSVLLQSQQVSAGSSVIGCSVVNNSATAGTGVIFDLTPSFAEPGVRGAQIEAVNTNGLNEISLSLKTSSGGNPPATRLHIAPEGRVGIGTEVPTAKLQVQAANSDTAATAFTARQNNAADTSQTSLSILIDPTTNTARLDATGTSSPNLAFLTAGTEHVRLDSSGRLLVGTSSAFNGTPSGANINLLSANEFGPQFRMRGTYDGSNPVFLVMDRARGSNVVQSGDKIMQLDCRGFDGTNYLSSARILAQVDGPPGANDMPGRIVLSTTPSGSATPVERLRVASTGRVTVTNSAVNPPGTLGTSGVVTIDLQSANNFSMSMTGNVTLGNPTNAVAGQSGCVVITNPSTHTLSFDTNWKFAGGTVPALTASATSVLTYYVASPTMIIGSLLGDVK
Physico‐chemical
properties
protein length:762 AA
molecular weight: 77649,86920 Da
isoelectric point:4,90034
aromaticity:0,05381
hydropathy:-0,03478

Domains

Domains [InterPro]
DC_0126
ATT
5–205
IPR005604
ATT
8–111
YP_009103150.1
1 762
Architecture
ATT
STR
RBD
ATT 5-205 | STR 375-590 | RBD 591-761 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009103150.1
1 762
Domain Start End Length (AA) Confidence
N-terminal 1 213 213 0,9843
Central domain 214 412 200 0,1362
C-terminal 413 762 349 0,9288
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-213
Central
214-412
C-terminal
413-762

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-CBP2
[NCBI]
756277 Uroviricota > Caudoviricetes > Autographivirales > Kembevirus SCBP2 >
Host Synechococcus sp. CB0208
[NCBI]
255252 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009103150.1 [NCBI]
Genbank nucleotide accession
NC_025455.1 [NCBI]
CDS location
range 35959 -> 38247
strand +
CDS
ATGGCAACACCTGAAGCGCTATACACCGGGAATGGGTCTACTGACATCTATTCCTATGCGTTCGATGCCCTCGAACTAAGCCATATCAAGGCCAGTCTCGACGGCGTCGACACTACAGCGTTCACGATTCCTGTTGCCGGCCAGGTGCAATTCAATACCGCCCCTACCAATGGGGCGGCTATTCGTATCTATCGCCGGACACCAGACGATTCGATTGACGCTACATTCACCCCCGGCTCTGCTCTGCGGTCACAGGACCTCAATAGCAACTTTGAGCAGCTCCTGTACGTTACTCAGGAGGACCGGGTCACTGCATCTAACTCTGGTTCTCTGGCACAGCAGGCTATTACCGACGCAGCTGCTGCTACTGGCGTAGCCAATAACGCCTTTGCTACGTCGGTGAACGCTGTGGCATCAGCTAACACGGCCACAACAAACGCCAACAATGCCATTACCGCCGCTAACTCAGCCCAGTCGGTGGCCAATAGTGCTCTGAGTGCAGCTAATGCAGCTGACTCCAAAGCCACCACCGCCCTTGATGCTGTGGCCGGTAGCCTTCAGTATGTACCTGTGGCTACGGTCGGCGACATTCCTGGTTCGCCAGCCAACAACTACGCTGTTGAGGTTGCTGACTCTACCGGCATCGAAAGTTTTACCCCACTTGCTGGTCTGCCGGCTGGGTTTGTGGGGGACTCTGGCTTACGTGTTCGTATCCAATACACGAGCGCCGGCTCAACCTGGAACTACCTAAGCTATCTTCCTAACTCTCCAGACCAGCGCTACCTCAGAAATACCGCTATTGACACTACAACGTCTATTGGCCTTTCGACGGGTGGTACTGAGCGGCTGTCTATTGATGCGAGTGGTCGGGTTGGAATCGGAACGAAGCCTGTATCTAGGGCTCTTGAAGTTAGTTCCCCACTGCAAATTGTTTCTGCGTTTAAGTCCACACAAACTGCTTCTCGTATTGCTTTTGTCGATTCAACCAGCACTGATGACGCCCGTAGTGGCATCGGATCCGTTGGTGATGGTACAGCCATTTATGCAGGCGCTACTGAACGATTCCGCATCACAGCGGACGGGAAGCTGGGTCTGGGGACGAGTAGCCCCAGTGTGTTGCTGCAAAGCCAACAAGTTAGTGCAGGTAGCTCAGTCATTGGTTGCAGTGTTGTCAATAACTCTGCTACAGCTGGTACTGGGGTAATTTTCGATCTGACACCATCTTTTGCCGAGCCAGGTGTTCGTGGTGCTCAAATAGAAGCAGTTAATACGAATGGCCTTAACGAGATTAGCCTTTCCTTAAAGACATCCTCTGGCGGCAACCCACCAGCAACACGGCTGCATATTGCTCCCGAAGGCCGTGTCGGCATCGGAACGGAGGTTCCTACGGCAAAGCTGCAAGTCCAAGCGGCCAACTCAGATACTGCGGCTACAGCGTTTACTGCTAGACAAAACAACGCAGCGGATACCAGTCAAACATCACTGTCAATTTTAATTGATCCGACCACAAATACTGCTCGGCTGGACGCTACTGGTACATCTTCGCCAAATCTTGCATTTCTTACCGCAGGAACCGAACACGTCCGCCTCGACAGCTCCGGCAGGCTGTTGGTGGGGACGTCTAGTGCGTTTAACGGCACACCTTCTGGCGCCAACATAAATCTACTTTCTGCAAATGAATTTGGTCCGCAATTTAGGATGCGTGGCACCTATGACGGGTCCAATCCTGTTTTCCTGGTTATGGATCGCGCTCGCGGTTCAAACGTTGTGCAATCAGGCGACAAAATCATGCAGCTTGATTGCCGAGGATTCGATGGCACAAACTATCTATCATCGGCTCGTATTCTGGCCCAAGTAGACGGCCCCCCTGGCGCTAACGACATGCCTGGGCGGATTGTCCTGAGCACAACGCCTTCGGGAAGCGCAACGCCCGTCGAGCGCCTTCGTGTTGCCTCAACGGGCCGTGTCACTGTTACCAACAGTGCCGTCAACCCTCCTGGCACCCTGGGCACCTCTGGTGTTGTAACGATTGATCTTCAATCTGCTAATAACTTCAGCATGAGCATGACGGGTAACGTCACCCTCGGCAATCCAACTAACGCTGTTGCTGGTCAGAGCGGCTGTGTCGTGATCACCAACCCCAGCACCCACACCTTGTCATTTGATACCAACTGGAAGTTTGCAGGCGGTACTGTACCGGCCCTCACTGCTTCCGCTACAAGTGTCCTGACCTATTACGTGGCCAGTCCAACAATGATCATTGGTAGTCTGCTTGGAGATGTGAAATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
e4c040f50c26877d5655ec645ef94f0f53b86ec38adb306a21002df1129f507d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7451
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50