UniProt accession
A0A8S5SZ41 [UniProt]
Protein name
Tail protein
RBP type
TSP
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,90
Protein sequence
MGSIRIAILSANNTQVAFMDNAHKKSMHYWGDELHEYLQGTANTYTFTVNAKHPDAQHITVGNKVAFTYKGKSYYLNIVNTDQTEKIITATAWSLSFELINEDAGEYKAGKAMSFAEYLTVFDAERTLKLGLNEVSDKRITNEWTGTTSVLKRLFSLAHVFSAEIEFETVLNRDYSLKEIVLNVYRKHSDTDSGVGEYRNDIVLRYGKGITGIRKTTDAEKLYTCIQPTGKDGLTINGLDKKEYDENGNIEYFTDGALIRAPKARDRFPSNIVNKDDAYIMLRKEYDTDNQDKLYSMALSDLKTASEPVVTYEVDGYFDTNIGDTVRMQDQEWTPVLYLQARVSEQVRSLTNPKTAKTVFSNYKELTSEISDDLLQWMEDLINKNKVYTCSISTNNGIIFKNGIGSTTLTAYAYDNGVDVTGKLEIRWSKDGTEFYVGKSVTVNATDVDTKAVYSFEALENGIKRGYYEVTITKVDDGEPGPVGPQGPPGEQGIPGKPGADGRTQYTHIAYANSADGTKDFSVSDSNREYIGMYVDFELLDSTDPSKYAWSKIKGADGADGVPGKPGADGRTPYFHVAYSNSADGSKDFSVSDSTNKQYIGQYTDFTQADSTDYRKYAWTKIKGEDGHDGADGVGIKNVTKYYLASEKNSGITVNTPGWTTTMQTMTEAKKYLWSYEIIAYTDGTSTKTTPVIIGVHGQNGEDGTSGIIVSPTPPENPKVGQLWQTASGEPIKRWDGSRWVLHYVSVENLDVQTLSAITVNAGELTAGKIKSKNGIMLIDIDAGKIVSKLIDNGVVDSTMELNSASLAFSGKDSGGNPANMTFSMQGLTYINRNTGGRSKLVLSDGDIYAQNGNNPLIGLSSYSKYDSGIKQGPFPEINPSNSIRIKLIRTAFVVTCTIIMKAQFPWKGRIEEIQEVRIPDGYRPAIEVLAPISEVSNGQIFGTGRYIIKENGAIAIDVENESYLERMLTTTWITEN
Physico‐chemical
properties
protein length:977 AA
molecular weight: 107935,17090 Da
isoelectric point:5,15293
aromaticity:0,09724
hydropathy:-0,47564

Domains

Domains [InterPro]
DC_0002
STR
1–561
PTHR24637
Unmapped
477–715
A0A8S5SZ41
1 977
Architecture
STR
STR 1-815 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A8S5SZ41
1 977
Domain Start End Length (AA) Confidence
N-terminal 1 825 825 0,9226
Central domain 826 966 142 0,1663
C-terminal 967 977 10 0,9961
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-825
Central
826-966
C-terminal
967-977

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. ctL4w2
[NCBI]
2827844 Uroviricota > Caudoviricetes >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
DAF56160.1 [NCBI]
Genbank nucleotide accession
BK032709 [NCBI]
CDS location
range 4283 -> 7216
strand -
CDS
ATGGGCAGCATTAGAATTGCGATTTTAAGCGCAAATAACACACAAGTAGCATTTATGGATAATGCACATAAAAAATCCATGCACTACTGGGGAGACGAGCTGCACGAATACTTACAGGGTACGGCAAATACTTACACTTTTACGGTAAATGCAAAGCATCCAGACGCGCAGCATATCACGGTCGGGAATAAGGTAGCGTTTACTTATAAAGGTAAATCTTACTACTTAAATATTGTAAATACCGATCAGACGGAGAAGATAATTACCGCTACGGCATGGTCACTGTCGTTTGAGTTAATCAACGAGGATGCTGGCGAATATAAAGCCGGAAAAGCAATGAGCTTTGCAGAGTACCTTACCGTATTTGACGCCGAGAGAACACTAAAATTAGGACTTAACGAGGTATCAGATAAGCGGATCACCAACGAATGGACAGGCACAACGTCCGTATTAAAGAGATTATTCTCTCTGGCCCATGTATTTTCTGCGGAGATCGAATTTGAGACGGTATTGAACAGAGACTACTCTTTAAAAGAGATTGTCCTAAATGTATATCGGAAACACTCCGATACAGACAGCGGAGTCGGAGAATACCGGAATGACATTGTACTGCGGTACGGGAAAGGAATTACCGGAATCCGTAAAACCACAGATGCCGAGAAGCTTTATACCTGTATCCAGCCGACCGGGAAAGACGGGCTTACGATCAATGGACTGGACAAAAAGGAATACGATGAGAACGGGAATATCGAGTACTTTACAGACGGGGCGCTCATTCGGGCACCGAAGGCAAGAGACCGATTCCCGTCCAACATCGTAAACAAGGACGATGCTTATATCATGCTACGGAAAGAGTACGATACGGACAATCAAGATAAGCTCTATAGCATGGCACTATCTGACCTCAAGACCGCATCCGAGCCAGTAGTGACTTACGAGGTGGACGGGTACTTCGATACCAATATCGGGGACACGGTGAGAATGCAAGATCAGGAGTGGACACCAGTCCTTTACCTACAGGCGAGGGTATCTGAACAGGTACGCAGTCTTACCAATCCAAAGACTGCCAAGACGGTGTTTAGTAATTATAAGGAGTTGACATCCGAGATTTCGGACGATTTGCTACAATGGATGGAAGACTTGATTAACAAAAATAAGGTTTATACTTGCTCTATCTCAACCAACAACGGCATTATTTTTAAAAATGGTATCGGTAGCACTACTCTGACCGCTTACGCTTACGATAACGGCGTGGATGTGACAGGCAAACTGGAAATCCGATGGAGCAAAGACGGTACAGAGTTTTATGTCGGCAAGAGCGTTACGGTAAATGCTACGGACGTGGATACAAAGGCGGTGTACTCGTTTGAGGCTTTGGAAAATGGGATAAAACGCGGGTATTACGAGGTTACGATTACTAAAGTCGATGATGGAGAACCGGGACCTGTGGGACCACAAGGGCCACCTGGAGAACAAGGGATTCCCGGAAAGCCAGGGGCGGACGGGAGAACACAGTATACCCACATTGCTTACGCAAACAGTGCGGATGGAACGAAGGACTTTTCCGTGTCCGACAGCAATCGGGAATATATCGGAATGTATGTGGATTTTGAACTATTGGATAGCACAGATCCATCGAAGTACGCATGGAGTAAGATTAAGGGAGCGGATGGAGCGGACGGAGTGCCTGGGAAGCCTGGAGCGGATGGCCGAACGCCTTATTTTCACGTCGCTTATTCCAACAGTGCAGATGGTTCAAAAGATTTTTCGGTATCAGACAGTACAAATAAGCAGTACATCGGACAGTATACCGATTTTACACAGGCGGACAGCACAGACTACAGAAAGTATGCTTGGACTAAGATTAAAGGTGAAGACGGACATGACGGCGCAGACGGTGTAGGGATTAAGAATGTCACAAAGTATTATCTGGCAAGCGAAAAAAACTCTGGGATTACGGTAAATACGCCGGGATGGACTACGACAATGCAGACCATGACGGAAGCAAAAAAATACTTATGGAGCTACGAGATAATCGCTTACACAGACGGAACATCCACCAAGACAACACCTGTCATCATCGGAGTGCATGGCCAGAATGGAGAGGACGGAACATCCGGCATCATCGTGTCTCCCACGCCCCCGGAAAATCCAAAAGTAGGACAGCTCTGGCAGACAGCAAGTGGAGAACCAATTAAAAGATGGGATGGAAGTCGTTGGGTGCTGCATTACGTATCGGTCGAGAATCTGGACGTGCAAACATTAAGTGCGATCACAGTGAATGCCGGCGAATTAACAGCAGGTAAGATAAAAAGCAAAAACGGGATCATGCTCATAGATATTGACGCAGGGAAAATCGTAAGCAAATTGATTGATAATGGAGTTGTCGACAGCACGATGGAACTTAATTCCGCTTCCCTTGCATTTTCCGGTAAGGACTCGGGAGGTAATCCTGCAAATATGACTTTTTCCATGCAGGGGCTTACTTATATAAACCGGAATACCGGAGGACGTTCAAAACTTGTGTTATCAGACGGAGATATATATGCGCAGAACGGAAATAATCCATTAATTGGGTTGTCTTCGTACAGCAAATACGATTCTGGCATAAAGCAGGGACCATTCCCGGAAATAAATCCATCAAATTCCATCAGGATAAAGCTTATAAGAACTGCTTTTGTTGTAACGTGCACGATTATTATGAAAGCACAGTTCCCGTGGAAAGGAAGAATTGAAGAAATACAAGAGGTAAGAATCCCTGACGGATACAGGCCGGCAATAGAAGTGTTGGCACCAATCAGCGAGGTTTCCAACGGACAAATATTCGGAACAGGCAGGTACATAATAAAAGAAAACGGCGCAATAGCTATAGATGTGGAGAATGAGTCGTACCTAGAAAGGATGCTAACAACAACTTGGATAACGGAAAACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
1da5a3cf5b8286865216f2dd643d2748522078a88f4efbd52c08b8612a52a294
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,8459
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50