UniProt accession
A0A8S5PY23 [UniProt]
Protein name
Tail protein
RBP type
TSP
Evidence RBPdetect
Probability 0,85
TF
Evidence RBPdetect2
Probability 0,90
Protein sequence
MGRFIIYSKDGQTQRCVANKLEYNGEFMGACSVNITVTSPTPIDFTVGDYLIYRGERFEINYDPTELKQASKNTYGEAFKYENVVFNSLADELTRCEFLDYVKEDNLIHYSSLPTFSFYAESINALAERIQVNLDRIYKGEQKWTVTVHPEYVNEANKSISISSINVWDALALVNSEFKANFIIKERTITIGTAGIAVGNMFGYGKGKGLYSIQKTADSSQKIITRLRAYGGTKNLPYNYYTTYGSPIVEAPIEDVSYGYDPNTHLIDGAVVTLPFYMKFLSDTALYDVTINGHSYKMKRGSFLGKCYVLLNSEADKDNVHIGAKMRIEKGIETDNVPRKYKRPSGALVPNNMAVKNLMLPDFPEKTLDPYLDSKNIDIIGVREGSVFFDGSDTSLPEIYPSMEGMTAQQLKDAGIIVNATGALDEIASDSVNKDNTPIADDGYFEEGETIPPFKIYLKDIGFDINDYLTGETATISMKSGMCGGREFEILGDADKPVKQGNMWVLTCNRVYDEGLNLYFPYKDFTIKAGDKFVLLGIDMPDVYIKAASQRLLTASKEYLAKNDYVRYTYEPKVDEIFMARHPELHDSIKEGDLMLFEDEDLNINGSIIIDSLTIKEGNGLIPTYDITLRNDKAVGTLEKIQNQIDSIVGGQGGGGLTTQQVESIIKAFGEKLFLNKTKPDQTSYLIKFLGGLFSDYIQSMNFSSGALGEGFVIKVDSKTGKSYIEVDELFVRIKAMFSELEIKKLSYAGGNYMFTAAGMKCGKVEEHEDFWRCYLLVDDGETAIENPFKEGDQVRFQDFNIKPGVYENVSNRYYWRLCVGVGEDYIDLSKTDCDANSDTPQEGDSLVQLGNRTDKKRQNAITLSVYGDDAPSIHQYAGINSYSLAGKEVTVISPQGNKFMGDFILKTGINIMTQFKILEDLIYSEISKVLDEVQAKDNYLYNAAFASNTNGWETKNDVRFFTVNGKFLLVNDKFYSRKDAMAAIIRDGDRNVLRILSSGIKQSNANLANKPTYEEGEEPKNFFISFRYKVATAGTLTIGFPGQNLHFTERIEPGEEYAMKEYSGTWDGTGDFELKFTGDIYIHSLALAENAFEDLYTKLSSEIKQTAESIRLEVKELSESNNQKFSQIEQTAENLKLSVTKIEEDVTQLGLDINGVTDELKLYVKKDGLGSEINVALDNISVVSKNIYFTGNISANGNVSIQADGTIKAIGGYFEGEINANSGVFKNVRTPNNSLVIDENGNVSIVGKISTASSGTKIEINPNSNSLKFYNSKGYDVGGISFLDSGGGGTSVTYPRLKLDNIASDGNLTASTTLFAGSLSMISNLSGSRYQVSLGISGLSFYKDGRLTKSYPSS
Physico‐chemical
properties
protein length:1355 AA
molecular weight: 150363,37470 Da
isoelectric point:4,90813
aromaticity:0,10554
hydropathy:-0,32458

Domains

Domains [InterPro]
DC_1573
STR
1–274
Coil
Unmapped
1126–1146
A0A8S5PY23
1 1355
Architecture
STR
STR
STR 1-274 | STR 408-1351 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A8S5PY23
1 1355
Domain Start End Length (AA) Confidence
N-terminal 1 40 40 0,9754
Central domain 41 1307 1268 0,4351
C-terminal 1308 1355 47 0,8347
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-40
Central
41-1307
C-terminal
1308-1355

Taxonomy

  Name Taxonomy ID Lineage
Phage Myoviridae sp. ctXwe21
[NCBI]
2825123 Uroviricota > Caudoviricetes >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
DAE11778.1 [NCBI]
Genbank nucleotide accession
BK015537 [NCBI]
CDS location
range 7335 -> 11402
strand -
CDS
ATGGGACGGTTTATAATATACAGCAAAGACGGGCAGACGCAACGATGTGTCGCTAACAAGTTAGAGTATAACGGGGAGTTCATGGGAGCTTGTTCCGTTAACATTACCGTTACGTCCCCCACTCCGATTGATTTTACAGTCGGGGACTATCTGATATACCGCGGAGAAAGATTTGAAATAAACTACGACCCGACTGAATTGAAGCAAGCCTCCAAAAACACATACGGAGAGGCTTTCAAATATGAGAACGTAGTTTTCAATTCTCTTGCAGACGAACTGACAAGATGCGAATTTTTGGACTATGTAAAAGAGGATAACTTAATCCACTACTCTTCCCTGCCTACATTCAGTTTTTACGCTGAAAGTATAAATGCTCTCGCGGAAAGAATACAGGTGAACCTTGACCGTATCTATAAAGGAGAACAAAAATGGACAGTTACAGTACATCCGGAATATGTTAATGAGGCTAACAAATCCATATCAATAAGCAGTATAAACGTTTGGGACGCACTCGCTTTGGTAAATAGCGAGTTTAAGGCAAACTTTATCATAAAGGAGCGAACGATAACAATAGGCACTGCCGGAATTGCAGTAGGAAACATGTTCGGGTATGGAAAGGGCAAAGGGCTGTACTCCATACAAAAAACCGCGGATTCGTCACAGAAGATAATTACCCGCCTAAGAGCATATGGTGGTACCAAAAACTTGCCTTACAACTATTATACAACATATGGTAGTCCTATTGTCGAAGCTCCCATCGAGGATGTATCTTACGGATATGACCCTAATACACATCTGATAGACGGAGCTGTTGTGACGCTTCCTTTTTACATGAAATTCCTATCTGACACAGCATTGTATGATGTGACAATCAATGGACATTCCTATAAAATGAAAAGAGGTAGCTTTCTTGGGAAATGCTACGTTTTGTTGAATAGCGAAGCCGACAAGGACAACGTTCATATAGGCGCAAAGATGCGGATAGAAAAAGGCATTGAGACGGACAATGTTCCAAGAAAGTACAAAAGACCTTCCGGAGCATTAGTCCCCAATAATATGGCTGTTAAAAACTTGATGCTTCCTGATTTTCCGGAAAAGACACTTGACCCATACCTTGATAGTAAAAACATAGATATTATCGGAGTTCGGGAAGGTTCGGTTTTCTTTGACGGGAGCGATACTTCTTTGCCGGAAATATATCCGTCTATGGAAGGAATGACGGCACAGCAGTTGAAAGACGCGGGAATAATCGTAAATGCTACCGGAGCGTTGGATGAAATCGCTTCCGATTCTGTGAATAAGGATAATACGCCAATCGCGGATGATGGTTACTTTGAAGAAGGGGAAACCATCCCACCGTTCAAAATATATCTCAAAGACATTGGATTTGACATAAACGATTATCTAACAGGGGAAACCGCCACCATATCCATGAAAAGCGGAATGTGTGGTGGACGTGAATTTGAAATACTTGGAGATGCAGACAAGCCCGTAAAACAAGGTAATATGTGGGTCTTGACATGCAACAGAGTCTATGATGAAGGTTTGAATCTTTATTTCCCATATAAGGATTTTACTATCAAGGCCGGAGATAAATTTGTGCTTTTGGGCATTGATATGCCGGATGTGTATATAAAAGCCGCTTCCCAAAGATTGCTAACAGCTTCCAAAGAATATCTTGCAAAAAATGATTATGTAAGATATACTTACGAGCCTAAAGTAGACGAAATATTTATGGCGCGTCACCCGGAACTGCATGACAGTATAAAGGAAGGTGATTTAATGTTGTTCGAGGATGAAGACCTAAACATCAATGGGAGCATTATCATTGACAGCCTTACGATAAAAGAAGGAAATGGACTTATTCCAACGTATGATATTACCCTTCGCAATGACAAAGCGGTAGGAACTTTAGAAAAGATACAGAATCAGATAGATTCAATAGTAGGCGGGCAAGGCGGTGGAGGATTAACTACCCAACAAGTGGAATCAATCATTAAAGCCTTTGGAGAAAAGCTGTTTTTGAATAAAACCAAACCTGACCAAACCAGCTATTTAATAAAGTTTTTAGGTGGATTGTTTTCAGACTACATCCAGTCCATGAACTTTTCTTCCGGTGCACTCGGTGAAGGCTTTGTTATTAAAGTAGACAGCAAGACGGGTAAATCCTACATTGAGGTGGACGAACTCTTTGTGCGTATCAAAGCGATGTTCTCCGAACTGGAGATAAAGAAGCTCTCTTATGCAGGCGGAAACTACATGTTCACCGCTGCCGGAATGAAATGTGGAAAGGTAGAAGAACACGAGGATTTTTGGCGTTGCTATCTTTTGGTGGATGATGGAGAAACGGCTATCGAGAACCCGTTCAAGGAAGGCGACCAGGTCCGTTTCCAAGACTTCAATATCAAACCGGGTGTCTATGAGAATGTTTCCAACCGTTATTATTGGCGCTTATGTGTAGGTGTTGGCGAGGATTACATAGACCTTAGTAAGACAGATTGTGACGCAAACAGCGACACACCGCAGGAAGGAGACAGCCTTGTACAGCTCGGTAACAGAACAGACAAGAAGCGTCAGAACGCAATCACCCTGTCTGTGTATGGTGATGATGCGCCGAGTATCCACCAGTATGCAGGAATAAATTCTTATTCTTTAGCAGGCAAGGAAGTGACAGTTATCAGCCCGCAAGGCAACAAGTTCATGGGAGACTTTATCTTGAAAACGGGAATAAACATTATGACCCAATTCAAGATATTGGAAGATTTGATTTACTCTGAAATTTCCAAAGTGCTTGACGAGGTGCAGGCAAAGGATAATTATCTGTACAATGCGGCATTTGCATCCAATACGAACGGTTGGGAGACAAAGAACGATGTTCGCTTCTTTACTGTGAACGGAAAGTTCTTATTGGTTAACGACAAGTTCTATTCCCGTAAGGACGCTATGGCTGCCATTATTAGAGACGGAGATAGAAACGTGCTTCGTATCCTTTCTTCCGGAATTAAACAGTCAAATGCGAATTTAGCCAATAAACCGACCTATGAGGAAGGGGAAGAACCGAAGAATTTCTTTATCTCTTTCCGGTATAAGGTAGCTACAGCCGGAACGCTGACAATAGGATTTCCCGGTCAGAACCTGCATTTCACCGAACGTATTGAACCGGGTGAGGAATACGCAATGAAGGAATATTCCGGCACATGGGACGGAACGGGCGATTTTGAGTTGAAGTTTACGGGGGATATATACATACATTCGCTGGCTCTTGCCGAAAACGCATTCGAGGATTTGTATACTAAATTGAGTTCCGAAATAAAGCAGACAGCGGAAAGTATCAGGTTGGAAGTAAAGGAGCTTTCTGAAAGTAATAATCAGAAGTTCTCACAGATTGAGCAGACAGCGGAAAACCTCAAATTGTCTGTTACCAAAATAGAGGAAGATGTAACGCAGTTGGGGCTGGACATCAATGGGGTTACCGATGAACTTAAATTATATGTCAAAAAAGACGGATTAGGTTCAGAAATCAATGTGGCACTTGATAATATTTCCGTGGTTTCCAAAAATATATACTTTACCGGAAATATATCCGCCAACGGGAATGTGTCTATTCAGGCAGACGGGACAATAAAGGCTATTGGTGGATATTTTGAAGGAGAGATAAATGCAAACAGCGGGGTGTTTAAAAATGTAAGAACTCCTAACAACTCTTTGGTGATAGACGAAAATGGGAATGTTAGCATTGTTGGCAAAATATCAACCGCTTCGTCAGGTACAAAAATAGAAATAAACCCAAATTCAAACAGCCTAAAATTTTATAATTCAAAAGGATATGATGTGGGTGGAATTTCATTCCTTGATAGTGGAGGCGGAGGTACTTCTGTTACTTACCCAAGATTAAAATTGGACAATATAGCAAGTGATGGCAACTTAACTGCGTCTACCACCCTTTTTGCAGGGTCATTGTCAATGATTTCAAATTTAAGTGGGTCAAGATACCAAGTGTCTCTTGGCATCAGCGGACTTTCTTTTTATAAAGATGGAAGATTAACTAAATCATACCCAAGCTCATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
29d82c5c8286031654f2ef9864d59c65f28b78655793970e0c7034d24b1b5eb5
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7838
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50