UniProt accession
A0A8S5QC64 [UniProt]
Protein name
Tail spike protein
RBP type
TSP
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,87
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MAFSKKTFTDGQTVIHADTLNAIQDELIRVAGLLGKDIQSAAINDRGHLILTLTDGTTLDAGVAKGAQGEKGATGPAGPQGPKGAPGTDASVTAANVAAAMGLSGLAADDQIMVSAVDADGKPTGWRKKYRDMLNVRDFGAKGDSTTDDTAAIQAALDAASTRGISAVLFPTGTYKVSATTADNNFFAALTVHSGQRLLFDAATLQLTANGYDFYAVLNIHNVNNVTVEGGLTIIGDRESHTATTGESGHGIRIVNSHNVHVSDVDIRYTWGDGVCVGGNGTMEEISQNVTLERIRTYKCSRNGLSIIEADGVVVRDCDFTYTDRTAPQYGIDVEPNLGTATNITIENVRMLNNGIGGFALYTTKATLPGVLTLRNIETDAKTIIYTSSAAGGTFDVRVDGWRHTQKSGETNPTLRLSGKGSLRIRQLYVVNKSAKRVIIPLDIENLRMDGVTVEDDPAVSIKGTLSVQSAVNTSIGKAVITGFLSRNPAEETWYSGNHLTVDNLQDTVVNLNEHLTTGGSSESYKLLLCDKALVLGTALSAKARVFIPYTYGNVNPFRVVNTTATEVQLYTTVSAGITFVGDVPGGSSANSAALTGNASYEVTPMLASGLVYVRKLNTRVPVKISEITNDSGYQTAAQVESTVTGKGYQTAAQVGAAITAAVGAAMEASY
Physico‐chemical
properties
protein length:671 AA
molecular weight: 70482,10240 Da
isoelectric point:5,55393
aromaticity:0,05961
hydropathy:-0,08376

Domains

Domains [InterPro]
G3DSA:1.20.5.320
STR
64–111
IPR011050
STR
132–409
IPR024535
ENZ
134–352
A0A8S5QC64
1 671
Architecture
STR
STR
STR 64-111 | STR 126-521 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A8S5QC64
1 671
Domain Start End Length (AA) Confidence
N-terminal 1 142 142 0,9894
Central domain 143 516 375 0,9884
C-terminal 517 671 154 0,7936
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-142
Central
143-516
C-terminal
517-671

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. ctvdw32
[NCBI]
2825723 Uroviricota > Caudoviricetes >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
DAE16349.1 [NCBI]
Genbank nucleotide accession
BK015621 [NCBI]
CDS location
range 7328 -> 9343
strand +
CDS
ATGGCCTTTAGCAAAAAGACCTTCACGGACGGTCAGACCGTCATCCACGCTGATACCCTCAACGCCATCCAGGATGAGCTGATCCGGGTGGCCGGACTGCTGGGCAAGGACATCCAGTCCGCTGCCATTAACGACCGCGGCCATCTGATTTTGACGCTGACGGACGGCACCACGCTGGACGCTGGCGTTGCCAAGGGCGCACAGGGAGAAAAAGGCGCGACCGGCCCGGCTGGCCCGCAAGGCCCGAAGGGAGCCCCTGGGACGGATGCAAGCGTGACGGCGGCCAACGTAGCCGCCGCCATGGGCCTGTCCGGTCTGGCGGCGGATGACCAGATCATGGTGTCCGCCGTGGATGCAGACGGCAAGCCCACCGGTTGGCGGAAAAAGTACCGGGACATGCTCAACGTCCGGGATTTCGGAGCTAAGGGCGACAGCACCACCGACGACACGGCGGCCATCCAGGCGGCGCTGGATGCGGCCAGCACACGGGGTATCTCCGCCGTGTTGTTCCCCACCGGGACCTACAAGGTCAGCGCGACCACGGCGGACAACAATTTCTTTGCCGCCCTGACGGTGCACAGCGGGCAGCGGCTGCTTTTCGACGCCGCTACCCTCCAGCTCACCGCCAATGGCTACGATTTCTACGCCGTGCTGAACATCCACAACGTGAACAACGTCACCGTTGAGGGCGGACTGACCATCATCGGCGACCGGGAGTCCCACACGGCCACCACCGGTGAGAGCGGTCACGGCATCCGCATCGTCAACAGCCACAACGTCCATGTCAGCGATGTGGATATCCGGTACACCTGGGGCGACGGCGTGTGCGTAGGCGGCAACGGGACCATGGAGGAGATCTCTCAGAACGTGACGCTGGAGCGGATCCGCACCTACAAATGCAGCCGCAACGGCCTGTCCATTATCGAGGCGGACGGCGTGGTGGTGCGGGACTGTGACTTCACCTATACCGACCGCACGGCGCCCCAGTACGGCATCGACGTGGAGCCCAATCTGGGCACGGCCACCAACATCACCATCGAAAACGTGCGGATGCTGAACAACGGCATCGGCGGCTTCGCACTGTACACCACCAAGGCTACCCTGCCCGGCGTGCTGACCCTGCGGAACATCGAGACGGATGCCAAGACCATTATCTACACCAGCAGCGCCGCAGGCGGCACCTTCGACGTCCGCGTGGACGGCTGGCGGCATACCCAGAAGTCCGGGGAGACCAATCCCACGCTGCGGCTGTCCGGCAAGGGGAGCCTGCGGATCAGGCAGCTGTACGTCGTCAACAAGAGCGCCAAACGGGTCATTATCCCGCTGGACATTGAAAACCTGCGCATGGACGGCGTGACGGTGGAGGACGACCCGGCGGTGAGCATAAAGGGAACGCTGTCCGTTCAGAGCGCCGTCAACACCTCCATTGGCAAGGCCGTTATCACCGGCTTTCTCAGCCGGAACCCGGCGGAGGAAACCTGGTATTCCGGCAATCATCTGACGGTGGACAATTTGCAGGACACGGTGGTAAACCTCAACGAACACCTGACCACCGGCGGCAGCAGCGAGAGTTACAAGCTGCTGCTGTGCGACAAGGCGCTGGTGCTGGGTACGGCCCTCAGCGCCAAGGCCCGGGTGTTCATCCCGTATACCTACGGCAACGTGAATCCCTTCCGGGTGGTGAACACCACGGCCACGGAGGTACAGCTGTACACCACGGTGTCGGCAGGCATTACGTTTGTGGGCGACGTCCCCGGCGGCAGCTCCGCCAACTCCGCCGCACTGACCGGAAACGCCAGCTATGAGGTCACGCCCATGCTGGCCAGCGGTCTGGTGTATGTGCGGAAGCTGAACACCCGTGTGCCTGTCAAGATCTCCGAGATCACCAACGACAGCGGGTACCAGACGGCGGCACAGGTAGAATCCACCGTGACAGGCAAGGGCTACCAGACGGCGGCACAGGTCGGCGCCGCTATCACGGCGGCCGTCGGCGCGGCAATGGAGGCGAGCTACTGA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0044423 virion component Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0051701 biological process involved in interaction with host Biological Process IEA:UniProtKB-ARBA (UniProt)
GO:0019058 viral life cycle Biological Process IEA:UniProtKB-ARBA (UniProt)

Tertiary structure

PDB ID
e5ae2cf3e2235441255c8f4cdf6d88864b333cdccf3fd66a9fd8369032258b22
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7051
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50