Genbank accession
QSJ04338.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MIYVKDFTGITEAVKIQNAINAAAVSTSPSKTVMLEEKDYYLESSITLLNDVELLFGYRSRLVIGGNFPVLLIGRNASVTNPFIAIDAPTFDSPVFYLDGKNKYYNTWNKTAIKDGVVLNWSGSHKGIGVRFYSGGTDHEISFVDVSNIKLVGLRKGIELEAKAPASGMAWVNANRFNNISIEDCVEMITLDSSETIPNECSGNIFSGLQLQPSTATTKVLKVSGQHNRFDGMLWDISLIPTAKFVDVTANSSYTKIDFNRSLPSTKVQDSGVSTILQ
Physico‐chemical
properties
protein length:278 AA
molecular weight: 30441,21480 Da
isoelectric point:5,67790
aromaticity:0,09353
hydropathy:-0,04101

Domains

Domains [InterPro]
IPR012334
STR
11–270
IPR011050
STR
15–205
QSJ04338.1
1 278
Architecture
STR
STR 11-278
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QSJ04338.1
1 278
Domain Start End Length (AA) Confidence
N-terminal 1 11 11 0,8503
Central domain 12 267 257 0,9940
C-terminal 268 278 10 0,2039
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-11
Central
12-267
C-terminal
268-278

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage BCPG3
[NCBI]
2812883 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus sp.
[NCBI]
1409 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QSJ04338.1 [NCBI]
Genbank nucleotide accession
MW584228.1 [NCBI]
CDS location
range 30770 -> 31606
strand -
CDS
ATGATATATGTGAAGGATTTTACTGGAATAACTGAAGCAGTTAAGATACAAAATGCAATTAATGCAGCAGCTGTATCAACTTCTCCATCCAAGACCGTAATGCTAGAAGAGAAGGATTATTACCTTGAGTCATCTATCACTCTGTTAAACGATGTAGAATTACTGTTTGGCTATCGTTCAAGACTAGTCATTGGTGGTAACTTCCCTGTTTTATTGATAGGGCGAAATGCTTCTGTTACAAATCCTTTCATAGCTATCGATGCTCCGACTTTCGATTCACCTGTCTTTTATCTTGATGGTAAGAATAAATATTACAATACATGGAATAAGACAGCGATAAAAGATGGTGTAGTTCTAAACTGGTCAGGCTCACATAAAGGTATAGGCGTTCGATTCTACTCCGGAGGAACTGACCATGAGATTTCATTTGTGGATGTGTCCAACATTAAACTTGTCGGTCTAAGAAAAGGTATTGAGTTAGAGGCTAAAGCCCCAGCATCAGGAATGGCATGGGTAAACGCTAATCGATTCAATAACATTTCCATAGAGGATTGCGTAGAAATGATTACACTGGATAGCAGCGAGACAATACCGAACGAATGTAGCGGTAACATCTTCTCAGGTCTTCAGTTACAACCATCTACTGCAACAACAAAAGTCTTGAAAGTAAGTGGGCAGCATAACCGTTTCGATGGTATGCTTTGGGACATCTCTTTAATACCTACAGCTAAGTTTGTAGATGTTACAGCGAACAGCTCCTACACGAAGATAGATTTCAATCGCTCATTGCCTAGTACGAAAGTACAGGATAGTGGAGTAAGCACAATTTTACAATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
9a2458cf4d7025ecebabfc35a1d10c1edd06b64fbe5dcf58392c83031d804126
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,9220
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50