Protein
View in Explore- Genbank accession
- WIT27903.1 [GenBank]
- Protein name
- tail spike protein
- RBP type
-
TFTSPTSPTSP
- Protein sequence
-
MTLFNNTLRYYVDFDQWGINAFNSNPIETTKGFNEALIYASENNFPIVEIPKGNFIIDSVNTLNQRNPEIGGGIKIPSNMELLLDPEAVFQVNPNRYQGYSCFYIGLAENVIIRGGRIIGDRYQHDYSLIDTDRKTHEWGFGIHVHGSKNVLIENVQISDCIGDNIWIAAHGMMNYPGMVYTPSKSVTVRKCELKRGRRNNLATNGCEGLLVEDCDIEEAGGDTIGPQLGIDLEGYGENGRKYDHPYELTISDCRFRKNGRGSVTAHTSGKVSIKDNYCDNVISYGYSTDVSIKGNKIINEGDSKEYGIDSVGVSSTETGNRIQITDNNIQGFKIGMMIRGKGVSIDNNTVKNASNCAIATHMAEDVSISNNRIQDSDCIQIQVRNSSDIKVSNNKGKGTTSAYAIKVMDSNDVKFLNNTFSNLYGGLYCERSQAVRIKLNDFLLSGKGYGIYWDKDSEVFLTRNEIFEPRNVAIMGAADMYNIRISDNQIYNCKAIIAIHLIGGSEHMVRGNEIMFNRDSDQGYGIYLNGTKKVRLIRNDVQGIGARVLSHPFATFNASSTTLIHNTYDSGTPRLALDDTVIDYK
- Physico‐chemical
properties -
protein length: 586 AA molecular weight: 65177,36960 Da isoelectric point: 5,80681 aromaticity: 0,08703 hydropathy: -0,40290
Domains
Domains [InterPro]
IPR012334
STR
28–377
STR
28–377
IPR011050
STR
30–376
STR
30–376
IPR051550
Unmapped
82–473
Unmapped
82–473
IPR006626
Unmapped
108–147
Unmapped
108–147
IPR039448
ENZ
139–298
ENZ
139–298
IPR011050
STR
338–567
STR
338–567
IPR006626
Unmapped
364–384
Unmapped
364–384
1
586
Architecture
STR 28-583 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
586
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 26 | 26 | 0,9598 |
| Central domain | 27 | 575 | 550 | 0,9963 |
| C-terminal | 576 | 586 | 10 | 0,3327 |
Note: Constraints were applied during segmentation.
C-terminal too short, adjusted boundary
C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-26
1-26
Central
27-575
27-575
C-terminal
576-586
576-586
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage SPbetaL6 [NCBI] |
3053440 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Bacillus subtilis BEST7003 [NCBI] |
1204342 | Bacillota > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus subtilis BEST7003 |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
WIT27903.1
[NCBI]
Genbank nucleotide accession
OQ921346.1
[NCBI]
CDS location
range 112652 -> 114412
strand +
strand +
CDS
ATGACATTGTTTAATAATACCTTACGATACTATGTTGACTTTGATCAGTGGGGAATAAATGCATTTAACTCGAATCCAATTGAAACAACTAAAGGATTTAATGAAGCTCTTATATATGCATCAGAGAACAACTTTCCTATTGTTGAAATTCCAAAAGGGAATTTTATTATTGACTCAGTAAATACATTAAATCAACGAAATCCTGAAATTGGTGGGGGAATTAAAATCCCATCAAATATGGAGCTCCTTTTGGATCCAGAAGCAGTGTTTCAAGTTAACCCCAATAGGTATCAGGGCTATTCTTGTTTTTATATTGGGCTTGCAGAGAACGTAATAATTAGAGGAGGTCGTATTATAGGTGACCGGTATCAACATGATTATTCTCTAATTGATACCGATAGAAAAACACATGAATGGGGATTTGGAATACATGTTCATGGAAGCAAAAATGTTTTGATTGAAAATGTACAAATCTCAGATTGTATTGGAGACAACATTTGGATTGCAGCTCACGGAATGATGAATTACCCAGGGATGGTTTATACGCCTTCCAAAAGTGTGACCGTAAGAAAATGCGAACTAAAAAGAGGGAGACGGAACAATTTAGCTACTAACGGTTGTGAAGGATTATTGGTTGAAGACTGTGATATAGAGGAAGCTGGAGGAGATACAATTGGCCCTCAACTAGGTATTGATTTAGAGGGTTACGGGGAAAACGGAAGAAAGTATGATCATCCTTATGAGTTAACGATATCGGATTGCAGGTTTAGAAAAAATGGTCGTGGTTCGGTTACTGCTCATACAAGCGGTAAAGTTTCCATCAAAGATAACTACTGTGACAATGTTATTTCATATGGCTACAGTACAGATGTGAGTATTAAGGGTAACAAGATAATAAATGAAGGGGATTCTAAAGAGTACGGAATAGACTCTGTAGGTGTTTCGAGCACTGAGACTGGCAACAGAATTCAAATAACTGATAACAATATTCAAGGGTTTAAAATAGGCATGATGATTAGAGGGAAAGGGGTATCGATTGATAATAATACCGTAAAGAACGCTTCAAATTGTGCAATAGCAACACATATGGCCGAAGATGTTTCCATTTCAAACAACAGAATACAGGACAGCGATTGTATTCAGATCCAGGTGAGAAATTCATCAGATATTAAAGTAAGCAATAATAAAGGGAAAGGTACAACTTCAGCATATGCAATTAAAGTGATGGATTCGAATGACGTTAAATTCTTAAATAATACGTTCTCCAATCTTTACGGAGGTCTATATTGTGAGAGATCTCAAGCAGTCAGAATTAAGCTAAACGATTTCTTATTGAGTGGAAAAGGTTACGGTATATATTGGGATAAAGATTCAGAAGTCTTCCTAACAAGGAATGAGATATTTGAACCAAGAAATGTTGCAATTATGGGGGCAGCTGATATGTACAATATCAGAATAAGTGATAACCAGATATATAATTGCAAAGCAATCATTGCGATCCACTTAATAGGCGGTTCTGAGCATATGGTTAGAGGAAATGAAATCATGTTTAACAGAGATTCAGATCAAGGCTATGGAATATATTTAAACGGGACAAAAAAGGTTCGATTAATAAGGAATGATGTTCAAGGTATTGGTGCAAGAGTGCTTTCGCATCCATTTGCAACGTTTAATGCTTCCAGTACAACTTTAATACATAATACATACGACAGCGGCACACCTAGATTAGCCTTAGATGATACAGTAATCGATTATAAATAA
Genome Context
Genome Context
Tertiary structure
PDB ID
6f2ac030c5ae9e16f1a93d169c28fbd11d4af9af566570ee7a66facc4e693a15
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50