Protein
View in Explore- Genbank accession
- XAG96615.1 [GenBank]
- Protein name
- tail spike protein
- RBP type
-
TFTSPTSPTSP
- Protein sequence
-
MTNGWMYFIDFDQFGIKDDGTDATSTTSGFVAAFKRAVELGHSAVYVPEGTYLIDAVGVGDYLPEYGGGLQFPSNIEVILHEKALFKVQPNDSTGYACFNLEGVENVTIRGGHIVGDRHEHNYRQDVNENRRTHEWGFGIQVRGSKNVTIEDVTIEDCTGDNIWVTSKGMMNWPGVYIPSESVTIRKCRTFRGRRNNIAAGASVGLLIDDCDIIEAGGDEIGPQLGIDLEGYADNSIKYQHPYEINVINCRFKDNGRGSMNINVSGKVNAIGNFCDDYIGYGFSTDVTISNNVITNETGVHKKFGIDSIRKSTSETANRAVITGNVIRGFQTGIAARGKTVTVSNNILEDISSIGIYPYLCDQAVVSSNIIDSDCLHIWVRESKDIKVSDNKGTGAANNVSIKVDASKDVLLSDNEVSGKGGVRVSRSTNVRIVDNDIDMIGPDYGIYFDKQSEVHLRDNLVKNAAFTAIRGYADQYSSYIKGNIIQDCKYMIAIHIDGGSKHMIKDNDITFRRGSNAGYGVYLIGANDSRLHNNDIRVMDGFGLINSFYTIQSTNTKLIGNTYDTGEMKTNDTDFLRYNEKLPK
- Physico‐chemical
properties -
protein length: 585 AA molecular weight: 64447,16860 Da isoelectric point: 5,33124 aromaticity: 0,08889 hydropathy: -0,35009
Domains
Domains [InterPro]
IPR012334
STR
12–373
STR
12–373
IPR011050
STR
29–418
STR
29–418
IPR051550
Unmapped
86–539
Unmapped
86–539
IPR006626
Unmapped
104–144
Unmapped
104–144
IPR039448
ENZ
135–273
ENZ
135–273
IPR006626
Unmapped
383–404
Unmapped
383–404
IPR012334
STR
394–581
STR
394–581
1
585
Architecture
STR 12-581 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
585
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 23 | 23 | 0,9383 |
| Central domain | 24 | 574 | 552 | 0,9958 |
| C-terminal | 575 | 585 | 10 | 0,4413 |
Note: Constraints were applied during segmentation.
C-terminal too short, adjusted boundary
C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-23
1-23
Central
24-574
24-574
C-terminal
575-585
575-585
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage KKP_4049 [NCBI] |
3109402 | No lineage information |
| Host |
Bacillus licheniformis [NCBI] |
1402 | cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XAG96615.1
[NCBI]
Genbank nucleotide accession
PP579744.1
[NCBI]
CDS location
range 138525 -> 140282
strand +
strand +
CDS
TTGACTAATGGGTGGATGTACTTCATTGATTTTGATCAGTTCGGTATCAAGGATGACGGGACAGATGCTACATCAACAACTAGTGGGTTCGTAGCAGCGTTTAAAAGGGCCGTAGAACTTGGTCACTCTGCTGTATATGTACCAGAAGGTACTTACTTAATTGACGCTGTAGGCGTTGGTGACTACCTCCCTGAGTACGGAGGAGGTCTTCAGTTCCCGTCCAACATTGAAGTTATCCTACATGAGAAAGCCCTTTTTAAAGTCCAACCGAATGACTCTACCGGGTACGCTTGTTTTAACTTAGAAGGCGTAGAAAATGTAACAATCCGAGGCGGACATATTGTAGGTGATCGTCATGAACACAACTACAGACAAGACGTTAACGAGAACAGACGTACTCACGAATGGGGATTCGGTATCCAAGTTCGTGGAAGTAAAAATGTTACAATCGAAGATGTAACAATTGAAGACTGCACCGGAGATAACATCTGGGTAACATCTAAAGGTATGATGAACTGGCCGGGAGTGTACATTCCATCTGAAAGCGTAACAATCAGGAAATGTAGAACATTTAGAGGAAGACGAAATAACATTGCTGCCGGAGCAAGCGTAGGTCTACTTATCGATGACTGCGATATTATTGAAGCAGGCGGGGACGAAATTGGACCGCAGCTAGGTATTGACCTCGAAGGATACGCAGATAATAGCATCAAGTACCAGCATCCTTATGAGATAAATGTGATTAACTGCCGATTTAAAGACAATGGAAGAGGTTCTATGAACATCAACGTGTCTGGTAAAGTTAATGCTATCGGTAACTTCTGCGATGATTACATTGGTTACGGGTTTTCAACTGACGTAACTATTAGTAACAATGTTATCACAAACGAAACAGGGGTTCACAAAAAATTCGGGATAGACTCTATTCGTAAGTCTACTTCTGAAACAGCTAACAGAGCAGTTATTACTGGAAATGTAATTCGTGGATTTCAAACGGGTATCGCTGCTAGAGGTAAAACGGTTACTGTAAGCAACAACATCCTAGAAGATATTAGTTCTATTGGAATCTACCCTTATCTATGCGATCAAGCAGTAGTATCGAGTAATATCATAGACAGCGACTGTCTACACATTTGGGTTAGAGAATCGAAGGATATAAAAGTTAGTGACAATAAAGGTACTGGCGCAGCTAACAATGTATCTATAAAAGTAGACGCCTCTAAAGACGTATTACTAAGCGATAATGAAGTTTCTGGAAAAGGTGGAGTTCGAGTAAGTCGCTCAACAAATGTGAGAATTGTGGATAATGACATTGATATGATCGGCCCAGACTACGGCATCTACTTTGATAAACAGTCTGAAGTACATCTTAGAGATAACCTTGTTAAAAACGCCGCTTTCACCGCAATCAGGGGTTATGCAGATCAGTACAGCAGCTACATAAAAGGAAACATCATTCAAGATTGTAAGTACATGATCGCTATTCATATTGACGGTGGGTCAAAACATATGATTAAAGACAATGATATTACATTCCGCAGAGGGTCTAATGCAGGCTACGGTGTCTATTTAATCGGAGCAAATGACTCTCGTTTACATAATAATGACATTAGAGTAATGGATGGGTTTGGCCTTATCAACTCTTTCTACACGATCCAGTCTACGAACACTAAGTTGATAGGGAATACATATGATACAGGTGAAATGAAAACAAACGATACTGACTTCTTACGATACAACGAGAAGCTACCTAAGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
83668d9fea161c21e4a8003789418f1212bda0d5f60bda37f412a2cde785e3f9
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50