Genbank accession
AGK86945.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MSEVYVKSFGAVGDGVTDDTEALQAAIDATPVNGTLLLEAGATYKTSKDLVGVKNIHFLGKGAAIVATHHKGRGLVFEGSLKATTTTSAAYTANTTYVTVGSTSGMAVGDQIRIYHTGDLYDTSRAYYYKGGNFLITKISGNNVYISRRIPYDMKSGAKVEVYKPITVTVDDLTIKHTGTLGSSVYGSYGLNIRFSKYSEVSNVTVDNFNHNIKMDMTLNCLLYRVKTGKAYWSGSSESYGVSNYSGNGLMIMHSTLNSGRHGYTTTGQETSYDTALNRCTIGQDDAVDLAGLDCHGNNYSLRALECTIKRFHLAGNCLLERCTVNESAKGNSSSFMVAETRPRSNFFLKDCYLYSPVYKIDAWGQQPTTSRKYIGSIVFENVKGGDAYTQCTFKSRDTGGSVQAIIDKLIVRGCENFTLVTNEQINNMYFEEVNTKRDAKILEQVGDAKADKIEFKKCTLPARWRTFYLTNFKNLKLIDCKWNVINSSAASMWVTSSAASVDLIRTDFTFGGGVETGGLDKFTTTQTSSIKFKQPSSIKTKRRVTYTNI
Physico‐chemical
properties
protein length:550 AA
molecular weight: 60431,49320 Da
isoelectric point:8,95932
aromaticity:0,10182
hydropathy:-0,29255

Domains

Domains [InterPro]
DC_2190
STR
1–440
IPR012334
STR
2–510
IPR011050
STR
3–430
IPR024535
ENZ
6–216
AGK86945.1
1 550
Architecture
STR
RBD
STR 1-510 | RBD 511-550
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AGK86945.1
1 550
Domain Start End Length (AA) Confidence
N-terminal 1 15 15 0,8452
Central domain 16 524 510 0,9909
C-terminal 525 550 25 0,7024
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-15
Central
16-524
C-terminal
525-550

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage SIOphi
[NCBI]
1285382 Uroviricota > Caudoviricetes > Herelleviridae > Siophivirus > Siophivirus SIOphi
Host Bacillus subtilis
[NCBI]
1423 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGK86945.1 [NCBI]
Genbank nucleotide accession
KC699836.1 [NCBI]
CDS location
range 77984 -> 79636
strand -
CDS
ATGAGCGAAGTATATGTAAAATCTTTTGGAGCAGTAGGGGACGGAGTTACAGACGATACAGAAGCATTACAAGCAGCAATTGACGCTACACCTGTAAATGGGACTCTTTTACTTGAAGCCGGGGCAACTTATAAAACAAGCAAAGACCTAGTCGGAGTTAAAAACATTCATTTTCTAGGTAAAGGCGCAGCAATTGTCGCTACTCATCACAAAGGAAGAGGTTTAGTCTTTGAAGGCTCCTTAAAGGCTACAACAACCACTTCCGCAGCGTATACAGCCAACACTACATACGTAACGGTTGGGAGTACATCAGGAATGGCTGTAGGAGATCAGATCAGAATTTACCACACTGGAGACTTATATGATACATCTAGAGCTTACTATTATAAAGGTGGTAACTTCCTTATTACAAAAATCTCAGGAAACAATGTCTACATCAGCAGACGTATTCCTTATGACATGAAGTCTGGAGCAAAAGTCGAAGTATATAAACCGATTACAGTTACAGTAGATGACCTTACAATCAAGCATACGGGTACTCTCGGAAGCTCTGTATACGGGTCATACGGTTTAAATATCCGTTTCTCTAAATACTCTGAAGTAAGTAATGTTACAGTAGACAACTTTAACCACAACATTAAAATGGATATGACATTGAACTGTCTGTTGTATCGAGTTAAAACAGGTAAAGCATACTGGTCAGGTTCTTCTGAAAGCTATGGTGTGTCTAACTACTCCGGTAACGGTTTGATGATCATGCACTCTACCTTGAATAGTGGTAGACACGGATACACAACAACTGGACAAGAGACTTCATACGATACAGCCCTAAACAGATGTACGATCGGACAGGATGACGCTGTAGATTTAGCCGGACTAGACTGTCATGGTAACAACTACTCTTTACGGGCGCTAGAGTGTACAATTAAACGATTCCACTTGGCAGGAAATTGTTTGTTAGAGAGATGTACAGTTAACGAGAGTGCTAAGGGTAATAGCAGCAGCTTTATGGTAGCAGAAACACGACCTAGATCAAACTTCTTTTTGAAGGATTGCTATTTGTACAGCCCCGTATATAAAATTGATGCTTGGGGACAACAACCGACAACATCACGTAAATACATCGGTAGTATCGTATTTGAAAATGTGAAAGGTGGTGATGCTTATACACAATGTACGTTTAAGTCTCGTGACACAGGTGGCTCTGTTCAGGCAATTATCGACAAACTAATTGTGCGTGGCTGTGAGAACTTTACACTTGTTACAAATGAACAGATTAACAATATGTATTTTGAAGAAGTGAATACAAAACGAGATGCTAAAATTCTTGAACAAGTTGGAGATGCTAAAGCAGATAAAATTGAGTTTAAAAAATGCACACTGCCTGCACGTTGGAGAACCTTCTATCTAACTAATTTCAAGAACTTAAAACTAATCGACTGTAAATGGAATGTAATTAATTCTTCCGCAGCATCTATGTGGGTTACAAGTTCTGCTGCTTCAGTAGACTTAATCCGTACAGACTTTACATTTGGCGGAGGAGTAGAGACTGGTGGTCTTGATAAGTTTACAACAACTCAGACATCATCTATCAAATTTAAGCAGCCATCCTCTATTAAAACGAAACGAAGAGTAACGTATACTAATATTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
9cf66d5902008368e7a8cafcafd24af138d4e60f71ba78b04af3b2d95b00a855
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7788
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50