Genbank accession
AOZ62238.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MGLLERNDEMSNVRKVGLLPTSPYRRYLPSAFDESMNIYEQLIACIEHVNNLGISFNELVDWLDKVVLQQNERLDEQDKKIDMLRDEWHIFEDYVINTLLKKKVVEVLKEWLEDGTLAEIINNDVMNMKVDKAGTVYVKDFKRLETETDDTGRIKRAIEELKKDENSTLVFEPIKYVVSEGFDVPSNKVIRGYKGKTVIDGSNIETATTLYQKGLFHVKGTLAEPIGLAQSVAQGDSKIVAPACDALLIGDLLIITSDESYAEGAPPSSRRGEIVTVKSFDGSNIELQGEVYFSYDQTKNARVQCMRGMKDVTIKDLDIIMGGKGKGHNGVIVENAQRIIIKNVFIDGAEDCGVVMTTCYNSHVYKSDIINNTSPGGTIGTSGYGVAFLSSKECSARDNFFRNSRHAIAGGGFIPAFACDITGNKAVDCPQYAYDCHEPCFFWNFSNNSATSCVGGFTIRGQFTRVVGNTITSSFANGILVESYTPVDNQKGNLIADNTIQYSKLNGIFCDGTNAIQTDLTITNNKIFDVKFAGINVFNTMNTIIEGNSVKNDYENGIRVLGRATGYRSNKLVIKGNTVYKSRFSNIRVAGVDNVIISGNNVETNTKDGIELFGAKEFIVDGNLVKDCEFYGIRSEESTNGSYTNNYLKTVRGENSDGLRIIKGGKIVMNGNTVVDPQRFGIYTTDTLYTVITSNNVYDCPSDGVKIDGPIKTHINQNNLTKAIDA
Physico‐chemical
properties
protein length:726 AA
molecular weight: 80185,57280 Da
isoelectric point:5,27559
aromaticity:0,08402
hydropathy:-0,27934

Domains

Domains [InterPro]
Coil
Unmapped
67–87
IPR011050
STR
149–557
IPR011050
STR
521–720
AOZ62238.1
1 726
Architecture
ATT
STR
ATT 3-211 | STR 212-725 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AOZ62238.1
1 726
Domain Start End Length (AA) Confidence
N-terminal 1 150 150 0,9926
Central domain 151 715 566 0,9952
C-terminal 716 726 10 0,1407
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-150
Central
151-715
C-terminal
716-726

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage QCM11
[NCBI]
1909400 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AOZ62238.1 [NCBI]
Genbank nucleotide accession
KX961631.1 [NCBI]
CDS location
range 17926 -> 20106
strand +
CDS
GTGGGTTTACTAGAAAGGAATGATGAGATGAGTAATGTTAGAAAAGTTGGTTTATTACCAACAAGTCCTTACAGACGATATTTACCTAGTGCTTTTGATGAATCAATGAACATTTATGAGCAACTTATTGCATGTATTGAACATGTAAATAATCTTGGTATATCTTTCAATGAATTAGTTGATTGGTTAGATAAAGTTGTATTGCAGCAGAATGAAAGACTAGATGAACAAGATAAAAAGATTGATATGTTACGTGATGAGTGGCATATTTTTGAAGATTATGTAATTAACACTCTTCTAAAGAAAAAAGTTGTTGAAGTTCTTAAAGAATGGCTAGAAGATGGAACGCTTGCTGAAATAATCAACAATGATGTAATGAATATGAAAGTAGATAAAGCTGGAACAGTTTATGTGAAAGATTTTAAGAGATTAGAAACTGAAACAGATGATACTGGTAGAATTAAAAGAGCAATTGAAGAATTAAAGAAAGATGAAAATTCGACTCTTGTATTTGAACCAATTAAATATGTTGTGTCTGAGGGTTTTGATGTTCCATCTAATAAAGTTATTAGAGGCTATAAAGGTAAAACTGTTATTGATGGCTCAAATATTGAAACGGCTACTACATTATATCAAAAAGGATTATTCCATGTAAAGGGAACTCTTGCTGAACCTATTGGTTTAGCTCAAAGTGTTGCTCAAGGAGATAGTAAAATTGTAGCTCCTGCTTGTGATGCTTTGTTGATTGGTGATCTACTAATTATAACAAGTGATGAATCTTATGCTGAAGGTGCACCTCCTAGTTCTCGTAGAGGTGAAATTGTAACAGTGAAGTCATTTGATGGTTCAAATATTGAATTACAAGGTGAAGTTTACTTTAGTTATGATCAAACTAAAAACGCTAGAGTTCAATGTATGCGAGGAATGAAAGATGTAACAATTAAAGATTTAGATATCATTATGGGTGGTAAAGGAAAAGGTCATAATGGTGTTATTGTTGAGAATGCTCAACGAATTATTATTAAAAATGTGTTTATTGATGGTGCTGAAGATTGTGGAGTTGTTATGACAACTTGTTATAACTCACATGTTTATAAATCAGATATTATCAACAATACTTCTCCGGGTGGAACGATTGGAACAAGTGGGTATGGTGTTGCTTTCTTATCGTCAAAAGAATGTTCAGCAAGAGATAACTTCTTTAGAAATTCACGTCACGCTATTGCGGGCGGTGGTTTCATTCCAGCATTTGCTTGTGATATTACAGGAAATAAAGCTGTTGATTGTCCACAATATGCTTATGATTGTCATGAACCTTGTTTCTTCTGGAATTTCTCTAATAACTCAGCTACATCTTGTGTAGGTGGTTTTACGATTCGTGGTCAGTTTACTAGAGTTGTAGGAAATACAATTACAAGTTCATTTGCAAATGGGATCTTAGTTGAAAGTTATACACCTGTTGATAATCAAAAAGGAAACTTAATTGCTGACAACACAATTCAATATTCTAAATTAAATGGTATTTTCTGCGACGGAACAAATGCGATTCAAACAGATTTAACTATTACTAATAATAAAATATTTGATGTTAAATTTGCTGGTATTAACGTTTTCAACACAATGAATACAATTATTGAAGGTAATAGTGTTAAAAATGATTATGAAAATGGTATCCGTGTACTAGGTAGAGCGACTGGTTATAGAAGTAATAAATTAGTTATTAAAGGTAACACTGTTTATAAGAGTCGTTTCTCTAATATTCGTGTTGCTGGTGTAGATAATGTAATTATTAGTGGAAACAATGTTGAAACAAATACAAAAGATGGTATTGAATTATTCGGTGCTAAAGAATTTATTGTTGATGGTAACTTAGTTAAAGATTGTGAATTCTATGGAATAAGATCTGAAGAGTCAACAAATGGTTCTTACACAAATAACTATCTTAAAACTGTTCGTGGTGAAAACTCAGATGGTTTACGTATTATCAAGGGTGGAAAGATCGTTATGAATGGTAACACTGTTGTTGATCCTCAAAGATTTGGTATTTATACAACTGACACTCTTTACACTGTTATTACTTCCAACAATGTGTATGATTGTCCTTCTGATGGTGTGAAAATTGATGGACCGATTAAGACTCATATCAATCAAAATAACTTAACAAAGGCTATTGACGCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
9a8e9fac2e23d72cc449ad92b8d758565d7213a492be892ab196ef0db4e306f9
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8325
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Predicting Genome Terminus Sequences of Bacillus cereus-group Bacteriophage using Next Generation Sequencing data Chung,C.-H., Walter,M.H., Yang,L., Chen,S.-C., Winston,V. and Thomas,M.A. GenBank