Genbank accession
QBX25250.1 [GenBank]
Protein name
endopeptidase
RBP type
TSP
Evidence RBPdetect
Probability 0,69
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MKPILFNKNEQQFDTYGLGEIDVTTGNVTRERNGLYTFYAEYPANGPLASVLEKEMKIKADAGLRTKNQTFEISRIVKDSSGVLKIYGSHIKHKLEYMAVRHGINLSGTASVALAIWANNLIGDYRFSTWSDIDTTGSTTFTADKMTNAHLALGGVEGSILDVWGGEYEFDNLTVRLHKQLGRRAPTVLEYGRNIISAESDESIEESYTSVYPFATYTPESQGSDSTPAPVTVTIPGDYVDSKYISMYANRRIKVVDFSSEFKEKEIPTPDKLRTMAVKFMEHNKIGAPKINTKIEYVDLASTLDYQDNKIIEELEFCDIVPVYYPSIGITEDDAKVTKIVYDFVNERNESVEFGIIGESIRSAMTGGLSGRMDSLENRQKAIESGLPDYLLNASGNKVWYQKPAEGTEHKVGDLWFEKNGQYDRMYVWNGEMWEKRIDTEDVDRVKKDIDEKLKQSTESIQQVENKASEALTKAGAIIDSQELLDKINAHLYSDTNNDDNGILGRKFRQQREANRSTRNIATSTRDKLTDYQRTNDENLVRIGQQLDNTVSKAEMKQTADGIRETIAKITVGARNLMVGTKDFSGDWFNKAKWTLEDEKYLGLSVYSRQEEWLGLSEIVEVRVGETYTFSAYVKSSIENDLVFMYLDNRLVEPRASLSLVRKDIQVSTNWTRVSATFSVTKAGLMTPRFERNNKNAKLYVAGYKLELGNVPSDWSQAIEDTNQIIDDKITTFDRTIDGIRATVAEAKSYIDADGQRRQELNQLIRDETAKGINTVLSTVEQSGYAKRTEIQSITETQRLYDRIIGTTEDGIKQNIARMTLTDSLFQTEVSKAVNKELTPSNYVANPFTMSDYVRKYNGNGSSATVTLSSTGINVFGKLELHAHSKLFYSDAIWLPLTRIPESVKDLSFSIIIDGIDKCKISVYLGTERSSTYINYQRQDNTIYGTLTGINSDYRGSDGVYLKLSFEGLNGQSVFLQKPIVVEGREPKFDFEINKRTELDQAVRSVQTQLAGSWSVKNINSAGDLISGINLGADGRNRITGKLTHITNETLMDRASIKSAAIESITADQITTGTLNASRINVINLNARSITSGTFRGLEYEGGIIRGNNGNTIINLNTNITTYNGTARIEFKSAHNNLVYSSSGTHAFLAPTQRQGTSYAAWAFGVGTSSSLDPNAGFVGLKIFNDPGSRKVILVGDVQIVKDTFTRDAPATALSDVLSQIQYNFVQIKNWFSRNDLGHPGLYDIEL
Physico‐chemical
properties
protein length:1247 AA
molecular weight: 139362,38670 Da
isoelectric point:5,50050
aromaticity:0,08741
hydropathy:-0,45662

Domains

Domains [InterPro]
DC_1353
STR
1–705
QBX25250.1
1 1247
Architecture
STR
ATT
STR
STR 1-707 | ATT 921-969 | STR 984-1212 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QBX25250.1
1 1247
Domain Start End Length (AA) Confidence
N-terminal 1 585 585 0,9255
Central domain 586 1100 516 0,3966
C-terminal 1101 1247 146 0,4249
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-585
Central
586-1100
C-terminal
1101-1247

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage Javan246
[NCBI]
2548068 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus gordonii
[NCBI]
1302 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QBX25250.1 [NCBI]
Genbank nucleotide accession
MK448884 [NCBI]
CDS location
range 30858 -> 34601
strand +
CDS
TTGAAGCCTATTTTATTTAATAAGAATGAGCAACAATTCGACACTTACGGGTTGGGAGAAATTGATGTAACAACAGGAAATGTCACCCGCGAGAGAAACGGTCTCTACACGTTTTATGCGGAATATCCAGCTAATGGCCCTCTAGCCTCTGTCTTAGAAAAAGAAATGAAAATTAAGGCAGACGCTGGACTTCGGACGAAGAATCAGACTTTTGAAATATCCAGAATTGTCAAAGACAGTAGCGGAGTTTTGAAAATCTATGGTAGTCACATCAAGCACAAGCTAGAGTACATGGCAGTGCGTCACGGGATCAACCTAAGCGGTACAGCTTCCGTGGCTCTTGCCATCTGGGCTAATAATCTGATAGGTGACTATCGCTTTTCTACTTGGTCAGACATTGACACAACAGGTAGCACAACATTTACCGCAGACAAGATGACGAACGCACATCTTGCTCTCGGTGGTGTTGAGGGCTCTATTTTGGATGTCTGGGGCGGTGAGTACGAGTTTGACAACCTGACTGTCAGATTGCACAAACAGCTTGGTAGAAGAGCTCCTACGGTCTTAGAATACGGTAGGAATATCATATCAGCTGAGAGTGATGAATCTATTGAAGAATCGTATACCTCAGTCTATCCGTTTGCTACCTATACACCAGAGAGCCAAGGAAGCGATAGCACACCAGCTCCCGTCACGGTAACGATACCAGGCGATTATGTAGACAGTAAGTACATCAGTATGTACGCCAATCGACGTATAAAAGTAGTGGATTTCTCCAGCGAGTTTAAGGAGAAGGAAATTCCAACCCCTGACAAGCTGAGAACTATGGCAGTGAAATTTATGGAGCACAACAAGATTGGCGCTCCTAAAATCAATACTAAAATTGAGTATGTGGACTTAGCAAGCACACTTGATTATCAAGACAATAAGATTATCGAAGAGCTGGAGTTCTGCGACATCGTGCCAGTCTACTATCCATCTATCGGAATCACAGAGGACGATGCGAAAGTCACTAAAATTGTTTACGATTTTGTCAATGAACGCAACGAATCTGTAGAGTTTGGGATCATCGGAGAATCTATCCGCTCTGCTATGACTGGTGGATTATCAGGGCGTATGGACTCGCTAGAGAACAGGCAGAAAGCCATTGAAAGCGGGTTGCCTGATTATCTCTTAAATGCGTCTGGTAACAAGGTCTGGTACCAGAAACCAGCTGAAGGAACAGAGCACAAAGTTGGTGACTTGTGGTTTGAAAAAAACGGACAATATGACCGCATGTACGTCTGGAATGGTGAGATGTGGGAGAAGCGCATTGATACCGAAGATGTGGATCGGGTCAAGAAAGACATTGACGAGAAGCTGAAACAATCCACAGAATCTATCCAGCAAGTCGAGAACAAAGCCTCCGAAGCCTTGACGAAGGCTGGGGCAATCATTGATAGCCAAGAGTTGCTGGATAAGATTAATGCCCATCTATATTCAGACACTAATAATGACGATAACGGAATTTTAGGTAGAAAATTTCGACAACAACGAGAGGCCAATAGGTCAACTCGAAATATAGCGACATCAACTAGAGATAAACTCACAGACTACCAACGCACGAACGACGAGAACCTAGTCCGCATTGGTCAGCAACTGGACAACACAGTCAGCAAGGCCGAGATGAAGCAGACGGCTGACGGTATCAGAGAGACGATAGCTAAAATCACCGTAGGAGCTCGAAACCTAATGGTCGGAACAAAAGATTTTTCTGGCGATTGGTTCAACAAGGCGAAATGGACGCTAGAAGACGAGAAATATTTAGGCTTGTCAGTATATAGCCGTCAGGAAGAATGGCTGGGCCTCTCTGAAATCGTTGAAGTACGAGTCGGTGAAACCTACACATTCAGTGCCTACGTTAAAAGTAGCATTGAAAATGACCTTGTCTTTATGTACTTAGATAATCGATTAGTAGAGCCTAGAGCTTCGCTGTCTTTGGTAAGAAAAGACATACAGGTCAGCACGAACTGGACGAGAGTGTCTGCAACGTTTTCGGTGACAAAAGCAGGGTTGATGACTCCTCGATTTGAGCGCAACAACAAGAACGCAAAACTGTATGTTGCAGGCTACAAGCTAGAACTGGGAAACGTACCGTCCGATTGGTCGCAAGCAATCGAAGATACCAATCAGATAATTGATGATAAAATCACGACTTTTGACCGTACGATAGACGGCATTAGAGCGACTGTCGCAGAAGCTAAGAGCTACATCGACGCAGACGGACAGAGAAGACAAGAGCTAAACCAGTTAATCAGAGACGAGACGGCTAAGGGCATTAATACAGTCTTATCTACTGTAGAGCAGTCAGGATATGCCAAGCGGACAGAGATACAGTCTATCACAGAGACACAAAGGCTCTATGACCGTATCATTGGCACAACGGAAGATGGCATCAAGCAGAACATCGCTCGGATGACGTTGACGGATAGTCTGTTTCAGACAGAAGTCTCAAAAGCAGTTAACAAAGAGCTGACACCGTCGAACTATGTGGCCAATCCGTTTACTATGTCTGATTACGTGAGGAAGTATAATGGAAACGGCAGTTCTGCAACTGTAACGCTATCGAGTACTGGCATAAATGTTTTTGGCAAGCTTGAGTTACATGCTCATTCTAAACTATTCTACAGTGATGCCATATGGTTGCCATTGACCCGTATACCAGAGAGTGTTAAGGATTTATCGTTCTCAATCATCATTGACGGCATTGATAAATGTAAGATTTCCGTGTACTTAGGGACAGAGAGATCCTCAACATATATCAACTATCAGCGACAGGATAACACGATTTATGGAACATTAACAGGTATCAATTCTGATTACAGAGGCTCTGATGGCGTGTATCTAAAACTCTCATTCGAGGGGCTAAATGGTCAGAGCGTGTTCCTTCAAAAGCCTATTGTGGTCGAAGGACGAGAACCTAAGTTTGACTTTGAAATAAACAAGCGGACAGAGCTAGACCAAGCTGTCCGAAGCGTCCAAACTCAACTAGCAGGATCTTGGTCTGTCAAAAATATAAATTCTGCTGGAGATTTAATCTCTGGTATTAATTTAGGGGCAGATGGTAGGAATCGTATTACTGGTAAGTTAACGCACATAACCAATGAGACGCTGATGGATAGGGCTAGTATTAAGAGTGCAGCGATCGAGAGTATAACAGCTGACCAGATAACCACTGGGACGCTTAATGCATCTCGTATCAATGTAATTAATCTCAATGCACGAAGTATCACGTCTGGAACATTCAGAGGGTTAGAGTATGAAGGTGGTATTATCCGAGGCAATAACGGAAATACCATCATCAATCTTAATACTAACATTACTACATACAACGGAACGGCTAGGATTGAGTTTAAATCAGCCCATAACAACTTGGTCTACAGCTCAAGCGGGACACATGCGTTTTTAGCACCAACTCAAAGGCAAGGCACATCTTATGCAGCATGGGCATTTGGCGTGGGTACTAGCAGTAGTCTTGACCCCAATGCTGGTTTTGTCGGATTGAAAATCTTCAACGACCCTGGCTCTCGCAAAGTCATACTGGTTGGCGATGTGCAGATTGTGAAAGATACGTTTACCCGAGATGCTCCAGCTACAGCATTAAGCGATGTATTATCTCAAATACAATACAATTTTGTGCAAATTAAAAATTGGTTCTCAAGGAATGACCTAGGTCACCCTGGTCTATATGACATCGAATTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a3fa442c9122e31776632567da4a81ee344eb8e7ad696422ee1f70187df2efd1
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2962
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Prophages and satellite prophages are widespread among Streptococcus species and may play a role in pneumococcal pathogenesis Rezaei Javan,R., Ramos-Sevillano,E., Akter,A., Brown,J. and Brueggemann,A.B. 2018-12-20 GenBank