Genbank accession
QBP07052.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MLNLTKVATWLVNWKASATGAVERTMESKLMDNISVADFGAIPGNTDSRAAFQAAINHAHSLGGGAVTVPSGTWTLKGSLTFYNNVHLVGAGQGSTVLLYSAINVPFAAIGTASTNINNVHIRDLQIIVDGTHTSGDFITWTNGYNLCLENVRVDGPFYNFLKMHGGPQQFIYRVNNCTINGATTSEDVIVIGDTSSAVVQDVFLQNLVLSGGGKGCGVKEVNASGVYASNIDCLGMKYGYAWLPSGATKCRGSFYTAILGDTCKECGLMFAPTGGASVSDLNFNGCWGSSCGTTALHPGVHLNAENGGLENLNFNGLTCVNNKGSGILLTGYNVGGINFSNISCNANSMATRGQKHGIELGAGVQNVNFTNVKAGATPVFEFNNQGYGIFISDGIGSGVKFVNVDARGNVNGAISNASDAAVTIENCPQYVTYNTGAAKIASGTNSVTVPTGISRALQGRYVQVTPTTDINAPFWFEVSGQNIIIRVRNNVSGDQYFSWTVSAQR
Physico‐chemical
properties
protein length:506 AA
molecular weight: 53097,89910 Da
isoelectric point:6,49365
aromaticity:0,08893
hydropathy:0,03379

Domains

Domains [InterPro]
IPR012334
STR
4–490
IPR011050
STR
25–377
IPR051801
Unmapped
33–88
IPR011050
STR
33–377
IPR024535
ENZ
34–233
QBP07052.1
1 506
Architecture
STR
RBD
STR 4-490 | RBD 491-505 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QBP07052.1
1 506
Domain Start End Length (AA) Confidence
N-terminal 1 43 43 0,9340
Central domain 44 430 388 0,9927
C-terminal 431 506 75 0,9905
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-43
Central
44-430
C-terminal
431-506

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage Minorna
[NCBI]
2547246 Uroviricota > Caudoviricetes > Autographivirales > Slopekvirinae > Drulisvirus
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QBP07052.1 [NCBI]
Genbank nucleotide accession
MK598851.1 [NCBI]
CDS location
range 160 -> 1680
strand -
CDS
ATGCTAAATCTTACCAAAGTGGCTACATGGTTAGTTAACTGGAAAGCATCCGCAACAGGTGCAGTTGAACGTACCATGGAGTCTAAACTAATGGATAACATTAGTGTTGCAGACTTCGGAGCTATTCCAGGTAATACTGACTCGCGTGCTGCCTTCCAAGCCGCGATTAATCACGCTCACAGCCTAGGAGGAGGGGCTGTAACTGTACCCTCTGGTACATGGACTTTAAAGGGCTCTCTTACATTCTATAATAACGTGCATTTGGTGGGGGCAGGACAGGGTAGCACTGTACTTTTATACTCTGCAATTAACGTGCCTTTTGCAGCCATCGGTACAGCATCTACCAATATTAATAATGTTCATATCCGTGACCTACAAATTATTGTGGATGGGACACATACAAGCGGCGACTTCATTACGTGGACTAACGGATACAACTTATGCTTGGAGAATGTTCGTGTAGATGGTCCCTTCTATAACTTCCTTAAGATGCACGGCGGTCCCCAGCAGTTTATCTACCGCGTAAACAACTGTACCATTAACGGTGCAACTACTAGCGAAGATGTTATTGTGATAGGCGATACCTCCTCGGCTGTAGTTCAGGATGTGTTCCTGCAGAACCTAGTACTGTCAGGGGGTGGTAAAGGATGTGGGGTTAAGGAGGTCAATGCCTCTGGCGTCTATGCGTCTAACATCGACTGTTTGGGAATGAAGTATGGTTACGCCTGGCTACCGTCTGGTGCTACTAAATGTAGAGGCAGTTTCTACACTGCTATCTTAGGCGATACCTGTAAGGAGTGCGGGTTAATGTTTGCCCCTACTGGGGGAGCCAGTGTCTCAGACCTGAACTTCAACGGTTGCTGGGGAAGTTCTTGTGGGACAACTGCATTACATCCTGGAGTACATCTTAATGCTGAGAATGGTGGGCTTGAGAACCTGAACTTTAACGGACTGACCTGTGTTAACAATAAAGGCTCAGGAATACTGCTCACAGGTTATAATGTAGGAGGCATTAACTTCAGTAACATCAGTTGTAATGCTAACAGTATGGCAACCCGCGGACAGAAGCACGGAATAGAGTTAGGGGCCGGGGTGCAGAATGTGAACTTCACAAACGTTAAGGCTGGTGCAACTCCTGTATTCGAGTTTAACAATCAAGGTTACGGGATCTTTATATCTGACGGTATTGGTAGTGGTGTTAAGTTCGTTAACGTAGATGCCCGAGGTAACGTAAACGGTGCAATAAGTAATGCTAGCGATGCAGCTGTCACCATCGAGAACTGCCCGCAGTATGTTACTTATAATACTGGGGCAGCTAAGATAGCTTCAGGGACCAACTCGGTTACTGTACCTACCGGTATATCGCGCGCCTTACAAGGTCGCTACGTTCAGGTAACTCCTACTACAGATATCAACGCCCCGTTCTGGTTTGAGGTATCCGGTCAGAATATTATTATACGTGTTCGTAATAATGTATCGGGGGACCAATACTTTAGCTGGACGGTATCAGCGCAGCGGTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
009e718bbc772565d008e3ac98f6f854e85d52d82e8ac61ac74ca9475de68d7c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8880
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete Genome Sequence of Escherichia coli Myophage Minorna Rogers,K., Min,L., Newkirk,H., Liu,M. and Ramsey,J. 2019 31171610 GenBank