Genbank accession
QEI23578.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TSP
Evidence UniProt/Swiss
Probability 1,00
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MTDITANVVVSNPRPIFTESRSFKAVANGKIYIGQIDTDPVNPANQIPVYIENEDGSHVQITQPLIINAAGKIVYNGQLVKIVTVQGHSMAIYDANGSQVDYIANVLKYDPDQYSIEADKKFKYSVKLSDYPTLQDAASAAVDGLLIDRDYNFYGGETVDFGGKVLTIECKAKFIGDGNLIFTKLGKGSRIAGVFMESTTTPWVIKPWTDDNQWLTDAAAVVATLKQSKTDGYQPTVSDYVKFPGIETLLPPNAKGQNITSTLEIRECIGVEVHRASGLMAGFLFRGCHFCKMVDANNPSGGKDGIITFENLSGDWGKGNYVIGGRTSYGSVSSAQFLRNNGGFERDGGVIGFTSYRAGESGVKTWQGTVGSTTSRNYNLQFRDSVVIYPVWDGFDLGADTDMNPELDRPGDYPITQYPLHQLPLNHLIDNLLVRGALGVGFGMDGKGMYVSNITVEDCAGSGAYLLTHESVFTNIAIIDTNTKDFQANQIYISGACRVNGLRLIGIRSTDGQGLTIDAPNSTVSGITGMVDPSRINVANLAEEGLGNIRANSFGYDSAAIKLRIHKLSKTLDSGALYSHINGGAGSGSAYTQLTAISGSTPDAVSLKVNHKDCRGAEIPFVPDIASDDFIKDSSCFLPYWENNSTSLKALVKKPNGELVRLTLATL
Physico‐chemical
properties
protein length:667 AA
molecular weight: 71855,98720 Da
isoelectric point:5,33857
aromaticity:0,08846
hydropathy:-0,16432

Domains

Domains [InterPro]
IPR009093
ATT
1–113
IPR009093
ATT
1–100
G3DSA:2.170.14.10:FF:000001
Unmapped
2–110
IPR036730
ATT
7–109
QEI23578.1
1 667
Architecture
ATT
STR
ATT 1-113 | STR 114-667
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QEI23578.1
1 667
Domain Start End Length (AA) Confidence
N-terminal 1 130 130 0,9594
Central domain 131 554 425 0,9864
C-terminal 555 667 112 0,8663
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-130
Central
131-554
C-terminal
555-667

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage SE21
[NCBI]
2592200 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella sp.
[NCBI]
599 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QEI23578.1 [NCBI]
Genbank nucleotide accession
MK972692.1 [NCBI]
CDS location
range 5127 -> 7130
strand -
CDS
ATGACAGACATCACTGCAAACGTAGTTGTTTCTAACCCTCGTCCAATCTTCACTGAATCCCGTTCGTTTAAAGCTGTTGCTAATGGGAAAATTTACATTGGTCAGATTGATACCGATCCGGTTAATCCTGCCAATCAGATACCCGTATACATTGAAAATGAGGATGGCTCTCACGTCCAGATTACTCAGCCGCTAATTATCAACGCAGCCGGTAAAATCGTATACAACGGCCAACTGGTGAAAATTGTCACCGTTCAGGGTCATAGCATGGCTATCTATGATGCCAATGGTTCTCAGGTTGACTATATTGCTAACGTATTGAAGTACGATCCAGATCAATATTCAATAGAAGCTGATAAAAAATTTAAGTATTCAGTAAAATTATCAGATTATCCAACATTGCAGGATGCAGCATCTGCTGCGGTTGATGGCCTTCTTATCGATCGAGATTATAATTTTTATGGTGGAGAGACAGTTGATTTTGGCGGAAAGGTTCTGACTATAGAATGTAAAGCTAAATTTATAGGAGATGGAAATCTTATTTTTACGAAATTAGGCAAAGGTTCCCGCATTGCCGGGGTTTTTATGGAAAGCACTACAACACCATGGGTTATCAAGCCTTGGACGGATGACAATCAGTGGCTAACGGATGCCGCAGCGGTCGTTGCCACTTTAAAACAATCTAAAACTGATGGGTATCAGCCAACCGTAAGCGATTACGTTAAATTCCCAGGAATAGAAACGTTACTCCCACCTAATGCAAAAGGGCAAAACATAACGTCTACGTTAGAAATTAGAGAATGTATAGGGGTCGAAGTTCATCGGGCTAGCGGTCTAATGGCTGGTTTTTTGTTTAGAGGGTGTCACTTCTGCAAGATGGTAGACGCCAATAATCCAAGCGGAGGTAAAGATGGCATTATAACCTTCGAAAACCTTAGCGGCGATTGGGGGAAGGGTAACTATGTCATTGGCGGACGAACCAGCTATGGGTCAGTAAGTAGCGCCCAGTTTTTACGTAATAATGGTGGCTTTGAACGTGATGGTGGAGTTATTGGGTTTACTTCATATCGCGCTGGGGAGAGTGGCGTTAAAACTTGGCAAGGTACTGTGGGCTCGACAACCTCTCGCAACTATAATCTGCAATTCCGCGACTCGGTCGTTATTTACCCCGTATGGGACGGATTCGATTTAGGTGCTGACACTGACATGAATCCGGAGTTGGACAGGCCAGGGGACTACCCTATAACCCAATACCCACTGCATCAGTTACCCCTAAATCACCTGATTGATAATCTTCTGGTTCGCGGGGCGTTAGGTGTAGGTTTTGGTATGGATGGTAAGGGCATGTATGTGTCTAATATTACCGTAGAAGATTGCGCTGGGTCTGGCGCGTACCTACTCACCCACGAATCAGTATTTACCAATATAGCCATAATTGACACCAATACTAAGGATTTCCAGGCGAATCAGATTTATATATCTGGGGCTTGCCGTGTGAACGGTTTACGTTTAATTGGGATCCGCTCAACCGATGGGCAGGGTCTAACCATAGACGCCCCTAACTCTACCGTAAGCGGTATAACCGGGATGGTAGACCCCTCTAGAATTAATGTTGCTAATTTGGCAGAAGAAGGGTTAGGTAATATCCGCGCTAATAGTTTCGGCTATGATAGCGCAGCGATTAAACTGCGGATTCATAAGTTATCAAAGACATTAGATAGCGGAGCATTGTACTCCCACATTAACGGGGGGGCCGGTTCTGGCTCAGCGTATACTCAACTTACTGCTATTTCAGGTAGCACACCTGACGCTGTATCATTAAAAGTTAACCACAAAGATTGCAGGGGGGCAGAGATACCATTTGTTCCTGACATCGCGTCAGATGATTTTATAAAGGATTCCTCATGTTTTTTGCCATATTGGGAAAATAATTCTACTTCTTTAAAGGCTTTAGTGAAAAAACCCAATGGAGAATTAGTTAGATTAACCTTGGCAACACTTTAG

Genome Context

Genome Context

Tertiary structure

1 / 30
PDB ID
Source
Method
Resolution
Oligomeric State

Literature

Title Authors Date PMID Source
Salmonella bacteriophage diversity and host specificity revealed by physiological characterization and whole genome sequencing Fong,K., Tremblay,D., Delaquis,P., Moineau,S., Goodridge,L., Levesque,R. and Wang,S. 2019-09-14 GenBank