Genbank accession
YP_005097997.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,96
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MTVSTEVDHNDYIGNGVTTSFPYTFRIFKKSDLVVQVADLSENITELVLDTDYTVTGAGEYTGGNVILSTPLTSGYQISISRELPVTQEIDFRNQGKFFAEVHEDGFDKLTMLIQQAISWLRLSLRKPSFVANYYDALNNYIRNLRDPSRPQDAATKNYVDSVANTNLSHTLRTPEAIPSLPGIEQRKNKIVAMNDSGDPIMVLPESGSAADVLIELAKPTGAELIGTLSSKSVQQELMIKTSSFPTLQDAANYAVNGIIVDDDYHFTDGETVDFSGKKLTIECKAKFIGDGKLTFENLGSGSRIVHPHMQSQTVPYVISRWDSNGEWITEPSTIISTLTQSRTQGYAPTVNDVDIYNSLPDNVKNQNLISHLIISNSSGIDVFYPKATFGSYESFKNNNVKFWYPRDFYGDMSNCIAFTAWDSTDYYHGNYVIGGSTNYGSGSGVCFYRNDGGVGHDGGVIGGFTPYRCGESGVKTYQNEVNGISQRCYNLRFIDINPIETYYDGVDLNADYGTPTERQHDYTLAQYAWNNLPTNHIVSNIQAYKTHGVGIWGDGSTGFYRDIYASYSRGAGIFIKGSGKNFKNLTSIQNNAANTPGENQITLDGANIIDGVNIINYTQPTGLAIFAPNSTVTNLNAPSVPSSSINIGNIEGLVVGNLIHVQPNLANQTSAVYLNVVNTSVASKREDTIKIGPGASEVTRYVISGSSPRLTMRENHGDFGAVNIAFSGTVLPDEAVPDANSYAVYWDGTNLTALINHGGVLTRQKLTT
Physico‐chemical
properties
protein length:769 AA
molecular weight: 84028,28030 Da
isoelectric point:5,04272
aromaticity:0,09883
hydropathy:-0,29935

Domains

Domains [InterPro]
DC_0055
ATT
8–171
IPR011050
STR
240–766
IPR015331
RBD
240–767
YP_005097997.1
1 769
Architecture
ATT
STR
ATT 8-171 | STR 233-769
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_005097997.1
1 769
Domain Start End Length (AA) Confidence
N-terminal 1 246 246 0,9960
Central domain 247 674 429 0,9875
C-terminal 675 769 94 0,8006
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-246
Central
247-674
C-terminal
675-769

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage SPN1S
[NCBI]
1125653 Uroviricota > Caudoviricetes > Uetakevirus >
Host Salmonella
[NCBI]
590 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales
Host bacterium
[NCBI]
1869227 cellular organisms > Bacteria > unclassified Bacteria >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_005097997.1 [NCBI]
Genbank nucleotide accession
NC_016761.1 [NCBI]
CDS location
range 20161 -> 22470
strand +
CDS
ATGACGGTCTCAACTGAAGTTGACCATAACGACTACATCGGGAACGGGGTCACAACTTCATTCCCTTATACCTTCCGAATTTTTAAGAAGTCTGATCTGGTTGTGCAGGTTGCAGACCTGAGCGAGAACATCACAGAATTAGTCCTTGATACTGATTACACCGTTACTGGTGCGGGAGAGTACACAGGAGGGAATGTAATACTGTCAACGCCACTGACAAGTGGATACCAAATCTCAATTTCACGTGAGCTTCCAGTTACGCAGGAAATCGATTTCAGGAATCAGGGTAAATTTTTTGCAGAGGTTCATGAAGATGGTTTCGATAAGCTGACAATGTTGATACAGCAGGCTATTAGCTGGTTGCGCCTTTCTCTACGTAAGCCGTCGTTTGTGGCTAATTATTATGATGCGCTGAACAACTACATCCGAAACCTTCGCGACCCGTCACGACCTCAGGATGCCGCCACCAAAAATTATGTAGATAGCGTAGCTAATACCAATCTAAGCCACACACTGAGAACTCCTGAAGCAATCCCTTCATTGCCCGGGATAGAACAACGTAAAAATAAAATTGTCGCGATGAATGATAGTGGCGATCCGATAATGGTTCTTCCTGAGTCTGGTTCCGCTGCTGATGTGCTGATAGAACTGGCAAAGCCAACAGGTGCTGAATTAATAGGAACTCTTTCTAGTAAATCAGTTCAGCAAGAGTTGATGATAAAGACATCCTCATTCCCAACTTTACAGGATGCTGCTAACTATGCTGTAAACGGGATAATTGTTGATGATGATTATCATTTTACTGACGGAGAAACTGTTGATTTTAGTGGTAAAAAGTTAACAATTGAATGTAAGGCAAAATTTATTGGAGATGGTAAATTAACATTTGAAAATTTAGGCTCAGGATCACGCATTGTTCATCCACACATGCAGTCACAAACGGTGCCTTACGTTATATCAAGATGGGATAGCAATGGGGAGTGGATAACTGAACCCTCTACTATCATTTCTACTCTTACTCAAAGCAGAACGCAAGGCTACGCACCTACAGTTAATGATGTAGATATATATAACTCTCTACCAGATAATGTTAAAAACCAAAATTTAATATCACATCTCATTATATCTAATTCATCAGGCATAGATGTGTTTTATCCAAAAGCAACGTTTGGATCATATGAATCATTTAAAAATAATAATGTGAAGTTTTGGTATCCACGTGATTTTTATGGAGACATGTCAAACTGTATCGCATTTACTGCATGGGATAGCACCGATTACTACCATGGTAATTATGTAATCGGAGGTTCAACTAATTATGGATCAGGAAGTGGGGTGTGTTTTTATCGAAATGATGGAGGGGTTGGCCATGATGGAGGAGTCATTGGTGGATTTACCCCTTACAGATGCGGTGAATCAGGTGTTAAAACATACCAGAACGAAGTTAACGGGATAAGTCAAAGATGTTATAATCTTCGTTTCATCGATATCAATCCGATAGAAACGTACTATGATGGTGTAGATCTGAATGCTGACTATGGCACGCCAACTGAACGCCAGCATGATTACACATTGGCGCAATACGCTTGGAACAACCTTCCAACAAACCACATCGTTAGCAACATTCAAGCGTATAAGACTCATGGAGTTGGTATTTGGGGTGACGGATCTACAGGGTTTTATCGAGATATCTATGCATCATATTCTCGTGGCGCAGGTATATTTATCAAAGGAAGTGGGAAGAATTTTAAAAACCTAACTTCCATTCAAAACAATGCAGCTAACACGCCAGGAGAAAACCAGATTACACTTGACGGAGCAAACATAATTGATGGTGTAAATATAATAAATTACACACAACCAACAGGACTTGCGATTTTTGCTCCAAATTCTACAGTCACTAATCTTAATGCTCCAAGTGTTCCTTCATCATCCATAAACATTGGCAATATTGAGGGGCTGGTGGTTGGCAACCTAATACATGTGCAGCCAAATCTTGCAAATCAAACTTCAGCTGTGTATTTAAATGTAGTCAATACTAGTGTGGCATCTAAAAGAGAGGATACCATAAAGATTGGCCCAGGAGCGTCAGAGGTTACCAGATATGTAATTTCAGGTAGTTCACCTAGGTTAACCATGAGAGAAAACCATGGCGATTTTGGGGCGGTAAATATTGCATTCTCTGGGACCGTCCTGCCAGACGAGGCCGTACCGGATGCAAATTCCTATGCTGTATATTGGGATGGGACAAACCTCACTGCTTTGATAAATCACGGTGGTGTTCTTACAAGACAGAAGTTAACAACATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
830d956c782b1e86cba6942f28e4c92c50e7cdad5a04f9abafc2021b35fbb6cd
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6838
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50