Genbank accession
YP_009113179.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,96
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MSTGCSDVLTLNDLQIAKKHQIFEAEVITGKQGGVAGGADIGYATNQVTGQTQKTLPAVLRDAGFSPASFNFATGGTLGINDANKAVLWPIEDGGDGNYYAWRGSLPKVIPAASTPLTTGGISDSAWVAFGDITFRAEADKKFKYSVKLSDFTTLQQLADAAVDSVLIDRDYTFSNNETVNFGGKTLTIDCKAKFIGDGMLIWEQLGEGSVVNQPHMQTQTTPYTVYRFDDNGNWVTNPSTVLASVVQRMDKGYKPNINDLDIWGSLPDHIKNQTAGATLRVMSGSNITVNSPEATFGGYVFTLCNRILVKNPRNFIAWESGITFENHHTSAWGYGNWVVGGEIKYGSGCAVLFIRNDGGEDHDGGVRDLISYRVGESGVKTYQNEIGGRSARNYRLVFDNITTIQCYYDGIDINADTGPQVERVDDYPLSQYPWFQLPTEHIIRNIITRDCMGIGAWWDGQRNIIDNVVTYEAHKEGIFDRGTNNDITNVTVIGANKDVVNVNQLTCEGSSRLRGVMIHAYTTQGYAVYAPQSEISAVACAGSGTKKILCTYVSDVQGGNINVQHNENQMTLAMRPAMHGTINPSLLMTADCQVAAPGGEASIVKLSAIQEGVRVGEMQLNRLGFKHMSIPVAPSALPESALEHNSSIGFFFGDDGVLRILIKKPDGTYKTHDLS
Physico‐chemical
properties
protein length:676 AA
molecular weight: 73436,50580 Da
isoelectric point:5,31157
aromaticity:0,08876
hydropathy:-0,23314

Domains

Domains [InterPro]
G3DSA:2.10.10.80
ATT
65–136
IPR040775
RBD
71–132
IPR015331
RBD
138–670
YP_009113179.1
1 676
Architecture
ATT
STR
ATT 65-136 | STR 137-675 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009113179.1
1 676
Domain Start End Length (AA) Confidence
N-terminal 1 155 155 0,9925
Central domain 156 558 404 0,9868
C-terminal 559 676 117 0,9255
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-155
Central
156-558
C-terminal
559-676

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage LSPA1
[NCBI]
1540823 Uroviricota > Caudoviricetes > Sarkviridae > Jerseyvirus > Jerseyvirus LSPA1
Host Salmonella paratyphi
[NCBI]
54388 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009113179.1 [NCBI]
Genbank nucleotide accession
NC_026017.1 [NCBI]
CDS location
range 23643 -> 25673
strand +
CDS
ATGTCTACTGGTTGCAGCGATGTACTGACACTTAACGATTTACAAATAGCTAAAAAACACCAGATTTTCGAAGCCGAGGTGATCACCGGCAAACAGGGTGGTGTAGCCGGCGGTGCAGATATCGGCTACGCCACTAACCAGGTAACAGGGCAGACGCAGAAGACGCTGCCGGCGGTCTTACGTGACGCCGGTTTCTCCCCGGCGTCCTTTAACTTCGCAACCGGCGGAACCCTGGGAATTAACGATGCCAATAAAGCTGTTCTTTGGCCTATAGAGGATGGCGGGGATGGGAACTATTACGCATGGCGTGGCTCCCTGCCGAAAGTTATCCCCGCGGCGTCCACCCCTCTAACAACCGGCGGCATTTCTGATTCGGCTTGGGTAGCTTTTGGGGACATTACCTTTCGCGCGGAAGCGGATAAGAAATTTAAATACTCCGTTAAGCTGTCCGACTTTACTACGTTACAACAATTGGCGGATGCCGCTGTTGATAGTGTTCTTATCGACCGCGATTACACTTTCAGCAATAACGAGACCGTTAACTTCGGCGGGAAGACCCTGACCATCGACTGTAAAGCGAAGTTTATCGGCGACGGCATGCTAATATGGGAACAACTCGGCGAAGGGTCTGTTGTGAATCAACCACATATGCAGACACAAACCACACCGTACACGGTGTATAGATTCGACGACAACGGTAACTGGGTGACTAACCCATCAACGGTGCTGGCGTCGGTAGTCCAAAGGATGGATAAGGGGTATAAGCCCAATATTAACGATTTGGATATCTGGGGTAGCCTTCCTGATCACATAAAAAATCAAACAGCCGGTGCGACCCTCCGCGTTATGAGCGGATCAAACATAACCGTAAATTCACCGGAAGCGACTTTCGGCGGTTATGTATTCACTCTATGTAATCGTATATTGGTTAAAAACCCACGAAATTTTATCGCATGGGAGTCGGGTATTACTTTTGAAAACCACCATACATCCGCATGGGGCTATGGTAACTGGGTCGTCGGCGGAGAGATAAAGTACGGTTCAGGGTGCGCCGTTTTGTTTATCCGCAATGACGGCGGTGAAGACCATGATGGCGGGGTCAGGGATTTAATATCATATCGCGTTGGTGAATCTGGAGTTAAAACTTATCAAAACGAGATTGGTGGAAGGTCCGCCCGAAACTACCGTCTGGTGTTTGATAACATTACGACCATACAGTGCTATTACGACGGGATAGATATCAACGCGGATACAGGCCCCCAGGTTGAGCGCGTAGATGATTACCCGCTCTCCCAATACCCCTGGTTTCAGTTGCCGACTGAACACATCATCCGCAATATCATTACACGTGACTGCATGGGTATCGGCGCGTGGTGGGATGGGCAAAGAAATATCATTGATAATGTTGTAACCTACGAGGCCCATAAAGAGGGTATTTTTGATAGAGGTACTAACAACGACATCACTAACGTAACAGTTATCGGCGCAAACAAGGACGTAGTTAACGTTAACCAGCTTACTTGCGAGGGCAGTAGCAGATTGCGGGGCGTTATGATTCATGCCTACACCACGCAAGGGTATGCCGTATACGCACCACAATCGGAAATATCCGCTGTCGCTTGCGCAGGGAGCGGGACCAAGAAAATACTTTGTACCTATGTCAGTGATGTGCAAGGGGGTAACATCAATGTCCAACACAATGAAAACCAGATGACACTCGCTATGCGCCCTGCGATGCACGGTACCATAAATCCATCACTATTGATGACCGCAGATTGTCAAGTGGCGGCACCTGGTGGGGAGGCAAGCATTGTGAAGCTTTCCGCAATCCAGGAGGGGGTGCGCGTGGGCGAGATGCAGCTTAACCGCTTAGGCTTTAAGCATATGAGCATACCAGTAGCCCCATCAGCTCTACCGGAAAGCGCATTAGAGCATAATTCATCTATAGGCTTTTTCTTTGGAGATGACGGGGTGCTGCGAATCCTCATCAAGAAACCAGACGGGACTTACAAAACCCACGACTTATCCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
aabd133a1474d45d86cbee28b48779ace35c36a8db980a0318fe27ee17589755
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7302
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50