Genbank accession
NP_848228.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
TSP
Evidence UniProt/Swiss
Probability 1,00
Protein sequence
MTVSTEVDHNDYTGNGVTTSFPYTFRIFKKSDLVVQVVDLNENITELILDTDYTVTGAGGYTCGDVVLSSPLANGYQISISRELPVTQETDLRNQGKFFAEVHENAFDKLTMLIQQVRSWLSLALRKPSFVANYYDALGNYIRNLRDPSRPQDAATKNYVDNLSEGNNSYADNLFSRTLRVPEKINTLPSSLDRANKIPAFDSNGNAIVIIPQSGSASDVLIELAKPSGSGLVGFSHSNNYNPGMVGEKLQNVVYPTDAPFYAPTDGTSDATTALQSAITHCEGKNAVLCINKSFSVSDSLSISSPLCVFAMNEQCGIVSSAPAGHAAVIFNGDNICWNGGFIRGLNQPSSSTIRQDGVLLNGNDCVLDNVSINGFFAKGLHTSNADGSGVGIRDYGTRNTISKCRVEYNKFGISLEGKDGWVLGNYVSNHYRMSSEAKPWDDTSNYWDGIVGGGEWLGVATGYLIDGNEFEDNGQSGIYAGGNGGIFAKNRITNNHIHGNWNRGIDFGVVQRLANSDVYENIITDNIVHNNRAANIWLAGVRDSIINNNNSWFTDDYRSMFAGNFDACVCLTLADGGEKAAPTGNQVNGNRCKTLESDDQISGFTLNITDTARGNQVRDNVLSPIGEAYIPNPELYAVNNIDIPTEFAFTPQLIGGSGVTLGNSSGKLTANGNVFSLSLSISAQSVSSPSGSLTIGYIPGLSGTSVRHHNVRTEFYNNLNTTMQRAQPYVNIGDSADQLRVYRLADGLSKDDLLEYFMSNSDLRMVGDIEIEPYNFSRSVTVVGHSFCTSDVMSTELNRLLGTDIYNFARGGASDVEVAMSQEAITRQYAPVGGSIPASGSVALTPTEVGIFWNGATGKCIFGGIDGTFSTTLVNAGTGETQLVFTRDSAGSAVSVSTTATFAMRPYTRFNTNTIPAGRKHSLHRDDIYIVWGGRNSTDYTRYVSELHTMVANMHTQRFVICPEFPYDTETTGTTGATNLAALNNNLKADFPDNYCQISGVDLLQNFKSKYNPAYAGDVTDIANGITPRSLREDNLHPSETLQPNGLYIGAKVNADFIAQFIKSKGWGG
Physico‐chemical
properties
protein length:1070 AA
molecular weight: 115674,86090 Da
isoelectric point:4,99560
aromaticity:0,09159
hydropathy:-0,26682

Domains

Domains [InterPro]
IPR012334
STR
261–631
NP_848228.1
1 1070
Architecture
ATT
STR
ATT 8-176 | STR 177-1070
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
NP_848228.1
1 1070
Domain Start End Length (AA) Confidence
N-terminal 1 269 269 0,9900
Central domain 270 646 378 0,9929
C-terminal 647 1070 423 0,6803
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-269
Central
270-646
C-terminal
647-1070

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage epsilon15
[NCBI]
215158 Uroviricota > Caudoviricetes > Uetakevirus >
Host Salmonella enterica
[NCBI]
28901 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
NP_848228.1 [NCBI]
Genbank nucleotide accession
NC_004775.2 [NCBI]
CDS location
range 20017 -> 23229
strand +
CDS
ATGACGGTTTCAACCGAAGTTGACCACAATGACTACACAGGGAACGGGGTCACGACATCATTCCCCTATACCTTTCGAATTTTTAAGAAGTCTGATCTGGTTGTGCAGGTTGTTGACCTGAACGAGAACATCACAGAACTGATTCTTGACACTGATTACACCGTGACTGGTGCCGGTGGATACACTTGCGGGGATGTTGTCTTATCATCTCCTCTTGCCAATGGTTATCAGATTTCAATTTCACGTGAACTTCCTGTTACTCAGGAAACAGATCTACGAAATCAGGGGAAGTTCTTCGCAGAAGTGCATGAGAATGCTTTTGATAAACTGACGATGCTGATTCAGCAGGTACGCAGTTGGTTAAGTCTGGCCCTGCGTAAGCCATCATTTGTCGCCAACTACTATGATGCACTTGGCAACTACATCCGCAATCTTCGCGACCCGTCTCGGCCTCAGGACGCCGCAACGAAAAATTATGTTGATAACCTTAGTGAAGGTAATAATTCCTATGCGGATAATCTTTTTAGTAGAACGCTTAGAGTTCCTGAGAAAATAAACACTCTACCATCATCGCTGGATCGGGCTAACAAAATCCCTGCTTTTGATAGTAATGGAAATGCAATTGTTATCATCCCGCAATCTGGCTCAGCATCAGATGTATTGATCGAACTTGCTAAACCATCTGGGTCTGGTTTAGTCGGATTCTCACACAGCAATAATTACAACCCAGGGATGGTTGGTGAAAAGCTTCAAAACGTTGTTTATCCAACTGACGCCCCATTTTATGCACCAACCGATGGGACTAGCGATGCAACGACTGCGCTTCAAAGCGCCATTACCCACTGCGAGGGAAAAAATGCAGTTTTATGCATCAATAAAAGTTTTTCGGTCTCTGACAGTCTTTCAATTTCATCACCGCTATGTGTATTTGCCATGAATGAGCAGTGCGGGATTGTATCATCCGCTCCAGCCGGGCATGCTGCTGTTATTTTTAATGGAGATAATATTTGCTGGAATGGTGGTTTTATTCGTGGTTTAAATCAACCAAGTAGTTCCACTATAAGACAAGATGGCGTCCTGCTTAATGGGAATGATTGTGTTTTAGATAATGTCTCTATCAATGGTTTCTTCGCTAAAGGGTTACATACCTCTAATGCAGATGGGAGCGGTGTTGGCATCCGGGACTATGGTACGCGAAATACCATCAGTAAGTGCCGGGTAGAGTATAATAAATTCGGCATATCTCTCGAAGGGAAAGACGGTTGGGTACTCGGAAACTATGTGAGTAACCATTACCGGATGTCTTCTGAAGCCAAGCCGTGGGACGATACCAGTAACTACTGGGATGGTATTGTTGGCGGCGGTGAATGGCTTGGCGTTGCAACCGGATATCTGATTGATGGTAATGAGTTTGAGGATAATGGTCAGAGCGGTATCTATGCTGGTGGCAACGGGGGTATTTTCGCCAAGAACAGGATTACTAATAACCACATACATGGAAACTGGAATCGCGGTATAGATTTTGGGGTTGTACAGCGTCTTGCTAATAGTGATGTTTATGAAAATATAATCACCGACAACATAGTGCATAACAACCGAGCAGCTAACATATGGTTAGCTGGCGTTCGGGATAGCATAATAAATAACAATAACTCCTGGTTTACTGATGATTATCGGTCTATGTTCGCTGGGAATTTTGATGCCTGCGTGTGCCTGACGTTAGCAGACGGCGGTGAAAAAGCAGCGCCAACCGGTAATCAGGTAAACGGTAACCGGTGTAAGACCTTGGAATCTGATGATCAGATCAGCGGTTTTACGTTAAATATTACAGACACCGCCAGAGGAAACCAGGTACGGGATAATGTGTTGTCCCCTATAGGGGAGGCATATATTCCAAATCCAGAACTATATGCTGTTAATAATATCGATATCCCTACTGAGTTCGCATTCACACCGCAACTCATAGGCGGGTCAGGTGTGACACTGGGTAACAGTTCTGGCAAGTTAACCGCTAACGGAAATGTGTTTAGCCTAAGTTTGTCTATCTCTGCCCAGTCTGTCTCATCCCCAAGCGGCAGCCTGACAATCGGGTATATACCGGGGCTTAGTGGTACTAGTGTTCGCCATCACAACGTACGAACGGAATTCTATAACAACCTGAATACTACAATGCAACGGGCGCAGCCGTACGTAAATATCGGTGATAGCGCGGACCAATTGCGTGTATACAGACTGGCTGATGGATTATCTAAAGATGATTTACTAGAGTATTTTATGTCTAATTCAGATCTACGTATGGTTGGCGATATTGAAATAGAGCCATATAACTTTAGCCGTTCAGTTACCGTGGTTGGGCATAGCTTCTGTACCAGTGATGTTATGAGCACAGAGTTGAACCGGCTGCTTGGTACCGATATATACAACTTCGCCAGGGGCGGGGCTAGTGATGTTGAAGTTGCCATGTCGCAAGAGGCAATAACACGACAATATGCGCCTGTAGGCGGGTCAATACCTGCGTCTGGTTCAGTAGCTCTTACGCCTACGGAAGTAGGTATATTCTGGAACGGCGCTACGGGGAAATGTATCTTTGGAGGTATCGACGGTACATTTTCAACAACGCTGGTAAACGCGGGAACTGGTGAGACTCAGCTTGTATTCACGCGTGATTCTGCTGGTAGTGCGGTAAGTGTGTCAACAACTGCAACATTTGCTATGCGGCCGTATACAAGATTTAATACAAATACTATCCCAGCAGGGCGAAAGCACTCTCTGCATAGGGATGATATCTATATCGTTTGGGGCGGTCGTAACTCAACTGACTATACTAGATATGTGTCAGAGTTGCATACCATGGTTGCTAATATGCATACTCAGCGCTTTGTTATTTGCCCTGAGTTTCCTTATGATACGGAGACAACGGGAACTACTGGAGCTACAAATTTAGCAGCTCTCAATAACAACCTGAAAGCCGATTTTCCAGATAACTATTGCCAAATTAGCGGCGTTGATTTATTGCAGAACTTTAAAAGCAAATATAACCCAGCCTATGCAGGAGATGTAACTGATATTGCAAACGGTATAACCCCTCGCTCTCTGCGAGAAGATAACCTGCACCCATCTGAAACACTACAGCCAAATGGCTTGTATATAGGTGCAAAAGTAAACGCTGATTTTATTGCTCAGTTTATTAAGTCGAAGGGGTGGGGTGGGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
63b7c4d1eccb9161c4b744084675986287c9ee1e52580e8c27cf278832c1823f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7589
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50