UniProt accession
A0A2Z5HP68 [UniProt]
Protein name
Tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,95
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MSRNLMPKSGAMAPYIVVNRDAAVAGVFSIDGEAGAVVLTSKYLQISKYNLDQQALATRLTGIDSSIQSNTEAIGNINTAIGSINSSLSAKAAKGANNDITELNALTKAITVSQGGTGATSPYGARLALELNHLTKSTDVSYMGSPDGSKLLYISNEGAWGVTAEGGGTNYALPITHGGTGALSAAEALVKLMDGKPLPLAADGQAPYDAVTVRQLSNISGGGNASMSGVMNNFIGAVEWFNGNRTKLPAGYIPADGQLVSRTDTRTRDLWSAVSGGLFYAVSDALWISSGDPVRPYAWRASYSTGDGSTTFRIPDLNGTQLNSLKHLFLSGSSGATNEPSANQVWAQSSPNMVGSFPTAIASQFELGMNRYSERFGLPGVFGAENSITTAPDGSHKSQSVTGDNPYGITFNAAFASKTYGRGPAYQTDPGGTGPIGDLYPNHATGIWIIRANGSFNAAGSQYHVINGDSTRPADGTTTYGGFAHSDYRIGETVNHSALIVSAKRFGNPYSVAVLQSYNNETGDLANLEVQSNGVVNLPTSINNKSWIKAFGANGLHLDAATTAQDLHTMPGGGCGVSQAERNAIELNNNPGAVGSGGYVNWLQGWWYDDRFQFGAIRSGSTVLDAVALSTYSNALGLKQWFFRANDGRIASTAGTIAIEGSDIKLKENIVRAPEGALDRITKITPREFDWKAGGRHDRGYIAQELRDVDPTYVYSSTVGEGEEVLNVSTSALISDLIAAVTTLKQELDSAKLEIKKLKAK
Physico‐chemical
properties
protein length:761 AA
molecular weight: 80032,91750 Da
isoelectric point:5,83938
aromaticity:0,08147
hydropathy:-0,22891

Domains

Domains [InterPro]
DC_0017
STR
1–761
IPR030392
CHP
662–755
IPR030392
CHP
662–714
A0A2Z5HP68
1 761
Architecture
STR
STR 1-761
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A2Z5HP68
1 761
Domain Start End Length (AA) Confidence
N-terminal 1 138 138 0,9009
Central domain 139 337 200 0,0878
C-terminal 338 761 423 0,8865
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-138
Central
139-337
C-terminal
338-761

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage S132
[NCBI]
2231355 Uroviricota > Caudoviricetes > Demerecviridae > Epseptimavirus > Epseptimavirus S132
Host Salmonella enterica subsp. enterica
[NCBI]
59201 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AXC41793.1 [NCBI]
Genbank nucleotide accession
MH370379 [NCBI]
CDS location
range 24151 -> 26436
strand +
CDS
ATGTCACGTAATTTAATGCCTAAATCTGGCGCAATGGCGCCCTACATTGTAGTTAATAGAGATGCTGCTGTTGCTGGAGTTTTCTCTATTGACGGAGAGGCTGGCGCTGTTGTACTTACTTCTAAGTACTTACAAATCTCTAAGTATAATTTAGATCAGCAAGCTCTAGCTACTCGTCTTACTGGAATAGATAGCTCTATTCAAAGTAATACTGAAGCTATTGGAAATATTAATACTGCAATTGGTAGTATTAATAGTAGTCTAAGTGCCAAGGCGGCTAAGGGTGCTAATAATGATATTACAGAATTAAATGCACTTACTAAGGCTATTACAGTATCTCAAGGTGGTACAGGTGCTACTTCTCCATATGGAGCACGTTTAGCACTAGAATTAAATCATCTCACTAAGTCAACAGATGTCTCATATATGGGATCTCCTGATGGCTCAAAGTTACTTTATATTAGTAATGAGGGGGCTTGGGGAGTTACTGCTGAGGGTGGTGGCACTAACTATGCACTACCTATTACTCATGGTGGTACAGGTGCTTTAAGTGCAGCAGAGGCACTAGTTAAGCTAATGGATGGTAAACCATTACCATTAGCTGCAGATGGGCAAGCTCCTTATGATGCTGTAACTGTAAGGCAACTATCTAATATTTCTGGTGGCGGAAACGCTAGCATGAGTGGAGTTATGAATAACTTCATAGGTGCTGTTGAGTGGTTTAATGGTAACCGTACTAAGCTACCCGCTGGGTATATTCCTGCAGATGGTCAACTAGTAAGTCGTACTGATACCAGAACTAGGGATTTATGGTCTGCAGTATCTGGAGGGTTATTTTATGCTGTTTCCGACGCTTTATGGATTAGCAGTGGGGATCCTGTTAGACCATATGCTTGGAGAGCTTCTTACTCTACTGGTGATGGTTCTACTACTTTCCGAATTCCTGACCTTAATGGTACTCAATTAAACAGTCTTAAGCATTTATTCTTATCCGGTAGTTCTGGGGCCACTAATGAACCTTCGGCCAATCAAGTATGGGCTCAATCCTCCCCTAATATGGTAGGTAGCTTCCCAACTGCTATTGCATCCCAGTTTGAATTAGGTATGAATAGATATTCGGAAAGATTTGGATTGCCTGGAGTTTTTGGAGCAGAAAATAGTATTACTACAGCTCCGGATGGTTCTCATAAGTCCCAAAGTGTTACCGGTGATAATCCTTATGGTATCACTTTTAACGCTGCTTTTGCTAGTAAAACATATGGGCGTGGTCCTGCTTACCAAACAGACCCTGGTGGTACTGGCCCTATTGGTGACTTATACCCTAATCATGCAACAGGTATTTGGATTATCCGTGCTAACGGTTCTTTTAACGCTGCTGGTTCACAGTATCATGTTATTAATGGGGACTCTACACGCCCTGCTGATGGAACTACTACTTATGGTGGCTTTGCACATAGCGACTACAGAATAGGGGAAACAGTTAATCACTCAGCTTTAATAGTAAGTGCCAAAAGATTTGGTAATCCTTATAGTGTTGCTGTTCTTCAGTCCTATAATAATGAGACAGGGGATCTTGCCAACCTAGAGGTACAATCTAATGGTGTTGTTAACTTACCTACTTCGATTAATAATAAATCCTGGATAAAAGCTTTTGGTGCCAACGGCCTACATTTAGATGCTGCTACTACTGCTCAAGATCTGCATACTATGCCTGGAGGAGGTTGTGGTGTATCTCAAGCAGAGAGAAATGCTATTGAGTTGAATAATAACCCAGGTGCTGTAGGTTCTGGTGGTTATGTTAACTGGCTGCAAGGATGGTGGTATGATGATAGGTTCCAGTTCGGTGCCATTCGTAGTGGTTCTACTGTACTGGATGCCGTAGCCTTATCTACATATTCTAATGCATTGGGGTTAAAACAGTGGTTCTTTAGAGCTAATGATGGTAGGATTGCTTCCACTGCAGGTACTATTGCTATTGAAGGTTCCGATATCAAACTTAAGGAGAATATAGTTAGAGCACCAGAAGGGGCTTTAGATAGGATAACTAAAATCACTCCTAGAGAGTTTGACTGGAAAGCTGGGGGCAGACATGATAGAGGGTATATTGCACAGGAATTACGCGATGTAGATCCTACCTATGTATATTCATCAACTGTTGGGGAAGGAGAAGAGGTATTAAACGTTAGTACATCTGCTTTGATTTCTGACTTAATAGCTGCTGTTACAACTCTTAAACAAGAATTGGATTCTGCCAAATTAGAAATTAAAAAGCTAAAAGCTAAATAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
125bc5f1654e8b7eb390066fdeb3dbc52dde4b86fd72f668f49c40be49b872bc
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6303
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50