Genbank accession
NP_052117.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MATTIKTVMTYPLDGSTTDFNIPFEYLARKFVRVTLIGVDRKELILNQDYRFATKTTISTTRALGPADGYTLIEIRRFTSATDRLVDFTDGSILRAYDLNISQVQTLHVAEEARDLTADTIGVNNDGNLDARGRRIVNVADAQDVGDAINLGQIQRWNDSALNSANRAKQEADRATARANDANNSANASASSASSSAGSAELAKRWATSDTVVESDLESSRTYALHSMLYRNETKDSADRAAVSETNAKASEGGAANSAAAAKVSETNAKASEERAITEASKLGNMNDFAAAIESVTGNDVKMKGAVSSPGNITGGGLVSTGAASIQKGALVGEDLIVGRDITAKQDMYSQRNIAVAGVTYAQGGIEQTLATNIYNKLYRLHINSNPQHVGQRQGLHIGWNESGSGESNFITNRGAGSGGFVFRTVNAENSVETGRVDITGGGVIYANHLQVRSGARIEGNNNIVGQNLYAGMGSTMFEGNGNLTGGIWAQWGNLWSGLNNNSLFAKPPGGVQLFTARGGYYLEGRVDGTAVGFRWFQSDRRLKEDIKVVRSADDMLNIIRSYIPVSYKYKDASYTDNRGRTNTIEGKRSRAGFITQDLIRLWPEAVDVMSDGMQSPDPNQIIGGLMLLVKNLDARIQELEKDKT
Physico‐chemical
properties
protein length:645 AA
molecular weight: 69439,22840 Da
isoelectric point:6,02365
aromaticity:0,06667
hydropathy:-0,40636

Domains

Domains [InterPro]
IPR005604
ATT
1–131
DC_0657
STR
1–624
IPR030392
CHP
539–607
NP_052117.1
1 645
Architecture
ATT
STR
CHP
ATT 1-131 | STR 132-624 | CHP 625-644 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
NP_052117.1
1 645
Domain Start End Length (AA) Confidence
N-terminal 1 213 213 0,8747
Central domain 214 440 228 0,6513
C-terminal 441 645 204 0,8412
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-213
Central
214-440
C-terminal
441-645

Taxonomy

  Name Taxonomy ID Lineage
Phage Yersinia phage phiYeO3-12
[NCBI]
110457 Uroviricota > Caudoviricetes > Autographivirales > Studiervirinae > Teetrevirus
Host Yersinia enterocolitica
[NCBI]
630 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales
Host Yersinia enterocolitica (type O:3)
[NCBI]
34051 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Yersinia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
NP_052117.1 [NCBI]
Genbank nucleotide accession
NC_001271.1 [NCBI]
CDS location
range 33972 -> 35909
strand +
CDS
ATGGCTACAACTATTAAGACCGTGATGACTTACCCGCTGGATGGCTCCACTACGGACTTTAATATTCCGTTCGAGTATCTGGCGCGTAAGTTTGTCCGAGTGACCCTTATCGGTGTTGACCGAAAGGAACTCATCTTGAATCAAGACTATCGTTTTGCAACTAAGACCACAATCTCCACAACGAGAGCGTTGGGGCCAGCGGACGGTTATACTTTGATTGAAATCCGTCGATTCACCTCTGCTACCGATAGGCTGGTGGACTTTACCGACGGCTCAATCCTGCGGGCATATGATTTGAACATATCTCAGGTTCAGACACTTCACGTTGCTGAGGAAGCCCGTGACCTTACCGCTGATACAATTGGCGTTAACAATGATGGGAACTTGGATGCTCGTGGTCGTCGTATTGTGAACGTTGCGGATGCGCAAGATGTAGGTGACGCAATCAACTTAGGTCAAATCCAACGGTGGAACGACTCTGCGTTGAACTCTGCGAATCGAGCGAAACAGGAAGCTGACCGTGCGACCGCTCGTGCAAACGATGCGAACAACTCTGCGAACGCATCTGCAAGCTCTGCAAGCTCTTCTGCTGGGTCTGCTGAGTTGGCGAAACGCTGGGCTACCAGTGATACAGTAGTTGAGAGTGACCTTGAGTCTTCTAGAACCTACGCGCTTCACTCCATGTTATATCGTAATGAGACTAAAGACTCTGCTGACCGTGCCGCTGTTTCCGAGACCAATGCTAAGGCCTCAGAAGGGGGCGCTGCTAACTCGGCTGCTGCCGCTAAGGTATCAGAGACCAACGCTAAGGCCTCCGAGGAGAGAGCAATCACTGAGGCGAGCAAGCTGGGAAACATGAATGACTTTGCGGCTGCCATAGAGTCAGTGACAGGGAACGATGTGAAGATGAAGGGCGCTGTGTCCTCACCCGGCAATATCACAGGTGGTGGTTTAGTCTCCACAGGTGCGGCGAGTATCCAGAAAGGTGCGCTTGTTGGAGAGGACTTGATTGTTGGAAGGGATATTACCGCAAAGCAGGATATGTACTCTCAGCGAAACATTGCCGTAGCTGGGGTCACGTATGCTCAGGGGGGCATTGAGCAAACACTAGCAACCAACATTTATAATAAGCTGTACCGACTTCACATTAACAGCAATCCACAACATGTGGGACAACGGCAGGGTCTACATATTGGATGGAACGAGAGTGGTAGCGGTGAGTCAAACTTTATCACCAACCGTGGTGCTGGTTCGGGCGGATTCGTGTTCCGCACAGTCAATGCTGAGAACTCAGTAGAGACTGGTAGAGTTGATATTACCGGAGGCGGTGTTATCTACGCAAACCACCTACAGGTTCGTTCAGGTGCCCGAATTGAGGGGAACAACAATATCGTTGGTCAGAACCTCTACGCAGGAATGGGAAGTACGATGTTTGAAGGTAACGGTAATCTAACTGGTGGCATCTGGGCGCAGTGGGGTAACTTATGGAGCGGGCTAAATAACAACTCACTCTTCGCCAAACCACCCGGCGGTGTTCAGTTATTTACCGCAAGGGGCGGTTATTATCTTGAAGGTAGGGTTGATGGTACAGCCGTTGGTTTCCGCTGGTTCCAGTCTGACAGAAGGCTGAAAGAAGACATTAAGGTTGTTCGCTCTGCTGACGACATGTTGAACATCATTCGGTCGTACATCCCGGTGTCCTACAAATATAAGGACGCATCCTATACAGATAACAGGGGTAGAACAAACACCATTGAAGGTAAGCGGTCACGGGCTGGCTTCATCACACAGGATTTAATACGCTTGTGGCCCGAGGCTGTGGACGTAATGTCAGATGGAATGCAGTCCCCTGACCCGAACCAGATTATTGGTGGACTCATGTTACTCGTTAAGAACCTAGATGCTCGCATTCAGGAGTTGGAGAAGGACAAGACTTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
237c0c8d6d1fc671fe1d6b4fc132154cf7b8044ec87216cbfb35e32c9c2f3c0d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6502
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50