Genbank accession
WPJ72496.1 [GenBank]
Protein name
L-shaped tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,95
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MALKTKIIVQQILNIDDTTTTASKYPKYTVVLGNSISSITAGELTAAVEAAAESAAAAKDSEIAAKDSENKAKDSEIQAGIHAGASEASATQSAASAAESERQANLSQGSAENSAASALESKNFKDASELAAQNAEQSKILAEQAQRAAEAAQSGAKASENKASAFATQAAASSASAGDFAAAAKQSELNAKTSETNAATSEVEAETQAETATTEANRAKAEADRAAQIVDSKLDKEDISGFIKVYKTKEEADADVSSRVLGEKILVWNQTDSKYGWYKVAGTAEAPVLELVETEQKLVSINNVRADDAGNVQITLPGGNPSLWLGEVTWFPYDKDSGVGYPGVLPADGREVLRVDYPDTWEAIEAGLIPSVTEEQWQAGATLYFSTGNGTTTFRLPDMMQGQAFRAAAKGEENAGNIKEQIPYITMINGKAPADDGTITLGNAADKNVWNGIDGEVLLRGAFGLGGTGLILNEPDAVSFFKAMRAFGSGYYRNDSESNPVIPKYSAGFYSKTADTHTFICSAYGNGVTFAATINDALLDGENPTVHTNILYGTANKPDLNTDTQGVLGVEKGGTGATTQKGARLNLDTPVGSRAIGMPNNSDVLAFMKSSAESGYYSSGNIVTGVPETAGWYMFDLHVHGKNAAGEMEYGNVYCTTSAGAIWYTLMEVGVWQPWRRLTTEHGIIPITSGGTGTNNANDARINLGLGPINAPTFSGMTLQGTNETTSGIAVFSNRNAEGTQLSYSRMYHEIQSGVGKTTIQTTREGGATNYFQIDEYGNIGNINSIIAYGYMGLGAANAMGNASIAIGDSDSGLKWNSDGNISTVADGVKIATWTPHGFYTHKIISSDVANTERGMYVNGVRTTGASALVAGVIEAGSHVGWRDRASGMLVELNTRGAAANIWKATRWGDQHAGASDIVIYDDGSPYYRTLVGGGEFGFNGLGQATCTSWISTSDIRLKAQLKEIVSAKDKVKSLQGYTYFKRNSLVEDEHSFYCEEAGLIAQDVQTVLPEAVYKIANSDLLGVNYSGVTALLANAVKEMLADAEAQEARISNLEEELAELKALIATLVNK
Physico‐chemical
properties
protein length:1071 AA
molecular weight: 113026,69140 Da
isoelectric point:4,76910
aromaticity:0,07470
hydropathy:-0,27890

Domains

Domains [InterPro]
DC_0608
ATT
2–580
Coil
Unmapped
142–162
IPR030392
CHP
954–1014
WPJ72496.1
1 1071
Architecture
ATT
STR
ATT
ATT 2-580 | STR 600-760 | ATT 846-1071
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WPJ72496.1
1 1071
Domain Start End Length (AA) Confidence
N-terminal 1 446 446 0,9392
Central domain 447 645 200 0,2881
C-terminal 646 1071 425 0,7668
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-446
Central
447-645
C-terminal
646-1071

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage CRW-SP6
[NCBI]
3079602 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella enteritidis
[NCBI]
149539 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WPJ72496.1 [NCBI]
Genbank nucleotide accession
OR464144.1 [NCBI]
CDS location
range 20410 -> 23625
strand +
CDS
ATGGCACTTAAAACTAAAATTATTGTACAGCAGATTCTGAACATAGATGACACTACAACTACTGCTAGTAAGTATCCTAAATATACAGTAGTTTTAGGTAATTCTATTAGTTCTATTACTGCTGGTGAACTAACAGCGGCTGTTGAAGCCGCCGCAGAGTCTGCTGCTGCTGCTAAAGATTCTGAAATAGCAGCTAAAGACTCTGAAAATAAAGCTAAAGATTCGGAAATTCAAGCGGGTATTCATGCTGGTGCTTCTGAGGCTTCAGCAACCCAGTCTGCTGCTTCTGCTGCTGAATCTGAAAGACAAGCTAACTTATCTCAAGGTAGTGCGGAAAACTCTGCTGCTTCTGCTTTAGAATCTAAGAATTTTAAAGATGCTTCGGAACTTGCTGCTCAAAATGCAGAGCAGAGTAAGATTTTAGCAGAGCAAGCTCAAAGAGCGGCAGAAGCTGCCCAGTCTGGTGCTAAAGCTTCTGAAAATAAAGCATCAGCATTTGCTACACAAGCTGCTGCATCTTCAGCTTCCGCAGGAGATTTTGCTGCAGCCGCTAAACAATCTGAATTAAATGCTAAAACTTCTGAAACCAATGCCGCAACATCAGAAGTGGAAGCGGAAACCCAAGCTGAAACTGCTACTACTGAGGCAAATCGTGCTAAGGCTGAAGCCGATCGCGCAGCTCAGATTGTAGATAGTAAGTTAGATAAAGAAGATATATCTGGCTTTATCAAAGTCTACAAGACTAAAGAAGAAGCGGACGCCGACGTTAGTAGCCGCGTACTAGGTGAAAAGATCCTAGTGTGGAACCAAACTGACTCAAAATATGGATGGTATAAAGTAGCTGGAACTGCTGAGGCTCCAGTATTAGAGTTAGTAGAGACAGAGCAAAAGCTAGTTTCTATTAATAACGTTCGTGCAGATGACGCAGGTAACGTACAGATTACTCTTCCTGGTGGTAATCCTTCCTTATGGTTGGGTGAAGTTACTTGGTTCCCTTATGACAAAGATTCAGGTGTTGGCTATCCTGGTGTTCTCCCTGCTGATGGCCGCGAAGTCCTTCGTGTAGACTATCCAGATACGTGGGAGGCTATCGAAGCCGGTCTGATTCCTTCTGTTACTGAAGAACAATGGCAAGCTGGTGCAACTCTCTACTTCTCCACTGGTAATGGTACTACTACTTTCCGCCTACCTGATATGATGCAGGGCCAAGCATTCCGTGCTGCTGCAAAAGGAGAGGAAAACGCTGGTAATATTAAAGAGCAAATCCCGTACATCACTATGATTAATGGTAAAGCTCCTGCTGACGATGGTACAATTACTTTAGGTAATGCTGCGGATAAAAACGTATGGAATGGTATTGATGGTGAAGTACTGTTAAGAGGTGCTTTTGGTCTTGGAGGTACTGGTTTAATTCTTAATGAACCTGATGCTGTTTCCTTCTTTAAAGCAATGCGTGCTTTTGGTTCAGGATATTATAGAAATGACTCTGAAAGTAACCCAGTAATCCCTAAATACTCTGCAGGATTCTACTCCAAAACTGCCGACACTCATACTTTTATCTGTTCTGCTTATGGTAATGGTGTTACTTTCGCAGCTACTATAAATGATGCATTATTAGATGGAGAAAATCCTACTGTACATACAAATATTCTTTATGGTACAGCAAATAAACCTGATCTGAATACCGATACTCAAGGAGTTTTAGGAGTAGAGAAGGGCGGTACTGGTGCTACTACGCAGAAAGGTGCTAGACTAAATCTGGATACTCCTGTAGGCAGCAGAGCTATTGGAATGCCTAATAACTCTGATGTACTAGCTTTCATGAAATCTTCCGCAGAAAGCGGATATTATTCCTCTGGTAATATAGTTACTGGAGTTCCAGAAACTGCAGGATGGTATATGTTCGATCTCCATGTACATGGTAAGAATGCTGCGGGAGAAATGGAGTATGGTAATGTATACTGTACAACAAGTGCTGGTGCTATTTGGTACACCTTAATGGAGGTTGGTGTATGGCAGCCATGGAGACGTTTGACCACAGAACATGGTATTATTCCTATTACTTCAGGGGGTACTGGTACAAATAATGCAAATGACGCAAGAATAAATCTAGGTCTTGGTCCTATAAATGCACCTACTTTTAGTGGTATGACTCTTCAGGGTACTAATGAAACTACTTCAGGTATAGCGGTTTTTAGTAATAGAAATGCGGAAGGGACTCAACTTTCCTATTCTAGAATGTACCATGAAATTCAGAGTGGTGTTGGTAAAACTACTATTCAGACTACAAGAGAGGGCGGGGCGACTAACTATTTCCAAATTGATGAGTATGGTAATATTGGGAATATTAACTCAATTATTGCATATGGATACATGGGATTAGGTGCTGCTAATGCTATGGGAAACGCCTCTATTGCGATTGGTGACTCTGACTCTGGGCTAAAATGGAATAGTGATGGTAACATAAGTACTGTAGCAGATGGTGTAAAAATAGCCACATGGACACCTCATGGATTTTATACACATAAAATAATAAGCTCAGATGTTGCTAATACCGAAAGAGGGATGTATGTAAACGGGGTTAGGACTACCGGTGCCTCCGCTCTTGTAGCTGGGGTTATAGAAGCTGGATCTCATGTTGGTTGGAGAGATAGAGCTTCAGGTATGCTTGTTGAATTGAATACTAGAGGAGCTGCTGCCAATATCTGGAAAGCAACTAGATGGGGTGACCAACATGCTGGTGCATCTGACATCGTTATTTATGATGATGGATCTCCTTATTATAGAACTCTTGTAGGCGGTGGTGAATTTGGGTTCAATGGCCTTGGACAAGCTACCTGTACTTCTTGGATCAGTACATCTGATATTAGGCTTAAGGCACAGCTAAAAGAGATAGTATCTGCTAAAGATAAGGTAAAATCCCTACAGGGGTACACTTATTTTAAACGTAATAGTTTGGTTGAAGATGAGCATTCCTTTTATTGTGAAGAGGCAGGATTAATCGCACAAGATGTTCAAACTGTACTACCTGAAGCTGTATATAAAATAGCTAACTCAGATCTTCTCGGTGTTAATTACTCTGGTGTTACCGCATTATTGGCTAACGCAGTAAAAGAGATGTTGGCGGATGCGGAGGCTCAGGAAGCTCGTATCAGTAATCTAGAAGAAGAACTGGCAGAGTTAAAAGCTCTAATAGCCACTCTGGTAAATAAGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
cbd81e91f2f4e52201bb37b9534f0308512ccb2dc09c320df1faf69b73e3f712
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6074
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50