Genbank accession
WPJ72496.1 [GenBank]
Protein name
L-shaped tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,76
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MALKTKIIVQQILNIDDTTTTASKYPKYTVVLGNSISSITAGELTAAVEAAAESAAAAKDSEIAAKDSENKAKDSEIQAGIHAGASEASATQSAASAAESERQANLSQGSAENSAASALESKNFKDASELAAQNAEQSKILAEQAQRAAEAAQSGAKASENKASAFATQAAASSASAGDFAAAAKQSELNAKTSETNAATSEVEAETQAETATTEANRAKAEADRAAQIVDSKLDKEDISGFIKVYKTKEEADADVSSRVLGEKILVWNQTDSKYGWYKVAGTAEAPVLELVETEQKLVSINNVRADDAGNVQITLPGGNPSLWLGEVTWFPYDKDSGVGYPGVLPADGREVLRVDYPDTWEAIEAGLIPSVTEEQWQAGATLYFSTGNGTTTFRLPDMMQGQAFRAAAKGEENAGNIKEQIPYITMINGKAPADDGTITLGNAADKNVWNGIDGEVLLRGAFGLGGTGLILNEPDAVSFFKAMRAFGSGYYRNDSESNPVIPKYSAGFYSKTADTHTFICSAYGNGVTFAATINDALLDGENPTVHTNILYGTANKPDLNTDTQGVLGVEKGGTGATTQKGARLNLDTPVGSRAIGMPNNSDVLAFMKSSAESGYYSSGNIVTGVPETAGWYMFDLHVHGKNAAGEMEYGNVYCTTSAGAIWYTLMEVGVWQPWRRLTTEHGIIPITSGGTGTNNANDARINLGLGPINAPTFSGMTLQGTNETTSGIAVFSNRNAEGTQLSYSRMYHEIQSGVGKTTIQTTREGGATNYFQIDEYGNIGNINSIIAYGYMGLGAANAMGNASIAIGDSDSGLKWNSDGNISTVADGVKIATWTPHGFYTHKIISSDVANTERGMYVNGVRTTGASALVAGVIEAGSHVGWRDRASGMLVELNTRGAAANIWKATRWGDQHAGASDIVIYDDGSPYYRTLVGGGEFGFNGLGQATCTSWISTSDIRLKAQLKEIVSAKDKVKSLQGYTYFKRNSLVEDEHSFYCEEAGLIAQDVQTVLPEAVYKIANSDLLGVNYSGVTALLANAVKEMLADAEAQEARISNLEEELAELKALIATLVNK
Physico‐chemical
properties
protein length:1071 AA
molecular weight: 113026,69140 Da
isoelectric point:4,76910
aromaticity:0,07470
hydropathy:-0,27890

Domains

View on InterPro
WPJ72496.1
1 1071 aa
CHP 954–1058 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

WPJ72496.1
1 1071 aa
Domain Start End Length (AA) Confidence
N-terminal 1 446 446 0,9392
Central domain 447 645 200 0,2881
C-terminal 646 1071 425 0,7668
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
WPJ72496.1 [NCBI]
Genbank nucleotide accession
OR464144.1 [NCBI]
CDS location
range 20410 -> 23625
strand +
CDS
ATGGCACTTAAAACTAAAATTATTGTACAGCAGATTCTGAACATAGATGACACTACAACTACTGCTAGTAAGTATCCTAAATATACAGTAGTTTTAGGTAATTCTATTAGTTCTATTACTGCTGGTGAACTAACAGCGGCTGTTGAAGCCGCCGCAGAGTCTGCTGCTGCTGCTAAAGATTCTGAAATAGCAGCTAAAGACTCTGAAAATAAAGCTAAAGATTCGGAAATTCAAGCGGGTATTCATGCTGGTGCTTCTGAGGCTTCAGCAACCCAGTCTGCTGCTTCTGCTGCTGAATCTGAAAGACAAGCTAACTTATCTCAAGGTAGTGCGGAAAACTCTGCTGCTTCTGCTTTAGAATCTAAGAATTTTAAAGATGCTTCGGAACTTGCTGCTCAAAATGCAGAGCAGAGTAAGATTTTAGCAGAGCAAGCTCAAAGAGCGGCAGAAGCTGCCCAGTCTGGTGCTAAAGCTTCTGAAAATAAAGCATCAGCATTTGCTACACAAGCTGCTGCATCTTCAGCTTCCGCAGGAGATTTTGCTGCAGCCGCTAAACAATCTGAATTAAATGCTAAAACTTCTGAAACCAATGCCGCAACATCAGAAGTGGAAGCGGAAACCCAAGCTGAAACTGCTACTACTGAGGCAAATCGTGCTAAGGCTGAAGCCGATCGCGCAGCTCAGATTGTAGATAGTAAGTTAGATAAAGAAGATATATCTGGCTTTATCAAAGTCTACAAGACTAAAGAAGAAGCGGACGCCGACGTTAGTAGCCGCGTACTAGGTGAAAAGATCCTAGTGTGGAACCAAACTGACTCAAAATATGGATGGTATAAAGTAGCTGGAACTGCTGAGGCTCCAGTATTAGAGTTAGTAGAGACAGAGCAAAAGCTAGTTTCTATTAATAACGTTCGTGCAGATGACGCAGGTAACGTACAGATTACTCTTCCTGGTGGTAATCCTTCCTTATGGTTGGGTGAAGTTACTTGGTTCCCTTATGACAAAGATTCAGGTGTTGGCTATCCTGGTGTTCTCCCTGCTGATGGCCGCGAAGTCCTTCGTGTAGACTATCCAGATACGTGGGAGGCTATCGAAGCCGGTCTGATTCCTTCTGTTACTGAAGAACAATGGCAAGCTGGTGCAACTCTCTACTTCTCCACTGGTAATGGTACTACTACTTTCCGCCTACCTGATATGATGCAGGGCCAAGCATTCCGTGCTGCTGCAAAAGGAGAGGAAAACGCTGGTAATATTAAAGAGCAAATCCCGTACATCACTATGATTAATGGTAAAGCTCCTGCTGACGATGGTACAATTACTTTAGGTAATGCTGCGGATAAAAACGTATGGAATGGTATTGATGGTGAAGTACTGTTAAGAGGTGCTTTTGGTCTTGGAGGTACTGGTTTAATTCTTAATGAACCTGATGCTGTTTCCTTCTTTAAAGCAATGCGTGCTTTTGGTTCAGGATATTATAGAAATGACTCTGAAAGTAACCCAGTAATCCCTAAATACTCTGCAGGATTCTACTCCAAAACTGCCGACACTCATACTTTTATCTGTTCTGCTTATGGTAATGGTGTTACTTTCGCAGCTACTATAAATGATGCATTATTAGATGGAGAAAATCCTACTGTACATACAAATATTCTTTATGGTACAGCAAATAAACCTGATCTGAATACCGATACTCAAGGAGTTTTAGGAGTAGAGAAGGGCGGTACTGGTGCTACTACGCAGAAAGGTGCTAGACTAAATCTGGATACTCCTGTAGGCAGCAGAGCTATTGGAATGCCTAATAACTCTGATGTACTAGCTTTCATGAAATCTTCCGCAGAAAGCGGATATTATTCCTCTGGTAATATAGTTACTGGAGTTCCAGAAACTGCAGGATGGTATATGTTCGATCTCCATGTACATGGTAAGAATGCTGCGGGAGAAATGGAGTATGGTAATGTATACTGTACAACAAGTGCTGGTGCTATTTGGTACACCTTAATGGAGGTTGGTGTATGGCAGCCATGGAGACGTTTGACCACAGAACATGGTATTATTCCTATTACTTCAGGGGGTACTGGTACAAATAATGCAAATGACGCAAGAATAAATCTAGGTCTTGGTCCTATAAATGCACCTACTTTTAGTGGTATGACTCTTCAGGGTACTAATGAAACTACTTCAGGTATAGCGGTTTTTAGTAATAGAAATGCGGAAGGGACTCAACTTTCCTATTCTAGAATGTACCATGAAATTCAGAGTGGTGTTGGTAAAACTACTATTCAGACTACAAGAGAGGGCGGGGCGACTAACTATTTCCAAATTGATGAGTATGGTAATATTGGGAATATTAACTCAATTATTGCATATGGATACATGGGATTAGGTGCTGCTAATGCTATGGGAAACGCCTCTATTGCGATTGGTGACTCTGACTCTGGGCTAAAATGGAATAGTGATGGTAACATAAGTACTGTAGCAGATGGTGTAAAAATAGCCACATGGACACCTCATGGATTTTATACACATAAAATAATAAGCTCAGATGTTGCTAATACCGAAAGAGGGATGTATGTAAACGGGGTTAGGACTACCGGTGCCTCCGCTCTTGTAGCTGGGGTTATAGAAGCTGGATCTCATGTTGGTTGGAGAGATAGAGCTTCAGGTATGCTTGTTGAATTGAATACTAGAGGAGCTGCTGCCAATATCTGGAAAGCAACTAGATGGGGTGACCAACATGCTGGTGCATCTGACATCGTTATTTATGATGATGGATCTCCTTATTATAGAACTCTTGTAGGCGGTGGTGAATTTGGGTTCAATGGCCTTGGACAAGCTACCTGTACTTCTTGGATCAGTACATCTGATATTAGGCTTAAGGCACAGCTAAAAGAGATAGTATCTGCTAAAGATAAGGTAAAATCCCTACAGGGGTACACTTATTTTAAACGTAATAGTTTGGTTGAAGATGAGCATTCCTTTTATTGTGAAGAGGCAGGATTAATCGCACAAGATGTTCAAACTGTACTACCTGAAGCTGTATATAAAATAGCTAACTCAGATCTTCTCGGTGTTAATTACTCTGGTGTTACCGCATTATTGGCTAACGCAGTAAAAGAGATGTTGGCGGATGCGGAGGCTCAGGAAGCTCGTATCAGTAATCTAGAAGAAGAACTGGCAGAGTTAAAAGCTCTAATAGCCACTCTGGTAAATAAGTAA

Genome Context

Tertiary structure

WPJ72496.1
ESMFold structure
Source ESMFold
pLDDT 60.7
Oligomeric state monomer