Genbank accession
WCZ57788.1 [GenBank]
Protein name
tail fiber
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,89
Protein sequence
MYYSLMRESKVIVEYDGRAFHFDALSNYDIQTSYEEFKTLRRTVHRRTNYADSIINAQTPSSISLAVNFSNTLTEANFFEWLGFDRKGNTFLLPLYSSNIEPIMFNIYIVNKDNNCVYFENCYISTVDFSLDKNIPILNVGIESGKFSEVSTYREAASIIQGEVMSYSPVIASTNGSILPGLISASLSFQQQCSWREDKSVFDINKIYNNKRAYVTEMNASATISLYYLKRFAGDMVYNIEPEIDVPLNIRNNNISIDFPSARITKRLDFSDVYRVEWDIIPTASSDPVRIDFFGEIKND
Physico‐chemical
properties
protein length:300 AA
molecular weight: 34459,35870 Da
isoelectric point:4,91580
aromaticity:0,13333
hydropathy:-0,22300

Domains

View on InterPro
WCZ57788.1
1 300 aa
ATT 1–298 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

WCZ57788.1
1 300 aa
Domain Start End Length (AA) Confidence
N-terminal 1 240 240 0,6523
Central domain 241 289 50 0,3023
C-terminal 290 300 10 0,6698
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
WCZ57788.1 [NCBI]
Genbank nucleotide accession
OQ291034.1 [NCBI]
CDS location
range 95902 -> 96804
strand -
CDS
ATGTACTACTCTCTAATGCGAGAGTCAAAAGTTATAGTTGAGTATGATGGTAGGGCATTTCATTTTGATGCCCTATCAAACTATGATATACAGACTTCCTACGAGGAATTCAAGACTCTTCGTAGGACTGTTCATCGTAGAACTAACTATGCAGACTCTATTATAAATGCTCAAACCCCCTCTTCTATCTCTCTAGCAGTAAATTTTAGTAATACTCTTACTGAAGCTAACTTCTTTGAATGGTTAGGTTTTGATAGAAAAGGTAATACTTTCTTACTACCACTATATAGTAGTAATATTGAACCTATTATGTTCAATATCTATATAGTAAATAAAGATAATAACTGTGTATATTTTGAAAACTGTTATATATCTACGGTAGATTTTTCTTTAGATAAGAACATACCAATTCTTAATGTTGGTATTGAGTCTGGGAAGTTCTCAGAAGTATCTACCTATAGAGAAGCAGCTTCTATTATACAGGGTGAAGTAATGTCTTACAGCCCAGTAATAGCTTCTACTAATGGCAGCATCTTACCCGGTCTTATTTCAGCCTCTTTATCTTTCCAACAGCAGTGCTCCTGGAGAGAGGATAAGAGTGTTTTTGATATAAATAAAATTTATAATAATAAAAGAGCTTATGTAACTGAAATGAATGCTTCGGCAACCATTTCTCTGTACTACTTAAAACGTTTTGCAGGAGACATGGTTTACAATATCGAACCGGAGATAGATGTACCCTTAAATATAAGAAATAATAATATTTCTATAGATTTTCCTTCAGCACGTATTACAAAACGCCTAGATTTCTCAGATGTGTATAGAGTTGAGTGGGATATTATACCTACTGCTTCTTCAGACCCAGTGAGAATAGATTTCTTTGGAGAAATTAAAAATGATTAA

Genome Context

Tertiary structure

WCZ57788.1
ESMFold structure
Source ESMFold
pLDDT 82.6
Oligomeric state monomer

Literature

Title Authors Date PMID Source
Whole genome sequence of Salmonella phage Kenya-K37 Gunathilake,D., Makumi,A., Loignon,S., Trembley,D., Labrie,S., Svitek,N. and Moineau,S. 2024-01-11 GenBank