Genbank accession
WGG14746.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 0,94
TSP
Evidence RBPdetect
Probability 0,89
Protein sequence
MGFFAGKYSDGKTVLSLNTESGGDINSHYSPNNNSIFHSDMPFVLVDGTYEAGLGDAGNGFFVCQMPSDIINIKSNDPGRVILTAIEINGTHRAFLNGTQSKVGQTIVATQADPFRSFASVSQTSGFAFGNSLASGTYNYNPSIGHEESISRNGTGGTTLHSTYHGIVRPGAGAPVGITVAQAFEQLGFPTNSSTVPVDGNNPYYWDPGWMSPLGAAHRGHDWFYVCSSNIRGYGGVRQGLPGSVNTMYHDGGNRFVCRGSTTNLANQSGNATILQDWYNITPTKVIWYVLNLRYSNGNMGISGNPFTGSDILISPSNFTIKGISLPNTGYKFINQNAFGNLGYRPDMEYIGNNAAYTGVFGDTTARCELVGSSNGGLWSPVDYGGAKSQISIYKFGAGKQWYVNSRDNTIGNEHGVVWGPSAVPLRLFGGNVGSSYMGDDITPSYPGTGDRYVGLSTIGLGIPGGNATVILTTEVISGNLNCAGVPANTWNNGVFQVQGRRAYSYSNGDAIFHQILTLPVGYLVPFHTTSAFRYTPNNALSRNSFIYTVKNLGNGNVELGVVMHVSLGSAVFLPRLRVTVQRLT
Physico‐chemical
properties
protein length:585 AA
molecular weight: 62446,73350 Da
isoelectric point:7,69005
aromaticity:0,11282
hydropathy:-0,18051

Domains

Domains [InterPro]
IPR059609
RBD
1–585
WGG14746.1
1 585
Architecture
RBD
RBD 1-585
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WGG14746.1
1 585
Domain Start End Length (AA) Confidence
N-terminal 1 10 10 0,2429
Central domain 11 209 200 0,4143
C-terminal 210 585 375 0,7986
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-10
Central
11-209
C-terminal
210-585

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage phC17
[NCBI]
3038310 Uroviricota > Caudoviricetes > Demerecviridae > Tequintavirus > Tequintavirus phC17
Host Salmonella enterica subsp. enterica
[NCBI]
59201 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WGG14746.1 [NCBI]
Genbank nucleotide accession
OQ680481 [NCBI]
CDS location
range 83124 -> 84881
strand +
CDS
ATGGGTTTTTTCGCTGGAAAATATAGCGATGGTAAGACCGTACTATCTTTAAATACTGAATCTGGTGGTGACATTAATAGTCACTATAGTCCAAATAATAATAGTATTTTTCATAGTGATATGCCATTTGTCCTAGTTGATGGTACTTATGAGGCTGGGTTAGGTGATGCTGGAAATGGGTTTTTTGTATGTCAGATGCCCTCTGACATAATAAATATTAAATCTAATGACCCAGGTAGAGTTATACTAACTGCTATTGAGATTAATGGTACTCATAGGGCTTTTCTTAATGGTACTCAGAGTAAGGTGGGTCAAACTATAGTTGCCACTCAAGCAGATCCTTTTAGATCTTTTGCTAGCGTTTCCCAGACTAGTGGATTTGCATTTGGTAATAGCCTGGCATCTGGAACATATAACTATAACCCATCTATAGGGCATGAAGAATCTATCTCTAGAAATGGTACTGGTGGAACCACCCTACATAGTACTTATCATGGTATAGTTAGGCCAGGTGCAGGAGCTCCAGTAGGTATTACTGTTGCACAAGCTTTTGAGCAATTGGGTTTCCCTACTAATAGTAGCACAGTACCTGTAGATGGCAATAACCCATACTATTGGGATCCTGGATGGATGTCGCCTCTAGGAGCAGCACATAGAGGGCATGATTGGTTTTATGTTTGTAGCTCTAACATACGTGGGTATGGTGGGGTAAGACAAGGTCTTCCTGGTAGTGTAAATACTATGTACCACGATGGGGGTAACAGATTTGTTTGTAGGGGTTCTACAACCAATTTAGCTAACCAATCTGGTAATGCTACAATATTACAGGATTGGTATAACATAACTCCCACTAAGGTTATTTGGTACGTACTTAATCTGAGATATTCTAATGGTAATATGGGTATTTCTGGTAATCCCTTTACTGGTTCAGATATTCTTATCTCTCCCTCTAATTTTACTATAAAAGGGATTAGTTTACCGAATACTGGATATAAGTTTATCAACCAAAATGCTTTTGGTAACCTAGGTTACCGTCCTGATATGGAGTATATAGGGAATAACGCAGCGTATACCGGGGTTTTTGGGGATACCACTGCAAGGTGTGAACTTGTAGGATCTAGTAATGGAGGATTATGGTCTCCTGTAGACTATGGTGGAGCTAAATCCCAAATTAGCATTTACAAATTCGGAGCTGGTAAGCAATGGTATGTAAATTCTAGGGATAATACTATCGGTAATGAACACGGGGTTGTTTGGGGACCATCAGCAGTTCCATTACGACTTTTTGGTGGAAATGTAGGTAGCTCTTACATGGGAGATGATATTACTCCCAGCTACCCAGGAACGGGTGATAGATACGTTGGTTTATCAACTATTGGGCTAGGTATACCAGGCGGAAATGCTACAGTAATTCTTACTACTGAAGTTATATCAGGTAACCTTAACTGTGCAGGTGTTCCTGCTAATACATGGAATAACGGTGTGTTTCAAGTGCAGGGAAGAAGAGCATATAGTTATAGTAATGGAGATGCAATATTCCACCAGATTTTAACATTACCAGTAGGGTATTTAGTACCTTTTCATACTACATCTGCTTTTAGATACACACCTAACAATGCACTAAGTAGAAACAGTTTTATATATACCGTTAAAAATCTAGGAAATGGAAATGTAGAGTTAGGAGTAGTTATGCACGTGAGTTTGGGTAGCGCAGTTTTCCTACCTAGATTAAGAGTAACAGTTCAACGCCTTACCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
3fc24feca6d83052eff0a5e918271c7d1f7d4ce048a3a22adcc214f1a2aa59e0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,2394
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Comparative analysis of effectiveness for phage cocktail development against multiple Salmonella serovars and its biofilm control activity Ribeiro,J.M., Pereira,G.N., Durli,I., Teixeira,G.M., Bertozzi,M.M., Verri,W.A., Kobayashi,R.K.T. and Nakazato,G. GenBank