Genbank accession
WGL32639.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,83
Protein sequence
MNQLTVVDRGGVKHPEIVTIDEAQALLDQYGTGGTAVKIPNGVNLRKFLGRCDGGFYYDNAALSGTTGGPIKDIVYYTVLNAGVPGSRAIIASSHGNDLWIAEVYGDVFRGWVNFARPDDVKNWIATAIDNHEKSRNHPYATEDEIGFVQLAKDWEAEAGQEDTHAMTPLRSQNLLSTYGLGSRSVYVPDGANLATFFNDKKPGFYRLNNPATSGYTNYPTDVPFTWAEIISANHEALNTRTLLMITNIGRTYTATVSSGAFSGWKLKLQMDDLPIASLLQRGIVQLTDSVTSTNVTTAATPNSVKMANDNANTRVPQARTINGKPLTGNIALTAADVDAWSKTEADNRFMFKWDSPLPPGTPIPWGKATAPDGFFIMSGQGISTAYPKLRSVYGDHLPDLRGLIIRGTDLGRGVDPGRAVLSYQEDAIQRIYGTFPCFTRWRAHEKITGVFTRIDGQWNTNVKNGNGDDWGMTFVFDSSRVVRSAGETRMKNLAFTYIVKGE
Physico‐chemical
properties
protein length:503 AA
molecular weight: 55071,13960 Da
isoelectric point:6,47290
aromaticity:0,09940
hydropathy:-0,33121

Domains

Domains [InterPro]
DC_0097
STR
102–250
DC_1872
RBD
225–503
IPR037053
ATT
356–407
IPR051934
Unmapped
359–431
IPR011083
ATT
361–405
WGL32639.1
1 503
Architecture
STR
RBD
STR
RBD
ATT
STR
STR 102-250 | RBD 251-277 | STR 278-318 | RBD 319-353 | ATT 354-407 | STR 408-503
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage Arash
[NCBI]
3038319 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella typhimurium
[NCBI]
90371 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WGL32639.1 [NCBI]
Genbank nucleotide accession
OQ632216.1 [NCBI]
CDS location
range 111139 -> 112650
strand +
CDS
ATGAATCAACTAACAGTGGTCGACCGCGGTGGTGTTAAGCACCCGGAGATCGTAACGATCGATGAAGCCCAAGCGCTGCTCGATCAGTATGGTACCGGTGGTACGGCTGTAAAAATCCCGAACGGCGTAAACCTGCGGAAATTCCTCGGGCGTTGCGACGGCGGATTCTATTACGATAACGCTGCTCTGTCCGGTACGACCGGCGGGCCGATCAAAGACATCGTCTACTACACCGTCCTGAACGCTGGCGTGCCGGGTAGCCGTGCCATCATCGCATCCTCTCACGGTAACGACCTGTGGATCGCGGAAGTTTATGGTGACGTGTTCCGCGGTTGGGTTAACTTCGCCCGTCCGGACGACGTGAAAAACTGGATTGCTACGGCCATCGATAACCACGAGAAATCCCGGAATCATCCGTACGCCACTGAAGATGAGATCGGCTTCGTTCAACTCGCGAAGGATTGGGAGGCAGAGGCTGGGCAAGAGGATACTCACGCCATGACGCCTCTGCGTTCACAGAACCTGCTGAGCACCTATGGTCTTGGCTCTCGTTCCGTTTACGTCCCAGATGGTGCAAACCTGGCAACCTTCTTCAACGATAAGAAGCCAGGCTTCTACCGACTGAATAACCCGGCGACGTCTGGTTACACCAACTACCCGACGGACGTACCGTTTACTTGGGCGGAAATCATCTCGGCAAACCACGAAGCTCTCAATACGCGTACTCTGCTGATGATCACCAACATCGGCCGTACGTACACGGCAACGGTCAGTTCCGGAGCGTTCTCGGGGTGGAAACTGAAGCTGCAAATGGACGATCTACCGATCGCGTCCTTGCTCCAACGGGGTATCGTTCAGCTGACAGATTCCGTCACCTCGACGAATGTCACCACCGCCGCCACGCCGAACTCCGTCAAAATGGCTAACGATAACGCAAATACCCGCGTGCCACAGGCTCGTACGATTAACGGTAAGCCACTGACGGGAAACATCGCCTTGACCGCAGCAGACGTTGACGCCTGGTCAAAAACCGAAGCGGATAACCGCTTTATGTTCAAGTGGGATTCTCCGCTGCCTCCTGGAACGCCTATTCCGTGGGGTAAAGCGACGGCTCCGGATGGCTTCTTCATCATGTCCGGGCAGGGGATTTCAACGGCGTATCCCAAACTGCGTAGCGTTTATGGTGACCACCTGCCAGACCTTCGCGGTCTGATTATCCGTGGTACTGACTTAGGCCGTGGCGTGGACCCAGGTCGTGCCGTACTGTCCTACCAGGAAGACGCCATTCAGCGTATCTACGGGACGTTCCCGTGCTTTACCCGCTGGCGTGCACACGAAAAAATTACTGGCGTATTCACCCGTATTGATGGACAATGGAACACCAACGTTAAGAATGGTAACGGCGATGACTGGGGTATGACTTTCGTGTTCGACTCCTCTCGCGTCGTACGCAGTGCTGGCGAAACCCGTATGAAAAACCTAGCGTTTACCTACATCGTTAAAGGAGAATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
779df98dcdf643a69ffef4e094ef8c12dc28451246e67789d82a153d57e1b7ee
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8066
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50