Genbank accession
WLJ70814.1 [GenBank]
Protein name
tail fiber protein host specificity
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,83
Protein sequence
MPIGVNVSGTNKQATMKVNVGGVWKLPIGWVRENGEWKKFQNPEFTYTISQNTANFNLATAVGSREEVIINLVINSGVSVYSTSVSTPAILIPDSFAGKTINIINNGNIYGQGGVGGTGAGQAGGPALRVQTSQKINLTNNGTIAGGGGGGGKGGTGGNGFFTTQSTQRDPSSGTWTYATGNHIEFRSRNCILRMGNSEIYRWYGPDFSTVVTYGITITVGEWTYYASNFYGDGVGVLKSNAMYRTRTVTNTTYTTGGAGGNGGNGQGFSQAATNGLAGANGGTNAGRGGNGGNGGTFGVAGATGATGASGNQTAGLGGQAGGAAGAAVDGTSKVNYVNAGKLLGPLIN
Physico‐chemical
properties
protein length:349 AA
molecular weight: 35149,25210 Da
isoelectric point:9,62747
aromaticity:0,08596
hydropathy:-0,24183

Domains

Domains [InterPro]
DC_1762
ATT
1–180
IPR007932
RBD
66–156
WLJ70814.1
1 349
Architecture
ATT
RBD
ATT 1-180 | RBD 196-348 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage MJW
[NCBI]
3068537 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Vibrio cholerae O1
[NCBI]
127906 Bacteria > Proteobacteria > Gammaproteobacteria > Vibrionales > Vibrionaceae > Vibrio

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WLJ70814.1 [NCBI]
Genbank nucleotide accession
OR248150.1 [NCBI]
CDS location
range 38185 -> 39234
strand +
CDS
ATGCCAATTGGAGTGAATGTTTCAGGAACAAACAAACAAGCCACAATGAAAGTTAATGTTGGTGGAGTTTGGAAGCTACCGATTGGTTGGGTTCGTGAAAACGGGGAATGGAAGAAATTCCAAAACCCCGAATTTACGTACACAATCAGTCAGAATACGGCAAACTTCAACCTAGCTACGGCAGTAGGTAGTAGAGAAGAGGTTATCATTAATCTAGTGATTAACTCTGGCGTGAGTGTGTACTCAACAAGTGTTAGCACTCCAGCAATCTTGATTCCAGATAGCTTCGCAGGAAAGACAATTAACATCATCAACAATGGTAATATTTATGGTCAGGGTGGCGTTGGTGGAACCGGAGCAGGTCAGGCTGGAGGCCCTGCCCTACGAGTACAGACTTCCCAAAAGATTAACTTAACCAATAACGGCACAATCGCCGGTGGAGGCGGTGGTGGTGGTAAAGGCGGTACAGGTGGTAACGGGTTTTTTACTACTCAATCTACACAACGTGACCCATCATCTGGAACTTGGACATACGCTACAGGGAACCATATCGAATTTCGTTCCCGTAACTGTATCCTACGTATGGGTAATTCAGAGATATATCGTTGGTATGGTCCAGACTTTAGTACTGTTGTTACCTATGGTATCACTATAACCGTGGGAGAATGGACGTACTATGCCTCCAACTTCTACGGTGACGGTGTGGGTGTTTTGAAGTCGAATGCAATGTACAGAACTAGAACAGTGACCAATACTACATACACTACCGGTGGTGCCGGTGGTAATGGAGGAAACGGTCAAGGATTCTCACAAGCAGCAACCAACGGTCTAGCTGGTGCTAACGGTGGGACTAACGCAGGTCGAGGTGGTAACGGAGGCAATGGTGGTACTTTTGGCGTAGCCGGTGCAACCGGAGCTACGGGGGCGTCGGGAAACCAGACTGCTGGACTTGGTGGTCAAGCTGGAGGTGCTGCTGGAGCTGCCGTTGATGGTACGTCTAAAGTTAACTACGTCAATGCTGGTAAACTGTTAGGTCCTTTAATTAACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
e00edc3f36c96c5f795e3bd50aef6badf5af29a93b6983ded530824ba5c48183
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8138
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50