Genbank accession
WAX10833.1 [GenBank]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MKITYINTGNSVGNETNYVAPIVIPIAAGLKQELLFEGMSAVTNMQIGVYRNGTLVHQQELAPSSTMRIDLSTFAQVLPSLNELYKYTVGSPSSNVQDKIQFVLMDGNTIRSYSFYVCSLNCPYLNSRGWVHHDERNLVSTALASMVGVSLVTVPASGTKAAVTDPIFGTGTKIKGRGGWAASLPSLGYSVATGVETTMSIYVDNVGSTDFNLIFRFGSSRDISVRIPVGFKGRVSESFRPNEAANYEPRILLSSPDAAMEANFYCFMVNYGSEPKPWIKAGADAASPCVGAMTDYRSGKLFYPAQVMSAVNTTTNLDLAGERRYARLYYSVGSDPTMFGFSVNGGTPVNYATGVQFSEGVNVRVLSPNGDIYGWMEYENKLPKDVSPNCGVFLRWLNSRGFIDSMFFEKYSIKPVMRTDNSGGNGIDYYETTVQIDVNWDNGPCLNWIQRSSDIIARIPIDICQFGKSTLLATNVFQTQGGNKTKTIQFKFKVQIVEP
Physico‐chemical
properties
protein length:499 AA
molecular weight: 54731,47990 Da
isoelectric point:7,49936
aromaticity:0,11022
hydropathy:-0,11182

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Phocaeicola phage BV739P1
[NCBI]
2968726 No lineage information
Host Phocaeicola vulgatus
[NCBI]
821 cellular organisms > Bacteria > Pseudomonadati > FCB group > Bacteroidota/Chlorobiota group > Bacteroidota

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WAX10833.1 [NCBI]
Genbank nucleotide accession
OP172747.1 [NCBI]
CDS location
range 21840 -> 23339
strand +
CDS
ATGAAAATAACGTATATAAACACAGGCAACAGCGTAGGCAACGAAACAAACTACGTAGCCCCAATAGTCATACCGATTGCGGCAGGTCTAAAGCAGGAACTACTGTTTGAGGGTATGTCGGCCGTAACCAATATGCAGATAGGTGTGTACCGTAACGGAACACTAGTACATCAACAGGAGTTAGCACCGTCCAGTACGATGCGCATAGACTTAAGTACGTTTGCACAGGTGTTACCATCACTTAACGAACTGTATAAGTATACAGTGGGTAGCCCATCAAGTAACGTACAGGACAAGATTCAGTTTGTATTAATGGACGGGAATACGATACGCTCATACTCTTTCTACGTTTGTTCGCTGAATTGCCCGTACCTGAACTCTAGAGGGTGGGTACACCATGACGAACGTAATTTGGTATCTACGGCATTGGCTAGTATGGTAGGCGTGTCATTAGTGACTGTCCCGGCCAGTGGAACTAAGGCAGCCGTTACTGATCCTATATTCGGGACGGGTACTAAGATTAAAGGGCGAGGAGGTTGGGCAGCAAGTTTGCCGTCTTTGGGGTACAGCGTGGCTACGGGAGTAGAAACTACGATGTCTATATACGTGGATAACGTAGGGTCTACGGACTTTAATCTTATTTTTAGGTTCGGCAGCTCACGAGATATTTCCGTAAGAATACCCGTAGGCTTTAAGGGGCGTGTTAGCGAGTCATTCAGGCCTAATGAAGCGGCCAACTATGAGCCAAGAATTTTGCTCAGTTCACCTGACGCAGCTATGGAAGCAAATTTTTATTGCTTCATGGTGAACTACGGAAGCGAGCCTAAACCGTGGATCAAGGCAGGGGCAGACGCTGCATCACCGTGTGTCGGTGCTATGACTGACTACCGTAGTGGGAAACTATTTTACCCTGCACAAGTGATGTCAGCCGTTAATACTACCACTAATTTAGACTTGGCAGGCGAACGAAGGTACGCGCGTTTGTATTATTCTGTCGGGTCTGATCCCACAATGTTCGGCTTTAGTGTGAACGGTGGTACACCAGTTAATTACGCAACGGGCGTACAGTTTTCAGAAGGGGTGAACGTAAGAGTGTTAAGCCCGAATGGCGATATATACGGATGGATGGAATACGAAAACAAACTACCTAAAGACGTTTCTCCTAATTGTGGTGTATTCCTTCGATGGTTGAATAGTAGGGGCTTTATAGATAGTATGTTCTTTGAAAAGTATTCAATCAAACCAGTTATGCGAACAGACAACAGCGGGGGCAACGGTATTGACTATTATGAAACGACAGTGCAAATAGATGTTAACTGGGATAATGGGCCGTGCTTAAATTGGATTCAGAGATCATCCGATATTATCGCACGTATTCCGATAGACATTTGCCAGTTCGGTAAATCTACTTTGTTGGCAACCAACGTTTTCCAAACACAGGGTGGCAACAAAACGAAAACAATTCAATTCAAATTTAAAGTTCAAATTGTAGAACCATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
c067c17f19e7b9cf6229b4d2d9ded7a55ead2cb2f06f3d767813f47ae7a2389c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4163
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50