Genbank accession
UZV40566.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
Protein sequence
MAEIKTGILLRRNLKKHFVNDAKPTQGEIVLATDTNEIGMLVNDEIQWTPIQGVVNTVAGKQGDVILNKKDVGLENVDNTADIDKPISNSTKLEFQRHYTAENPHNITKKTLGLENVDNTADIDKPVSNLTQIELNKKISWDDARKQAGGKDPVFTDTTYTIKDGELSEFNFNSYYKNFIDTFNTNSRVLPSTQALIANGRTITLRRADGSSESIETQDTLYDDSELRALIEQAKIDLHINIQDNLESDSTQDALSANQGKVLKGLIDEIKKVINITDDDFRNLQDIINYIEENREKFDDLTIANIKGLQAALDSKLNRDDSTYIAPNSALLESHPASDFVLNTNYNAKLIEIQDSLNSINSQIKLFETQAGVDSKINQAIRDLNFTEIIQSINEQITRLQGSLDDIDLDAITENLQKVQQDLTQRISQLETNTSKKLEEFEAIVNNFDMSEIQTSINNFKDQINQNIDSIQGVVDSVSESLSNIENNIQTSLENKVSKDELATEVQTINDNIADLSNIANEAKWQDNFYGKVDRRRQALWSIISISKDSFKGSYKKPNTYNYWEAKYKNLKYINDNFDSLETISGAPTYDVVTDQVIRLTFNDISQASFLIGSPKNKIQIAKIIAVNANTKKAVFIYGYPSFMTDDLSKIMVTIEKDSSFGNYLSRANKTDPETFQKIVTAQEFDLPDDTDDYSYFEASYEISGNTITLTIPENIFLEFYGNMTSVETTVTGASAAQVWSATLKSTDYEGKKLFSNGVLIGKDLTGAILTIYDDKSTYINDFDEARTNYNDFQIIREEYEDIMYTRDFLPGTIGPGEDDLEW
Physico‐chemical
properties
protein length:823 AA
molecular weight: 92764,71120 Da
isoelectric point:4,48615
aromaticity:0,08141
hydropathy:-0,53159

Domains

Domains [InterPro]
DC_0351
STR
2–93
IPR054500
STR
242–269
Coil
Unmapped
413–433
UZV40566.1
1 823
Architecture
STR
STR
STR 2-93 | STR 98-807 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Campylobacter phage vB_CjeM_WX1
[NCBI]
2995678 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Campylobacter jejuni
[NCBI]
197 cellular organisms > Bacteria > Pseudomonadati > Campylobacterota > Epsilonproteobacteria > Campylobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UZV40566.1 [NCBI]
Genbank nucleotide accession
OP654737.1 [NCBI]
CDS location
range 123052 -> 125523
strand +
CDS
ATGGCTGAAATAAAAACTGGTATTTTATTAAGACGTAATCTTAAAAAACATTTTGTAAATGATGCAAAACCAACACAAGGTGAAATCGTTCTTGCTACTGATACAAATGAAATTGGTATGCTTGTAAACGATGAAATACAATGGACTCCTATTCAAGGCGTCGTTAACACAGTTGCTGGTAAACAAGGTGACGTCATACTAAACAAAAAAGATGTAGGCCTTGAAAATGTTGATAACACTGCTGACATTGATAAGCCAATTTCTAACTCTACAAAATTAGAATTTCAAAGGCATTATACAGCCGAAAACCCACACAATATAACAAAGAAAACACTTGGTTTAGAAAACGTTGATAACACTGCTGATATAGATAAACCTGTATCTAACTTAACTCAAATTGAGTTAAATAAGAAAATTTCTTGGGATGATGCTAGAAAGCAAGCCGGTGGGAAAGACCCAGTTTTTACAGATACTACTTATACCATTAAAGATGGTGAACTTTCAGAATTTAACTTTAATTCGTATTATAAGAATTTCATAGATACTTTTAATACTAACTCTAGGGTCTTACCAAGTACACAAGCTCTTATAGCTAATGGTAGGACAATAACCTTAAGAAGAGCTGACGGCTCGAGTGAAAGTATAGAAACACAAGATACTTTATATGATGATTCTGAACTTAGAGCATTAATAGAACAAGCTAAAATAGATTTACATATAAACATACAAGATAATTTAGAATCAGATTCTACTCAAGATGCTTTGAGTGCTAATCAAGGCAAAGTTCTTAAAGGTTTAATAGACGAAATAAAGAAGGTAATTAATATCACAGATGATGATTTTAGAAATCTTCAAGATATTATTAATTATATAGAAGAAAACCGCGAAAAATTTGATGATTTAACAATTGCTAACATCAAAGGCTTACAAGCTGCTTTAGATTCTAAATTAAATAGAGATGATTCTACATATATTGCTCCAAATTCTGCATTATTAGAATCTCACCCAGCTAGTGATTTTGTATTAAATACAAATTATAATGCTAAGTTAATAGAAATACAAGATTCTTTAAATTCTATCAATTCACAAATTAAATTGTTTGAAACTCAAGCGGGTGTGGATTCTAAAATAAACCAAGCAATCAGAGATTTAAATTTCACTGAAATCATACAGAGTATCAATGAACAAATCACGAGACTACAAGGCTCTTTAGACGATATTGATTTGGATGCTATAACAGAAAATCTCCAAAAAGTTCAACAAGATCTAACTCAACGAATATCACAATTAGAAACTAACACATCTAAAAAATTAGAAGAATTTGAAGCTATTGTTAATAATTTTGATATGTCTGAAATACAAACCAGTATTAATAATTTTAAAGACCAAATCAATCAAAATATTGATAGTATACAAGGTGTTGTAGATTCTGTTTCTGAATCATTATCAAATATTGAAAATAATATCCAAACCAGTTTAGAAAATAAAGTTTCAAAAGATGAATTAGCTACTGAGGTTCAAACTATTAATGATAATATTGCTGATTTATCAAACATAGCAAATGAAGCAAAATGGCAAGATAATTTTTATGGTAAAGTTGATAGAAGACGCCAAGCTTTATGGTCAATTATATCAATTTCAAAAGACTCTTTCAAAGGTTCGTATAAAAAACCAAATACCTATAATTACTGGGAAGCAAAATATAAAAATCTTAAGTATATCAATGATAATTTTGATAGTTTAGAAACGATATCTGGTGCACCTACATATGACGTGGTTACTGATCAGGTTATTAGATTAACTTTTAATGATATTTCACAAGCATCATTTTTAATTGGAAGTCCTAAGAATAAAATTCAAATAGCTAAAATTATAGCAGTAAATGCTAATACTAAAAAAGCTGTGTTCATATACGGGTATCCTAGTTTTATGACAGACGATTTATCAAAAATTATGGTAACCATAGAAAAAGATTCAAGTTTTGGTAATTATCTATCAAGAGCTAATAAAACAGATCCAGAAACTTTTCAAAAAATTGTAACTGCACAAGAATTTGATTTGCCGGATGATACTGATGATTATTCATATTTTGAAGCTAGTTATGAAATATCTGGAAATACAATAACACTAACTATACCAGAAAACATATTTTTAGAATTCTATGGTAATATGACATCAGTAGAAACAACCGTTACGGGTGCTAGTGCTGCACAAGTTTGGTCAGCGACTTTAAAATCCACTGATTATGAAGGTAAGAAATTATTCTCAAACGGGGTGTTGATAGGAAAAGATTTGACTGGCGCAATCTTAACTATATATGATGATAAATCAACATACATAAATGACTTCGATGAAGCTAGAACTAATTATAATGATTTTCAAATAATACGGGAAGAATATGAGGATATTATGTATACAAGAGATTTTTTACCAGGAACTATAGGACCAGGCGAAGACGACCTAGAATGGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
f74ef3ee20f96afff0df24842e6b895e79e45cbd141c509ce6a2a72bed12eff4
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4748
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50