Genbank accession
QGF21243.1 [GenBank]
Protein name
central tail fiber J
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MNKQLPVPVGAKGGSSKPKAPYEMEDNLISIDKIKILLAVSDGEVDPSFSLMNLYLDDVPVQSSNGILNYPELRLNLGREHNIKITLKDSPIPQAKLHMARGVKAESDGDQVGVRVEYAVDMAVDGGAYSEYMVDVIDGKTTSGYDRSRRIDLPAFNTQVLLRIRRVTPDSQSGNVVDAIQLQSYAEVIDAKFRYPLTGLVYVEFDSKLFPNQIPNISIKKKWKLINVPVNYDPFHAHTPARGMEYGRKRGVITPLSFCTTVPNGKGGTEPRYLCDVVIQSQIEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRNPSYIFTNENVIGGEFTYTFASEKSMYTQCNVTFDDAQNFYAQDVEGVFDPEMTLRFGHNPTAITAIGCTRRSEANRRGRWILKTNVRSTTVNFATGLEGMIPTIGDVIIVADNFWSSALTMNLSGRVMEVSGLQVFLPFKVDARAGDRIIVNKPDGAPVGRTIASVTPDGKTITLNTTFGFDVQPDSIFAIERTDLAQQQYVVTEIKRGDGEEEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDILPAPENVKVSSYSRIVQGMSVETMRVTRDKVEYATLYEMQWRKDNGNWNNTPQTASKEIEVEGIYAGNYHVRVRSVSSNGSSSGWSKIVSVHLKGKVGEPGAPINMTASDNEVFGIRVKWGMPEGSGDTAYIELHRAPNSAEGHPIEDQATLLTLVPYPQYEYWHSILPAGQVIWYKARAIDRIGNVSQWTDFVRGMSSVDTSIITDHIKVDIENSEGINSKNGKFTAQIKESLRLIANETEARVTQVSQLEADFDGKITAQNSELREVIATGDEALSRSIDELRAEIGDDIQGQITEVKEAIVTETDARVTADTALSTRIGDNEAAINQKLDSDYGVNLGLKYNGQEYSAGMALSLVGDGTGVKSQMLFSADRFAIISNAQAGAFTLPFVVENNRVYARGVKAESDGDQVGVRVEYAVDMAVDGGAYSEYMVDVIDGKTTSGYDRSRRIDLPAFNTQVLLRIRRVTPDSQSGNVVDAIQLQSYAEVIDAKFRYPLTGLVYVEFDSKLFPNQIPNISIKKKWKLINVPVNYDPFHAHTPARGMEYGRKRGVITPLSFCTTFINSLLVKDGTITTAKIAQQINSTNWSSGSAGWMINKNGIAEFNRYGKGKPLRKQRGIFIFINSLLVKDGTITTAKIAQQINSTNWSSGSAGWMINKNGIAEFNRYGKGKPL
Physico‐chemical
properties
protein length:1238 AA
molecular weight: 136991,98160 Da
isoelectric point:5,59309
aromaticity:0,08643
hydropathy:-0,31624

Domains

Domains [InterPro]
IPR032876
ATT
282–446
QGF21243.1
1 1238
Architecture
STR
ATT
STR
ATT
STR
ATT
STR
RBD
STR 1-95 | ATT 96-191 | STR 192-281 | ATT 282-446 | STR 447-965 | ATT 966-1056 | STR 1057-1119 | RBD 1120-1235 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Citrobacter phage HCF1
[NCBI]
2849700 Uroviricota > Caudoviricetes > Drexlerviridae > Hicfunavirus HCF1 >
Host Citrobacter amalonaticus
[NCBI]
35703 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QGF21243.1 [NCBI]
Genbank nucleotide accession
MN545971.1 [NCBI]
CDS location
range 25631 -> 29347
strand +
CDS
ATGAATAAACAATTACCTGTTCCAGTTGGTGCAAAAGGCGGATCAAGCAAGCCTAAAGCGCCGTATGAAATGGAAGATAATCTGATCTCTATTGATAAGATTAAGATTTTATTAGCTGTTTCTGACGGTGAAGTAGATCCTAGCTTTTCGTTAATGAATTTATATCTTGATGATGTTCCAGTTCAAAGTTCAAATGGGATATTAAACTATCCCGAGTTAAGGCTGAATTTAGGCCGGGAACACAATATCAAGATCACATTAAAGGATTCACCGATACCGCAAGCGAAATTACATATGGCGCGAGGCGTTAAAGCAGAAAGTGACGGCGATCAGGTTGGCGTTCGCGTAGAATATGCGGTAGACATGGCTGTTGACGGTGGAGCGTATAGTGAATATATGGTTGATGTTATCGACGGCAAAACAACAAGCGGTTACGACAGAAGCCGTCGCATTGATTTGCCAGCGTTCAATACTCAAGTATTATTGCGCATTCGCCGTGTAACTCCAGATAGCCAGAGCGGGAACGTTGTTGATGCTATTCAGTTGCAAAGCTATGCAGAGGTTATTGATGCTAAATTCCGTTATCCTCTTACTGGACTTGTATATGTTGAATTTGACAGCAAATTATTCCCGAACCAGATCCCTAACATTTCAATCAAAAAGAAATGGAAGTTGATTAACGTTCCGGTTAATTACGATCCATTTCACGCACATACTCCGGCACGTGGAATGGAGTATGGAAGAAAGCGTGGAGTAATAACCCCGCTTTCATTCTGTACGACCGTTCCTAACGGTAAGGGCGGAACAGAGCCGCGTTATCTTTGCGACGTTGTGATCCAGTCTCAGATTGAAGCATATCAGCTTATTCGTGATATTTGCTCAATCTTCCGTGGCATGAGCTTTTGGAACGGTGAGAGCCTTTCAATTGTGATCGATAAGCCTCGCAATCCGTCTTACATTTTCACTAATGAAAACGTGATCGGCGGTGAGTTCACTTACACTTTCGCAAGCGAAAAGAGCATGTACACACAATGCAATGTGACGTTCGACGACGCTCAAAACTTTTACGCTCAGGATGTGGAAGGTGTTTTTGATCCAGAAATGACGTTGCGCTTTGGTCATAATCCGACAGCAATCACCGCTATCGGTTGCACACGACGCAGTGAAGCAAATAGACGTGGGCGCTGGATACTGAAAACCAACGTTCGCAGCACTACGGTAAACTTTGCTACTGGCCTTGAGGGTATGATCCCGACAATCGGTGACGTGATTATTGTTGCGGATAACTTCTGGAGCAGTGCGTTAACAATGAATCTTTCAGGGCGCGTAATGGAGGTAAGCGGGTTACAGGTTTTCTTACCGTTTAAAGTGGATGCGAGAGCAGGTGATCGCATTATCGTAAATAAGCCAGACGGCGCACCAGTAGGACGAACGATTGCCAGCGTAACGCCGGACGGAAAGACAATCACACTCAACACTACTTTCGGTTTTGACGTGCAGCCAGATTCAATCTTTGCCATTGAGCGCACCGATTTAGCGCAGCAGCAGTATGTTGTTACCGAAATCAAGCGCGGTGATGGTGAGGAAGAATTTACTTATAGCATCACGGCAGTCGAATACGATCCGAACAAGTACGACGAAATTGATTACGGCGTGAACATTGATGATCGACCGACTTCAATCGTGCAGCCGGATATTCTGCCAGCGCCGGAAAACGTTAAGGTTTCCAGCTACAGCCGAATTGTTCAGGGCATGAGTGTTGAAACGATGCGCGTTACCCGGGACAAGGTTGAATATGCAACGCTGTACGAAATGCAGTGGCGCAAGGATAACGGAAACTGGAACAACACGCCGCAGACCGCAAGCAAAGAAATTGAGGTTGAAGGTATTTACGCAGGTAACTATCACGTTCGCGTTCGTTCTGTTTCATCCAACGGCTCATCGTCTGGATGGTCTAAGATCGTTAGTGTTCACCTGAAAGGCAAGGTTGGCGAGCCCGGCGCACCTATTAACATGACAGCCAGTGATAACGAGGTGTTTGGTATTCGTGTTAAATGGGGTATGCCGGAGGGAAGCGGTGACACTGCATACATTGAGTTGCACCGGGCACCAAACAGCGCGGAAGGACACCCGATTGAAGATCAAGCAACACTGTTGACGCTTGTACCGTATCCGCAATATGAATACTGGCACAGCATTTTGCCAGCAGGACAAGTGATCTGGTACAAGGCGCGAGCCATTGACCGAATCGGAAACGTGTCACAATGGACTGATTTTGTGCGCGGCATGTCGTCCGTTGATACAAGTATCATCACGGATCATATTAAGGTTGATATTGAGAATTCAGAGGGTATCAATTCCAAAAACGGAAAGTTCACGGCGCAGATCAAGGAGTCATTGCGACTTATCGCAAATGAAACCGAAGCGCGTGTAACTCAGGTGTCACAACTGGAAGCTGATTTTGACGGAAAAATTACCGCCCAAAACAGCGAATTGAGAGAGGTTATTGCAACTGGTGACGAAGCGTTAAGCCGTTCAATTGACGAACTTAGAGCCGAGATAGGAGACGATATTCAGGGGCAAATAACAGAAGTTAAGGAGGCAATAGTAACTGAAACTGATGCGCGTGTTACTGCTGATACTGCGTTATCTACACGCATTGGTGACAATGAGGCTGCAATTAACCAGAAGTTAGACTCGGATTACGGAGTTAATCTTGGCCTGAAATATAACGGTCAGGAATATAGCGCTGGTATGGCGTTGTCACTTGTTGGCGATGGTACTGGTGTGAAGTCGCAAATGCTTTTCTCCGCTGATCGGTTTGCAATCATCAGCAATGCACAGGCTGGCGCGTTTACGCTTCCGTTTGTGGTTGAGAATAACCGTGTTTATGCGCGAGGCGTTAAAGCAGAAAGTGACGGCGATCAGGTTGGCGTTCGCGTAGAATATGCGGTAGACATGGCTGTTGACGGTGGAGCGTATAGTGAATATATGGTTGATGTTATCGACGGCAAAACAACAAGCGGTTACGACAGAAGCCGTCGCATTGATTTGCCAGCGTTCAATACTCAAGTATTATTGCGCATTCGCCGTGTAACTCCAGATAGCCAGAGCGGGAACGTTGTTGATGCTATTCAGTTGCAAAGCTATGCAGAGGTTATTGATGCTAAATTCCGTTATCCTCTTACTGGACTTGTATATGTTGAATTTGACAGCAAATTATTCCCGAACCAGATCCCTAACATTTCAATCAAAAAGAAATGGAAGTTGATTAACGTTCCGGTTAATTACGATCCATTTCACGCACATACTCCGGCACGTGGAATGGAGTATGGAAGAAAGCGTGGAGTAATAACCCCGCTTTCATTCTGTACGACCTTTATTAACAGCCTTTTGGTGAAAGATGGAACGATTACCACGGCTAAGATTGCGCAGCAAATCAACTCTACAAACTGGAGTTCTGGATCGGCTGGCTGGATGATTAACAAGAACGGAATTGCGGAGTTTAACCGTTACGGTAAGGGGAAACCTTTACGCAAACAGCGGGGCATTTTCATTTTTATTAACAGCCTTTTGGTGAAAGATGGAACGATTACCACGGCTAAGATTGCGCAGCAAATCAACTCTACAAACTGGAGTTCTGGATCGGCTGGCTGGATGATTAACAAGAACGGAATTGCGGAGTTTAACCGTTACGGTAAGGGGAAACCTCTTTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
e83e1cf57ae793b65f9c780d9ceb4e876c69d9da9c29c5847e954453311272b3
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5943
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50