UniProt accession
G9JXH9 [UniProt]
Protein name
Putative tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MAAGTLSVTNNSKAVVGVGTTFTEYKAGDFLSLVVGQVPYTVAIASIESATALTLVLPFDGPTATGLAWDGIKRDTMSLATMGVTVQAQKALRLMIADENNWRAIFGEEEEITVTLPNGQVMQGMSWGYLSQLMKQIDPVEMRNLQQQAETAKNQAVTAKGQAESARDAANTAKTGAENARNQANTARDQANTAKTGAESARDAANTAKTGAENARSQAQGYRDEAEQFKNQINPSQFMLKSQNLSDVANKDTARDNLSLGRTQRAQFEGVDLSKSDWPGLRFITTSMSPTEVGYRVVFEHDSSDNRMALYWRNGSDANGQAAVHFTAPASGQTRFIAYKEEVNLPEITGWGVAAPSQGPRATNATNLAPGLYWGTVADVGNPISGTLGMSMLQTSGSSANYRCQLVFQDSAGGAMYLRSSNSSVFGNYKQVTTSAVSDERLKTVRGNLNLEGALDNINRMDFKIFSFLSDGPERSYRRGVISQQIRQIDKQYTKEIGGYYHLDQTPMLLDALAAIKALRARDEANKAEIAELKAAIAELKK
Physico‐chemical
properties
protein length:542 AA
molecular weight: 58588,78350 Da
isoelectric point:6,11567
aromaticity:0,07011
hydropathy:-0,41956

Domains

Domains [InterPro]
DC_0162
STR
1–542
Coil
Unmapped
142–169
IPR030392
CHP
438–490
G9JXH9
1 542
Architecture
STR
STR 1-542
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage EP23
[NCBI]
1109721 Uroviricota > Caudoviricetes > Dhillonvirus >
Host Shigella sonnei
[NCBI]
624 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEV89341.1 [NCBI]
Genbank nucleotide accession
JN984867 [NCBI]
CDS location
range 22032 -> 23660
strand +
CDS
ATGGCAGCGGGTACGCTCTCCGTAACGAACAATAGCAAGGCTGTAGTAGGGGTTGGCACAACGTTTACCGAGTACAAGGCTGGTGACTTCTTATCGCTGGTGGTTGGGCAAGTGCCTTACACCGTGGCGATCGCGTCCATCGAAAGCGCAACCGCGCTCACACTGGTGTTGCCGTTCGACGGCCCAACGGCGACGGGTCTGGCCTGGGACGGCATCAAACGTGACACCATGTCATTGGCGACGATGGGCGTAACCGTCCAGGCGCAAAAAGCATTGCGATTGATGATCGCAGATGAGAACAACTGGCGCGCAATCTTCGGAGAAGAAGAGGAAATAACAGTGACGTTACCTAACGGGCAGGTTATGCAGGGCATGTCATGGGGCTATCTGTCGCAGCTAATGAAGCAGATCGACCCCGTAGAAATGCGCAACCTGCAACAACAGGCCGAGACGGCAAAAAACCAGGCTGTTACTGCGAAGGGCCAGGCAGAATCAGCGCGCGATGCGGCTAATACAGCAAAAACTGGCGCGGAGAACGCCCGCAACCAGGCCAACACCGCACGTGATCAGGCCAACACCGCCAAAACAGGCGCAGAATCAGCGCGCGATGCGGCTAATACAGCAAAAACTGGTGCGGAGAACGCCCGCAGCCAGGCGCAAGGGTATCGCGACGAAGCAGAGCAGTTTAAAAACCAAATCAACCCATCACAGTTCATGCTTAAGTCGCAAAACCTTAGCGACGTAGCGAACAAAGATACGGCGCGGGACAATCTGTCACTAGGCCGCACTCAGCGTGCGCAATTTGAGGGGGTTGACCTGTCTAAAAGCGACTGGCCCGGCTTAAGGTTTATTACAACCAGCATGTCACCAACAGAGGTCGGCTATCGCGTTGTCTTTGAGCATGATTCGAGTGATAACCGCATGGCGCTCTACTGGCGAAATGGCTCAGATGCCAATGGGCAGGCCGCCGTCCACTTCACCGCGCCTGCCAGCGGGCAGACGCGATTTATTGCATACAAGGAAGAAGTAAACCTTCCAGAAATTACAGGCTGGGGCGTAGCGGCTCCCTCGCAAGGACCGAGGGCGACGAACGCCACAAACCTTGCGCCGGGGCTTTATTGGGGTACCGTCGCTGACGTCGGCAATCCAATTTCAGGAACACTCGGCATGTCGATGCTTCAAACATCAGGATCATCCGCCAATTACCGCTGCCAGTTGGTGTTTCAGGATAGCGCGGGCGGGGCGATGTATCTCCGTTCCAGTAATAGTAGCGTTTTTGGTAACTACAAGCAGGTAACTACCTCTGCGGTGTCGGATGAGCGCCTGAAAACTGTCCGGGGAAATCTTAATCTAGAAGGTGCGCTGGATAACATAAACCGAATGGATTTTAAAATTTTCTCGTTCCTGAGTGATGGGCCGGAACGTAGCTACCGACGCGGCGTTATCTCGCAGCAGATCCGCCAGATTGATAAGCAGTACACGAAAGAGATCGGCGGGTACTATCATCTTGATCAGACACCAATGCTACTGGACGCCCTAGCGGCAATTAAAGCATTGCGGGCGCGTGACGAGGCTAATAAAGCAGAGATTGCAGAGTTAAAAGCGGCGATTGCAGAATTGAAAAAATAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
da1147e0f6a0bcd5eda23aff10db572135eea750d731a7184a1283be4cc47379
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7737
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50