UniProt accession
A0AAF0AAT8 [UniProt]
Protein name
Uncharacterized protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MQKFDTNLLQNLKWMQNNAPNITSLVQKKSDWYERYQDQFWIDWYKNIFNIDTCDAMGIYIWCDILCIPRDFVDLSNYEHNWAFGPTRDNFEGPDNIGGNFFGAGGKVVKNIEEARILLKLRYLTLTSDGRIESINRMLKWIFNRGEDWDYSSSRYAYVTDNTQGAVSSSESTNGSFLLYDWQGSTEIQHDYKQVLLSHGNNAGLTISGVAGTVSRMAHPDWYSWGNGSDAGIVGGDTRQYGGPDYYGQLCGDYPGEAIENFANAVRVIKTASSGEPVYSLANNPSAQSIALPSQAGVNVCYSFYIKVLSENVASVSVTENSDASSNGKLFDPVTGQFNLDPSLSYFITGKFDGVRRESSRGGIKAIGGDWYRIWIVEQTANATAGVKTSFSIELKATAAGSTQTRELPEGSDALLFGYSICVCDPVQNPNGPEPYFYGTQNPPADGTAILSVTAGTLNNISCANGPLTTRDGVVGESFMQPGSSLAYSGSIGGYPYPGNATLIAETDGIRRSWNVSRLDGVPPPPTAAYYMEYVIGPYFANNLSDNFKTLVGDRSEGFLPANAGIRWQLYQEEADLSDIDLTVSDITDTRIGFETEGVAWINTDGIIRVGGINEWPLNNFNGVIRQLESLKGVNQVIDGALVNDYSISKASRDDFGREIPRAAITNYWRYTQPNDGTSFGNIVKDGANTVTATPISGVPNQFPSFKLSATIGTVPASGYARAVLATKYTGTLTGPLSFIAIVKNNTVNGQKLFVTVPDGMGGYIPKFYNIGVGDLQRIEIDLAAEIVEGATWDGSTGGISIGLISGKDLPTEVGTTLELDILSLMLVSGDTMNALAISTSEDFAKKTATSIYIKNPGSLATGVQLIASNGDVVATVNFPRSAAGGYVSSMNITDLTGDWASTLLSKIKYIV
Physico‐chemical
properties
protein length:912 AA
molecular weight: 98655,81850 Da
isoelectric point:4,64638
aromaticity:0,10526
hydropathy:-0,19397

Domains

Domains [InterPro]
IPR021283
TAS
10–146
IPR021283
TAS
10–150
A0AAF0AAT8
1 912
Architecture
TAS
STR
TAS 10-149 | STR 150-912
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ECO71P1
[NCBI]
2968662 Uroviricota > Caudoviricetes > Lindbergviridae > Wifcevirus ECO71P1 >
Host Shigella flexneri
[NCBI]
623 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WAX14116.1 [NCBI]
Genbank nucleotide accession
OP172789 [NCBI]
CDS location
range 32783 -> 35521
strand -
CDS
ATGCAAAAGTTTGACACAAACCTGTTGCAAAACCTTAAGTGGATGCAGAACAACGCGCCAAACATCACATCTTTGGTTCAAAAGAAAAGTGACTGGTATGAGCGATATCAAGATCAGTTTTGGATTGACTGGTACAAAAACATATTCAACATTGATACTTGCGATGCAATGGGCATCTATATTTGGTGTGACATCCTATGCATACCAAGGGATTTTGTTGACTTAAGCAACTATGAGCACAACTGGGCATTTGGACCCACACGCGACAACTTCGAAGGGCCGGACAACATCGGCGGCAACTTCTTTGGTGCTGGCGGAAAGGTTGTGAAAAACATTGAAGAAGCGCGAATCCTTCTTAAGCTTCGTTATTTGACGCTCACAAGCGATGGCCGCATAGAAAGCATCAACCGTATGCTTAAGTGGATTTTTAACCGTGGAGAAGATTGGGATTACTCTTCATCGCGCTACGCATACGTGACAGACAATACTCAAGGCGCAGTGTCTTCTTCTGAATCAACAAACGGTTCATTCCTTCTGTACGACTGGCAGGGGTCAACTGAGATTCAGCACGATTATAAACAAGTTCTTTTGAGTCACGGCAACAACGCAGGACTTACCATAAGTGGGGTGGCAGGGACCGTTTCGCGAATGGCACACCCAGACTGGTATTCATGGGGCAATGGTTCCGACGCTGGCATTGTCGGCGGTGATACACGACAATATGGCGGCCCTGACTATTACGGCCAACTGTGCGGTGATTACCCAGGAGAGGCCATAGAGAATTTCGCCAACGCAGTACGGGTCATCAAGACCGCTTCCAGCGGCGAGCCTGTGTATTCTTTGGCCAATAATCCAAGCGCCCAATCTATTGCGCTTCCTTCTCAGGCTGGTGTCAACGTTTGCTATAGCTTCTACATAAAAGTGCTTTCTGAAAACGTGGCTTCTGTTTCCGTAACAGAGAACAGTGATGCATCGAGTAATGGCAAGCTTTTCGACCCAGTTACTGGTCAATTCAACCTTGACCCTTCTTTGAGTTATTTCATCACAGGAAAGTTCGACGGTGTGAGAAGGGAATCAAGCCGTGGTGGCATCAAGGCCATCGGCGGCGATTGGTACAGAATCTGGATTGTAGAGCAAACTGCAAACGCGACAGCTGGGGTGAAAACAAGCTTTTCAATTGAGTTGAAAGCAACTGCCGCTGGATCAACCCAGACTCGCGAATTGCCTGAAGGGTCTGATGCACTTCTTTTTGGATACTCCATTTGCGTTTGCGACCCTGTTCAGAATCCTAATGGGCCTGAGCCTTATTTCTATGGGACTCAAAATCCTCCTGCAGATGGAACGGCAATCTTGTCTGTTACGGCAGGAACGCTCAACAACATCTCTTGCGCGAACGGGCCGTTGACAACAAGAGATGGCGTGGTAGGCGAGTCATTCATGCAGCCTGGCTCATCGCTTGCGTACTCTGGTTCAATCGGCGGCTATCCGTACCCAGGCAACGCCACTCTGATCGCAGAAACAGACGGCATCAGACGCTCATGGAACGTTAGCCGATTGGACGGAGTACCGCCGCCGCCGACCGCTGCCTATTACATGGAATATGTCATCGGGCCTTATTTTGCCAACAACTTGTCTGATAACTTTAAGACGCTAGTTGGCGACAGGAGTGAAGGATTCTTGCCAGCAAACGCTGGGATACGTTGGCAACTTTACCAGGAAGAAGCAGACCTTTCAGATATTGACTTGACAGTAAGCGACATCACAGACACCAGAATCGGATTTGAAACGGAAGGTGTTGCATGGATTAACACTGACGGGATAATCAGAGTTGGCGGCATTAACGAGTGGCCGTTGAACAACTTTAACGGTGTCATAAGACAGCTTGAATCTTTGAAAGGTGTAAACCAAGTCATCGACGGCGCACTTGTCAACGATTATTCAATCTCAAAAGCATCAAGAGATGACTTTGGTCGTGAAATCCCACGTGCCGCAATCACAAACTACTGGCGATACACACAGCCCAATGACGGCACATCTTTTGGCAATATCGTCAAGGACGGAGCTAACACGGTAACGGCAACGCCAATCAGCGGTGTTCCCAACCAGTTCCCATCATTCAAGCTTTCCGCTACAATAGGGACTGTGCCAGCATCCGGTTATGCACGCGCCGTACTGGCCACTAAATACACTGGCACCCTGACTGGCCCTTTGTCGTTTATCGCAATAGTCAAAAACAACACGGTTAACGGTCAAAAACTTTTTGTGACTGTACCGGATGGGATGGGCGGCTATATTCCTAAATTCTACAACATAGGCGTAGGCGATTTGCAGCGTATCGAAATTGACCTTGCCGCTGAGATTGTTGAAGGAGCGACTTGGGACGGGTCGACAGGTGGAATCAGTATTGGCTTGATATCAGGGAAAGACTTGCCGACCGAAGTTGGCACAACTCTTGAACTTGACATTTTAAGCTTGATGCTTGTGTCTGGTGATACGATGAATGCGCTTGCTATTTCAACAAGCGAAGATTTTGCAAAGAAAACTGCAACTTCTATATACATCAAAAACCCAGGGTCACTGGCTACTGGAGTTCAATTGATAGCTTCGAATGGGGATGTTGTAGCGACAGTAAACTTTCCAAGAAGTGCTGCTGGTGGTTATGTTTCAAGCATGAATATAACTGACTTAACTGGTGATTGGGCATCAACCCTTTTATCAAAAATTAAATACATCGTGTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
cd2dd5438f4304a108b40890ca58d3c355bc8a822d24b32fc45d412294e08ca7
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4279
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50