UniProt accession
A0AAE8B0H4 [UniProt]
Protein name
Lateral tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,84
TF
Evidence RBPdetect2
Probability 0,79
Protein sequence
MAITTKIIVQQILNIDDTKATASKFPRYTVTLGNSISSITANELVSSIEAAAKSAAAAKDSEIAAKTSELNAKNSEQEAAISAGASEASATQSATSATQSAASADKSAESAAAAKVSETNSKTSEADAKTSETNAKTSETNAKSSETKAKASETNAKASETNAAASAAAAKLSETNAKVSETNAAASAADSSGFRNEAETFSTQAATSASAAKTSETKAKTSETNAKTSETKAKASETNAASSATSASQSVTTIQGLKSDVEQLKSDTQAIKNSAVTETTAIKADVAQLKTDTQGIKNSAVTETTALKNQAATSATNAASSATEAGKQATNAANSAKTAKTEADRSKTEADRSEAAANSTPDIQPLPDVWIPFNDSLDMLAGFAPGYKQITVGDDVIKMPSDKVVDFKRASGATYINKSGVLTVAEINEPRFERDGLLIEGQRTNYHLNSLTPSKWGATTSVTITESGVDEFGFTYGRFQIKDEKIGTNTTMNIAAIPGGRGVDVTGTEKYVTTSCRVKSDSANIQCRIRFERYDGSAYFYLADAYLNITDMSIRKTGEGAARITARAEKESNGWIYFEVTYQSEAIDNMVGSQIQIAPPVSPGTYSGGEYLQVATPQFEGGACASSFIITEATPVTRASDMVTLPIKNNLYNLPFTVLVEAHKNWSITPNAAPRVFDTSGHQTGAAIILAFGSAEGDNDGFPYCDIGKSNRRVYENAKLKKMIMGMRVKSDYNNCCVSNARISSETKTEWRYIQSTATIRIGGQTNTGERHLFGHIRNFRIWHKELTDRQMKEYV
Physico‐chemical
properties
protein length:796 AA
molecular weight: 84314,13870 Da
isoelectric point:5,96761
aromaticity:0,05905
hydropathy:-0,46520

Domains

Domains [InterPro]
DC_0608
ATT
1–203
DC_0166
STR
188–261
DC_0141
STR
240–760
Coil
Unmapped
254–274
A0AAE8B0H4
1 796
Architecture
ATT
STR
ATT 1-203 | STR 204-760 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage IrisVonRoten
[NCBI]
2852004 Uroviricota > Caudoviricetes > Demerecviridae > Tequintavirus > Tequintavirus irisvonroten
Host Escherichia coli K-12
[NCBI]
83333 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QXV80203.1 [NCBI]
Genbank nucleotide accession
MZ501075 [NCBI]
CDS location
range 22352 -> 24742
strand +
CDS
ATGGCAATAACTACTAAGATTATTGTACAACAAATATTAAATATTGATGATACTAAAGCTACTGCTAGTAAGTTTCCTAGATACACAGTAACTCTTGGAAATTCTATTAGCTCTATTACTGCTAATGAGTTAGTATCCTCTATAGAGGCCGCTGCTAAATCTGCTGCGGCTGCAAAAGATTCTGAAATAGCAGCTAAGACTTCAGAGCTTAATGCTAAGAACTCTGAACAGGAAGCTGCTATTTCTGCCGGAGCTTCAGAAGCTTCTGCTACTCAGTCTGCTACATCTGCTACTCAATCTGCTGCATCAGCGGATAAATCTGCAGAATCAGCCGCAGCAGCTAAAGTATCCGAGACTAACTCGAAGACTAGTGAGGCTGATGCAAAGACTAGCGAAACTAATGCAAAAACATCAGAAACTAATGCAAAGTCTAGTGAAACCAAAGCTAAGGCCTCCGAGACTAATGCTAAGGCTAGTGAAACTAATGCTGCTGCTTCGGCTGCAGCGGCAAAACTTAGTGAGACTAACGCAAAGGTCAGTGAGACTAATGCTGCCGCTTCTGCTGCGGATTCCAGTGGTTTTAGGAATGAGGCGGAAACATTCTCTACACAAGCTGCTACATCAGCGTCTGCTGCAAAAACCTCCGAAACAAAGGCTAAAACTAGCGAAACTAATGCAAAAACTAGTGAGACTAAAGCCAAAGCTAGTGAGACTAACGCAGCTAGTTCCGCAACCTCAGCTAGTCAGTCAGTAACTACTATACAAGGCTTAAAATCAGATGTTGAGCAACTAAAATCAGATACTCAGGCCATTAAAAACAGTGCTGTAACAGAGACAACAGCTATAAAGGCAGATGTTGCGCAGTTAAAAACAGACACACAAGGTATTAAGAATAGTGCGGTAACAGAAACTACCGCGTTAAAAAACCAAGCTGCAACTTCCGCAACTAATGCAGCTAGTTCTGCAACAGAAGCAGGAAAACAAGCTACTAATGCTGCCAATAGTGCTAAGACTGCCAAGACTGAGGCCGACCGTTCAAAAACTGAGGCTGATAGATCAGAAGCTGCTGCTAATTCTACCCCCGACATTCAACCTCTTCCAGATGTATGGATACCTTTCAACGACTCGCTGGACATGCTAGCAGGATTTGCACCAGGCTATAAGCAAATAACTGTAGGTGATGATGTTATTAAAATGCCATCCGATAAGGTTGTGGATTTCAAACGGGCGTCAGGTGCGACGTACATTAATAAATCTGGCGTTTTAACCGTTGCAGAGATTAACGAGCCTAGATTTGAACGAGACGGCTTATTAATTGAGGGGCAAAGAACTAACTATCATCTTAATTCACTTACGCCATCTAAGTGGGGAGCTACAACAAGTGTAACTATAACAGAAAGTGGTGTTGATGAGTTTGGCTTTACTTATGGACGGTTTCAAATAAAGGACGAAAAAATTGGGACAAATACGACAATGAATATCGCTGCGATTCCAGGAGGAAGAGGTGTCGATGTTACTGGAACCGAAAAGTATGTTACAACATCATGTCGTGTAAAAAGTGATAGTGCTAATATACAATGTCGTATAAGATTTGAAAGATATGACGGGTCCGCATATTTTTATCTGGCAGATGCATATCTTAATATAACAGATATGTCCATTAGAAAAACTGGAGAAGGGGCTGCAAGAATAACCGCCAGAGCGGAGAAAGAATCTAATGGATGGATTTATTTCGAGGTTACATATCAATCTGAAGCTATTGATAATATGGTTGGCTCTCAGATCCAAATTGCTCCACCTGTGTCTCCTGGCACTTATTCTGGTGGAGAGTATTTACAGGTAGCTACACCACAATTTGAAGGCGGGGCGTGCGCTTCATCGTTTATTATTACAGAAGCGACTCCAGTAACACGGGCAAGCGATATGGTCACATTACCAATAAAAAACAACCTTTACAATTTGCCGTTTACAGTACTTGTGGAAGCTCATAAAAACTGGAGTATAACGCCAAATGCAGCGCCACGTGTTTTTGATACCAGCGGTCATCAAACCGGAGCGGCTATTATTCTTGCTTTCGGATCTGCCGAAGGCGATAATGATGGATTCCCTTATTGCGATATAGGTAAATCAAATAGAAGGGTTTACGAGAATGCAAAACTAAAAAAGATGATCATGGGAATGAGAGTAAAGAGTGATTACAATAATTGCTGTGTAAGTAATGCCAGAATATCAAGCGAAACAAAAACGGAATGGAGATACATACAATCAACGGCTACTATAAGGATTGGAGGACAAACAAATACTGGAGAACGACATTTATTTGGCCATATCCGAAACTTTAGAATCTGGCATAAGGAACTAACTGATAGACAAATGAAGGAGTATGTATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
30a164e6557f67a0b27b7f957609805c855d03a4a579c306a8cd79e181905a51
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7182
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50