Genbank accession
URN70742.1 [GenBank]
Protein name
tail fiber
RBP type
TF
Evidence GenBank
Probability 1,00
Protein sequence
MTDKLIRELLIDVKQKGATRTAKSIENVSDALENAAAASELTNEQLGKMPKTLYSIERAADRAAKSLTKMQASRGMLSVTKSINDIGTKLDDLAITMIEVADKLEVGFDGVSRSVKTMGNDVAAATEKVQDRLYDTNRALGGTARGFNDTAGAAGRASRAIGNTSGSARGATRDFAAMAKIGGSLPIMYAALASNIFVLQSAFEQLKLGDQLNRLEKFGVIVGTQTGTPVQTLARSLQEAAGYAISFEEAMRQASSASAYGFDAEQLNKFGLVARRAAAVLGVDMTDALNRVIKGVSKQEIELLDELGVTIRLNDAYADYVKQLNAANTGITYNVNSLTTFQKQQAYANAVIAESTKRFGYLDEVLRATPWEQFAANADAALRKIQQAAAKYLGPVIDAINTVFYTSQASISAEAARAQEKTNRQIDPTNVGAVALSLAASEEGYNKALDMYKESLDKRNKLKSEFDKRMEQADFYTKLAIRQVGEGIPVGLAAAGASEANKKFVEETAAMGLQVTRLGKEVEDSTENLNAWKSAYQAAGAAAAKANPEFQKQINLQRDTTDPDAVYDFNSTVLKGLTEQQKAYNQTKKTASDLANDIQNVAQNTDTAAKTSATLADAIKNIESLSLGTGKSADEYVKNLNLGYNTLSEMKTASQALSEYVKLTGNETKNQLAVQQKIADVYNQTKDKEKAQEAGRRLELQQLEEQEAALRRVLQTNQGNKAVEKEIEKIQLEKLKLTNQGMEGQKKVKDYTDKILGVDREIALLNDRTMTDTQYRLAQLNLELTIEKEKYEWYTKQADKQKEAEQSRRSQAQIEREIWKFRQNQQAEMTSKRQEAFENTLTSMFPLAGEMQKMEMQLDFYTQMKELTKGNANEQMRWNAEIAKTRAQMSALTAQRNAQMQQQVGSSVGATYTPTTGLSGADKDFADMQNRMSSYDQAISKLSELNSEATAVAQSMGNLTNSMIQFSQGSLDTTSMIASGMQTVASMIQYSTSQQVSAIDQAIAAEQKRDGKSEASKAKLKKLEAEKLKIQQDAAKKQIIIQTAVAVMQAATAVPYPFSIPLMVAAGLAGALALAQASSASGMSSIADSGADTTSYLTLGERQKNIDVSMSANAGELSYIRGDKGIGGANSFVPRAEGGNMYPGVSYQMGEHGTEVVTPMVPMKATPNDELKTSSNSTSGRPIILNISAMDAASFREFASSNSSALRDAVELALNENGASLKTLGNS
Physico‐chemical
properties
protein length:1227 AA
molecular weight: 132923,01500 Da
isoelectric point:5,82522
aromaticity:0,05542
hydropathy:-0,44344

Domains

Domains [InterPro]
Coil
Unmapped
686–740
URN70742.1
1 1227
Architecture
TAS
STR
TAS
TAS 1-81 | STR 82-1189 | TAS 1190-1225 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC104
[NCBI]
2936939 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URN70742.1 [NCBI]
Genbank nucleotide accession
ON185581.1 [NCBI]
CDS location
range 89956 -> 93639
strand -
CDS
ATGACTGATAAGCTAATACGAGAATTACTAATAGACGTTAAACAGAAGGGGGCAACTCGTACTGCAAAGTCTATTGAAAACGTATCTGATGCATTAGAGAATGCTGCTGCGGCTTCTGAGCTAACGAATGAACAGTTAGGTAAAATGCCCAAAACTCTATACTCCATTGAGAGGGCAGCAGATAGAGCAGCAAAAAGTCTTACTAAAATGCAAGCTAGTAGGGGTATGCTTAGTGTTACTAAATCTATTAATGATATAGGAACTAAGCTGGACGACCTTGCTATTACAATGATAGAGGTAGCTGATAAGCTAGAAGTTGGATTCGATGGGGTTTCTAGATCTGTTAAAACAATGGGCAATGATGTTGCAGCTGCAACAGAGAAAGTTCAAGATAGGTTATATGATACTAATAGAGCATTAGGTGGCACAGCTAGGGGTTTTAATGACACTGCTGGTGCCGCTGGTAGAGCTTCTAGAGCTATTGGTAATACTTCTGGTTCAGCACGTGGTGCAACTCGTGATTTTGCAGCAATGGCTAAGATAGGTGGTAGTTTACCTATTATGTACGCAGCTCTTGCTTCCAACATCTTCGTTTTGCAATCAGCGTTCGAACAACTTAAACTAGGTGACCAGCTAAATCGTCTAGAAAAATTTGGTGTTATAGTAGGTACTCAGACAGGTACTCCTGTTCAGACCCTTGCTAGATCACTACAAGAAGCTGCTGGATATGCTATTTCTTTTGAGGAAGCAATGAGACAGGCATCTTCAGCATCTGCTTATGGATTTGATGCCGAACAACTTAATAAATTTGGTTTAGTAGCTCGTCGTGCTGCTGCTGTTCTTGGTGTTGATATGACTGATGCACTTAACCGTGTAATTAAGGGCGTATCTAAACAAGAAATCGAACTTCTGGATGAACTTGGTGTCACCATCCGTCTTAATGACGCATACGCCGATTATGTTAAACAGTTAAATGCTGCAAACACAGGTATAACATATAATGTTAATAGTCTTACCACTTTCCAGAAACAACAAGCATACGCTAACGCGGTAATTGCTGAATCTACTAAGAGGTTTGGCTACCTAGATGAAGTACTACGTGCAACTCCATGGGAGCAATTTGCTGCTAATGCAGATGCCGCCCTGAGAAAGATTCAGCAGGCTGCTGCTAAGTATTTAGGGCCAGTAATTGATGCTATCAATACTGTATTCTACACTTCTCAGGCTTCGATATCTGCTGAGGCAGCTAGAGCGCAAGAAAAAACTAATAGACAGATAGATCCTACCAACGTTGGTGCTGTTGCTTTAAGTTTGGCTGCTTCTGAAGAAGGCTATAATAAAGCTCTAGATATGTATAAGGAATCTCTTGATAAGCGTAATAAGCTAAAATCTGAGTTCGACAAACGAATGGAACAAGCAGATTTCTATACAAAACTAGCCATACGTCAAGTTGGTGAAGGTATTCCTGTTGGTCTTGCTGCTGCTGGTGCTTCCGAAGCTAATAAAAAGTTCGTAGAAGAAACTGCGGCTATGGGTCTACAAGTAACCAGACTAGGTAAGGAAGTAGAAGATTCTACTGAAAACCTTAATGCTTGGAAGTCAGCCTATCAAGCTGCGGGTGCTGCTGCTGCAAAAGCTAATCCAGAGTTTCAGAAACAGATTAATCTACAGAGAGATACTACTGATCCTGATGCTGTATACGATTTTAACTCTACTGTATTAAAAGGACTAACTGAGCAGCAGAAAGCGTACAATCAGACTAAGAAAACGGCTAGTGACTTAGCTAATGACATACAGAACGTTGCTCAGAATACAGATACTGCTGCTAAAACTAGTGCTACTTTAGCAGATGCTATAAAAAATATAGAATCTCTATCTCTAGGTACTGGTAAAAGTGCAGATGAGTACGTTAAAAATCTTAACCTAGGCTATAACACTCTGTCTGAAATGAAAACTGCGTCTCAGGCCTTATCTGAGTATGTTAAACTAACCGGTAATGAAACTAAAAATCAGTTAGCAGTTCAACAGAAGATAGCTGACGTATACAACCAAACTAAAGATAAAGAAAAGGCACAGGAAGCCGGTAGGCGTTTAGAACTCCAACAGTTAGAAGAGCAAGAAGCTGCTTTACGCCGTGTACTTCAAACTAATCAGGGAAATAAAGCTGTTGAGAAAGAAATTGAAAAAATTCAGCTGGAGAAACTTAAACTTACTAATCAGGGTATGGAGGGTCAGAAGAAAGTTAAGGATTATACAGATAAAATTCTGGGTGTAGACCGTGAGATAGCTCTTCTGAATGACCGTACTATGACAGATACTCAGTATCGTCTAGCACAGTTAAATCTTGAATTGACCATTGAAAAAGAGAAGTACGAATGGTATACAAAACAAGCGGACAAACAGAAAGAGGCTGAACAGTCTAGACGTTCTCAAGCCCAAATTGAACGTGAAATATGGAAATTCCGTCAGAACCAGCAAGCTGAAATGACCAGTAAAAGGCAAGAAGCCTTTGAAAATACTCTAACTTCCATGTTCCCTCTGGCAGGGGAAATGCAAAAAATGGAAATGCAGTTAGATTTTTATACTCAGATGAAAGAACTTACCAAAGGTAACGCTAATGAGCAAATGCGTTGGAATGCTGAAATAGCTAAGACAAGGGCTCAGATGTCAGCTTTAACAGCACAACGTAATGCTCAGATGCAGCAACAGGTAGGTTCTTCTGTTGGAGCTACTTATACTCCTACAACTGGCCTATCTGGAGCAGATAAAGATTTTGCTGATATGCAGAATAGGATGTCTTCGTATGACCAAGCAATTTCTAAACTATCTGAGTTAAATTCCGAAGCAACTGCTGTAGCTCAAAGCATGGGTAATTTAACTAACTCTATGATTCAGTTCTCCCAGGGATCCCTAGATACTACGTCTATGATTGCTTCTGGTATGCAGACTGTAGCCTCGATGATTCAATATAGTACTAGCCAACAAGTTAGTGCAATTGATCAGGCTATTGCAGCAGAACAGAAACGTGATGGTAAATCAGAAGCATCTAAAGCTAAGTTGAAGAAGTTGGAAGCTGAAAAGCTGAAGATTCAACAAGACGCAGCTAAGAAGCAGATTATCATCCAAACTGCAGTAGCTGTAATGCAAGCGGCAACAGCTGTACCATACCCGTTCTCTATTCCTTTAATGGTTGCGGCAGGTTTAGCGGGTGCATTGGCGTTAGCACAAGCATCTTCTGCATCTGGCATGTCTTCTATTGCAGATTCTGGGGCGGATACAACTAGTTATTTAACCTTAGGAGAACGTCAGAAGAATATAGATGTGTCCATGTCTGCTAATGCTGGTGAATTATCTTATATTCGTGGCGATAAAGGCATAGGCGGTGCTAACTCTTTCGTTCCTCGTGCTGAGGGTGGTAATATGTACCCTGGGGTTAGCTATCAAATGGGTGAGCATGGTACAGAAGTAGTTACCCCTATGGTTCCTATGAAAGCTACACCTAATGATGAGTTAAAAACCTCATCTAACTCAACTTCAGGAAGACCTATCATCTTGAATATTAGTGCTATGGATGCTGCAAGTTTTAGAGAGTTTGCTTCTAGTAATAGTAGTGCTCTAAGAGACGCAGTAGAATTAGCTCTGAATGAGAACGGTGCTAGTCTGAAAACATTAGGAAATTCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2312ed73f2722618a887395f04e23719546ec3e28b7fed154366256f970ccd9d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6121
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequences of 17 Escherichia coli bacteriophages isolated from wastewater, pond water, cow manure and bird feces Vitt,A.R., Ahern,S.J., Gambino,M., Holst Sorensen,M.C. and Brondsted,L. 2022-10-20 GenBank