Genbank accession
UPW38328.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,76
TF
Evidence Phold
Probability 1,00
Protein sequence
MKKILDSARNYLKNNSRIKTASLISLELPGSTGTNTAFIYLTDYFRDVLYNGILYQAGKVKSISTHKQNRDLSIGSLSFTITGTAQDEVLKLVQNGVSFLDRIVSIHQAIITEDGSILPVDPDTNGPLLYFRGRITGGGIKDNISTSGVGTSTITWNCSNQFYDFDRVNGRFTDDASHRGLEVVAGQLVPSNGAKRLEYQEDYGFFHANKSISILAKYQVQEERYKLKSKKKLFGLSRSYSLKKYYETVTKEVDIDFNLAAKYIPVVYGVQKIPGIPIFADTELHNPNIVYVVYAFAEGEIDGFLDFSFGDNPMICMDSNDSSARTCFGVKKVAGDTMQRIASGISSSSPSVHGQEYKYNDGNGDIRIWTYHGKADQTASEVLVNIAKERGFYLQNMNGNGPEYWDARYKLLDTAYAVVRFTINENRTEIPEVSAEIQGKKVKVYHSDGRVTANSTSLNGIWQTLDYLTSDRYGANITIDQFPLQQLIQEAAILDIIDESYQVSWQPYWRYVGWTDPLAENRQIVQMNTILDTSESVFKNVQGLLESYGGAINNLSGQYRVTVEKYSNTPLEINFLDTYGDLELSDTTGRNKFNSVQASIVDPALSWKTNSITFYNSRYKEQDKNLDKKLQLSFANITNYYTARSFADRELKKSRYSRTLSFSLPYQFIGIEPNDAIAFTYDRYGWDKKYFLVDEVENSREGKINVTLQEYGEDVFINSEQVDNSGNDIPDISNNVLPPRDFKYTPTPGGLVGSIGKNGELSWLPSLTNNVVYYSIVHSGHAEPYIVQQLETNPNERMIQEIIGEPAGLAIFEIRAVDINGRRSSPVTLSIELNSAKNLSVVSNFRVTNTASGDVTEFVGPDVKLAWDKIPEEEIIPEIYYTLEIHDSQDRMLRSIRIEDAYTYDYLLTYNKADFALLNSGALGINRKLRFRIRAEGESGEQSVSWATI
Physico‐chemical
properties
protein length:949 AA
molecular weight: 106909,29360 Da
isoelectric point:5,23563
aromaticity:0,11064
hydropathy:-0,40179

Domains

Domains [InterPro]
UPW38328.1
1 949
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS_ESCO30
[NCBI]
2918879 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UPW38328.1 [NCBI]
Genbank nucleotide accession
OM386657.1 [NCBI]
CDS location
range 29537 -> 32386
strand +
CDS
ATGAAGAAAATACTAGATAGTGCTAGAAACTACTTAAAAAATAATAGCAGAATAAAAACTGCTAGTCTAATTTCCCTAGAATTACCTGGCTCTACTGGTACTAATACTGCTTTTATTTATTTAACGGATTATTTTAGAGATGTACTATATAATGGTATTTTGTACCAGGCCGGTAAAGTTAAGTCTATTAGCACACATAAACAAAATAGAGATTTATCTATTGGTAGTCTATCTTTTACTATTACTGGTACAGCGCAGGATGAAGTACTAAAATTAGTGCAGAATGGTGTGTCCTTTTTAGATAGAATAGTATCAATCCACCAAGCAATTATTACAGAGGATGGTTCTATTTTACCTGTAGACCCAGACACAAATGGCCCTTTATTATATTTTAGAGGGAGAATTACCGGTGGGGGTATTAAGGACAACATTAGTACTTCTGGGGTTGGAACTTCCACAATTACTTGGAATTGTTCTAACCAATTCTATGATTTTGATAGGGTTAATGGTAGATTTACTGATGATGCTTCTCATAGGGGGCTTGAGGTTGTAGCAGGACAATTAGTTCCATCTAACGGGGCTAAAAGACTTGAGTACCAAGAAGATTATGGGTTCTTTCACGCTAATAAAAGTATCTCTATTCTAGCAAAGTATCAGGTACAGGAAGAAAGATACAAGCTAAAGTCTAAGAAAAAGCTATTTGGACTATCTAGAAGTTATAGTCTTAAAAAGTATTATGAGACTGTTACTAAAGAAGTAGATATAGATTTTAACCTTGCTGCTAAGTATATACCAGTAGTTTATGGTGTACAGAAAATACCGGGAATACCTATTTTTGCGGATACGGAATTACACAATCCTAATATAGTTTACGTAGTATATGCTTTTGCTGAAGGAGAGATAGACGGTTTTCTTGACTTTTCCTTTGGTGATAACCCTATGATTTGTATGGACTCTAATGATAGCTCTGCTAGAACCTGTTTCGGTGTTAAAAAAGTAGCAGGAGACACCATGCAAAGAATAGCATCAGGAATATCTTCTAGTAGTCCTTCTGTGCATGGCCAGGAATATAAATATAACGATGGTAATGGTGATATAAGGATTTGGACTTATCATGGAAAAGCTGATCAAACGGCTTCTGAAGTATTAGTAAATATAGCGAAGGAACGTGGGTTCTATCTCCAAAATATGAATGGCAATGGCCCTGAATACTGGGATGCTAGATATAAGCTACTAGATACTGCTTATGCCGTAGTACGTTTTACTATTAATGAAAATAGAACTGAGATTCCAGAAGTTAGTGCTGAAATTCAAGGTAAAAAAGTAAAAGTCTATCATTCTGATGGTAGAGTAACTGCTAATAGTACTAGTTTAAATGGTATTTGGCAAACACTTGATTACTTAACCTCTGATAGATATGGCGCTAATATTACCATTGATCAGTTCCCTCTTCAGCAACTAATACAGGAAGCAGCTATTTTAGATATTATAGATGAATCCTATCAGGTATCTTGGCAGCCATATTGGAGATACGTTGGGTGGACTGATCCACTAGCAGAAAATAGACAAATAGTACAAATGAATACTATTCTGGATACATCTGAATCAGTATTTAAAAATGTGCAAGGTTTGTTAGAGTCCTATGGTGGGGCTATTAACAATTTATCTGGCCAGTATAGGGTTACTGTAGAAAAATACTCTAATACTCCATTAGAGATTAATTTTCTAGATACTTACGGTGATTTGGAGCTATCAGATACTACTGGTAGAAATAAATTTAACTCAGTTCAAGCATCTATCGTAGATCCCGCCCTTAGCTGGAAAACTAATTCCATTACATTCTATAACTCCAGGTATAAGGAACAGGATAAGAACCTAGATAAAAAATTACAACTATCTTTTGCTAATATTACTAATTACTATACTGCAAGAAGTTTTGCTGATAGGGAACTTAAGAAATCCAGATACTCAAGAACACTTTCTTTCTCATTGCCATATCAATTCATTGGTATTGAGCCTAATGACGCTATTGCATTTACATACGACCGTTATGGGTGGGATAAGAAGTACTTCCTAGTAGACGAAGTGGAAAACTCCAGGGAAGGAAAGATAAATGTTACCCTACAGGAGTATGGAGAAGATGTATTCATCAACTCCGAGCAGGTTGATAATAGTGGTAATGATATTCCTGATATTAGTAATAATGTCCTTCCTCCTAGAGACTTTAAATATACCCCTACTCCTGGCGGTTTAGTAGGCTCTATAGGTAAAAATGGTGAGTTATCCTGGCTTCCGAGTCTAACCAATAATGTAGTTTATTACTCTATTGTGCACTCAGGCCATGCCGAACCTTACATAGTACAACAGTTAGAGACCAATCCTAACGAACGTATGATCCAAGAAATAATTGGAGAACCAGCAGGTCTGGCTATATTTGAGATAAGGGCAGTAGATATTAATGGTAGAAGAAGTTCTCCAGTGACTCTGTCCATAGAACTTAACTCCGCTAAAAACCTTAGTGTAGTATCTAATTTTAGAGTAACTAATACTGCTTCTGGAGATGTAACTGAGTTTGTTGGCCCAGATGTGAAACTAGCCTGGGATAAGATACCTGAGGAAGAAATAATTCCAGAAATTTATTATACCCTAGAAATACATGATTCTCAGGATAGAATGCTTCGTAGTATTAGGATAGAGGATGCCTATACTTATGACTATCTATTAACATATAATAAGGCGGATTTTGCTTTGCTGAACTCAGGTGCTCTAGGTATCAATCGTAAATTACGATTTAGAATACGGGCTGAAGGAGAAAGCGGAGAACAATCTGTTAGTTGGGCCACTATTTAA

Tertiary structure

PDB ID
5b2f3a1b3ffca0074a3d8e235e20707158f43eee8897c77acbcd186649fc4891
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,8553
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Avian Pathogenic Escherichia coli bacteriophages Nicolas,M., Trotereau,A. and Schouler,C. 2015-06 GenBank