Genbank accession
URC25446.1 [GenBank]
Protein name
non-contractile tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,73
TF
Evidence RBPdetect2
Probability 0,93
Protein sequence
MSYTFTEHIANGTQVTYPFSFAGRDKGYLRASDVIVESLQGNTWIEVTSGWQLTGTHQITFDVAPVAGLKFRIRREVQKEYPYAEFDRGVTLDMKSLNGSFIHILEITQELLDGFYPEGYFIKQNVSWGGNKITDLADGTNPGDAVNKGQLDAIDKKHTDWNAKQDIEIAGLKAGMTSGIAHRTVPWYTIAQGGEISVKPPYEFQDALVFLNGVLQHQIVGAYSISNNTITFAEPLVAGTEVYVLIGSRVATSEPNIQLELNFDLVEGQQVVQIGSAFKYIEVYLDGLLQPKLAYQVDGDIVTFSEGVPECRMTAKIITA
Physico‐chemical
properties
protein length:320 AA
molecular weight: 35358,48640 Da
isoelectric point:4,83594
aromaticity:0,10625
hydropathy:-0,11188

Domains

Domains [InterPro]
DC_0041
STR
1–312
IPR011049
STR
128–154
G3DSA:6.10.250.2040
STR
133–185
URC25446.1
1 320
Architecture
STR
STR 1-312 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EC120
[NCBI]
2936938 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URC25446.1 [NCBI]
Genbank nucleotide accession
ON185580.1 [NCBI]
CDS location
range 34156 -> 35118
strand +
CDS
ATGAGTTATACTTTCACAGAACATATAGCCAACGGTACGCAAGTAACCTATCCCTTTAGCTTTGCTGGTAGGGATAAAGGTTATCTTCGTGCCTCAGATGTGATAGTGGAGTCTCTTCAAGGTAACACTTGGATTGAGGTTACATCTGGCTGGCAACTAACTGGCACGCACCAGATTACTTTTGATGTAGCACCAGTTGCAGGTTTGAAGTTCCGTATTAGAAGGGAAGTACAAAAGGAGTATCCATACGCTGAGTTTGACCGTGGTGTTACCTTGGATATGAAGTCTTTAAATGGTTCTTTCATTCATATACTGGAGATTACACAGGAGTTACTTGACGGGTTTTATCCAGAAGGATACTTCATTAAACAGAATGTAAGCTGGGGCGGCAATAAGATTACTGATTTGGCTGATGGCACAAATCCGGGAGATGCAGTAAATAAAGGGCAGCTTGATGCCATCGACAAGAAGCATACAGATTGGAACGCCAAACAGGACATTGAGATTGCTGGCCTTAAGGCTGGTATGACTTCTGGTATTGCGCACAGAACTGTTCCTTGGTACACGATAGCCCAAGGTGGTGAGATTTCCGTAAAACCACCTTATGAATTTCAAGATGCACTAGTTTTCCTTAATGGGGTGTTGCAGCACCAAATTGTAGGCGCATACTCTATAAGCAACAACACTATCACTTTCGCAGAGCCGCTTGTGGCTGGTACAGAGGTGTATGTGCTGATTGGTAGTCGTGTGGCTACATCTGAACCTAATATTCAGTTAGAGTTGAACTTTGACTTAGTAGAAGGCCAACAAGTAGTACAGATTGGCTCTGCATTTAAGTACATTGAGGTCTACCTTGATGGATTATTACAACCTAAACTTGCTTATCAGGTAGACGGTGACATTGTTACTTTCTCAGAGGGAGTGCCAGAATGCCGGATGACTGCTAAGATTATCACAGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
aec4f2edb29c06af6575c85cad10d56a76ce3581b9affe7e744e848fc20c8635
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8540
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequences of 17 Escherichia coli bacteriophages isolated from wastewater, pond water, cow manure and bird feces Vitt,A.R., Ahern,S.J., Gambino,M., Holst Sorensen,M.C. and Brondsted,L. 2022-10-20 GenBank