Genbank accession
YP_010673456.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,85
TF
Evidence RBPdetect2
Probability 0,90
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MARELMPKSGIMMPHVVVTRDAAVVGVSTVDGVAGAVDLTGKYLQKTDAEDTYQTKTEGASKTYVLDSIQPIMEGALFKADPFVDNNIPFRSAGANGVESVDMIKVTTENSIKLGSYESSVQGVEIHSAGRVVVVDKNDSGVETKYPVYSKRFRPEIEDLPFAAIGSYVKDSKGRTIGVNRTGINSDIKQFTQKVTFQQPVTIADGVGDYDAVTMRQLRNSGGGSGGPTMSGISNFGIGDFHLRDSRAYIQPYEVVSDGQLLNRADWPELWAYAQMLSPISDTDWLADPAKRGKYSLGNGTTTFRVPDRNGVQSGSVPALFGRGDGGDSSHNGRVEESAAPDIKGGFTTSSFVYNGGVYSLVQNRSGAFFAGGSVTDSAINIGSVTSGQQSFGTDFSASKSSPVYGRQSVSEIRPNSFYGVWVIRASGGFVAANTSWSVKNADATAPSSGTTVRGGEVISEYSNQEGKRFARLYSDTVWGSPGAAVIEAEGAGKHAYKFYEDGTVDFRKTGSSNRVNMYTTTSSWDNVSTASHTDLTSGGVDRVLESTASDTLVLGFSIGAVRPDAYPTKVGFHQYIWPESSYNFGSAVISIGSDSSWCRYMFGVDGQFWGSTEQGEFTYTRNGVSDSRVKHDITPTTVDKAWQNLKALQFVTFIYNNDEQERQRRGLIAQQAETVDSLYVKTRMYPGKNIGDPMVEQKELDTTPMLLDTMHVVQKLIAEIDALKAEIADLKASK
Physico‐chemical
properties
protein length:735 AA
molecular weight: 79392,14890 Da
isoelectric point:5,37557
aromaticity:0,09524
hydropathy:-0,36585

Domains

Domains [InterPro]
DC_1514
STR
20–315
SSF88874
STR
257–428
IPR030392
CHP
626–676
YP_010673456.1
1 735
Architecture
STR
STR 20-735
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_010673456.1
1 735
Domain Start End Length (AA) Confidence
N-terminal 1 101 101 0,9773
Central domain 102 300 200 0,0897
C-terminal 301 735 434 0,8835
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-101
Central
102-300
C-terminal
301-735

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage O18-011
[NCBI]
2742113 Uroviricota > Caudoviricetes > Mktvariviridae > Kuravirus > Kuravirus O18011
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010673456.1 [NCBI]
Genbank nucleotide accession
NC_070985.1 [NCBI]
CDS location
range 28358 -> 30565
strand -
CDS
ATGGCTAGAGAGTTAATGCCCAAATCTGGCATAATGATGCCTCATGTAGTAGTAACCAGAGATGCTGCGGTTGTTGGTGTCTCCACTGTAGATGGAGTTGCTGGTGCAGTTGATCTTACTGGTAAATACCTACAGAAGACAGATGCGGAAGATACCTATCAAACAAAAACTGAAGGTGCTTCTAAGACTTATGTTTTAGATTCTATTCAACCTATCATGGAAGGTGCTTTGTTTAAAGCTGACCCTTTCGTAGATAATAATATCCCTTTCCGTTCTGCTGGAGCTAATGGTGTGGAATCTGTGGATATGATTAAGGTCACTACAGAGAACTCTATTAAACTTGGAAGTTACGAGTCTTCTGTTCAAGGAGTAGAGATTCACTCTGCTGGTCGTGTTGTGGTTGTTGATAAGAACGACTCTGGTGTAGAAACTAAATATCCAGTTTACTCTAAACGTTTCCGTCCTGAGATTGAAGATTTACCATTTGCAGCCATTGGCTCTTATGTTAAGGACTCTAAAGGTCGTACTATTGGTGTGAACCGTACAGGAATTAACTCTGATATCAAACAGTTTACTCAGAAGGTAACTTTCCAACAGCCTGTAACAATCGCAGACGGTGTGGGTGACTATGACGCTGTGACTATGAGACAGCTTCGTAACAGTGGAGGTGGCTCTGGTGGTCCTACCATGAGTGGTATTTCTAACTTTGGTATTGGTGATTTCCACTTACGCGATAGCCGTGCATACATTCAGCCATATGAGGTAGTGTCAGACGGGCAATTGCTTAATCGCGCAGACTGGCCTGAACTTTGGGCTTATGCGCAAATGCTGTCACCTATCAGTGATACAGACTGGCTGGCGGACCCAGCAAAACGCGGTAAATACTCGCTTGGTAACGGGACGACAACCTTCCGCGTACCTGACCGTAACGGTGTGCAGTCTGGTAGCGTGCCTGCACTTTTCGGTCGTGGTGACGGAGGTGATTCGTCGCATAATGGTCGCGTTGAAGAGTCAGCAGCACCAGATATAAAAGGTGGCTTCACCACGAGTAGCTTCGTCTACAACGGTGGCGTATACTCCTTAGTGCAAAACCGTAGCGGAGCCTTCTTTGCAGGCGGGTCAGTCACTGATTCAGCGATTAATATTGGTTCTGTCACTAGCGGTCAACAGTCATTCGGCACAGACTTTTCAGCAAGTAAAAGCAGTCCTGTGTACGGTCGACAGAGTGTGTCAGAGATACGACCTAATTCTTTTTATGGCGTCTGGGTAATCCGTGCTTCTGGTGGTTTCGTGGCGGCCAACACGTCGTGGAGTGTTAAGAACGCAGACGCGACCGCTCCTTCGTCCGGTACCACTGTTCGTGGTGGTGAGGTGATATCTGAGTATTCTAACCAGGAAGGTAAGCGGTTTGCTCGTCTATACTCGGACACTGTCTGGGGTTCTCCTGGCGCGGCTGTTATTGAGGCAGAAGGGGCAGGCAAACACGCCTATAAATTCTATGAAGATGGTACAGTGGACTTCAGGAAAACTGGAAGTTCAAACCGAGTTAACATGTACACTACCACCTCAAGTTGGGATAACGTTTCAACCGCATCACACACCGACTTAACATCGGGTGGGGTTGACAGGGTGCTGGAATCAACCGCTTCAGATACTTTAGTACTAGGATTTTCTATTGGTGCTGTACGTCCTGACGCCTACCCGACGAAAGTTGGTTTTCACCAATACATCTGGCCCGAGAGTTCGTACAATTTCGGTTCGGCGGTAATATCAATCGGCTCTGATAGCTCTTGGTGTAGATACATGTTTGGTGTGGACGGTCAATTCTGGGGGTCTACAGAACAAGGAGAGTTTACATACACGCGCAACGGTGTATCCGACTCGCGTGTTAAGCATGATATTACGCCAACAACTGTCGACAAAGCATGGCAGAATCTGAAGGCTTTGCAGTTCGTAACGTTCATCTACAACAACGATGAACAAGAGCGTCAGCGCCGTGGCCTCATCGCACAACAAGCTGAGACGGTAGATAGCCTGTATGTGAAAACCAGAATGTATCCGGGTAAAAATATCGGCGATCCTATGGTTGAGCAGAAGGAACTGGACACCACGCCAATGCTTCTGGATACCATGCACGTTGTGCAGAAACTCATCGCTGAAATTGATGCGTTGAAAGCTGAGATTGCCGATCTAAAGGCAAGTAAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
4865d0ba2d6d468604e2b89704c7190528cd7fa9445100dae288584d595ee1f0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5897
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Genomic and proteomic analysis of O18-011, a novel Escherichia coli phage Shahin,K., Bao,H., Soleimani-Delfan,A. and Wang,R. 2020-05-06 GenBank