Genbank accession
UHS65533.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MTRNVESIFGAVVTAPHQIPYTYTATGGEKFISLPFYPVTGIVTINGGMQVPLDNFEIDGNTLNLGRALSKGDVVFCLFDKILSPEDYKTGIRIYKFQAIGGETEFTPDYTSYGVQSLYIGGEYKTPEIEYSYDSTTGKVSLQTALTAGVWVVAEMSVKQPNISPLFDRSIQEIARATNVKDSEVILSTDTTQSLNGKKIIYSVSEQKAYGLPTLPTNVYISSVNGDQLTYNPGGIVVDLLPAPNDVTPVENELQVYKTAMVTDGGTLVNSGLSGVSSAIKRTIDSHFKDYYNVKDFGAVGDGATDDTVAIKAAIAYASTTKGVVYFPDGNYKISSTLALPSNVSVIGRSRESVTITKTTSTTVTVLVTAPALSGGYTGTLPTDMNAVICLGDSSSSRWSGVITGVTLHGTKTTTENHAVEFGIVNAGVVSDATIEDVYIYDCKYGLILPVVFASRVGNNRVTACLGGIFINNGTSCVISTNYSNSCRDYGHCYRDLKYSVIEANACDHTNRNDYYPDRTRVCYGYILQNLLGVTVTGNGQEGTLGINWRLDNFDHSSFTNNTSIRLGSDYTGPSNISWLELNGVARNSIIENNVSYEYNSNGMLFGGAVAGQHHNIYISDITFFNAILRNNIVRSTRNGDPVEAGWLNNVTRTVANSSAKIRPDRTFLANDPGSPEITVQSGTNAVISYGEYTVHNINYVGDFAHIFGVFDVTIEWSSGTSQFISVTGFPVARDYAYIAVTGVNNGDIGFTSGNIPASFRMNIGNTGGAFFTEGEKTLRIDAVVSGKRLYISYDGWYRVSP
Physico‐chemical
properties
protein length:802 AA
molecular weight: 86629,67520 Da
isoelectric point:5,14071
aromaticity:0,10224
hydropathy:-0,09763

Domains

Domains [InterPro]
DC_0063
STR
1–793
G3DSA:3.30.2020.50
ATT
155–245
IPR011050
STR
290–635
IPR024535
ENZ
292–487
UHS65533.1
1 802
Architecture
STR
ATT
STR
STR 1-154 | ATT 155-245 | STR 246-793 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UHS65533.1
1 802
Domain Start End Length (AA) Confidence
N-terminal 1 303 303 0,9865
Central domain 304 667 365 0,9910
C-terminal 668 802 134 0,9839
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-303
Central
304-667
C-terminal
668-802

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoM-RPN242
[NCBI]
2900331 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UHS65533.1 [NCBI]
Genbank nucleotide accession
OL656110.1 [NCBI]
CDS location
range 120540 -> 122948
strand -
CDS
ATGACCAGAAACGTCGAGAGCATCTTTGGAGCGGTCGTAACAGCTCCGCACCAGATTCCATACACTTACACAGCAACGGGCGGAGAGAAGTTTATCTCCCTCCCTTTCTACCCTGTCACTGGTATAGTCACAATCAACGGTGGTATGCAAGTTCCGTTAGACAACTTTGAAATCGATGGAAATACGTTAAATCTCGGACGCGCATTATCCAAAGGTGACGTTGTGTTCTGTCTGTTTGATAAGATTCTGTCTCCAGAAGATTATAAAACTGGGATTCGAATCTATAAGTTTCAGGCTATAGGAGGTGAAACCGAGTTCACTCCTGATTACACATCTTATGGAGTCCAATCTCTTTATATCGGTGGTGAGTACAAAACACCCGAAATTGAATATTCCTATGACAGCACGACAGGGAAAGTGTCTTTACAAACTGCGCTGACAGCAGGCGTTTGGGTAGTCGCTGAAATGTCTGTTAAACAACCGAATATCAGTCCGTTGTTTGACCGAAGTATTCAAGAAATCGCTCGTGCTACTAACGTTAAAGATAGTGAAGTAATTCTTAGCACCGATACTACTCAGTCCCTCAATGGTAAGAAGATTATTTATTCTGTTAGTGAGCAGAAGGCATATGGGTTGCCTACATTGCCTACTAACGTATATATTTCTTCCGTTAATGGTGACCAACTAACATACAATCCTGGTGGGATTGTTGTTGACCTGCTCCCTGCACCCAATGATGTAACTCCAGTGGAGAATGAGTTGCAGGTATATAAAACGGCTATGGTTACTGATGGGGGAACATTAGTAAATTCTGGCTTATCTGGTGTTTCTTCTGCAATTAAGCGAACCATTGATAGCCATTTTAAAGACTACTATAACGTTAAAGATTTTGGTGCTGTTGGTGACGGAGCTACTGATGATACGGTAGCTATAAAAGCAGCTATAGCTTATGCCAGCACCACAAAAGGCGTGGTTTATTTCCCGGATGGTAACTATAAAATAAGCAGTACTCTTGCTCTACCATCGAATGTATCAGTAATAGGGAGAAGTCGCGAATCAGTTACTATAACGAAAACGACATCCACTACCGTTACAGTTCTGGTGACGGCGCCTGCACTATCTGGAGGATATACCGGAACCCTGCCTACAGATATGAATGCGGTTATTTGTTTAGGGGATTCTAGTAGTTCAAGATGGTCCGGTGTCATCACAGGGGTTACTCTGCATGGGACAAAAACTACAACGGAAAACCACGCGGTAGAATTTGGTATCGTAAATGCAGGCGTGGTATCCGACGCCACGATAGAAGACGTCTATATTTACGATTGCAAGTATGGGCTAATTCTCCCTGTTGTATTTGCGTCGCGAGTAGGAAACAACCGCGTTACCGCATGTCTCGGTGGGATATTTATAAACAACGGCACTTCATGCGTAATATCTACTAACTATAGTAATAGTTGCCGTGATTATGGGCATTGCTATCGGGATCTTAAATACTCTGTTATAGAGGCTAATGCCTGTGATCACACTAACAGGAACGACTATTATCCGGACAGGACAAGAGTGTGTTATGGCTATATATTACAGAACCTGCTAGGGGTAACGGTAACTGGGAATGGTCAAGAAGGCACTCTGGGTATTAACTGGAGATTGGATAATTTCGACCATAGCTCATTCACCAATAACACCAGTATTAGACTTGGTTCTGATTACACAGGCCCGTCTAACATATCATGGCTAGAATTAAATGGAGTGGCGCGCAACAGCATCATTGAGAACAACGTATCTTATGAGTACAACTCTAATGGGATGTTGTTTGGCGGGGCCGTAGCCGGGCAGCACCATAATATTTACATCTCCGATATTACGTTTTTTAACGCAATATTGCGTAACAACATTGTAAGGTCTACGCGAAATGGGGATCCGGTGGAGGCCGGTTGGCTCAACAACGTTACCAGAACGGTGGCGAATTCATCCGCCAAAATCAGACCAGACCGTACTTTCTTAGCGAATGACCCCGGTTCGCCTGAAATTACAGTTCAATCAGGCACCAATGCGGTGATATCATACGGCGAGTATACCGTACATAACATAAATTATGTCGGAGATTTTGCTCATATATTCGGTGTGTTTGATGTGACAATTGAATGGAGTTCCGGCACCAGTCAGTTCATATCTGTGACAGGATTTCCTGTCGCGCGGGATTACGCATACATAGCAGTAACAGGGGTGAACAACGGGGACATCGGTTTTACCTCTGGAAATATTCCAGCCAGCTTTCGAATGAACATCGGGAACACTGGAGGAGCGTTCTTTACTGAAGGTGAAAAAACGCTTCGCATTGATGCAGTTGTGTCCGGTAAACGTTTGTATATTAGCTACGACGGGTGGTATCGGGTTAGCCCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
70f0027dec9723b0fdccc5ad46f6ff3ae3c588ce6020e0e2ebe890e75f85fc9a
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7274
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Characterization and complete genome analysis of a novel Escherichia phage, vB_EcoM-RPN242 Imklin,N., Sriprasong,P., Thanantong,N., Lekcharoensuk,P. and Nasanit,R. GenBank