Genbank accession
UGO52669.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MAIYDAGTASLAADGTVTGVGTTWRQPLTLIRVGATMIFNTTPASIVTIAEIISDTEIRVFNDKGFTAPTGTQYSILAHDGITVQGLAQDVAETLRYYQSNETEVAAAVDAFNQFDADAFQQNVTNVNNQSQQVASDALQVSADKADVQSSLSQAEAARDAAGLSASNAASSAQSAESAAQSVSGALIGSFQSGVTIQSATQQVIDISSGIAVPYIWAGALPKTVPANSTPESTGGVSSSGWVPLSFADRNVDNLSILRGLVPLRDGERVYVSSHSGTSMLGGGWFYYDANDSTSTDDDGVVIVTTGNHRWKRDLSLIEGLSPMMFGALMNAPYIDGNTIQGTPYPKAPSMGAADLTGVSNDGVAMQKCYTASIKLNKKILIDRPIYIGTTRVNVSGERFSGQTIRIEGTATPRRCLIYTSGNGGFIVSPWGHNMFVKNIGFRNADADYNGSPLISGNDSGQGGGGKQYTIDNVEFYHYKYALSTLTFVSKISNVYMYDCTYGIGLSGNTSTALDSVWAHHCDVGFLWGYGINTTTLEPVAGGFPVMYVTATNIAADGCLTPHKIGGQLRSVNIVGAGVEGVNGDTVFDFSDYGGDDDQFGFDVKGFSCWIQSSMNTGVLRMIKLPANESRMPVGSIRFSDGYFKSDYALMIMENTISNPTAEGNSVHFGDDFRIINSQYTGSFSKSTLRSTKVGNVTYGESPVEARNSYNGTTLSAVHVTSGMDFNQVRTEDATLLLPYNRALDILLTTVGEESRYGSTFIAGELSLIPINKNGLGGQESGGIIQFSLSGSTKANVSSGIPWYNKIAKSTGSKSTSLDGVSITKYVTGGQTFLRILTPTASVSTFLCHLKLTYSGFAHFYDKRWQVRAI
Physico‐chemical
properties
protein length:870 AA
molecular weight: 92570,11490 Da
isoelectric point:5,16970
aromaticity:0,09310
hydropathy:-0,10253

Domains

Domains [InterPro]
DC_1966
ATT
1–425
IPR040775
RBD
183–246
G3DSA:2.10.10.80
ATT
186–244
UGO52669.1
1 870
Architecture
ATT
STR
ATT 1-425 | STR 426-870
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UGO52669.1
1 870
Domain Start End Length (AA) Confidence
N-terminal 1 362 362 0,9605
Central domain 363 699 338 0,9879
C-terminal 700 870 170 0,9563
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-362
Central
363-699
C-terminal
700-870

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoD_Opt-719
[NCBI]
2902671 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UGO52669.1 [NCBI]
Genbank nucleotide accession
OL539451.1 [NCBI]
CDS location
range 27490 -> 30102
strand -
CDS
ATGGCAATTTATGACGCGGGAACAGCATCGCTTGCAGCTGACGGAACTGTAACTGGTGTTGGTACTACCTGGAGGCAGCCGTTAACTCTAATTCGAGTTGGCGCAACAATGATTTTCAATACCACGCCAGCAAGTATTGTTACCATTGCTGAAATAATCAGTGATACCGAAATTCGAGTTTTTAACGACAAGGGATTTACCGCTCCGACTGGTACTCAGTACTCAATTCTTGCTCACGACGGGATTACTGTTCAGGGACTTGCTCAGGATGTTGCTGAAACCTTGCGTTACTATCAGTCAAATGAGACTGAAGTCGCAGCAGCTGTTGATGCTTTTAATCAGTTTGATGCTGATGCATTCCAGCAAAATGTCACAAACGTAAACAACCAGTCTCAGCAGGTTGCCAGTGATGCCTTGCAAGTATCGGCAGATAAGGCAGATGTTCAGTCCTCATTATCTCAAGCTGAGGCGGCGAGGGATGCAGCGGGTTTAAGTGCATCTAACGCAGCATCCTCTGCTCAAAGTGCTGAGTCAGCAGCTCAGTCAGTTTCTGGTGCTCTCATTGGCTCATTTCAGTCTGGCGTAACTATTCAATCAGCAACTCAACAGGTAATTGACATTTCATCTGGTATTGCTGTCCCTTACATTTGGGCGGGCGCGCTTCCTAAAACCGTCCCGGCTAACTCAACCCCTGAGTCAACTGGAGGGGTTTCATCTTCTGGATGGGTGCCTTTGTCATTTGCTGATCGTAATGTTGATAACTTATCTATTTTGAGGGGCTTGGTGCCACTTCGTGACGGTGAGCGGGTTTATGTATCTAGTCACTCAGGGACTAGCATGCTTGGCGGTGGCTGGTTCTATTACGATGCCAATGATTCAACCAGCACAGATGATGATGGTGTTGTTATTGTCACTACCGGAAACCACAGATGGAAGCGAGATCTTTCATTGATTGAAGGCTTGTCACCAATGATGTTTGGTGCTCTGATGAACGCGCCATATATTGATGGAAATACAATCCAAGGTACGCCATATCCAAAAGCACCAAGCATGGGCGCCGCTGATTTGACTGGCGTGTCTAACGATGGCGTTGCTATGCAGAAGTGCTACACAGCATCAATCAAATTAAACAAGAAGATTCTCATTGATAGGCCTATCTACATTGGCACAACAAGGGTCAACGTATCTGGTGAGAGGTTCTCAGGACAGACAATTAGGATTGAAGGTACAGCAACTCCTAGGAGGTGTTTGATTTACACATCTGGCAATGGTGGCTTCATTGTTTCGCCTTGGGGTCACAATATGTTTGTTAAAAACATTGGCTTCAGGAACGCAGATGCTGATTATAACGGCTCACCTCTAATTAGCGGAAATGACTCAGGTCAAGGTGGTGGCGGCAAGCAGTACACAATAGATAACGTTGAGTTTTATCATTACAAGTATGCATTATCAACACTGACATTTGTTTCTAAAATTTCAAATGTATACATGTACGATTGCACATACGGCATTGGACTTTCAGGTAATACAAGTACCGCCCTTGATTCTGTATGGGCACACCACTGTGATGTTGGCTTTCTTTGGGGTTACGGCATTAACACAACAACCCTGGAGCCTGTAGCTGGTGGTTTTCCTGTTATGTATGTCACTGCGACAAACATAGCTGCTGATGGTTGCTTAACTCCACACAAGATTGGTGGCCAGTTAAGGTCTGTTAATATCGTAGGTGCTGGGGTTGAGGGTGTTAATGGAGATACTGTATTTGATTTCTCCGACTATGGTGGTGATGACGATCAGTTTGGATTTGATGTAAAGGGGTTTTCTTGCTGGATTCAATCAAGCATGAACACTGGAGTCCTTAGGATGATAAAATTGCCAGCTAATGAAAGTAGAATGCCAGTTGGATCGATACGTTTTAGCGATGGTTATTTCAAGTCTGATTACGCCTTGATGATTATGGAAAACACCATTTCAAACCCAACTGCAGAAGGTAACTCTGTTCATTTTGGTGATGACTTCAGGATTATTAACAGTCAATACACTGGTTCATTCTCAAAGAGCACGTTGAGGAGCACTAAAGTTGGCAACGTGACATATGGCGAGTCACCAGTTGAAGCAAGGAACTCATATAACGGAACTACGTTATCTGCTGTTCATGTTACTAGTGGAATGGACTTCAATCAGGTCAGAACTGAGGATGCTACTTTATTGCTGCCTTATAACAGGGCGCTTGATATCCTTCTTACTACAGTTGGTGAGGAGTCCAGGTATGGGTCAACATTTATTGCTGGAGAGTTGAGCCTTATACCAATTAATAAGAATGGATTAGGCGGTCAGGAATCAGGAGGTATTATTCAATTTAGCTTATCTGGAAGCACTAAGGCCAATGTCTCATCAGGAATACCTTGGTATAACAAGATAGCTAAGTCGACTGGCTCAAAAAGCACTTCTCTTGATGGGGTTTCTATTACTAAGTACGTGACTGGTGGTCAGACATTCCTGAGGATTTTAACACCAACTGCATCGGTAAGCACATTCCTTTGCCACCTTAAATTAACATATAGTGGATTTGCTCACTTTTATGATAAAAGGTGGCAGGTTAGAGCAATATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
d70d9091e844a5ea97a92a23f3627ca56e316d31c577c4d507b8be153b6c98cf
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7118
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50