Genbank accession
AOT23598.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MTTPGASAPNGAYENGSDYGFDLTEETARQMVTQPFHNAYGTLPTQFRNGMGSFIQRALDGNPENDPTLGEIADWFTGWNKVHTANAAKIVQIENRLAEGSTFMDDFARPNNRTVLGNGWQQGGKGQGLGIIDNAARVDNTEGLIGVDSGRRWAVCPVLAAENNVAVTAVVNPKGVATGTMTSLFVRANAGLTSFVYANVYGKSCYLGYGTRSGDSWTFTDWKSNTNYRLSEGASVELSAVENVYSLTIDGAVVLQYEDTVGVIPVDATRRTVGFASETKIVNLLPSYSWGVAAFSHRPVLFVDVAGNKEAIDQANEVASNAEAAAAAAMSAVVGLQNENTGASVDGVVIRDGFDVPGSNLGASWAQTGSGNFGVGSDPGRAMIVAGTPPSFNTTLTYVARYITPLSTDNFNVSAVLANSGADYAEARSYVIGRCNADLSSFVYVRWSRSGGVEMGRGSASESNVNLTKWTSDSRTVEGGAHVELRCVGNEYRAYINGSALLYYKDTANTVPVGPSNRYAGIGVTRYADYFAQWRDSAAIDSFAASDTLAKGSIQGTGWSLVRTNTNSTAVGNGAAPVPANTWDTERVRNGVTVEDLGAGIVKINREGWYAISLRLGHTQTAYQSGEYGSSRFDRQAVLYVAEPDSSSFSIARKGASGADWDDGTQVSAMLYLKAGSRVRAGTNHNGYGPNIVGDAWNRCYFDGALTSSPQGLRGEQGEKGDKGDRGPVGEGLRFDYFVDLVSQLPAAGPEGAHALVKENGRAYLFSEGAWHQGAVLIGPKGDPGSKGDPGSKGDPGSKGDPGSKGDPGSKGDPGSKGDPGVKGDKGDTGTAWTGTQAAYDALPSATRYAQGFVAVIV
Physico‐chemical
properties
protein length:858 AA
molecular weight: 90125,02290 Da
isoelectric point:5,13628
aromaticity:0,09207
hydropathy:-0,32611

Domains

Domains [InterPro]
DC_1994
ATT
65–295
IPR055681
STR
327–547
G3DSA:2.60.120.560
RBD
359–535
AOT23598.1
1 858
Architecture
ATT
STR
ATT 65-295 | STR 296-855 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AOT23598.1
1 858
Domain Start End Length (AA) Confidence
N-terminal 1 354 354 0,4437
Central domain 355 574 221 0,7188
C-terminal 575 858 283 0,4320
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-354
Central
355-574
C-terminal
575-858

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhodococcus phage Harlequin
[NCBI]
1897551 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AOT23598.1 [NCBI]
Genbank nucleotide accession
KX611788 [NCBI]
CDS location
range 19044 -> 21620
strand +
CDS
GTGACTACCCCAGGCGCATCGGCTCCGAACGGAGCATATGAAAATGGCAGTGACTATGGTTTCGACCTCACGGAAGAGACCGCCAGGCAGATGGTCACTCAGCCTTTCCACAACGCATACGGCACTCTGCCAACACAATTCAGGAACGGGATGGGCTCGTTCATTCAGCGGGCACTAGACGGTAACCCTGAGAATGACCCGACCCTGGGAGAGATCGCAGACTGGTTCACCGGCTGGAACAAGGTTCACACCGCGAACGCGGCGAAGATCGTACAGATTGAGAACCGACTGGCAGAGGGTTCCACGTTCATGGACGACTTTGCTCGTCCGAACAACCGCACAGTGCTCGGCAACGGCTGGCAGCAAGGCGGTAAGGGCCAGGGGCTCGGCATCATCGACAACGCAGCTCGGGTGGATAACACCGAGGGTCTGATCGGTGTGGACTCTGGCCGGCGTTGGGCTGTCTGCCCTGTGCTCGCAGCAGAGAACAACGTAGCTGTAACCGCCGTCGTAAACCCCAAAGGTGTTGCGACAGGCACCATGACCTCACTGTTTGTCCGGGCCAACGCGGGACTGACTTCGTTCGTCTATGCCAACGTCTACGGCAAGTCCTGTTACCTGGGCTACGGCACTCGTTCAGGTGACTCGTGGACGTTCACAGACTGGAAGTCCAATACGAACTACCGACTGTCGGAGGGTGCCAGCGTTGAGCTGTCCGCAGTTGAGAACGTCTACAGCTTGACAATAGACGGTGCTGTCGTCCTACAATACGAAGACACCGTTGGAGTGATCCCGGTGGACGCCACTCGGCGTACGGTCGGCTTCGCATCTGAGACGAAGATCGTGAATCTGCTCCCCTCTTACTCGTGGGGTGTAGCAGCCTTCTCACACCGCCCCGTCCTGTTCGTGGACGTTGCAGGTAACAAGGAAGCGATTGACCAAGCTAACGAGGTAGCGAGCAACGCGGAAGCGGCTGCCGCAGCGGCAATGTCGGCTGTAGTCGGCTTGCAGAACGAGAACACCGGAGCGTCGGTTGACGGTGTCGTGATCCGTGACGGATTCGACGTGCCGGGTAGCAACCTGGGCGCTTCGTGGGCACAGACCGGTTCCGGCAACTTCGGTGTCGGATCTGATCCGGGACGAGCGATGATCGTTGCGGGAACACCGCCGTCGTTCAACACGACCCTGACTTACGTGGCTCGGTACATCACCCCGTTGTCCACGGACAACTTCAACGTCTCTGCCGTCCTAGCGAACAGTGGCGCTGACTACGCAGAGGCTCGTTCGTACGTCATCGGTCGGTGCAACGCGGATCTCTCGTCGTTCGTGTACGTACGTTGGTCCCGTTCCGGTGGTGTGGAGATGGGACGCGGTTCGGCTTCCGAGTCGAACGTGAACCTGACCAAGTGGACCTCTGATTCTCGCACTGTCGAAGGTGGAGCTCACGTCGAATTGCGTTGTGTTGGTAACGAGTACCGGGCCTACATCAACGGCAGTGCTCTGCTCTACTACAAGGACACTGCCAACACCGTCCCTGTGGGACCGTCCAACCGGTACGCCGGCATCGGCGTCACCCGGTACGCGGACTACTTTGCACAGTGGCGTGACTCGGCGGCTATCGACAGTTTCGCAGCATCCGACACTCTGGCTAAGGGTTCGATCCAGGGCACAGGCTGGTCGTTGGTCCGCACGAACACAAACAGCACAGCAGTTGGTAACGGAGCTGCCCCGGTTCCAGCCAACACCTGGGATACCGAGCGGGTCCGCAACGGTGTGACGGTCGAGGATCTCGGCGCGGGCATCGTGAAGATCAACAGGGAGGGTTGGTACGCAATCTCCTTGCGCCTGGGGCACACCCAAACGGCCTACCAGTCAGGTGAATACGGCTCCAGCCGATTCGATCGACAAGCGGTTCTCTACGTAGCAGAGCCCGACTCAAGTTCGTTCAGCATCGCCCGCAAGGGCGCATCTGGCGCTGACTGGGACGACGGTACACAGGTCTCCGCGATGCTCTACCTCAAGGCCGGATCTCGAGTCCGTGCCGGCACGAACCACAACGGCTACGGCCCCAATATCGTTGGTGACGCTTGGAACCGTTGCTACTTCGACGGTGCTCTCACGTCCAGCCCCCAGGGCTTGCGTGGTGAACAGGGCGAGAAGGGAGACAAGGGTGACCGGGGTCCAGTAGGTGAAGGGCTTCGGTTCGACTACTTCGTGGATCTGGTCTCGCAGCTCCCGGCTGCCGGCCCCGAAGGTGCTCACGCCCTGGTGAAGGAAAACGGACGGGCTTACCTGTTCAGCGAGGGTGCTTGGCACCAAGGCGCAGTGCTCATCGGCCCCAAGGGCGATCCTGGCAGTAAGGGTGACCCAGGCTCGAAGGGTGATCCCGGAAGCAAGGGCGATCCTGGCAGTAAGGGTGACCCAGGCTCGAAGGGTGATCCCGGAAGCAAGGGTGATCCTGGTGTGAAGGGTGACAAGGGAGACACCGGTACAGCTTGGACCGGTACTCAGGCTGCCTACGATGCTCTGCCGTCCGCTACGCGGTACGCACAGGGCTTTGTGGCGGTGATCGTATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
5f9d7114ca27babcc4e2be99bc7aaee30ae495e4a64b2f0fc5a2dc5e8e7a0ea5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7488
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50