Genbank accession
AOT24091.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MTLVHFDLRHPGASGRKPAAGRIAWVPTERKVDGSIVVLPTRVSITLDPLSSAQIEPGIYLFHEEVVGGISAYRVVPNSLEADYSSLVAIDPATLDPASQPEAAWYAFVETLNAANADMLASALASQHSAELAQLSATGSQTAASASALAASNSAGAAASSATQAGTARDGAVSAQGSAAGYASAASGSAVAASDSASSAAASASSASTSAGTAITKAAEATAAVLGFSVGTVSTVAPSEAASATITGPAGSRVLNLSIPRGAVPVFSVAETTTGPETPGATGLQGQKGDKGDPGGWTAATDLGTMDLNTVLTAGLYRVTQGANVSTTLNYPITLNATAVLHVMMVSATNVIQQFEFVLSTPAARGFWQRSTSTGGTTWTPWRFVATQRVDSTAGRAIYTWDDTAVPGREQLVYGDTGSREISAYLNTTNWAAGNIKIRRVGWEVELRAYGLDNVAGVVGSLGILNQQLPAGFRNQHTVAGFAQIGANPGQVVVAFSATQATIVGVVDNVICGFTAKWQTTDPWPTSLPGNADGAIPNL
Physico‐chemical
properties
protein length:539 AA
molecular weight: 54700,16940 Da
isoelectric point:5,11320
aromaticity:0,06494
hydropathy:0,10557

Domains

Domains [InterPro]
DC_0554
STR
42–539
cd19958
STR
299–355
AOT24091.1
1 539
Architecture
STR
STR 42-539
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AOT24091.1
1 539
Domain Start End Length (AA) Confidence
N-terminal 1 187 187 0,9670
Central domain 188 386 200 0,0595
C-terminal 387 539 152 0,9993
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-187
Central
188-386
C-terminal
387-539

Taxonomy

  Name Taxonomy ID Lineage
Phage Arthrobacter phage Vallejo
[NCBI]
1897554 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AOT24091.1 [NCBI]
Genbank nucleotide accession
KX621005 [NCBI]
CDS location
range 77 -> 1696
strand +
CDS
GTGACTCTCGTCCACTTCGATTTGCGTCACCCCGGAGCCTCCGGACGCAAGCCAGCAGCGGGGCGTATTGCTTGGGTGCCCACCGAGCGCAAGGTCGACGGGTCAATCGTGGTCCTCCCGACCCGCGTTTCGATCACCCTGGACCCTCTCTCCTCCGCTCAGATTGAACCCGGAATCTACCTCTTCCACGAGGAAGTAGTCGGCGGCATCTCGGCTTACCGCGTCGTCCCGAACTCGCTGGAGGCTGACTACTCGTCTTTGGTGGCGATTGACCCGGCTACTTTGGACCCCGCGTCCCAGCCCGAGGCCGCGTGGTACGCCTTCGTTGAGACCCTGAATGCCGCCAACGCTGACATGCTGGCCTCTGCGCTCGCCTCTCAGCACTCCGCCGAACTCGCTCAGCTCTCCGCGACCGGCTCTCAGACCGCTGCCAGCGCATCCGCTCTCGCAGCGTCCAATTCGGCTGGTGCTGCTGCATCCTCCGCGACCCAAGCGGGAACGGCCCGTGATGGGGCTGTCTCGGCTCAGGGGTCGGCGGCTGGGTACGCGTCGGCGGCTTCGGGGTCGGCTGTTGCGGCTTCTGACTCCGCCTCCAGCGCAGCGGCCAGCGCTTCGAGCGCTTCCACCAGCGCCGGGACGGCCATCACGAAGGCCGCAGAGGCTACGGCAGCGGTTCTTGGCTTCTCGGTCGGCACGGTGTCCACGGTGGCCCCCTCGGAGGCCGCGTCTGCCACGATCACAGGTCCTGCCGGGTCTCGTGTGCTTAATCTGTCGATCCCCCGGGGTGCGGTGCCGGTATTCTCGGTCGCCGAAACCACTACCGGACCCGAGACCCCGGGTGCAACTGGCCTGCAGGGCCAAAAGGGCGACAAGGGTGACCCGGGCGGCTGGACTGCAGCTACTGATCTCGGTACGATGGACCTCAACACGGTCCTGACAGCGGGTCTGTACCGAGTAACCCAGGGTGCCAACGTCTCCACGACGTTGAACTACCCAATCACCCTGAATGCAACTGCTGTATTGCACGTGATGATGGTATCCGCCACCAACGTCATTCAGCAGTTCGAGTTCGTCCTGTCCACCCCAGCAGCTCGTGGGTTCTGGCAGCGCTCAACCTCTACCGGTGGCACCACTTGGACGCCTTGGAGGTTCGTCGCCACCCAGCGCGTCGACAGTACTGCTGGTCGGGCCATTTACACGTGGGACGACACGGCAGTTCCCGGTCGCGAACAGCTCGTATACGGCGACACTGGAAGCCGCGAAATCAGCGCGTACCTGAATACTACCAACTGGGCCGCAGGGAATATCAAAATTCGGCGGGTAGGCTGGGAGGTCGAACTCCGGGCTTACGGTCTGGACAACGTGGCTGGAGTTGTGGGAAGCTTGGGTATCCTCAACCAGCAGTTGCCTGCTGGGTTCCGAAACCAACACACGGTGGCTGGATTCGCTCAGATCGGGGCTAACCCTGGCCAAGTCGTTGTCGCGTTCAGCGCCACCCAAGCTACCATTGTCGGGGTTGTGGACAACGTGATTTGCGGGTTCACCGCCAAATGGCAGACCACCGACCCATGGCCCACCAGCCTCCCCGGCAACGCCGATGGGGCAATCCCGAACCTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
249693531e29ed3698ce1bca23c710c858ae37594f64cdf9d34bef0d5d5b8339
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7769
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50