Genbank accession
ABU97052.1 [GenBank]
Protein name
WD40-like beta-propeller protein
RBP type
TSP
Evidence RBPdetect
Probability 0,55
Protein sequence
MLNADEYPLWHFVRKPVPQRQALAISLGPEEYGQNPPFTKTYVLVKSGNRVELYRSTGSLDESQVSDYPIPTNAVRNSYQGRFALAITPTQALVAVQTGNPGGARGEPSTIEVYAGSVQVYATQGYDPQLVYSALLVPPSLLNTYPECVVKLGAVILFYMHPQESKLIAEYLSPPYTAASSRVEFPLSEPLQLVAAVPVEGKAQLWFVNGKGEWVAAQFSAGMLLPRLSGEVPGPEVWNDPQDGSYAKVFGQTWTWRVDKANNRFVFTKSGSQTAYFVPADHSLQDVCFAAAAFDQAGYPAVAYQIGDQTYVKYWNILERRYVNTGPLPLRFPLMLQEATVLGWRFVPEADVVLLGQGSSGNLVSRRQKDAYGVEYVLATESGLIPESVDLSQALRYSVQAVDRGTVYRTSLFPYDSYAYRKFKQATPEYPIEHNLFAWLTKAGISTRELTTQYNPPRIEIVAQLNGSGLSVRDLVTVYTLSFGITAQLITAEVNTRTVTTVYTPPTLNMSANLTSGDVSTKEVVKPYNPPTISFQAVLTNASVSVFPDGPSTDPISPNN
Physico‐chemical
properties
protein length:560 AA
molecular weight: 61586,85400 Da
isoelectric point:5,49510
aromaticity:0,11250
hydropathy:-0,14893

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
ABU97052.1
1 560
Domain Start End Length (AA) Confidence
N-terminal 1 559 559 0,9862
Central domain 560 559 1 0,3973
C-terminal 560 560 0 0,0000
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-559
Central
560-559
C-terminal
560-560

Taxonomy

  Name Taxonomy ID Lineage
Phage Thermus phage P74-26
[NCBI]
2914007 Uroviricota > Caudoviricetes > Oshimavirus >
Host Thermus thermophilus
[NCBI]
274 cellular organisms > Bacteria > Thermotogati > Deinococcota > Deinococci > Thermales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ABU97052.1 [NCBI]
Genbank nucleotide accession
EU100884 [NCBI]
CDS location
range 75501 -> 77183
strand +
CDS
GTGCTAAACGCGGACGAGTACCCGTTGTGGCACTTCGTCAGGAAACCGGTACCCCAGCGCCAAGCTTTGGCGATAAGTTTGGGACCGGAAGAATACGGCCAAAACCCGCCCTTCACCAAGACCTACGTCCTGGTAAAGAGCGGAAACCGGGTAGAACTATACCGAAGTACCGGAAGCCTAGACGAAAGCCAAGTATCGGATTACCCTATACCGACCAACGCGGTTAGAAATAGCTACCAAGGGCGCTTCGCCTTGGCTATAACCCCCACGCAAGCCCTGGTAGCGGTACAGACCGGAAACCCTGGTGGCGCTAGAGGCGAACCTTCCACCATAGAGGTCTACGCCGGAAGCGTCCAGGTCTACGCCACCCAGGGCTACGACCCTCAGTTGGTCTATAGCGCTCTCCTGGTACCGCCAAGCCTTCTCAATACCTACCCGGAATGCGTGGTGAAGCTCGGCGCGGTAATCCTCTTCTACATGCACCCGCAAGAAAGCAAGCTAATAGCGGAATACCTAAGCCCGCCATACACCGCGGCTTCGTCCCGCGTTGAGTTTCCCCTAAGTGAACCCCTTCAGTTGGTAGCCGCCGTTCCGGTGGAGGGCAAGGCCCAGCTATGGTTCGTAAACGGCAAAGGGGAATGGGTGGCGGCTCAGTTCTCCGCGGGCATGCTCCTGCCCAGGCTATCCGGAGAAGTGCCGGGGCCGGAGGTGTGGAACGACCCGCAAGACGGAAGCTACGCCAAGGTCTTCGGCCAGACCTGGACCTGGAGGGTGGACAAGGCCAACAACCGCTTCGTGTTCACCAAGTCGGGAAGCCAGACGGCGTACTTCGTGCCCGCGGACCATAGCCTGCAGGACGTGTGCTTCGCCGCGGCGGCCTTCGACCAGGCGGGCTACCCGGCGGTGGCCTATCAGATAGGCGACCAGACGTATGTCAAGTACTGGAACATTCTAGAACGTAGGTACGTCAACACCGGACCCTTGCCGTTGCGCTTCCCGCTCATGCTCCAGGAGGCCACGGTGCTAGGGTGGCGCTTCGTGCCCGAAGCCGACGTGGTGCTCTTGGGACAGGGAAGCTCAGGCAATCTGGTCTCGCGAAGGCAGAAAGACGCTTACGGGGTGGAATACGTGCTAGCGACGGAAAGCGGGCTCATCCCGGAGAGCGTGGACCTGAGCCAAGCCTTGCGCTATAGCGTACAGGCGGTGGACCGGGGTACGGTATACCGCACATCGCTTTTCCCCTATGATAGCTACGCGTACCGCAAGTTCAAACAGGCAACGCCGGAATACCCGATAGAGCACAACCTGTTTGCTTGGCTTACTAAAGCAGGCATTAGCACTAGAGAGCTAACGACTCAATATAACCCGCCACGCATAGAGATAGTGGCTCAACTTAATGGAAGTGGTCTTAGCGTACGCGACTTAGTTACGGTTTATACCTTAAGTTTCGGAATAACCGCCCAGCTCATTACAGCCGAAGTTAACACTCGAACTGTAACCACCGTTTATACGCCGCCAACCCTAAACATGTCGGCTAACCTCACGAGCGGCGATGTGAGTACCAAGGAAGTTGTTAAACCTTATAACCCACCTACCATAAGCTTCCAAGCCGTACTAACTAATGCAAGCGTGTCTGTGTTTCCAGATGGTCCTAGTACAGACCCGATTAGCCCTAATAACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2e12c4b294e5f2d7c47cb9e7ad775108267cc0fcd633a7857690b17e073ca442
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5356
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50