Genbank accession
AYP68600.1 [GenBank]
Protein name
hypothetical protein
RBP type
TF
Evidence RBPdetect
Probability 0,88
Protein sequence
MPTPKVKVNGKWYSLGMTANDKIEIINSANQYANQLNQEIQEEIDDILRSLEDLDGLIDDAFKDGIISETEARIIKSEIRNLATKKAELDREYTDLMGNVHTDANSQSLLSIAKSNYDAKYNLLVNTITDVIADSKADESEVALVDAAFIEYRNAIQLLVSAMRGAIDQIGESHGDKARKDAIDYTDGQLFPIVQQLTSASTEIQRNAFEISLKADAEYVTTEFARVEGAVLNEANQYADSIRTDVTTAIDSLEESLQNTEFYIENSFKDGIISSLEASGIGNYINTLNSAKSGFNANYDSLIVNPYLKDTPESALRSAKLNLDAKHTALISAIQTAIEDSKATAEEVTDVNTKFSEYAQAVNLYEASANKAIDAIAREKAQESYDNAVQYTDGELVPIRNTVSSHSSELSVQAQAILLRVTSEKHAYDISQALTSANQYTDAAKDILEQDILDLTGALEQTEDYIETSFKDGILYDAEYNRLRGFLQSLDESKQLFDARYTTLYQNANLEGQPKTDLASAKTLYDSKYTSLVDTVNTAIADQLATPAEAEAVRTSFSEYKGAVALLGAVMEQAIDAISKKVADKAKEEAKAYTDQIKTQINSDLTSLSNDITATNNYINEAFRDGVISNTESNRISSYINLLSQAKDTFDTRVYELIGDTDLQQIYRLSLQNAKGNYDTAYSNLITAINDALADEVASPIEVSNVDTKYRLYNETVKDLNIAIDRALKSIAQSKATKAYQDAMGYTDDIIAPIAEMVTLNNASIEVLEDSIVKKVSQETFSQVTDNLDTKIDKIEIGGRNLLRNSKKTEGIVTNDDFGGHPVLIAMNTEDGLPYARITIAPTNTAKTIAVYQDVTIARLTDLNWNGRDLTVSTRFRSDAVGRNYEFRSYIYNPNWSIVMTAPAQLIPITGQWQTASYTYKNVPVSLDPTTQYTALTLESRVIGDMTGKRLDLRLSKVEYGNKVTDWSPAPEDIQMDIEENLAYRLEVYSTNGNIFRNGDINTVLQATLYKGQTDITDETPASRFYWERTSLNSTSDAVWNANNVGKKNVIVTNSDVNKQATFSCYLKDQ
Physico‐chemical
properties
protein length:1070 AA
molecular weight: 118504,93490 Da
isoelectric point:4,52390
aromaticity:0,07570
hydropathy:-0,40757

Domains

Domains [InterPro]
DC_1272
STR
3–231
Coil
Unmapped
26–57
DC_1272
STR
588–817
AYP68600.1
1 1070
Architecture
STR
RBD
STR 3-817 | RBD 948-1068 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Exiguobacterium phage vB_EalM-132
[NCBI]
2419623 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Exiguobacterium alkaliphilum
[NCBI]
1428684 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AYP68600.1 [NCBI]
Genbank nucleotide accession
MH884511 [NCBI]
CDS location
range 70842 -> 74054
strand +
CDS
GTGCCTACTCCTAAGGTTAAAGTTAATGGCAAATGGTATTCTTTAGGTATGACTGCTAATGATAAGATAGAAATTATCAATAGCGCTAATCAGTATGCAAATCAACTTAACCAAGAAATACAGGAGGAGATAGATGATATACTCCGTAGTCTAGAAGACCTTGATGGTCTTATTGATGATGCCTTTAAGGACGGAATCATATCAGAAACTGAAGCCCGTATTATCAAGTCTGAGATACGTAATTTAGCTACTAAAAAAGCTGAGTTAGACCGAGAGTATACTGACTTGATGGGGAACGTCCATACGGACGCTAACAGCCAGTCTCTCCTTTCCATAGCTAAAAGTAACTATGACGCTAAGTATAATTTGCTTGTTAACACCATCACAGATGTTATAGCAGACTCTAAGGCAGATGAAAGTGAAGTAGCCTTAGTAGATGCGGCGTTTATAGAATACAGAAATGCTATACAACTTCTTGTTTCAGCTATGCGTGGAGCGATTGACCAGATTGGGGAATCTCATGGAGATAAAGCTAGGAAAGATGCCATTGATTATACTGATGGTCAGCTATTCCCAATCGTTCAGCAACTTACTTCTGCTAGTACCGAGATTCAACGAAATGCTTTTGAGATATCCCTAAAAGCAGACGCAGAGTATGTTACTACAGAGTTTGCACGCGTAGAAGGAGCTGTCCTAAATGAAGCTAATCAATACGCAGATAGCATCAGAACGGATGTAACCACAGCTATTGACTCTTTAGAGGAGAGTCTCCAGAATACTGAGTTTTATATTGAAAATAGTTTTAAAGATGGTATTATCTCCTCTCTAGAAGCATCTGGTATTGGTAACTATATCAATACATTAAATAGTGCAAAAAGTGGATTCAATGCCAATTATGACAGTCTCATCGTCAACCCATATCTCAAAGATACGCCAGAATCTGCGCTCCGTTCCGCAAAATTAAACCTCGATGCCAAGCACACAGCGTTGATTTCTGCTATACAAACTGCTATTGAGGATTCAAAAGCAACGGCTGAGGAAGTGACAGATGTCAACACAAAATTCTCAGAGTACGCACAAGCAGTAAACCTGTACGAAGCATCTGCTAATAAAGCTATTGATGCTATTGCTAGGGAGAAAGCACAAGAATCTTATGATAATGCTGTTCAATACACTGACGGGGAACTCGTTCCAATCCGTAACACCGTTAGTAGCCACTCTTCAGAACTTTCTGTACAAGCCCAAGCTATTTTATTACGAGTTACTTCAGAAAAGCACGCTTATGACATCTCACAAGCACTTACGTCAGCTAATCAGTATACCGATGCGGCTAAAGACATTCTAGAACAAGACATTCTAGACCTGACTGGAGCACTAGAACAGACTGAGGACTATATCGAAACTAGCTTTAAAGACGGTATTCTGTATGACGCTGAGTATAATCGGTTACGGGGATTCTTACAGAGTTTAGATGAATCTAAGCAATTGTTTGATGCTAGATACACAACTCTATACCAGAACGCTAACTTAGAAGGACAACCTAAGACTGACTTAGCTAGTGCAAAAACTTTGTATGATAGCAAGTATACATCATTGGTTGATACAGTTAACACCGCTATTGCAGACCAATTAGCTACTCCTGCTGAAGCGGAAGCCGTCAGAACGTCATTTTCTGAGTACAAGGGTGCAGTAGCCTTGTTAGGAGCTGTAATGGAGCAAGCTATAGATGCCATCTCCAAGAAAGTAGCAGACAAGGCTAAAGAAGAAGCTAAAGCATATACAGACCAGATTAAAACTCAGATTAACAGTGACCTAACTTCTCTATCTAATGATATAACTGCTACTAATAACTATATCAATGAAGCATTTCGTGATGGGGTTATTTCTAATACGGAATCTAATCGTATTAGTAGTTATATTAATTTGCTTAGTCAGGCTAAGGACACTTTTGATACTAGAGTTTATGAGCTGATTGGGGATACTGACCTACAACAGATTTACAGACTCAGCCTACAGAATGCTAAAGGTAACTATGACACAGCCTACAGTAACTTAATTACTGCCATTAATGACGCTCTAGCGGATGAGGTGGCAAGTCCTATTGAAGTGTCTAATGTCGATACTAAATACAGGCTGTATAATGAAACTGTTAAAGACCTCAATATAGCTATCGACAGAGCATTAAAGAGCATAGCTCAATCAAAAGCTACCAAAGCATATCAGGATGCAATGGGATACACTGATGATATTATTGCTCCTATTGCTGAGATGGTTACCTTGAATAATGCTAGTATTGAGGTACTAGAAGACAGTATTGTTAAAAAGGTATCCCAAGAGACATTCAGTCAAGTAACTGATAATCTGGATACTAAGATTGATAAGATTGAAATAGGTGGTCGCAATCTTCTACGAAACTCTAAGAAGACTGAGGGTATTGTTACTAACGATGATTTTGGAGGACACCCTGTATTAATAGCAATGAATACTGAGGATGGGCTTCCTTATGCTAGAATCACAATAGCTCCAACAAACACAGCTAAAACAATAGCGGTGTATCAGGATGTTACTATAGCTAGGCTGACTGACCTTAACTGGAACGGTAGAGATTTAACTGTATCAACGAGGTTCCGTTCAGATGCAGTAGGTAGAAACTATGAGTTTAGGTCTTATATCTATAATCCTAACTGGAGTATCGTGATGACTGCTCCTGCTCAGTTGATTCCTATAACAGGTCAATGGCAGACTGCTAGCTACACCTATAAAAATGTGCCTGTTAGCTTAGACCCTACTACACAATACACCGCTCTTACCTTAGAGTCTAGAGTTATTGGTGATATGACAGGAAAACGTTTGGACTTGAGATTGTCTAAGGTAGAATATGGTAACAAGGTTACAGACTGGTCTCCTGCTCCTGAGGATATCCAGATGGATATAGAGGAAAATTTAGCCTACCGTTTGGAAGTTTATAGTACAAATGGAAATATTTTCCGTAACGGAGACATTAATACAGTTTTGCAGGCTACCTTATACAAGGGGCAAACGGATATTACAGATGAGACACCTGCTAGCAGATTCTATTGGGAACGTACTTCACTCAATAGCACATCCGATGCGGTGTGGAATGCTAATAACGTTGGTAAGAAGAACGTTATAGTAACAAACTCAGATGTGAATAAACAAGCCACTTTCTCTTGTTATTTAAAAGACCAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
22dbff29658aebf786128eafa28b5c2e4f17aaa4e6e7265ec8c441f6c50c12d1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7353
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Comparative Genomic Analysis of Eight Novel Haloalkaliphilic Bacteriophages from Lake Elmenteita, Kenya Akhwale,J.K. GenBank