Genbank accession
XFD06871.1 [GenBank]
Protein name
hypothetical protein
RBP type
TF
Evidence GenBank
Probability 1,00
Protein sequence
MASGSFIIGTTNQYVEGRCYWDAWSDPNRNVSRINVSVYFYRTNNWSGATKGTFTFYLYSSFGEQARNTQYFTFTNPNGGQGTQVLNASWEQAHDANGELRFRLSVGKDTDVFALHNNGGDVVADKIARESTVASQPSATLPNDIWIDLNVSNSTFYHTVELWAKNTSGADTLIDTQTRVGTRAYFSAVPALRQKLAQALGNRSEMALWATAVTYDANGNQIGGKRWGPEGRYYRPSLGFISCPALFVNEVIRARIGAYDSRLQYRVHVCVHMNYDGNGNPPAADTAKFRKIYTPTTEQFDISFTQAEQDKIATDTMPDRTARLVTFQVDTMCEGVVLNTNNVIPSYSNGGIDISRMNIAPTFSGSFPAVDTNSVTTAITGNSSMIIQGQSKVTVTVPAANRATAKLGATITRYDFTINGVTVPVTPPATGDVVAIFGTVNASANTSCTVTAIDTRGMSAPVSTGVTMIPYTPPTVIGTGARQNNFEASTKVTASGSYAPLTISGQNKNDIVTRTFRKRVLGGTFDAAVSFATITKGTNTYEATTTSVIFDVDKIYEVEVTIKDKVSPEVKTTFILNKGTPIAFMDAKTQSLGVNMIPVAGNGENKLQVTGSAYVSDNVKTQVLLFPKTGSENNPSAANSQYVSFRIKDDRIMMNDKNIFYQLGNTSDLRLGGNLYTSSTQGAFIDAYGNIKPQTGVIWENTWGVFQKKSDGSYNAAILLNLVADADSNVTTVTLGGVNYKMNKNASNLAPNGAMSFTNSTNGGRMTAYFENIFAYNAGANPSRPEYKENLELLDISATDIINNNNVYAYDYKDEYRQPNHNKLKKDIGVLINESLSLMTNDDDTAIKDYAMSSILWKGLQETNARLKKLEEATK
Physico‐chemical
properties
protein length:875 AA
molecular weight: 95328,34040 Da
isoelectric point:7,01412
aromaticity:0,09600
hydropathy:-0,32217

Domains

Domains [InterPro]
DC_1676
ATT
3–131
IPR030392
CHP
783–874
XFD06871.1
1 875
Architecture
ATT
RBD
ATT 3-131 | RBD 735-875
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Azalea
[NCBI]
3289875 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus thuringiensis
[NCBI]
1428 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XFD06871.1 [NCBI]
Genbank nucleotide accession
PQ217616.1 [NCBI]
CDS location
range 61635 -> 64262
strand +
CDS
ATGGCAAGTGGTTCATTTATAATAGGTACAACGAACCAATATGTCGAGGGTAGATGTTACTGGGATGCTTGGTCAGACCCCAACCGAAACGTTAGTAGGATTAACGTTTCGGTGTACTTCTACCGTACGAACAACTGGTCAGGTGCTACAAAAGGTACATTTACATTTTATCTTTACTCATCTTTTGGTGAGCAAGCAAGAAACACGCAATATTTCACATTCACCAACCCTAACGGAGGGCAAGGTACACAGGTACTTAATGCATCGTGGGAACAGGCGCATGATGCCAATGGTGAGTTAAGGTTTAGATTATCTGTAGGTAAGGATACCGATGTATTCGCCTTACACAATAACGGTGGTGATGTAGTAGCAGATAAGATAGCCCGAGAGAGTACAGTAGCATCACAGCCATCTGCGACCCTACCGAATGATATCTGGATTGATTTAAACGTAAGCAATAGTACATTTTACCACACAGTTGAGCTGTGGGCTAAAAACACTAGTGGTGCGGATACGCTAATTGATACGCAAACAAGAGTAGGCACACGTGCTTACTTTAGTGCTGTACCAGCACTTCGACAAAAACTAGCACAGGCTCTAGGAAACCGAAGTGAGATGGCACTATGGGCAACCGCCGTAACCTACGATGCTAACGGAAACCAGATAGGAGGTAAACGGTGGGGACCAGAAGGAAGGTACTATCGACCTAGCTTAGGTTTCATATCATGTCCCGCATTATTTGTAAATGAAGTTATAAGAGCTAGGATAGGAGCGTATGATAGTAGATTGCAGTATCGAGTTCATGTTTGTGTGCATATGAACTACGATGGTAACGGTAATCCACCAGCGGCAGACACTGCAAAATTTAGAAAGATTTATACACCTACTACCGAGCAGTTTGACATATCCTTTACTCAAGCGGAACAGGATAAAATAGCAACAGACACAATGCCAGACAGAACCGCTAGATTAGTAACATTCCAAGTTGATACGATGTGTGAGGGTGTTGTTCTTAATACCAATAACGTTATCCCATCATACTCAAATGGTGGAATAGACATCTCAAGAATGAATATCGCACCTACATTTTCTGGTTCATTCCCTGCTGTTGATACAAACTCCGTTACCACTGCTATAACAGGTAACTCATCTATGATTATCCAAGGACAGTCAAAAGTTACAGTTACTGTTCCGGCAGCAAATAGAGCAACAGCTAAGTTGGGCGCGACAATAACACGTTATGATTTTACGATTAATGGTGTGACAGTGCCAGTAACGCCACCAGCAACAGGTGACGTAGTAGCTATCTTCGGTACGGTAAATGCATCCGCAAATACATCGTGTACCGTTACCGCTATAGACACAAGAGGAATGTCTGCACCTGTTTCTACTGGTGTTACTATGATACCGTACACGCCTCCAACAGTTATAGGTACTGGAGCCAGACAGAACAACTTCGAAGCCAGTACAAAAGTAACGGCATCAGGTTCCTACGCTCCGCTAACAATAAGTGGGCAGAACAAGAATGATATTGTTACTAGAACATTTAGAAAACGGGTATTAGGCGGAACATTCGATGCTGCTGTTAGTTTCGCAACCATTACGAAAGGAACGAACACTTACGAGGCTACGACAACATCTGTAATATTTGACGTTGATAAGATTTACGAAGTAGAGGTTACAATAAAGGACAAAGTATCTCCAGAAGTAAAAACTACCTTCATTCTAAACAAAGGTACTCCTATTGCGTTCATGGATGCTAAGACACAGTCATTAGGTGTTAACATGATACCTGTTGCAGGTAATGGAGAGAATAAGTTACAGGTAACAGGCTCAGCATACGTTAGTGATAACGTTAAGACACAGGTTCTATTATTCCCAAAAACTGGTTCAGAGAATAATCCATCCGCAGCCAATTCACAGTATGTTTCGTTTAGAATAAAAGATGATAGAATCATGATGAACGATAAGAACATTTTCTACCAACTAGGAAACACATCAGATTTGCGACTAGGCGGGAACTTATACACTAGTAGTACACAAGGTGCTTTCATCGATGCTTACGGTAATATAAAACCGCAGACAGGAGTTATCTGGGAAAACACGTGGGGAGTATTCCAGAAGAAGTCTGACGGTTCCTATAATGCGGCGATACTACTAAATTTAGTAGCTGATGCTGACTCTAACGTAACTACGGTTACACTTGGCGGCGTTAACTACAAGATGAATAAAAACGCATCTAACTTGGCACCAAACGGCGCAATGTCATTCACAAACTCAACAAACGGCGGGAGGATGACAGCTTACTTCGAGAATATATTCGCATATAATGCAGGTGCAAACCCATCTAGACCAGAGTATAAGGAAAATCTAGAACTACTAGATATTAGCGCAACAGATATCATCAATAACAACAACGTATATGCGTATGATTACAAAGATGAATATCGACAACCTAACCATAATAAACTCAAAAAAGACATAGGTGTCCTAATCAATGAATCTCTTAGTCTCATGACAAATGACGACGACACTGCAATCAAAGATTACGCTATGTCAAGTATTCTATGGAAAGGTCTGCAAGAAACAAATGCAAGACTTAAAAAATTAGAGGAGGCAACAAAATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
3b5208d47901f57af1c1b1de243a7224daae0af0564e324f5c805d95b704b651
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7149
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50