Genbank accession
XFD06871.1 [GenBank]
Protein name
hypothetical protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MASGSFIIGTTNQYVEGRCYWDAWSDPNRNVSRINVSVYFYRTNNWSGATKGTFTFYLYSSFGEQARNTQYFTFTNPNGGQGTQVLNASWEQAHDANGELRFRLSVGKDTDVFALHNNGGDVVADKIARESTVASQPSATLPNDIWIDLNVSNSTFYHTVELWAKNTSGADTLIDTQTRVGTRAYFSAVPALRQKLAQALGNRSEMALWATAVTYDANGNQIGGKRWGPEGRYYRPSLGFISCPALFVNEVIRARIGAYDSRLQYRVHVCVHMNYDGNGNPPAADTAKFRKIYTPTTEQFDISFTQAEQDKIATDTMPDRTARLVTFQVDTMCEGVVLNTNNVIPSYSNGGIDISRMNIAPTFSGSFPAVDTNSVTTAITGNSSMIIQGQSKVTVTVPAANRATAKLGATITRYDFTINGVTVPVTPPATGDVVAIFGTVNASANTSCTVTAIDTRGMSAPVSTGVTMIPYTPPTVIGTGARQNNFEASTKVTASGSYAPLTISGQNKNDIVTRTFRKRVLGGTFDAAVSFATITKGTNTYEATTTSVIFDVDKIYEVEVTIKDKVSPEVKTTFILNKGTPIAFMDAKTQSLGVNMIPVAGNGENKLQVTGSAYVSDNVKTQVLLFPKTGSENNPSAANSQYVSFRIKDDRIMMNDKNIFYQLGNTSDLRLGGNLYTSSTQGAFIDAYGNIKPQTGVIWENTWGVFQKKSDGSYNAAILLNLVADADSNVTTVTLGGVNYKMNKNASNLAPNGAMSFTNSTNGGRMTAYFENIFAYNAGANPSRPEYKENLELLDISATDIINNNNVYAYDYKDEYRQPNHNKLKKDIGVLINESLSLMTNDDDTAIKDYAMSSILWKGLQETNARLKKLEEATK
Physico‐chemical
properties
protein length:875 AA
molecular weight: 95328,34040 Da
isoelectric point:7,01412
aromaticity:0,09600
hydropathy:-0,32217

Domains

Domains [InterPro]
XFD06871.1
1 875
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Azalea
[NCBI]
3289875 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XFD06871.1 [NCBI]
Genbank nucleotide accession
PQ217616.1 [NCBI]
CDS location
range 61635 -> 64262
strand +
CDS
ATGGCAAGTGGTTCATTTATAATAGGTACAACGAACCAATATGTCGAGGGTAGATGTTACTGGGATGCTTGGTCAGACCCCAACCGAAACGTTAGTAGGATTAACGTTTCGGTGTACTTCTACCGTACGAACAACTGGTCAGGTGCTACAAAAGGTACATTTACATTTTATCTTTACTCATCTTTTGGTGAGCAAGCAAGAAACACGCAATATTTCACATTCACCAACCCTAACGGAGGGCAAGGTACACAGGTACTTAATGCATCGTGGGAACAGGCGCATGATGCCAATGGTGAGTTAAGGTTTAGATTATCTGTAGGTAAGGATACCGATGTATTCGCCTTACACAATAACGGTGGTGATGTAGTAGCAGATAAGATAGCCCGAGAGAGTACAGTAGCATCACAGCCATCTGCGACCCTACCGAATGATATCTGGATTGATTTAAACGTAAGCAATAGTACATTTTACCACACAGTTGAGCTGTGGGCTAAAAACACTAGTGGTGCGGATACGCTAATTGATACGCAAACAAGAGTAGGCACACGTGCTTACTTTAGTGCTGTACCAGCACTTCGACAAAAACTAGCACAGGCTCTAGGAAACCGAAGTGAGATGGCACTATGGGCAACCGCCGTAACCTACGATGCTAACGGAAACCAGATAGGAGGTAAACGGTGGGGACCAGAAGGAAGGTACTATCGACCTAGCTTAGGTTTCATATCATGTCCCGCATTATTTGTAAATGAAGTTATAAGAGCTAGGATAGGAGCGTATGATAGTAGATTGCAGTATCGAGTTCATGTTTGTGTGCATATGAACTACGATGGTAACGGTAATCCACCAGCGGCAGACACTGCAAAATTTAGAAAGATTTATACACCTACTACCGAGCAGTTTGACATATCCTTTACTCAAGCGGAACAGGATAAAATAGCAACAGACACAATGCCAGACAGAACCGCTAGATTAGTAACATTCCAAGTTGATACGATGTGTGAGGGTGTTGTTCTTAATACCAATAACGTTATCCCATCATACTCAAATGGTGGAATAGACATCTCAAGAATGAATATCGCACCTACATTTTCTGGTTCATTCCCTGCTGTTGATACAAACTCCGTTACCACTGCTATAACAGGTAACTCATCTATGATTATCCAAGGACAGTCAAAAGTTACAGTTACTGTTCCGGCAGCAAATAGAGCAACAGCTAAGTTGGGCGCGACAATAACACGTTATGATTTTACGATTAATGGTGTGACAGTGCCAGTAACGCCACCAGCAACAGGTGACGTAGTAGCTATCTTCGGTACGGTAAATGCATCCGCAAATACATCGTGTACCGTTACCGCTATAGACACAAGAGGAATGTCTGCACCTGTTTCTACTGGTGTTACTATGATACCGTACACGCCTCCAACAGTTATAGGTACTGGAGCCAGACAGAACAACTTCGAAGCCAGTACAAAAGTAACGGCATCAGGTTCCTACGCTCCGCTAACAATAAGTGGGCAGAACAAGAATGATATTGTTACTAGAACATTTAGAAAACGGGTATTAGGCGGAACATTCGATGCTGCTGTTAGTTTCGCAACCATTACGAAAGGAACGAACACTTACGAGGCTACGACAACATCTGTAATATTTGACGTTGATAAGATTTACGAAGTAGAGGTTACAATAAAGGACAAAGTATCTCCAGAAGTAAAAACTACCTTCATTCTAAACAAAGGTACTCCTATTGCGTTCATGGATGCTAAGACACAGTCATTAGGTGTTAACATGATACCTGTTGCAGGTAATGGAGAGAATAAGTTACAGGTAACAGGCTCAGCATACGTTAGTGATAACGTTAAGACACAGGTTCTATTATTCCCAAAAACTGGTTCAGAGAATAATCCATCCGCAGCCAATTCACAGTATGTTTCGTTTAGAATAAAAGATGATAGAATCATGATGAACGATAAGAACATTTTCTACCAACTAGGAAACACATCAGATTTGCGACTAGGCGGGAACTTATACACTAGTAGTACACAAGGTGCTTTCATCGATGCTTACGGTAATATAAAACCGCAGACAGGAGTTATCTGGGAAAACACGTGGGGAGTATTCCAGAAGAAGTCTGACGGTTCCTATAATGCGGCGATACTACTAAATTTAGTAGCTGATGCTGACTCTAACGTAACTACGGTTACACTTGGCGGCGTTAACTACAAGATGAATAAAAACGCATCTAACTTGGCACCAAACGGCGCAATGTCATTCACAAACTCAACAAACGGCGGGAGGATGACAGCTTACTTCGAGAATATATTCGCATATAATGCAGGTGCAAACCCATCTAGACCAGAGTATAAGGAAAATCTAGAACTACTAGATATTAGCGCAACAGATATCATCAATAACAACAACGTATATGCGTATGATTACAAAGATGAATATCGACAACCTAACCATAATAAACTCAAAAAAGACATAGGTGTCCTAATCAATGAATCTCTTAGTCTCATGACAAATGACGACGACACTGCAATCAAAGATTACGCTATGTCAAGTATTCTATGGAAAGGTCTGCAAGAAACAAATGCAAGACTTAAAAAATTAGAGGAGGCAACAAAATGA

Tertiary structure

PDB ID
3b5208d47901f57af1c1b1de243a7224daae0af0564e324f5c805d95b704b651
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7149
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50