Genbank accession
UNA01499.1 [GenBank]
Protein name
minor structural protein
RBP type
TF
Evidence RBPdetect2
Probability 0,63
Protein sequence
MASGSFNTSTGNKYVQGTVTWSSTPNTGGNYSDVYVEWRFSRTNSGYETYGNGTFGIYVDGQQSVNTLRFSFTQNSRTLVVSGNFRVNHNSDGTKNLRIGVSGYTDVVSINEGVVYVDLDRIPRASSISSNISWTAAIEGLPLSISRASGSFTHSLTLQIKNPTNNNWVSVAARYNIGDYTTIYFDQNEMTIIYREMSKWENTDVWIKLDTFNGGTYIGSSEKYGKVYCATPATPVVSDFNIGTKSVDVTLDYYYDTFNYSLEFTFGSFKKVFPNMGKSNKMDFSDAEVIQMYQQVPNQQSAQANVYASTKYNGIEIQDNIPKDQNKKITLRVVNSEPQYDGGFTYLDSNTTTSGLTGNNQYIIQGRSTLQVKLPVAKKAKPTNYATITRYEVAVNGAVKSINFSDTADLTLDFGTVDVSTNTSIVVSAVDSRGLKKSVSSVILVLPYSLPSFNFNAERVNNFETTTNLKVTGVASPLNVSNANKNRIKTAKYKTRKVGGAWNPEVDLPITGTFPSFASNNASVQLDNTQAWEVSVTITDMIGFATIVSAVAVGTPILFIDTNKKSIGVNKFPTGTKTFEVAGDWVIDGMVNLKADQWWTTSGKAALDLKNSDIIGMNGLYFNDASDSGDEGLNFLKSGKPVGSKVLTDYDNLCVLDGALKLNNKVMFWQEGGTTSSNLRFGGDAYSYSSGGAIFDVWGNIKGQPTAGAANTWSVSAADGKLRFLCGIGNGSTASTELYAYSGGIKFFHDGINSWNFWQNGGGAYAHFDMGAGRMKWNGDRGTFQFLTNAGAWTGIEAQNVSFPSAREYKKNIETFEESALTLIEASKAYLYHYVEEDEELVERHLGLIVDESPEIIRGNGDKSINNYAMNTLLWKAVQELSHKVNNLDRRISIR
Physico‐chemical
properties
protein length:895 AA
molecular weight: 98185,07640 Da
isoelectric point:5,85796
aromaticity:0,11397
hydropathy:-0,33028

Domains

Domains [InterPro]
DC_1676
ATT
3–126
IPR030392
CHP
805–895
IPR030392
CHP
805–857
UNA01499.1
1 895
Architecture
ATT
STR
ATT 3-126 | STR 511-895
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage vB_BcgM
[NCBI]
2918264 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus weihenstephanensis
[NCBI]
1405 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UNA01499.1 [NCBI]
Genbank nucleotide accession
OM743306 [NCBI]
CDS location
range 56881 -> 59568
strand +
CDS
ATGGCATCAGGTTCATTTAATACTTCCACTGGTAATAAATATGTACAGGGTACTGTGACTTGGAGCAGTACCCCTAATACTGGTGGTAACTATAGTGACGTTTATGTAGAATGGCGATTCTCACGTACCAACTCAGGGTATGAGACATATGGTAACGGTACATTCGGAATATATGTAGACGGACAACAATCAGTGAATACGCTTAGATTTAGCTTCACTCAAAACTCAAGAACATTGGTAGTAAGTGGGAACTTCCGAGTAAACCATAACTCTGATGGTACTAAAAACTTACGTATCGGTGTATCAGGTTACACGGACGTAGTTAGCATTAATGAAGGTGTTGTATATGTAGACTTAGACCGTATACCACGGGCAAGTAGTATATCATCTAACATAAGTTGGACAGCAGCTATAGAGGGATTACCCTTATCTATTAGTCGGGCTTCTGGTTCCTTTACGCACTCTCTTACTTTGCAAATAAAGAACCCAACGAATAACAACTGGGTAAGTGTAGCTGCTAGATATAATATCGGAGATTACACTACGATATACTTCGACCAAAACGAAATGACAATCATTTATCGAGAGATGTCGAAGTGGGAAAATACAGATGTATGGATAAAGCTAGACACATTTAATGGTGGAACATATATAGGCTCGTCAGAAAAGTATGGGAAAGTATATTGCGCTACTCCAGCTACTCCGGTAGTATCTGACTTTAACATTGGAACAAAATCAGTAGATGTAACGCTAGATTACTACTACGATACATTCAACTACTCACTAGAGTTTACTTTTGGTAGTTTCAAAAAAGTATTCCCTAACATGGGAAAATCTAACAAGATGGATTTCTCAGATGCAGAAGTAATTCAAATGTATCAGCAGGTTCCTAACCAGCAATCGGCACAAGCCAATGTATACGCTAGTACGAAGTATAATGGCATTGAGATTCAAGATAACATCCCTAAAGACCAGAATAAAAAAATCACGTTACGAGTAGTTAATAGTGAGCCTCAGTACGACGGAGGTTTCACATACTTAGACTCTAACACGACTACATCAGGTTTAACAGGAAATAACCAGTACATAATTCAAGGTAGGTCAACACTTCAAGTGAAACTACCTGTAGCTAAGAAAGCAAAACCAACAAACTACGCTACAATTACTCGTTATGAGGTAGCTGTCAATGGTGCGGTAAAATCTATTAACTTCTCTGATACAGCAGATTTAACACTGGATTTCGGTACTGTAGATGTAAGCACAAATACCAGTATTGTAGTATCCGCAGTAGATAGTCGAGGGTTAAAGAAATCTGTATCTTCTGTTATATTAGTATTGCCTTATTCGCTACCATCGTTTAATTTCAATGCGGAACGTGTCAACAACTTCGAAACAACAACGAATTTGAAAGTAACAGGTGTGGCATCTCCGCTAAATGTGAGTAACGCGAATAAGAATAGAATCAAGACCGCTAAATACAAGACTAGAAAGGTCGGTGGTGCTTGGAATCCAGAAGTAGATTTACCAATAACAGGTACATTCCCTAGCTTCGCATCTAACAACGCTTCTGTTCAATTAGATAATACACAAGCATGGGAAGTATCAGTAACAATCACAGATATGATAGGTTTCGCTACAATCGTTAGTGCAGTAGCCGTAGGTACCCCTATCCTATTCATCGATACGAATAAGAAGTCCATCGGGGTTAATAAATTCCCTACTGGTACTAAAACTTTTGAAGTTGCAGGTGACTGGGTAATAGATGGTATGGTAAATCTAAAAGCTGACCAATGGTGGACAACTAGTGGTAAAGCTGCATTAGACCTTAAAAACTCTGACATCATTGGTATGAATGGTTTGTACTTTAATGACGCATCTGATTCTGGTGATGAAGGATTAAACTTCCTTAAAAGCGGTAAACCAGTAGGTTCTAAAGTCCTTACAGATTATGATAACCTATGTGTTTTAGATGGCGCTTTAAAGTTGAATAATAAAGTTATGTTTTGGCAAGAAGGTGGAACGACCTCATCAAACCTTCGTTTCGGTGGAGATGCTTATTCATACTCTAGCGGTGGAGCTATCTTCGATGTCTGGGGTAACATTAAAGGTCAACCAACAGCAGGAGCAGCTAATACTTGGTCTGTATCCGCTGCTGACGGTAAACTTCGATTCCTTTGCGGTATCGGTAATGGTTCAACCGCATCTACAGAACTATACGCCTACTCTGGTGGTATTAAGTTCTTCCATGATGGTATAAACTCTTGGAACTTCTGGCAAAATGGCGGAGGTGCATACGCTCACTTCGATATGGGTGCAGGAAGAATGAAGTGGAACGGGGACAGAGGTACATTCCAGTTCTTAACAAACGCAGGGGCATGGACTGGCATCGAAGCCCAAAACGTATCATTTCCTTCCGCAAGAGAATACAAGAAAAATATCGAGACATTTGAAGAGAGCGCATTAACTCTTATAGAAGCTTCAAAAGCGTATCTGTACCATTACGTAGAGGAAGACGAAGAATTAGTAGAGAGACATTTAGGTTTAATCGTAGATGAATCTCCAGAAATTATACGAGGAAACGGTGACAAATCTATTAACAACTACGCCATGAATACTTTGCTTTGGAAAGCAGTACAAGAATTGTCACATAAAGTAAACAATCTAGACAGAAGAATATCAATTAGATAG

Genome Context

Genome Context

Tertiary structure

PDB ID
359d85bbfa92205df52c80a736809d98b477757ce7d00d504a1a54fe8b7cf646
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2984
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50