Protein
View in Explore- Genbank accession
- AID50243.1 [GenBank]
- Protein name
- structural protein
- RBP type
-
TF
- Protein sequence
-
MRTPSGLLHVVDFKTDQIISAIQPKDYWADNRHWEIKNNIDIFDFTTFDGTPHAITLQQQNLVLKEVRDGRIVPYVINNEVEKDSNDRSLTVHSSGAWVQIAKDGIIKPQRIESETVNTFIDIALADSKWQRGITDYSSFHTMTIDEFIDPLTFLKKIASLFELEIQYRVEVMGSKITGWYVDMIKKRGRETGKEVTLGKDLVEVRRIEHSRDICTALVGFVRGEGDKLITVESINKGLPYIVDNDAFQRWNAHGKHKFGFYTPETEDQNMTPERLMTLMKTELRKRVNTSVSYVVEAQSIGRIFGLAHELINEGDTIRIKDTGFTPKLYLEARVIAGDESFTDPTQDKYVFGDYRELTDPNEELKKIYNRILSKFGEKQEILDQLDKLVKDANETASDAKKESEAAKTLAEKVQENLKNNTVDIIEAKNPPTTGLRPYKTLWRDISNGKPGILKIWTGAAWESVVPDVDSVKKETLEQVTKDINASAEKLDARVKEAETKADNLQKDFNAVKGEQERVSQVTKALEESDEATKETITRIQGTQEDMNKTIVETTKGVEGLQSTVSDIKKDQNGITDRVVKTEQNINGISSSIEQINKTSSQTIQKLNKVEEDANGTKQTIERIEKNVNNLDDDVINLVRGTKTLTTNEELSLKGARLSVIKDTYNGNAIAQTDTEWQGIAVKPSELIKRGKIKIGDTVTFSVTARMIGGESTQVFFPNSAGKTTVNGEWKRVSVTIPVGSDAADPNVVYRFEAESIPKGALYQQTSPMLSLTKKVYPWRPAPEDQADSNEFIKVTTEIKAEAGKISTKLEQVEARTVGVENWLINTGPNERPQTIGMIGGAVLNKVTSFVQPGEYVAIECQDHTDAFYQFHLDNTKIGDFEKGKDITISLDIQNDVLLDFILFQYINGSWSESVQKPVPAKDWRRESWTFKIDVRATGWGFRIRFARNEASKGKRFRFKKAKLEKGSVPTDFSKSTYELEQSVDGVKSTVTKVQDNQAGFDKRVATVEKDATTIKQNVSFIQNTQTEQGRQLQEAKAGWENTAKALQGKVELKQVEDYVAGFKIPELKQIVDKNKQDLLGELKYICYRTYVGARIYP
- Physico‐chemical
properties -
protein length: 1098 AA molecular weight: 123826,17650 Da isoelectric point: 5,78521 aromaticity: 0,07559 hydropathy: -0,58352
Domains
Domains [InterPro]
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage Waukesha92 [NCBI] |
1510440 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Bacillus thuringiensis serovar kurstaki [NCBI] |
29339 | Bacteria > Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AID50243.1
[NCBI]
Genbank nucleotide accession
KJ920400
[NCBI]
CDS location
range 28045 -> 31341
strand -
strand -
CDS
ATGAGAACACCAAGTGGATTACTTCATGTTGTTGATTTTAAAACAGATCAAATTATATCAGCTATTCAACCAAAAGATTACTGGGCTGATAACCGTCATTGGGAAATCAAAAATAACATTGATATATTTGATTTTACAACTTTTGACGGCACTCCACATGCAATTACATTACAACAGCAGAACTTGGTTTTAAAGGAAGTACGAGATGGTCGCATTGTTCCGTATGTTATTAACAATGAAGTAGAAAAAGATTCAAATGATAGATCTTTAACTGTACACTCGTCTGGTGCTTGGGTTCAAATAGCGAAAGATGGGATTATTAAACCTCAACGTATAGAGAGCGAAACAGTTAATACGTTTATTGATATCGCTCTTGCCGATTCAAAATGGCAACGTGGAATAACGGATTATTCTTCATTCCACACTATGACTATCGACGAATTTATTGATCCTCTCACTTTTTTAAAGAAAATTGCTTCTTTGTTTGAGTTAGAAATCCAATATCGCGTCGAAGTAATGGGTTCCAAGATTACCGGCTGGTACGTAGATATGATAAAGAAGCGAGGGCGAGAAACTGGCAAGGAAGTAACTCTAGGAAAAGATTTAGTTGAGGTTAGAAGAATTGAACATTCTAGAGATATTTGCACAGCACTTGTCGGATTTGTACGAGGTGAAGGTGACAAACTTATCACGGTGGAGAGCATAAATAAAGGACTACCTTACATCGTCGATAATGACGCATTTCAGCGATGGAATGCGCATGGTAAACATAAATTTGGTTTCTACACTCCAGAAACAGAAGATCAAAATATGACACCAGAGCGGCTTATGACTCTCATGAAAACAGAATTAAGGAAACGAGTCAATACTTCCGTTTCTTATGTAGTAGAAGCACAATCGATTGGACGTATTTTCGGACTAGCACATGAACTAATTAACGAGGGTGATACAATCCGAATCAAAGATACAGGCTTCACTCCTAAGTTATACCTAGAAGCACGTGTAATTGCTGGTGATGAATCTTTTACGGATCCTACACAGGATAAATATGTGTTTGGTGATTATCGTGAACTTACTGATCCGAACGAGGAATTAAAAAAGATTTACAATCGAATCCTTAGTAAATTCGGTGAGAAACAAGAAATACTGGATCAGTTAGATAAATTAGTGAAAGATGCAAATGAAACAGCTAGTGATGCTAAGAAAGAATCCGAAGCAGCGAAAACATTGGCTGAAAAAGTACAAGAGAATCTTAAAAATAACACGGTAGACATCATTGAAGCAAAGAATCCACCGACAACAGGGCTTAGACCTTATAAGACACTTTGGCGTGATATTAGCAATGGAAAGCCTGGTATTTTGAAAATATGGACAGGCGCAGCTTGGGAATCGGTCGTACCTGATGTTGATTCAGTTAAGAAAGAAACGTTGGAACAGGTGACAAAAGATATTAATGCTAGTGCGGAAAAGCTAGATGCTAGAGTTAAAGAAGCTGAAACGAAAGCAGACAATCTACAAAAAGACTTCAATGCTGTTAAAGGAGAACAAGAAAGAGTCAGTCAGGTAACTAAGGCGCTTGAAGAAAGTGACGAGGCAACTAAAGAAACAATCACACGTATACAAGGTACTCAAGAAGATATGAACAAAACAATCGTTGAGACTACAAAAGGCGTTGAAGGGCTACAAAGTACTGTATCCGATATTAAAAAAGATCAAAACGGTATTACAGATCGTGTAGTAAAAACAGAACAAAACATTAACGGTATTTCCAGTTCCATTGAACAAATCAACAAAACGTCATCACAAACAATTCAAAAGTTAAATAAAGTTGAAGAAGATGCAAATGGAACAAAGCAAACCATCGAACGAATCGAAAAGAATGTAAATAACCTTGATGATGATGTTATCAACCTTGTAAGAGGAACTAAGACGTTAACAACTAACGAAGAATTATCTTTAAAAGGTGCTCGCCTATCTGTTATTAAGGACACGTACAATGGGAATGCTATCGCGCAAACTGATACAGAATGGCAAGGTATAGCCGTTAAACCCAGTGAATTAATCAAGCGAGGGAAAATAAAAATTGGTGATACTGTAACATTTTCTGTAACAGCTAGAATGATAGGTGGGGAATCTACACAAGTATTCTTTCCTAACAGTGCTGGTAAAACAACAGTTAATGGAGAATGGAAAAGAGTTTCGGTAACAATTCCAGTCGGATCTGATGCCGCTGATCCAAATGTTGTATATAGATTCGAAGCTGAATCAATACCTAAAGGCGCTTTATACCAACAAACTTCACCTATGCTATCGTTAACTAAAAAAGTTTATCCGTGGAGACCTGCACCTGAAGACCAGGCAGACAGTAACGAATTTATCAAAGTTACAACTGAAATTAAAGCAGAAGCTGGGAAAATCTCGACAAAGTTAGAACAGGTTGAGGCTCGTACTGTAGGCGTTGAAAACTGGCTAATCAATACAGGCCCAAACGAAAGACCTCAAACAATCGGGATGATTGGTGGCGCGGTGTTGAATAAAGTTACATCATTTGTTCAGCCTGGCGAATACGTAGCGATTGAATGTCAAGATCATACAGACGCCTTTTATCAATTCCATCTAGATAATACTAAGATAGGAGACTTTGAAAAAGGGAAAGATATAACAATATCTTTAGACATTCAAAATGATGTGCTTTTAGATTTTATTTTATTCCAATACATCAACGGATCGTGGAGTGAGTCAGTACAAAAGCCTGTGCCAGCTAAAGACTGGCGTCGTGAGTCATGGACGTTTAAAATCGATGTTCGTGCTACTGGTTGGGGATTTAGAATTCGTTTTGCTAGAAATGAAGCATCTAAAGGGAAAAGGTTTCGTTTCAAGAAAGCTAAACTAGAAAAGGGTTCTGTTCCAACAGACTTCAGCAAGTCAACATATGAATTAGAGCAAAGTGTGGATGGTGTAAAATCTACTGTAACTAAGGTGCAGGATAACCAAGCCGGATTTGATAAGCGTGTTGCTACTGTAGAAAAAGATGCAACTACCATTAAACAAAATGTCTCTTTCATACAAAATACGCAGACAGAACAAGGAAGACAATTACAAGAGGCGAAAGCTGGATGGGAAAATACTGCGAAAGCACTGCAAGGTAAAGTTGAACTTAAGCAAGTAGAAGATTATGTTGCTGGGTTTAAGATACCTGAGTTGAAGCAAATAGTTGATAAAAATAAACAAGATTTGTTGGGAGAACTAAAATATATCTGCTACAGGACGTATGTGGGCGCAAGAATTTATCCCTAA
Tertiary structure
PDB ID
11a0cc4c1294610a46db496a9716d23e61c9128d844493ca962f64fb282d9c49
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Complete genome sequence of a mosaic bacteriophage, waukesha92 | Sauder,A.B., Carter,B., Langouet Astrie,C. and Temple,L. | 2014 | 25146131 | GenBank |