Protein
View in Explore- Genbank accession
- YP_001429654.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TF
- Protein sequence
-
MATRNFNMTLQLDAKYIGERWWAGEEIHQLWNVENMNTLQRSEWGLDRNGWIQYRSQLDSYSIRYSAWDTILNDYYFETKMQPAGQGNYEIGLAFRMKDRYSFYYLTYNGGYADWGGKNIRLMKVIGTQHVKLAEYECPVFDTKTVYKVRVDLKGNNIKVTWDDQVIFNFNDPNPIPNGAFGPLVMGQEFAKWEGFQAKNVTKFILQERYNDLGVESNYNSPETGKLVDSKTVEDLMRKQLDTYLNGRVYERIIYDAFRLVSSEPRVKLVFDRVPNKNVTSDPTSRIYAYQDAPSVPPVAPINLLGKALSTSSIQLDWTHVDDSEDGFYILDENGETKGIVGVDTFQFIEEGLEENTSYKRKVVAFNVAGRSKESNTVIVKTLQSVPIAPSDFEGKPTGDGEITWTWKDHSFNEASFEIIEWDQYGNIQVIGTVGENVTVFVEKGLIPLQEYTRAVRAKNPAGVSPPSNKATVKTKKDVPPKPNKAPINFYGVGISDDTIVWSWEDENTVPIDGYELLDPDDNVIATIKGKITNFYESGLYAKMKYRRKIRAFNEGGVGPASLIAEAETLDYGSNFKDKPVAPFNLYIMEVGTDRCKLRWEYNDHPLMPAVGFNLYNELDQVVDTVPLEVRERNILHLNPDSVYYFYVTAYNKNGDSLPSNKVRVKTLKVYVDENENNEDDKTPSKPLDDPLGDLTYDQEKPGMEKIRAFQSGIGDRLDLAVRNVMHTGGVNMEEFDCAMYIKGLYQKEEPKFVDVQFKFRVTCNGIDVRNRTAYSKVSEWMGATVRGGEDGITFNTPTKMETPDYVTEKIYTIEVQDMKGNTIPMDGTEKGDVKVHWTLDPYEAKDKIGWVQEIYMRNVFNNWKKFSHKGVFQPANSLEMNAWRYKEDTDELICTQNTDTYVGAVSPDVYDDYYIKAQFRSNNWDDDAMAFVVAYTLDEYSREHTISAVRTHSSAITWSLVYNYSSTGQKTLASNIEVDKELVSNDNTKGGWAGFYPNGTIIEVVREGTKITARTSAAGKTELGYELKIDLDSLPELAKFKKPCHIGVASLSQQDSALKIQEFRGNKTEVYTIYNLRAWTNYKDNVGNQTEWIGSISKTKRMKVKAFEQLRFTGRIRSPKYEIPWQKLNEKYFFDPTGYKVVVNCSNPNVDVTLEFNITDFFPKESTYIDIPMIAKILNHTQTPWNPSIHHGYYYLNHKEHFLFSNPEVLPKETSNSGVYVYNFPYSIRLVGQREYAGKDMVYSDDTMARFQLGIMKDTKYDVDKKVLTLDTEPTGTFISRIFDFQYDLDMFRAVDVKLKKALAGSSYTLEVGIPDENGDVATWVKQNGNLPVEFRDKFSRVRYKLTLNEGQSKEPYAATFPLDALALEKGKKKYITVDGKTIYIDDPTFRDVGTFLTSPMEYGKQIEEMGIVELDMEIPDGGRVELYSVTADEETADFENPSRDMPWIPLELVSQKGKRFTYRVKSAMKRYVVLVAKLFRGLTYTSQNTVTLSADTYVPQWADNVTWGVNGMELVDASKPGTYQSKATNLGFIKFYDKVQLETTMNNANHKIDIYTVTGWTAEEVEEAAKTESNWKKVVDGQIQSDIRYCIMYRVVLTSGTGTTSPQLQKMTITPLVDTYVSPKLDNIVASATLFNWVRDVPEITSVNVSGYIEKGIVAEEYFMPMTGELIANGSEQTITALPTERLAWEWVRREGIPNPDELVLKDFFAEIDPAYPVDLYTDIAGTGYVTGKTTASVGELVKKKEKLYFDEKTQAIEVKPIPQAGTPILITNAQGTRLRQVHFRDSTTGKPTLTNMENLKADETRYLFLEHTGIDPRSAHVWVMIDGKWTEILNIKVVENRIVLPHLLIPGSDVRVTYKLLESFTVDYNYSPKTDVAQIKLHTTFDVNEKESRLLDIQYETNKESAYYQATEVNLNPLHTKISSGFLYLTDEIYPPHKLDIKANPTTLYHGKKDRVTVHGYVKDEHGNPVVGERVDFDVKYGSLEIMNPLSDVNGMVMAIYTAPNNNSLTSDTIEMHVVSRDTTYHVTNKVTINLVKETFEEKLAIKLEKHMATPGDVVQLKVIAMGINYERLMNKAIEVSCDIGSVSPAKGTTNADGELFVKYTVPNTQEKFATITVKAKREDGLQLIEQNILGISGVS
- Physico‐chemical
properties -
protein length: 2143 AA molecular weight: 243689,27920 Da isoelectric point: 5,28702 aromaticity: 0,10919 hydropathy: -0,47466
Domains
Domains [InterPro]
G3DSA:2.60.120.560
RBD
31–196
RBD
31–196
IPR050964
Unmapped
291–1212
Unmapped
291–1212
IPR036116
STR
297–480
STR
297–480
IPR003961
STR
298–382
STR
298–382
IPR003961
STR
298–372
STR
298–372
IPR003961
STR
300–385
STR
300–385
IPR003961
STR
582–665
STR
582–665
DC_1887
STR
610–988
STR
610–988
1
2143
Architecture
RBD 31-196 | STR 282-572 | STR 574-988 | STR 1043-1161 | STR 1938-2037 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage 0305phi8-36 [NCBI] |
458639 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Bacillus thuringiensis [NCBI] |
1428 | cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
YP_001429654.1
[NCBI]
Genbank nucleotide accession
NC_009760.1
[NCBI]
CDS location
range 81728 -> 88159
strand +
strand +
CDS
ATGGCTACTCGTAATTTTAATATGACGCTACAACTTGATGCGAAATACATCGGTGAGCGTTGGTGGGCGGGTGAAGAGATTCACCAGTTATGGAATGTAGAAAACATGAATACACTGCAACGTTCTGAATGGGGATTAGATAGAAACGGTTGGATACAATATCGTAGCCAACTGGATTCCTATTCGATTCGTTATAGTGCATGGGATACCATCTTAAACGATTACTACTTCGAAACAAAGATGCAACCTGCTGGTCAAGGGAACTATGAGATTGGGTTAGCATTCCGTATGAAAGACCGTTACTCATTCTACTACTTAACATATAACGGTGGATATGCAGACTGGGGCGGTAAGAACATTCGCCTGATGAAAGTAATCGGTACGCAACATGTAAAGCTGGCTGAGTATGAATGTCCTGTATTCGATACAAAGACCGTTTACAAAGTTCGTGTTGATTTAAAAGGTAACAACATTAAGGTTACATGGGATGACCAAGTTATCTTTAACTTCAACGACCCAAACCCAATTCCAAACGGTGCATTCGGGCCACTTGTAATGGGACAAGAGTTTGCGAAGTGGGAAGGCTTCCAAGCGAAGAACGTAACGAAGTTCATCTTGCAAGAACGCTACAATGATTTAGGAGTAGAGTCCAACTACAATTCACCTGAAACTGGTAAGCTAGTAGACTCGAAAACTGTTGAAGACTTAATGCGTAAGCAACTTGATACATACTTAAACGGACGAGTATATGAACGTATCATCTATGATGCATTCCGTCTTGTATCGTCTGAACCAAGAGTCAAACTAGTCTTTGACCGTGTACCAAATAAAAACGTTACATCTGACCCGACATCTCGAATCTATGCTTATCAAGATGCGCCGTCAGTACCGCCTGTCGCTCCTATTAACTTACTGGGTAAAGCATTAAGTACATCTAGTATTCAGCTTGACTGGACGCATGTCGATGACAGCGAAGATGGATTCTATATACTAGATGAGAACGGTGAGACAAAAGGAATTGTCGGGGTAGACACATTCCAATTTATCGAGGAAGGCTTGGAGGAGAATACCAGTTACAAACGTAAAGTGGTTGCCTTCAACGTAGCGGGACGAAGTAAAGAATCAAATACAGTTATCGTGAAGACATTGCAGTCCGTACCAATTGCGCCATCTGACTTCGAGGGGAAACCAACAGGTGATGGTGAAATTACATGGACATGGAAAGACCATTCATTCAACGAAGCATCATTTGAAATTATTGAGTGGGATCAGTACGGAAACATTCAAGTCATTGGTACAGTTGGTGAGAACGTAACAGTATTTGTTGAGAAGGGATTAATTCCGTTACAAGAATATACTCGTGCTGTAAGGGCAAAGAACCCTGCGGGCGTGAGTCCTCCTTCTAATAAAGCAACAGTAAAGACGAAAAAGGACGTACCACCAAAACCAAACAAAGCACCAATCAACTTCTACGGCGTAGGGATTTCAGACGATACAATCGTTTGGTCATGGGAGGATGAAAATACAGTACCGATTGATGGCTACGAGTTATTGGATCCGGACGATAATGTTATCGCAACAATTAAAGGCAAGATTACGAACTTCTACGAAAGCGGACTATATGCAAAGATGAAGTACCGCAGAAAGATTCGTGCTTTCAATGAAGGCGGAGTGGGGCCAGCATCTTTAATTGCAGAGGCGGAAACATTAGACTACGGTAGTAACTTTAAAGATAAGCCAGTTGCTCCGTTTAATCTATACATCATGGAAGTCGGTACAGACCGATGTAAGTTACGTTGGGAATACAATGACCATCCATTAATGCCGGCAGTCGGATTCAACTTATACAATGAGTTAGACCAAGTAGTAGATACTGTACCACTTGAAGTACGTGAACGTAATATCCTGCACTTAAATCCTGACTCGGTTTATTACTTCTACGTAACGGCGTACAACAAGAATGGCGACAGTTTACCGTCGAATAAAGTACGTGTTAAAACACTCAAGGTATATGTTGACGAAAATGAGAATAACGAAGATGACAAGACTCCTTCGAAACCTCTCGATGATCCGCTAGGAGATTTAACATATGACCAAGAGAAACCGGGCATGGAAAAGATTCGTGCATTCCAATCCGGTATTGGTGACCGCTTAGATTTAGCCGTACGTAATGTAATGCATACAGGCGGAGTCAATATGGAAGAGTTCGATTGTGCGATGTACATCAAAGGGTTGTATCAAAAAGAAGAACCGAAGTTTGTTGACGTGCAATTTAAGTTCCGTGTTACATGTAACGGTATTGATGTTCGTAACCGCACCGCTTACTCAAAAGTATCCGAATGGATGGGCGCAACAGTTCGCGGCGGTGAAGATGGTATTACATTCAATACACCAACAAAGATGGAAACGCCGGACTATGTAACAGAGAAAATATACACAATCGAAGTGCAGGACATGAAAGGTAATACAATTCCGATGGACGGCACAGAGAAAGGCGATGTAAAAGTACACTGGACTCTTGACCCGTATGAAGCGAAAGACAAGATTGGATGGGTACAAGAAATCTATATGCGTAACGTATTTAACAACTGGAAGAAGTTCTCGCATAAAGGTGTATTTCAACCAGCGAATAGTTTAGAGATGAATGCATGGCGTTACAAAGAAGATACGGATGAGTTAATCTGTACGCAGAATACAGATACATATGTTGGTGCAGTAAGTCCTGATGTATATGATGACTATTACATCAAAGCGCAGTTCCGTTCAAACAACTGGGACGATGATGCAATGGCATTCGTCGTTGCTTATACGCTAGATGAGTACAGCCGTGAACATACCATTTCAGCAGTACGTACACATTCATCTGCTATCACATGGTCGTTAGTTTATAACTACTCATCAACTGGTCAGAAGACATTAGCTTCTAACATCGAGGTTGATAAGGAACTTGTAAGTAATGACAATACAAAAGGCGGTTGGGCTGGCTTCTATCCAAACGGTACAATCATTGAGGTTGTTCGTGAAGGTACGAAGATTACAGCAAGAACATCGGCTGCGGGTAAAACAGAATTAGGCTACGAGTTAAAGATAGACCTAGACAGCTTACCTGAACTTGCGAAATTCAAGAAGCCTTGTCATATCGGCGTAGCATCTTTATCCCAGCAGGATTCAGCACTCAAGATTCAAGAGTTCCGTGGGAATAAGACAGAAGTGTACACGATATATAACTTACGTGCATGGACGAACTACAAAGATAATGTAGGGAATCAAACAGAATGGATTGGATCTATTTCGAAAACGAAGCGCATGAAAGTAAAAGCATTTGAGCAATTACGATTCACTGGACGCATTCGTTCACCGAAGTATGAGATTCCGTGGCAGAAGTTAAATGAGAAATACTTCTTCGACCCAACGGGATATAAGGTTGTTGTGAACTGTAGCAATCCAAACGTGGATGTAACGCTAGAGTTTAACATAACAGACTTCTTCCCGAAAGAATCAACGTACATTGACATTCCGATGATAGCGAAGATACTGAACCACACGCAAACTCCGTGGAATCCAAGTATCCATCATGGCTATTACTATCTGAATCATAAGGAACATTTCTTATTCTCGAATCCGGAAGTGTTACCGAAAGAGACATCCAATAGTGGCGTCTATGTCTACAACTTCCCGTACAGCATTCGTCTTGTCGGACAGCGTGAGTATGCAGGAAAAGACATGGTGTACAGTGATGATACAATGGCACGATTCCAATTAGGGATAATGAAGGACACGAAGTATGATGTCGATAAGAAAGTACTAACGTTAGATACAGAACCGACTGGTACATTCATCTCTCGCATCTTTGACTTCCAATATGACCTTGACATGTTCCGTGCAGTTGACGTCAAGCTCAAGAAAGCATTAGCAGGTTCTAGCTATACACTTGAGGTTGGTATCCCTGATGAGAACGGTGATGTAGCGACATGGGTAAAACAGAACGGTAATTTACCGGTAGAGTTTCGTGACAAGTTCAGCCGAGTTCGTTACAAGCTAACGCTAAACGAAGGACAAAGTAAAGAGCCTTATGCTGCTACCTTCCCGTTAGATGCCTTAGCACTTGAGAAGGGTAAGAAGAAATACATCACTGTTGACGGTAAAACAATTTACATCGACGATCCAACATTCCGTGACGTTGGTACATTCTTAACAAGTCCAATGGAATACGGAAAGCAAATCGAAGAGATGGGTATTGTGGAACTGGATATGGAAATCCCTGACGGCGGACGAGTAGAGTTATACTCTGTCACGGCTGATGAAGAAACGGCTGACTTTGAAAATCCATCAAGAGATATGCCGTGGATTCCGCTAGAACTGGTATCTCAAAAAGGCAAACGATTCACATACCGTGTGAAGTCGGCAATGAAGAGATACGTGGTACTTGTGGCGAAGCTATTCCGTGGACTGACTTATACCTCTCAAAACACTGTCACACTAAGTGCAGATACATACGTACCGCAGTGGGCTGATAATGTAACATGGGGCGTTAATGGTATGGAGTTAGTTGACGCAAGTAAACCGGGTACATATCAATCAAAGGCAACGAACTTAGGCTTCATCAAGTTCTACGATAAAGTTCAGCTAGAAACAACGATGAACAATGCGAACCATAAGATTGATATTTATACGGTAACTGGTTGGACGGCAGAAGAGGTAGAAGAAGCTGCTAAAACAGAATCCAACTGGAAGAAGGTTGTAGATGGACAGATACAAAGTGACATTCGTTACTGCATTATGTATCGTGTTGTTTTAACATCGGGTACAGGCACAACATCACCACAGCTACAGAAGATGACAATCACGCCACTGGTTGATACATACGTTTCACCAAAGCTCGATAACATCGTAGCAAGTGCAACACTATTCAACTGGGTGAGAGATGTGCCGGAGATTACAAGCGTGAATGTATCGGGCTACATTGAAAAGGGTATCGTTGCAGAAGAATACTTCATGCCAATGACAGGAGAGTTAATTGCCAACGGTAGTGAACAAACAATCACGGCATTACCTACAGAACGTCTAGCATGGGAATGGGTACGTAGAGAAGGGATTCCAAACCCTGATGAACTCGTACTAAAAGACTTCTTCGCAGAGATTGACCCAGCTTATCCAGTAGACCTATACACAGACATTGCTGGTACAGGATACGTAACAGGTAAGACGACGGCATCTGTTGGTGAGCTTGTGAAGAAGAAAGAGAAGTTATACTTCGACGAGAAGACACAAGCTATTGAAGTGAAACCAATCCCGCAAGCAGGTACACCAATCTTAATCACAAATGCACAAGGCACAAGACTTCGCCAAGTTCATTTCCGTGATAGTACGACTGGTAAACCAACGCTGACGAATATGGAAAACTTAAAAGCAGATGAGACACGTTACCTATTCTTAGAACATACAGGCATTGATCCACGCTCTGCTCATGTATGGGTAATGATTGACGGGAAGTGGACAGAGATTCTAAACATCAAGGTTGTAGAGAATCGTATTGTGCTGCCTCACTTACTCATTCCGGGGTCGGATGTACGTGTAACATATAAACTGCTTGAGTCCTTTACCGTGGATTATAACTATAGTCCGAAGACGGATGTAGCGCAGATCAAACTGCATACTACATTTGATGTGAATGAAAAGGAAAGTCGCTTACTGGATATTCAGTACGAAACGAATAAAGAGTCGGCGTACTATCAAGCGACAGAGGTAAATTTAAATCCGTTACATACAAAGATTAGCTCCGGCTTCTTATACTTAACGGACGAGATTTACCCGCCTCATAAGTTAGACATCAAAGCAAATCCGACGACGCTATATCACGGTAAGAAGGATAGAGTAACCGTACATGGATATGTGAAAGATGAACACGGTAATCCAGTCGTTGGCGAACGAGTAGACTTCGATGTGAAGTACGGCTCACTAGAGATTATGAATCCTTTATCAGATGTGAACGGAATGGTTATGGCAATCTACACTGCGCCAAACAATAACTCATTAACATCTGACACAATTGAGATGCATGTGGTCAGTCGTGACACAACATATCATGTGACAAACAAAGTGACGATTAATCTTGTCAAAGAAACGTTCGAGGAGAAATTAGCCATTAAGCTAGAAAAACATATGGCAACTCCGGGAGATGTTGTACAATTGAAAGTGATAGCGATGGGTATCAACTACGAACGCCTGATGAATAAAGCTATTGAAGTAAGCTGTGACATTGGCTCTGTCAGTCCAGCGAAAGGTACAACCAACGCCGATGGCGAATTGTTCGTCAAATATACAGTTCCAAATACACAAGAGAAGTTCGCGACCATCACCGTGAAAGCAAAGCGTGAAGATGGACTGCAACTTATAGAACAAAACATACTAGGTATAAGTGGGGTGTCATAA
Genome Context
Genome Context
Tertiary structure
PDB ID
96130ae124be26394caf99be6eeae938b93e5348adedfb6bd07cf683fa8363de
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50