Protein
- UniProt accession
- H0USX5_9CAUD [UniProt]
- Protein name
- Phage minor structural protein
- RBP type
-
TF
evidence: RBPdetect2
probability: 0,9513
- Protein sequence
-
MRTPSGILHVVDFKTDQIVAAIQPQDYWDDKRQWEIKNNVDILDFTVFDGTPHSATLQQQNLVLKEVRDGRIVPYVIRETEKNSDNRSITTYASGAWVQIAKSGIIKPQRIEGETVNKYIDMALVGMKWKRGKTDYAGFHTMTIDEFIDPLTFLKKIASLFKLEIQYRVEVQGSQIIGWYVDMIQRRGRDTGKEIELGKDLIGVTRIEHSRDICTALVGFVKGEGDNVITIESINRGLPYIVDNDAFQRWNERGKHKFGFYTPETEELNMTPERLMTLMEIELKKRVNSSVSYEVEAQSIGRIFGLAHELINEGDTIRIKDTGFTPKLYLEARVIAGDESFTDPTQDKYVFGDYREITDPNEELRKIYNRILSKFGEKQEMLDQLDKLVKEANETASNAKKESEAAKTLAEKVQENIKNNTVEIIEAKNSPTTGLKPNKTLWRDISNGKPGILKIWTGTVWESVVPDVESVKKETLEQANKNIESTKAELNKKVQEAQNQATGQFNEVQEGLQGVSRTISNIENKQGEIDKKVTKFEQDSNGFKTSIESLTKKDNDISNKLNTVEQTVEGTNKNISDVQQTTSELKKTTTDIKEEAGKISEKLTSVEKKVNSDKAGGRNLLLKSNVKYEKTDYLINQYSLTENFFASEEYTFVMKGSVPQGQKFGIWQNGGSSNVGYATSVYANGITYVTFKAVAATSGNERKLSLYNYPSSNTKSIVEWVALYKGNKPQDWTAPPEEQVTTDEFTQKTTEITKSVDGIKETITKVENNQNGFDKRVATVEKDATTIKQNVSFIQNTQTEQGRQLQEAKAGWENTAKALQGKVELKQVEDYVAGFKIPELKQTVDKNKQDLLGELANKLATEQFNQKMTMIDNRFTINEEGINAAAKKKEVYTIEQANGQFATSSYVRDMETRLQLTEKGVSLSVKENDVIAAFNMSKENITLNANRINLVGFITANHIKGKVLEGVTLKTSGNRFVEINKQDMKIFDLDKPRGYIGFMETNDGSIQPSLVLGSDNRKYAGTGSFYIYQVMPRINGVDQPSKAYAKFGVSKGENAEGTNIWSNYIQMQNDGGHLSVYSDGQFRFQNLNDIIFESEGWAPGYGYFSVTTTEPHIFTNNAGQFTFKRKGSDYKIHFVNGATDHDLIMGNAMIRSSFVQGYNNGLQIKDMMGKGWKDIELRTLRAQENISATGRMWAQEFIPNSSRKLKTDIEDLPFSALDKINSVNIKQYHFIRDVERFESGESITLPINYGMIAEDSDDVFTTPQKDAVTLYSSVAISIQSIQEVDFKVKNLQFDHGMLKQEVDTLKEQLEAEKLEKVSMKAEIYELKLLVQQLINEEPKQP
- Physico‐chemical
properties -
protein length: 1341 AA molecular weight: 151592,85600 Da isoelectric point: 5,63078 aromaticity: 0,08427 hydropathy: -0,61477
Domains
Domains [InterPro]
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
Name | Taxonomy ID | Lineage | |
---|---|---|---|
Phage |
Bacillus phage phIS3501 [NCBI] |
1124578 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
Host |
Bacillus thuringiensis serovar israelensis ATCC 35646 [NCBI] |
339854 | Bacteria > Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AEV89308.1
[NCBI]
Genbank nucleotide accession
JQ062992
[NCBI]
CDS location
range 32699 -> 36724
strand +
strand +
CDS
ATGAGAACACCAAGCGGCATTTTGCATGTTGTGGATTTTAAAACAGATCAAATCGTTGCAGCTATTCAGCCGCAGGACTATTGGGATGATAAAAGGCAGTGGGAAATCAAAAACAATGTTGATATCTTGGATTTTACTGTTTTTGATGGAACACCTCATTCGGCTACTTTACAACAACAAAATCTTGTTTTAAAAGAAGTTCGTGACGGAAGAATTGTACCATATGTTATTAGAGAAACAGAAAAGAATTCAGACAACCGATCCATTACCACATATGCTTCAGGAGCTTGGGTTCAAATTGCTAAGTCAGGTATTATAAAACCGCAAAGAATAGAAGGGGAAACGGTAAACAAATATATTGATATGGCCCTAGTAGGCATGAAATGGAAACGTGGGAAAACGGATTATGCAGGATTCCATACGATGACCATTGATGAATTTATTGATCCACTAACATTTTTAAAGAAAATAGCTTCTTTATTCAAATTAGAAATTCAGTACCGTGTTGAGGTTCAAGGATCACAAATCATTGGATGGTATGTTGATATGATTCAAAGACGTGGTCGAGACACAGGCAAAGAAATAGAGCTCGGGAAAGATTTGATAGGCGTTACACGTATTGAACATTCAAGAGATATTTGTACAGCACTAGTTGGATTTGTGAAAGGTGAAGGCGACAATGTAATTACCATCGAAAGTATCAACAGGGGACTTCCGTATATTGTTGATAATGATGCATTTCAACGATGGAACGAACGTGGTAAGCATAAGTTTGGTTTTTATACGCCAGAAACAGAAGAGTTAAATATGACGCCAGAACGTTTAATGACGTTAATGGAAATAGAATTAAAAAAACGTGTTAATTCTTCCGTTTCGTATGAAGTAGAAGCACAATCGATTGGACGTATTTTCGGACTAGCACATGAACTAATTAACGAGGGTGATACTATCCGAATCAAAGATACAGGGTTCACACCTAAGTTATACCTTGAAGCGCGTGTAATTGCCGGTGATGAATCTTTTACGGATCCTACACAAGATAAATATGTGTTTGGTGATTATCGTGAAATTACGGATCCGAACGAGGAATTACGAAAGATTTACAATCGAATTCTTAGTAAATTCGGTGAAAAACAAGAAATGCTGGATCAGCTAGATAAATTGGTGAAAGAAGCGAATGAAACAGCAAGTAACGCTAAGAAAGAATCAGAAGCCGCGAAAACACTTGCCGAAAAGGTACAAGAGAATATTAAAAATAATACTGTTGAAATTATAGAAGCTAAGAATTCGCCAACAACAGGGCTTAAACCTAATAAAACGCTTTGGCGTGATATTAGTAACGGAAAGCCCGGCATTTTAAAAATATGGACAGGTACAGTTTGGGAATCGGTTGTACCAGATGTTGAATCAGTTAAGAAAGAAACACTTGAACAGGCTAATAAAAATATCGAGTCCACAAAAGCAGAATTAAACAAAAAGGTACAAGAAGCACAGAATCAAGCTACAGGACAATTCAATGAAGTACAGGAAGGTTTACAAGGTGTCAGTCGTACAATTTCTAATATCGAAAATAAACAAGGTGAAATCGATAAAAAAGTAACTAAGTTTGAACAGGATTCTAATGGATTTAAAACTTCTATTGAATCGTTAACAAAAAAAGATAATGATATCAGCAATAAATTAAATACAGTCGAACAAACTGTAGAAGGTACAAATAAGAATATTTCTGATGTGCAGCAAACAACAAGTGAGCTCAAGAAAACAACAACTGACATTAAAGAAGAAGCTGGGAAAATCAGTGAGAAGTTAACGAGTGTAGAAAAAAAGGTTAATAGTGATAAAGCTGGTGGACGTAACCTTTTATTAAAATCAAATGTTAAATATGAAAAAACAGACTATCTAATCAATCAATATTCTCTAACTGAAAATTTCTTTGCGAGTGAGGAATATACCTTTGTAATGAAAGGGAGTGTCCCGCAAGGTCAGAAATTTGGAATTTGGCAGAATGGTGGGTCTAGCAATGTTGGATATGCAACAAGTGTTTACGCGAATGGAATAACGTATGTAACCTTCAAAGCTGTTGCGGCTACAAGTGGAAATGAACGAAAGTTAAGCTTATATAATTATCCGAGTAGTAATACGAAATCTATTGTGGAATGGGTTGCCTTGTATAAAGGGAATAAACCGCAGGATTGGACAGCACCGCCTGAAGAGCAGGTAACAACGGATGAGTTCACCCAGAAAACAACTGAAATTACAAAAAGTGTGGATGGAATTAAAGAAACAATTACAAAAGTGGAAAATAATCAAAATGGGTTTGATAAACGTGTTGCTACTGTAGAAAAAGATGCAACTACTATTAAACAAAATGTCTCTTTCATACAAAATACGCAGACAGAACAAGGAAGACAATTACAAGAGGCGAAAGCTGGATGGGAAAATACTGCGAAAGCACTTCAAGGTAAAGTTGAGCTTAAACAAGTAGAGGATTATGTTGCGGGATTTAAGATTCCAGAGTTGAAGCAAACAGTTGATAAAAATAAACAAGATTTGTTGGGCGAATTAGCTAACAAACTTGCAACGGAACAATTTAATCAGAAAATGACTATGATCGACAACCGTTTCACTATAAATGAAGAGGGTATCAATGCCGCAGCAAAAAAGAAAGAAGTATACACAATAGAGCAAGCAAATGGGCAATTTGCCACATCATCTTATGTAAGAGATATGGAAACCCGTCTTCAGTTAACTGAAAAGGGCGTTAGTCTATCTGTAAAAGAAAATGATGTAATCGCAGCATTCAATATGAGTAAAGAAAACATTACCTTGAATGCGAACAGGATTAACTTAGTAGGTTTTATAACAGCAAATCATATTAAAGGAAAAGTTTTAGAAGGAGTAACACTTAAAACGAGTGGAAACAGATTTGTTGAAATAAATAAACAAGACATGAAGATTTTCGATTTAGATAAGCCACGTGGTTATATAGGATTTATGGAGACAAATGATGGAAGTATTCAACCTTCATTAGTCCTTGGTTCTGATAATAGAAAATACGCTGGTACAGGATCATTTTATATTTATCAAGTCATGCCGCGAATTAATGGAGTCGATCAACCTTCTAAAGCGTATGCAAAATTTGGGGTTTCTAAAGGAGAAAATGCAGAAGGAACTAATATTTGGTCAAATTATATTCAAATGCAGAATGACGGTGGACATCTGAGTGTATATTCAGATGGACAATTTCGTTTTCAAAACTTGAATGATATTATCTTTGAATCTGAAGGATGGGCTCCAGGATATGGTTACTTCTCTGTAACTACAACTGAACCACATATTTTTACAAATAACGCGGGGCAGTTTACTTTCAAAAGAAAAGGCAGTGACTATAAAATACATTTCGTAAACGGCGCCACCGATCATGATTTAATTATGGGTAATGCAATGATAAGGTCAAGTTTTGTACAAGGTTATAACAATGGCTTGCAGATTAAAGATATGATGGGTAAGGGATGGAAAGATATAGAATTAAGAACATTACGAGCGCAAGAGAATATTAGCGCTACAGGGCGTATGTGGGCGCAAGAATTTATCCCTAATTCTTCTCGTAAGCTTAAAACGGACATAGAAGACCTTCCATTCTCTGCTTTAGATAAAATCAACTCTGTAAACATCAAACAGTATCACTTTATAAGAGATGTTGAACGCTTCGAGTCAGGGGAGTCTATTACACTTCCAATTAATTACGGTATGATTGCGGAGGACTCTGACGATGTATTCACCACACCACAGAAAGACGCTGTAACACTTTATAGCTCGGTTGCAATTTCTATTCAATCAATACAAGAAGTTGACTTTAAAGTTAAAAATCTTCAATTTGACCACGGTATGTTGAAGCAGGAAGTTGACACTCTTAAAGAACAACTTGAAGCAGAAAAACTTGAGAAAGTTTCAATGAAAGCTGAAATTTATGAATTAAAGTTATTAGTACAACAATTAATAAATGAGGAACCAAAGCAGCCATAA
Gene Ontology
Description | Category | Evidence (source) | |
---|---|---|---|
GO:0098015 | virus tail | Cellular Component | IEA:UniProtKB-KW (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available.
Literature
No literature entries available.