Genbank accession
AFQ96521.1 [GenBank]
Protein name
putative minor structural protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MASGGFDLSTSNTYVKGRVNWSSTANTNENYSNVYVEMRFSRTNSGYTTYGSGTFGIYVDGQAVENTTSFSITQNSNTLVVSGTVRVNHNADGTKNFRIGASGFTNVFSINEGSTTAYMDNIPRASTISSNVSWTAGVNSLPVTINRASSAFVHYVELQVKNPINGSWAIVASRSNVGDSVTFDFSKDEITKMYQQIVAYEETQAWMKVETWNGGTFVGRNEKYGTVYAAPTGTASYNQWGSFDIGQSITGWVNNYVNGFTYNLTMNFGSFSRTWNNVPKDYTLSFSAAEIQTLYGQTSTANSKTGTITCRTLYNGVYAEDGAPVSNVTTFTLNVKSSDPTYAGGFTYLDTNGTTTTLTGNNQYIVQGKSKLQIKLPAANKATANNGATMSRYDVTVNGVTQSVNYATTDLTVNFNEVNASSNATATVTAVDSRGNKTSASSVILMLPYSPPVISASADRLNNFEDSTTIKLSGSISPLTISSANKNSLTVVKFQRRQVGGTYDSPGTNFTITGNPNFTATDVKVNLANTLAWEILITATDKVGSTVTLTRTVPVGTPIFFIDTVKKTVGVNKFPTSAANGLEIAGDLDVDGIIKAKADQWMGNSKVGLDMRNSDMKGLNALYFNDASDSGDEGINFLRSGKGVGSTNIADYDNFCVYDGAFRLNNRNLFWQEGGQASNNIRLAGDMYSQTTGGVIFDVWGNIKGQPTAVDYNTWSVQDTAGRTRFICGIGRGSTAATEVKSYTGGIKLVHDDIQVAAFWQQGQGASHIFQLGAGILQYYNNWFEFKGPGNSGWGNVYGNWVAPSSAAYKKDIQVFDDSALSYVNSVKPVLYQYLEQDDSEPYTLGLIAEESPVIIQGANGKGVNAYSMITLLWKAMQEIDGKVENINRRITLR
Physico‐chemical
properties
protein length:894 AA
molecular weight: 96648,80340 Da
isoelectric point:5,69290
aromaticity:0,10850
hydropathy:-0,28803

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage vB_BceM_Bc431v3
[NCBI]
1195072 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus cereus
[NCBI]
1396 Bacteria > Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AFQ96521.1 [NCBI]
Genbank nucleotide accession
JX094431 [NCBI]
CDS location
range 120388 -> 123072
strand -
CDS
ATGGCAAGTGGCGGTTTTGACTTATCGACTAGTAATACCTACGTCAAAGGTAGGGTTAACTGGTCTAGTACGGCAAATACAAATGAAAACTATAGTAATGTGTACGTGGAGATGCGATTCTCCCGTACCAACTCAGGGTATACAACATACGGTAGTGGTACATTCGGTATTTACGTAGACGGACAAGCAGTAGAGAATACTACAAGCTTCTCAATTACACAGAACTCTAACACACTAGTTGTAAGTGGTACAGTTCGAGTTAACCATAATGCTGACGGTACGAAGAACTTTCGTATCGGGGCTAGTGGTTTCACAAACGTATTCAGTATCAACGAAGGTAGTACAACTGCTTATATGGATAACATCCCACGAGCAAGTACTATCTCGTCTAACGTAAGTTGGACCGCAGGAGTTAACAGTTTACCTGTAACGATTAACAGGGCGTCTAGTGCGTTCGTCCATTACGTAGAGTTACAAGTTAAGAACCCTATTAACGGATCGTGGGCTATTGTAGCTTCACGTTCTAACGTAGGTGACTCTGTAACATTCGATTTCAGTAAAGACGAAATAACAAAGATGTACCAACAAATCGTAGCTTACGAGGAAACCCAAGCATGGATGAAAGTTGAGACATGGAACGGTGGAACATTCGTAGGTCGTAATGAGAAGTATGGGACTGTATATGCCGCACCTACGGGAACTGCATCCTACAACCAATGGGGTTCATTTGATATCGGTCAGAGCATTACAGGTTGGGTTAATAACTACGTAAACGGATTCACTTACAACCTGACAATGAACTTCGGATCATTCTCACGTACTTGGAATAACGTACCTAAGGACTACACTTTATCGTTCTCTGCTGCCGAGATACAGACACTGTACGGGCAGACATCAACTGCGAATAGCAAGACAGGTACGATTACATGTCGAACACTCTATAACGGAGTGTACGCAGAGGACGGGGCACCAGTAAGTAACGTTACTACATTCACACTAAATGTAAAGAGTAGCGATCCGACTTACGCAGGTGGGTTTACATACTTAGATACAAACGGTACCACTACAACACTAACAGGTAATAACCAATATATCGTACAAGGTAAATCTAAGTTACAGATTAAGTTACCTGCGGCCAATAAAGCTACGGCTAATAATGGGGCTACAATGTCTCGTTACGATGTTACAGTCAACGGGGTTACGCAATCGGTTAACTACGCTACGACAGACCTTACGGTTAACTTTAACGAGGTAAACGCATCGTCAAATGCTACTGCAACAGTTACGGCAGTAGACAGTCGAGGTAACAAGACATCTGCATCTTCTGTTATATTAATGCTACCTTATTCACCACCTGTTATTTCAGCTAGTGCGGACCGCTTAAATAACTTCGAGGACTCAACTACGATTAAGTTAAGTGGTTCAATATCTCCATTGACTATCAGCAGTGCGAATAAGAACTCACTGACAGTTGTTAAATTCCAAAGAAGACAGGTAGGTGGTACATACGATAGTCCAGGAACTAACTTTACGATTACTGGGAATCCGAACTTCACTGCGACAGACGTTAAAGTAAACTTAGCTAATACTCTTGCTTGGGAGATTCTAATTACGGCTACAGACAAAGTAGGTTCTACTGTAACGTTAACACGTACGGTTCCAGTCGGTACTCCAATCTTCTTCATTGATACAGTGAAGAAAACAGTCGGGGTTAATAAGTTCCCTACTAGTGCGGCCAATGGTCTCGAAATAGCAGGTGACTTGGACGTTGATGGTATAATTAAAGCGAAAGCAGATCAGTGGATGGGTAATAGTAAAGTAGGGTTAGATATGCGTAACTCTGATATGAAAGGGCTAAACGCCCTATATTTCAATGACGCATCGGACTCAGGTGACGAAGGAATTAACTTCCTACGTTCAGGTAAGGGTGTAGGCTCTACAAACATCGCAGACTATGATAACTTCTGTGTCTATGACGGTGCATTTAGATTGAACAACAGAAATCTGTTTTGGCAAGAGGGCGGCCAAGCTTCAAACAACATCCGTTTAGCTGGTGATATGTACTCACAAACTACAGGTGGAGTTATCTTCGATGTATGGGGTAACATCAAAGGTCAACCAACTGCGGTAGATTATAACACTTGGTCAGTACAAGATACAGCGGGTAGAACACGTTTCATCTGTGGTATCGGTAGAGGGTCTACAGCGGCTACAGAGGTTAAGTCGTATACTGGTGGTATTAAACTAGTACATGATGACATACAGGTGGCAGCCTTTTGGCAACAAGGTCAAGGTGCGTCCCACATCTTCCAATTAGGTGCAGGAATACTTCAGTATTACAACAACTGGTTCGAGTTCAAGGGACCTGGTAACTCAGGTTGGGGTAACGTGTATGGTAACTGGGTAGCACCTTCGTCAGCAGCGTATAAAAAGGACATTCAGGTATTTGACGATAGCGCATTATCCTACGTCAACTCTGTAAAACCAGTACTCTATCAATACCTGGAGCAAGACGATTCGGAACCTTATACACTAGGGTTAATTGCAGAGGAGTCTCCTGTAATCATTCAAGGAGCTAACGGTAAAGGTGTCAACGCCTACTCGATGATAACGCTACTTTGGAAAGCCATGCAAGAGATTGACGGAAAAGTAGAAAACATTAACAGAAGAATAACACTAAGATAA

Tertiary structure

PDB ID
bc90c096ae798e1bacc15e4c0646ba3ebbd0eb63bd2b12a2d4f0635e7bc23b64
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7575
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50