Genbank accession
WQN07538.1 [GenBank]
Protein name
minor head protein with inserted intein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MLLTLAPQGSSMSLLTSLISHQIWLQRTASGEVKDLAPFIKEMRDEIKRQVLLFGDDGRSTARLNKLLRDLEEALTALTGDWRTKLTEDLKELAAYEAEWNVKTLTANVNAEFVTPTAEQVWSAAEFQPLSLSDKPVDFTKLMSGWGETEVARLVTGVKMGFVQGQTTRQIVKNVVGAGGLADISERNAATVIRTALSHVSNEARNETYRQNDDIIEKYEWVSTLDSRTSTTCFTAETELAPIAGMDAVMRGLYSGKILTIELSNGEKFSGTPKHPVLTQYGWTPLDELDPTKHVLYTTVDKVTVLETVKNINMPARADYIFNTLADFPIRKMVSTSPAATDFYGDGVGLNGKIDIVRADCKLRNYVHASGIKQFKSEGFSLIHSAALLSDDSSGDFLLRGECPVSVPALSEAKCFDHRIENCFAYLGPSYALNRRDAGLEQFYSPFLIQNGVVRNPTTGSIMHESELLEKCCYRGGSDAVILGYDAGGSAISVQRTNIIRISVEFRTCHVYTLSSSQGYYTAGSAIVKNCRARDGMTWEIGKGPMPPAHFGCRSTTAPVISSEFDFLDKGAKRAAKGAEGGTQVSADTTYYEFLKQQPAWFQDEALGPVRGKIFRNSGISPEEFRVISVDGFGNPLTLKQMAELDKRVADYLKED
Physico‐chemical
properties
protein length:656 AA
molecular weight: 72214,95790 Da
isoelectric point:5,60054
aromaticity:0,08537
hydropathy:-0,24405

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB-Eco-KMB36
[NCBI]
3093646 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WQN07538.1 [NCBI]
Genbank nucleotide accession
OR525703 [NCBI]
CDS location
range 2784 -> 4754
strand +
CDS
ATGCTATTAACTTTAGCGCCACAGGGTTCATCTATGAGCTTACTTACATCTCTAATCAGCCACCAGATATGGCTGCAACGCACTGCATCCGGTGAGGTGAAAGACCTCGCGCCGTTCATTAAGGAGATGCGGGACGAAATAAAACGGCAGGTGCTGTTATTCGGCGACGACGGGCGAAGCACCGCGCGACTGAATAAACTGTTACGCGACCTTGAAGAAGCACTTACCGCGCTTACCGGTGACTGGCGGACAAAGCTGACGGAAGACCTTAAGGAGCTTGCAGCGTATGAGGCTGAGTGGAATGTGAAGACACTCACGGCTAACGTTAATGCGGAATTTGTTACCCCGACCGCTGAACAGGTGTGGTCCGCCGCCGAGTTTCAGCCATTATCATTAAGCGACAAGCCAGTTGACTTCACTAAGCTGATGTCGGGCTGGGGGGAAACAGAAGTCGCGCGCCTTGTAACAGGCGTTAAGATGGGCTTTGTACAAGGCCAGACAACACGGCAGATTGTTAAGAATGTTGTTGGTGCTGGTGGTTTGGCGGACATTTCAGAACGTAACGCGGCTACTGTAATCCGCACCGCGCTGTCTCATGTATCCAACGAGGCCCGTAACGAGACGTACCGCCAGAACGACGACATCATCGAGAAGTACGAGTGGGTGTCGACGCTGGACAGCCGTACCAGTACAACGTGTTTCACGGCTGAAACAGAATTAGCGCCCATCGCGGGGATGGACGCGGTAATGCGCGGACTCTATTCAGGAAAAATCCTTACCATCGAACTTTCCAACGGCGAGAAGTTCAGTGGCACCCCGAAACACCCTGTACTCACGCAATACGGATGGACTCCGCTTGATGAACTCGACCCAACTAAGCATGTCTTGTATACCACTGTTGACAAAGTGACTGTGCTCGAAACAGTGAAGAACATAAACATGCCAGCCAGAGCGGATTATATTTTTAATACGCTCGCTGATTTCCCCATCAGGAAAATGGTAAGCACGAGTCCCGCGGCAACAGATTTCTATGGCGATGGAGTGGGACTCAATGGCAAAATCGACATTGTACGGGCCGATTGCAAACTGCGGAACTACGTCCACGCCAGCGGAATTAAACAATTCAAAAGCGAGGGTTTCAGTCTTATTCATAGCGCCGCTTTGCTGTCGGACGATAGCTCTGGCGATTTTCTCCTCCGGGGTGAATGTCCTGTGAGTGTGCCCGCGCTTAGCGAGGCCAAGTGTTTTGACCATAGAATAGAAAACTGTTTTGCTTACTTGGGTCCTTCTTATGCACTCAATAGGCGGGACGCCGGACTCGAACAATTCTATAGCCCTTTTTTGATTCAAAATGGTGTCGTTAGAAACCCAACCACGGGGTCGATCATGCATGAGTCCGAGCTTCTTGAGAAATGTTGTTACCGTGGTGGTAGTGACGCCGTAATTCTTGGCTATGATGCTGGCGGGAGTGCCATCTCGGTACAAAGAACAAATATCATCCGCATAAGCGTCGAGTTTAGAACGTGCCATGTTTATACCCTCTCTAGTAGTCAAGGATATTATACAGCAGGTAGTGCGATAGTCAAGAATTGCAGGGCCAGAGACGGAATGACGTGGGAAATCGGTAAAGGACCGATGCCCCCGGCCCATTTTGGGTGCAGAAGTACTACCGCACCAGTAATCAGTTCCGAGTTCGACTTCCTCGATAAAGGCGCGAAACGCGCAGCTAAGGGCGCAGAAGGTGGTACCCAGGTAAGCGCAGACACCACCTATTACGAGTTCCTTAAACAACAACCGGCATGGTTTCAGGATGAAGCATTAGGGCCGGTTCGCGGTAAGATTTTCCGTAACAGCGGTATATCGCCGGAAGAGTTTCGCGTAATATCTGTAGATGGTTTCGGGAATCCGCTGACGCTTAAGCAGATGGCGGAACTCGATAAACGTGTTGCTGATTATCTGAAAGAGGATTAA

Tertiary structure

PDB ID
8f4970284ecddb4f4ee440b432827d8b49219d447b2a9bb9ca2f940a1f6a18a6
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6082
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50