Genbank accession
AEX56199.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,77
Protein sequence
MALVIHYTRNEDGTFDVKRYRDNPMNFVVNHVPDGVPVRVFIDEIGEDNDVTEDFEALKENATFHIVESAGGGAIKGVMKIFSVILKPLAKLLSPSVKGASSNLANSQADSPNNSLTDRNNKARPYERSYDICGTVQTIPNNLMSTYKVFNAAGKIVEYGYYDAGRGYLDIHPDGITDGDTRVSDITGTSVAVYAPYTSPNNTSTPQVMVGDPIEQGLYITVESNEVDGVVLKAPNGLGISFSYMSGYPSLSGNIGTIYDPTGGSDFSGVLVPNDTFSLVSAWTNTDVDLSGGGYQVVSVSEGTVTFIVPGGLIGRWQEIRPGSFFRGDGEASLQPDNTYEKTLTDWVSINRTEVERIVANIAAANGMYKDNGKSKTLASVTAEIQYQLLDENSVPYGPIYTAQGTVSGRTPDYNGVTIYADLPVVSRVRVRARRVTDLDFNFEGSVVDEITYVNLYGQTRDNTPHYGNRTTVHSMHKQTPRAAEVKQPQLRMIATEMVYKYLGNGVFEDTMTPNTQAVQSLIRLARDPDVGGLNLTVRNMDKLLAVQNEVEAYFGDKQAGEFCYTFDDYKTTMQDIVSTIADAIFCTPYRRGADILLDFERPRMGPEMVFTHRSKAGTSEKWTRTFNDSQVFDSLKFSYIDPKTNVKETITIPETGGLKTETYDSKGIRNYKQAFWAANRRHQKNILKKISVSFTATEEGIFALPNRAVSVVKGSRMSTYDGYVTAVNGLTVELSQPVKFTSGDDHYLVLKLRDGGVQSVRVVPGAHDRQVIMTSVPQEAIYTGNSALKTEFSFGNEARHNAQMILVSTVDPGDDRTVKITGFNYDKDFYKFDNVPPFGRAFSSGFDNGFN
Physico‐chemical
properties
protein length:852 AA
molecular weight: 93898,83950 Da
isoelectric point:5,26195
aromaticity:0,09859
hydropathy:-0,36385

Domains

Domains [InterPro]
AEX56199.1
1 852
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage SE2
[NCBI]
1115478 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella enterica subsp. enterica serovar Enteritidis
[NCBI]
149539 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEX56199.1 [NCBI]
Genbank nucleotide accession
JQ007353 [NCBI]
CDS location
range 38359 -> 40917
strand +
CDS
TTGGCGCTAGTAATTCACTATACCCGTAACGAAGACGGCACATTTGACGTTAAACGTTATCGCGATAATCCGATGAACTTTGTCGTGAACCACGTTCCCGACGGGGTACCGGTTCGTGTTTTCATCGACGAAATTGGAGAAGATAACGACGTAACAGAAGACTTCGAAGCACTGAAAGAAAACGCGACTTTCCACATTGTGGAATCCGCCGGGGGTGGCGCCATTAAAGGCGTCATGAAGATTTTTAGCGTTATTCTTAAACCGTTGGCGAAGCTCTTGTCGCCGTCCGTGAAAGGGGCATCATCTAACCTCGCGAACTCACAGGCGGATTCCCCGAACAACAGTCTCACCGACCGTAACAACAAGGCCCGCCCATACGAGCGCAGCTACGACATCTGTGGGACGGTGCAAACTATCCCCAATAACCTTATGTCTACTTACAAGGTGTTTAACGCTGCCGGTAAAATCGTAGAGTACGGCTATTACGACGCCGGGCGTGGTTACCTCGACATACACCCGGATGGCATAACGGACGGGGATACCCGTGTATCAGATATAACAGGCACGTCGGTTGCGGTGTACGCGCCGTATACGTCACCAAATAATACATCCACACCACAGGTCATGGTTGGCGACCCGATAGAGCAAGGCCTGTACATCACCGTAGAATCTAACGAAGTAGACGGCGTGGTTCTGAAAGCACCTAACGGCCTGGGTATTTCTTTCTCTTACATGTCAGGGTATCCGTCTTTGTCAGGAAACATTGGCACTATATACGACCCAACAGGCGGCTCGGATTTTTCGGGGGTACTGGTGCCTAATGACACGTTTTCGCTGGTGTCCGCGTGGACAAATACAGACGTTGACCTCTCCGGTGGAGGATATCAGGTAGTCAGCGTGTCCGAAGGGACCGTTACCTTTATAGTACCCGGTGGTCTCATTGGTAGGTGGCAAGAAATAAGACCCGGTTCGTTTTTCCGCGGTGACGGGGAAGCGTCGCTGCAACCTGATAACACGTACGAGAAAACATTAACCGATTGGGTGTCTATAAACCGCACCGAGGTTGAGCGCATAGTCGCCAATATCGCGGCTGCCAATGGTATGTATAAAGACAACGGCAAGTCAAAAACACTGGCATCCGTTACCGCTGAGATACAGTACCAGCTACTTGATGAAAATAGCGTTCCTTACGGGCCAATATACACGGCGCAAGGAACCGTGTCCGGGCGAACCCCAGACTACAACGGTGTCACTATTTACGCCGACCTGCCGGTTGTGTCGCGGGTGCGGGTGCGCGCCAGAAGGGTGACAGACCTCGACTTTAATTTCGAGGGGTCTGTAGTTGATGAGATAACGTACGTTAACTTGTACGGGCAAACACGCGATAACACTCCGCACTACGGCAACAGAACTACCGTACACTCGATGCACAAGCAGACCCCGCGTGCCGCGGAAGTAAAGCAACCGCAGTTGCGCATGATTGCTACTGAAATGGTGTACAAATACCTCGGTAATGGTGTTTTTGAAGACACGATGACCCCCAATACGCAAGCCGTGCAATCTCTTATCCGCCTGGCGCGTGATCCGGATGTGGGGGGTTTAAACCTGACGGTACGCAACATGGATAAGTTACTTGCTGTGCAGAACGAGGTCGAAGCGTATTTTGGCGACAAACAGGCTGGAGAATTTTGTTACACGTTTGATGACTATAAAACCACCATGCAGGACATAGTTAGTACTATAGCAGACGCCATATTCTGCACCCCATATCGGCGTGGGGCGGATATCCTTCTCGATTTTGAGCGCCCTCGCATGGGCCCCGAGATGGTGTTCACCCACCGAAGCAAGGCCGGTACTTCTGAAAAGTGGACCAGAACATTTAACGATTCTCAGGTTTTTGACAGCCTTAAATTCTCGTACATAGACCCTAAGACGAACGTCAAAGAGACTATAACCATACCCGAAACCGGGGGCCTTAAAACGGAGACTTACGACTCAAAAGGAATCCGCAACTATAAGCAGGCTTTCTGGGCGGCAAACCGTCGCCACCAGAAGAACATTTTAAAGAAAATTTCGGTGTCGTTTACCGCCACTGAAGAGGGTATTTTTGCCCTTCCGAATCGTGCCGTTAGTGTGGTTAAGGGTTCGCGTATGTCTACTTACGACGGCTACGTAACCGCGGTTAACGGTCTTACCGTAGAGCTATCCCAGCCGGTTAAGTTCACATCCGGAGATGACCATTATTTGGTTCTGAAGTTACGTGATGGCGGAGTCCAAAGTGTTCGTGTTGTCCCTGGCGCACATGACCGACAAGTAATTATGACGTCTGTGCCGCAAGAAGCCATTTACACTGGTAATAGCGCTTTGAAAACTGAATTTTCATTCGGCAACGAAGCAAGGCATAATGCTCAGATGATTCTTGTTTCTACGGTAGACCCTGGCGATGACAGAACAGTCAAAATAACCGGGTTTAACTATGACAAGGATTTCTATAAGTTTGACAACGTGCCTCCTTTCGGTCGTGCGTTCTCCAGCGGATTCGATAACGGTTTTAACTAA

Tertiary structure

PDB ID
68d82a6c57bbb99fdd1fc5ac0828601c9f0c1daa12e6156d74e053aecf2b49eb
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8294
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete Genomic Sequence of Salmonella enterica Serovar Enteritidis Phage SE2 Tiwari,B.R., Kim,S. and Kim,J. 2012 22733878 GenBank