UniProt accession
E3SML3 [UniProt]
Protein name
Uncharacterized protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
Protein sequence
MSIPYSTSIAQVLDKHLRIRHNGTWEYVEDVRINNSGTWEDVKEVYIRHSGSWQLVHEGEHFLFNHTLNGNSQSEFSLASWISGQGYSGNKIKGALTVNNLQQRVNLGSWSSDSRVYLRINNNDRISGKGGNGGQRGQNAASNGQNGQRALYTRVGFHLDNGGIIAGGGGGGAGGRNGQVTQQVTETNNCMKGNQCQNTYNVTNNTNGGGGGGGAGYPGGSNGGNGAQNGQSNGGGQGGSSGAGSARNGGNGGGLGQNGSNAENNQGGTRGTAGNAIDGWSYRISGEGSGNGDRRGNSVN
Physico‐chemical
properties
protein length:300 AA
molecular weight: 30878,49130 Da
isoelectric point:9,32459
aromaticity:0,06333
hydropathy:-0,88400

Domains

Domains [InterPro]
DC_1762
ATT
14–193
IPR007932
RBD
66–297
E3SML3
1 300
Architecture
ATT
RBD
ATT 14-193 | RBD 194-297 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage P-HM1
[NCBI]
445700 Uroviricota > Caudoviricetes > Eurybiavirus >
Host Prochlorococcus marinus subsp. pastoris str. CCMP1986
[NCBI]
59919 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae > Prochlorococcus > Prochlorococcus marinus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ADO98656.1 [NCBI]
Genbank nucleotide accession
GU071101 [NCBI]
CDS location
range 26834 -> 27736
strand +
CDS
ATGTCAATACCCTATAGTACTTCAATTGCCCAAGTTCTTGATAAGCATTTGCGTATCAGACACAATGGCACATGGGAATACGTTGAAGATGTAAGAATAAACAACAGTGGCACATGGGAGGATGTTAAAGAAGTATACATTCGTCATAGTGGATCATGGCAACTTGTTCATGAGGGTGAACACTTCTTGTTCAACCATACATTAAATGGTAACTCACAAAGTGAATTTAGTTTAGCAAGTTGGATATCTGGTCAAGGTTACAGTGGTAACAAGATAAAAGGTGCATTGACAGTTAATAATTTACAACAACGAGTGAACTTAGGATCATGGTCTAGTGACTCTAGAGTATATCTTAGGATCAATAATAATGATAGAATATCTGGTAAAGGTGGTAATGGTGGTCAACGTGGTCAAAATGCAGCATCTAATGGTCAAAATGGACAACGTGCATTATATACAAGAGTTGGTTTTCATTTAGATAATGGTGGCATCATCGCAGGAGGCGGTGGTGGCGGTGCAGGAGGTCGTAATGGACAAGTTACACAACAGGTAACAGAAACTAATAACTGTATGAAAGGAAATCAGTGCCAGAATACATATAATGTAACCAACAACACTAACGGTGGTGGAGGTGGCGGTGGAGCTGGTTATCCTGGCGGTTCTAACGGTGGTAATGGAGCACAAAACGGTCAGTCAAATGGCGGTGGACAAGGTGGTTCATCAGGTGCAGGATCTGCTAGAAACGGTGGTAACGGTGGAGGTCTTGGTCAAAACGGTAGTAATGCTGAAAATAACCAAGGTGGAACACGAGGAACCGCAGGAAATGCCATCGACGGATGGAGTTATAGAATATCAGGAGAGGGTTCTGGTAATGGAGACCGTAGAGGTAACTCAGTAAACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2c9af9d159cd79fb711cf2edd18da68bce4b1321e170025d54645753440c12d1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8435
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50