UniProt accession
M4QDG1 [UniProt]
Protein name
Uncharacterized protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MSVPYSNTIAQVLDKHLRIRHSNSWQYVEDVRIRHNNTWEDVKEVYIRHSGSWQLVHEGEHFLFNHTLNGNSQSEFSLASWISGQGYSGNKIKGALTVNNLQQRVNLGSWSSDSRVYLRINNNDRISGKGGNGGQRGQNAASNGQNGQRALYTRVGFHLDNGGIIAGGGGGGAGGRNGQVTQQVTEQNNCMKGNKCQNTYNVTNNTNGGGGGGGAGYPGGTNGGNGAQNGQSNGGGSGGNSGAGTARDGGDGGDLGQNGDNAQNNQGGSPGSAGNAIDGWSYRLSGEGSGNGDRRGNSVN
Physico‐chemical
properties
protein length:300 AA
molecular weight: 31096,70330 Da
isoelectric point:9,11810
aromaticity:0,06333
hydropathy:-0,94000

Domains

Domains [InterPro]
DC_1762
ATT
14–193
IPR007932
RBD
66–297
M4QDG1
1 300
Architecture
ATT
RBD
ATT 14-193 | RBD 194-297 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage MED4-213
[NCBI]
889956 Uroviricota > Caudoviricetes > Eurybiavirus >
Host Prochlorococcus
[NCBI]
1218 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGH26183.1 [NCBI]
Genbank nucleotide accession
HQ634174 [NCBI]
CDS location
range 86012 -> 86914
strand -
CDS
ATGTCAGTTCCATATAGTAATACAATTGCACAAGTTCTTGATAAGCATTTGCGTATCAGACATAGTAACTCATGGCAATATGTTGAGGATGTAAGAATACGTCATAATAACACATGGGAGGATGTTAAAGAAGTATACATCCGTCATAGTGGATCATGGCAACTTGTTCATGAGGGTGAGCACTTCTTGTTTAACCATACATTAAATGGTAACTCACAAAGTGAATTTAGTTTAGCAAGTTGGATATCTGGTCAAGGTTACAGTGGTAACAAAATAAAAGGTGCATTGACAGTTAATAATTTACAACAACGAGTGAACTTAGGATCATGGTCTAGTGACTCTAGAGTATATCTTAGGATCAATAATAATGATAGAATATCTGGTAAAGGTGGTAATGGTGGTCAACGTGGTCAAAATGCAGCATCTAATGGTCAAAATGGACAACGTGCTTTATATACTAGAGTTGGTTTTCATTTAGATAATGGAGGAATCATCGCAGGAGGCGGTGGCGGTGGTGCAGGAGGTCGTAATGGACAGGTTACACAACAGGTAACAGAACAAAACAATTGCATGAAAGGAAATAAGTGCCAGAATACATATAATGTAACCAACAACACTAACGGTGGTGGCGGTGGCGGTGGAGCTGGTTATCCTGGCGGTACCAATGGTGGTAATGGAGCACAAAATGGACAGTCAAATGGTGGCGGTTCGGGTGGAAATAGTGGTGCAGGAACTGCTAGAGATGGTGGAGATGGTGGAGATCTTGGTCAAAACGGTGATAATGCCCAAAATAACCAAGGTGGATCACCAGGATCTGCAGGAAATGCCATTGATGGATGGAGTTACCGATTATCAGGAGAAGGTTCTGGTAATGGAGACCGCAGAGGCAACTCAGTAAATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
6231abc0b88a289b39bdb109c6816232dd9e185357dc61c309e837d903460985
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7689
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50