UniProt accession
A0A6G8R5I4 [UniProt]
Protein name
Uncharacterized protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 0,56
Protein sequence
MAITFPTDPGAQTPANTFSPSSTPVANSANGNTYVWNGTAWTTQQANLEQYVNVAGDTMTGPLTGTSATYSGNIAAANIPTQGSIVGYQQGLWTPTAPGTASIAPTAERCVWTRTGNEVTVWAYISQINSGSTDGTTLYFKNLPYPILFNGVIGPAMIQNCGRASSSSYVSSTSQGIAFYESASATWQSVTLQSINSAGNNGSAFFGGTYITDDTTWTPINGATVD
Physico‐chemical
properties
protein length:226 AA
molecular weight: 23520,42360 Da
isoelectric point:4,15581
aromaticity:0,10619
hydropathy:-0,15929

Domains

Domains [InterPro]
DC_1616
RBD
10–214
A0A6G8R5I4
1 226
Architecture
RBD
RBD 10-214 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A6G8R5I4
1 226
Domain Start End Length (AA) Confidence
N-terminal 1 71 71 0,9904
Central domain 72 215 145 0,0076
C-terminal 216 226 10 0,9979
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-71
Central
72-215
C-terminal
216-226

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-N03
[NCBI]
2718943 Uroviricota > Caudoviricetes > Pantevenvirales > Huanghaivirus > Huanghaivirus snothree
Host Synechococcus sp. MW02
[NCBI]
1620844 Cyanobacteriota > Cyanophyceae > Synechococcales > Synechococcaceae > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QIN96642.1 [NCBI]
Genbank nucleotide accession
MT162466 [NCBI]
CDS location
range 5793 -> 6473
strand -
CDS
ATGGCTATTACCTTCCCCACCGACCCAGGAGCACAAACTCCTGCTAATACTTTTAGTCCTAGCAGTACTCCCGTTGCCAACTCGGCTAATGGGAATACGTATGTGTGGAATGGCACGGCCTGGACTACTCAACAGGCAAACCTGGAGCAATACGTAAATGTTGCCGGGGATACGATGACTGGTCCTTTGACTGGTACCAGTGCAACTTATAGTGGTAATATTGCTGCGGCTAACATCCCAACTCAAGGTTCCATTGTTGGTTATCAACAGGGGCTTTGGACTCCAACAGCTCCAGGTACTGCCAGTATTGCCCCTACTGCTGAACGGTGTGTTTGGACCCGTACAGGAAATGAAGTCACTGTCTGGGCTTACATCTCGCAGATCAACTCTGGATCAACAGACGGCACTACCCTTTACTTCAAGAACCTTCCTTACCCAATTCTCTTTAATGGAGTTATAGGACCAGCGATGATTCAGAACTGCGGTAGGGCTTCAAGCTCTAGCTACGTTTCGTCAACGTCTCAAGGTATTGCTTTTTACGAAAGCGCGAGTGCGACTTGGCAATCCGTGACGCTTCAAAGTATTAACTCCGCTGGCAATAACGGCAGTGCATTTTTCGGCGGCACATACATCACCGACGACACCACCTGGACACCAATTAACGGTGCTACTGTCGACTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
6e5b170503b810e9783bda56dc17c5ffdd3062eec3913618f9f2f8af87765b42
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6567
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50