Genbank accession
AIR93428.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,93
Protein sequence
MAKLGISTGTTPNDGSGDSLLDGAVKVNSNFDEVYNKIGDGTDLFVGIVSSITVSGPLSISTTFGAPVITGLANTANINATNFQVTGVGTITGTTRLAGINTFSAAGYTVAGLVTASNIISNETIKVAGIVTTSEDGINVSAAVTARSLAIQDVTQTSHFVGLNTVFIDHTGVGATAINITDTATIGFGSISSANITTINANTANINSGILTTATVGTAITIHSKGIDAGQAGIITASRLFGAVTGAVTGTASSATEADLAYGLTGTPSIVVGVATLGGHIFNAPGAFISGILTATSFSVGTNEIVSSARQLKNIATLDATTKLTIENAISDPPNDFDDLNVVGLATVNRLFISGDTRGLNILGVTTGLSAPGITTLGIITGATSLQATDVYSNFLHGDGSNISNVTGGVTVQDEGSALSTVATTLNFVGSGVVASGSGATKTITISGGGGGGSIAGISTTGTSGFNQLNVTGVSTFGANIDLNANIDVSGSSTLHNGLVVNGGIVDINHQIVGLATDNVIPFYYANVSDFPSASTYHGAVAHGHNTGLLYFAHAGAWLELVSKDSSGVVNKIVVGAAVTIDQNNIDTVGIITASEFHGDGSNLTGITGSTVAGISTTGTSGFNQLSISGVSTFTGNIDANGALDVDGQTDLDVVNVAELATFSSRVQVGTGLTLDQNNIDAGSYVGIITAKEFHGDGTNVATSRWAVTNASSNHYVFNGPGNLVNANDPTIYLARGQKYEFDINASGHPFRIQTSSGASGYNSGNEYTTGITNVGAASSLLTFDVPFDANNTLYYVCQNHSSMNGTIIIYPSI
Physico‐chemical
properties
protein length:814 AA
molecular weight: 81821,29990 Da
isoelectric point:4,48951
aromaticity:0,06143
hydropathy:0,24988

Domains

Domains [InterPro]
DC_0533
ATT
1–65
IPR036240
STR
2–44
IPR008987
ATT
6–41
G3DSA:1.20.5.960
Unmapped
13–42
AIR93428.1
1 814
Architecture
ATT
STR
ATT 1-65 | STR 569-814
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AIR93428.1
1 814
Domain Start End Length (AA) Confidence
N-terminal 1 162 162 0,6173
Central domain 163 509 348 0,4500
C-terminal 510 814 304 0,3983
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-162
Central
163-509
C-terminal
510-814

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage P-TIM68
[NCBI]
1542477 Uroviricota > Caudoviricetes > Pantevenvirales > Haifavirus > Haifavirus tim68
Host Prochlorococcus sp.
[NCBI]
1220 cellular organisms > Bacteria > Bacillati > Cyanobacteriota/Melainabacteria group > Cyanobacteriota > Cyanophyceae

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AIR93428.1 [NCBI]
Genbank nucleotide accession
KM359505 [NCBI]
CDS location
range 75788 -> 78232
strand +
CDS
ATGGCTAAATTAGGAATTAGTACAGGAACCACGCCCAATGATGGATCAGGTGACAGTCTGTTGGATGGTGCTGTTAAGGTTAATTCAAATTTTGATGAAGTCTATAATAAAATAGGAGATGGGACAGATTTATTTGTTGGAATTGTTAGTTCTATTACTGTATCAGGACCTTTAAGTATAAGCACTACTTTTGGTGCACCTGTCATAACTGGATTAGCAAACACTGCAAATATAAATGCAACCAACTTTCAAGTAACTGGTGTAGGAACAATAACAGGAACAACAAGGTTGGCAGGTATCAATACATTCTCTGCTGCTGGATATACTGTGGCAGGTTTAGTAACTGCGAGTAATATAATATCAAATGAGACTATTAAGGTAGCAGGTATAGTTACAACATCTGAAGATGGTATAAATGTATCAGCTGCTGTAACTGCTAGATCACTAGCAATTCAGGATGTAACTCAAACCTCTCATTTTGTTGGTTTAAACACTGTATTCATAGATCATACTGGTGTTGGTGCTACTGCTATCAATATAACTGACACTGCAACCATAGGATTTGGATCTATATCTAGTGCAAACATCACTACTATAAATGCAAACACAGCAAATATAAACAGTGGTATATTAACAACTGCCACAGTTGGAACAGCAATTACTATACATTCAAAAGGAATTGATGCAGGTCAAGCTGGTATCATAACTGCAAGTAGATTATTTGGTGCTGTAACTGGTGCTGTAACTGGAACAGCATCCTCTGCTACTGAAGCAGACTTAGCATATGGGTTAACAGGAACACCTAGCATTGTTGTAGGTGTTGCAACTCTTGGTGGACATATATTCAATGCACCTGGTGCCTTTATATCAGGTATTCTTACTGCTACTTCATTCTCTGTTGGTACTAATGAGATTGTAAGTAGTGCTAGACAATTAAAAAATATAGCAACCTTAGATGCTACAACCAAACTTACAATAGAAAATGCCATATCTGATCCTCCAAATGATTTTGATGATTTAAATGTAGTAGGTCTTGCCACTGTCAATAGATTGTTCATAAGTGGAGATACAAGAGGTTTAAACATTCTTGGTGTCACCACTGGTTTAAGTGCACCAGGTATTACAACTCTTGGTATTATTACAGGTGCCACATCATTACAGGCAACTGATGTTTATTCAAACTTTTTACATGGTGATGGATCTAATATCTCAAATGTCACTGGTGGTGTTACTGTTCAAGATGAAGGAAGTGCATTATCAACTGTTGCAACCACATTAAACTTTGTAGGATCTGGTGTAGTGGCATCTGGATCTGGTGCGACTAAAACAATTACTATCTCTGGTGGTGGAGGTGGTGGTAGTATTGCTGGTATCAGTACTACAGGAACATCTGGATTCAATCAACTCAATGTAACTGGGGTATCAACCTTTGGTGCTAATATTGATCTCAATGCAAATATAGATGTCAGTGGATCATCAACTCTTCATAATGGATTGGTTGTAAATGGTGGTATTGTTGATATTAATCATCAGATAGTTGGTCTTGCAACAGACAATGTAATTCCATTCTACTATGCTAATGTAAGTGATTTCCCATCTGCATCTACATATCATGGTGCAGTTGCTCATGGACATAATACTGGTTTACTATATTTTGCACATGCTGGTGCTTGGTTAGAATTAGTCAGTAAAGATAGTAGTGGAGTGGTCAATAAGATTGTAGTTGGTGCTGCTGTTACTATTGATCAAAATAATATTGATACTGTAGGAATTATAACTGCTTCTGAATTTCATGGTGATGGTTCAAACTTAACTGGTATCACTGGTTCAACTGTTGCAGGTATCAGCACTACAGGAACATCTGGATTTAATCAACTAAGTATTTCTGGAGTTTCCACATTTACAGGTAATATAGATGCTAATGGTGCCTTAGATGTTGATGGACAAACTGATTTAGATGTAGTTAATGTTGCTGAACTTGCTACATTTAGTTCAAGAGTTCAAGTTGGAACTGGTCTTACACTTGATCAAAATAATATTGATGCTGGTTCTTATGTTGGTATTATAACTGCCAAAGAGTTTCATGGTGATGGAACAAATGTGGCAACATCTAGATGGGCAGTGACAAATGCTAGTTCAAATCATTATGTGTTCAATGGACCTGGTAATTTAGTTAATGCTAATGATCCAACTATATACCTTGCAAGAGGTCAGAAGTATGAATTTGACATTAATGCTAGCGGTCATCCATTTAGAATACAAACTAGCTCTGGTGCATCAGGTTATAATTCTGGAAATGAGTATACTACTGGTATTACTAATGTAGGTGCTGCATCTAGTTTGCTTACATTTGATGTTCCATTTGATGCTAACAATACTTTATACTATGTTTGTCAAAACCATTCTAGTATGAATGGAACTATTATCATCTATCCATCAATATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
8822b2a3c647f84c8ee79e24f003d3c16b1889a9a67d81f9a1375f59d636e90e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5341
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50