Genbank accession
AGH26128.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,87
TSP
Evidence RBPdetect2
Probability 0,81
Protein sequence
MATRIKLKRSTTATVVPTTSNLEDGEVALNIQDRKLYARNGSNIIEVANQKPNTGEVTTTMFATDVTNGPGNTFFVASTGNNNTTLANGGANGKHADTPFLTITKALETATSGDTIHIAAGEYQEVFPMTVPDGVTLRGANLRSTSVKPTGATNTNNAFILSGDCHVSDLTIKEFFYDSGNDKGYAFVVVSNMNSTQSPYVERVTVNTKGSVVSGSDPYGYTQGDAGRGALLDGANIAAASQHSSVLFNECTFITPNQVGLKITNGMRVEWLNCFNYFASIGIEGAQGATGKSGTGSTRLKFGGTSGTFSSSEVAYQLEDSFQSGTYARSGSTVTLTRTGHGLVTGDYIYADHISGGATDGFYQVTLVDANNVTYSSGSGTISSSNVTYKKAVATGTVASNDGTYVFITGKGTGEFTTVNKPTKTLSRFGDSQLSTAQKKFGTASILLDGTEDNVKVPTDEDFGFGSANFCIEAFIRPGSVTGTQRIFDLRDNSATDTAPTVYLDGTTLHYAVGNTSQINGGTLSTNTWYHVAVARSNGTTRLFLDGTQLGTYTDNNDYGSTKPVIIGSNYAASPVEAFNGYVDEVRISKASARFTAAFTPTTTEYGSDLNTVLLLHANGDNASTTFTDVSGGISDIRSSGGDSATSVITADYSAFGAELRSVASACVYGQKGVQADGSGVKLILTAHNFGYVGSGDDFTNDPSLAIQANEVVELNGGKVLYSSTDQDGDFRVGDAFTVDQETGNVQFQATSSAQSAANITLSDATGTTNIFPAYIETGNLRFAGNSMTSTAGQVIVDPAGEEDFVVNAETIVKEAVYFDVNKSISFGSTIQGALKIAGFGGSTVFGSSEASSFSTRSFVLLKNGLGTVNLTGAGSGYLSGQQTVDVTTNPFQTAQATAVLGTSGGLKTFTVTNRGIGYTALPTVTIDGSGNGAATAAFGVSGDIRSVTIGNGGSNYASPTGAIDAPPTNVFTGGATYEDANEVSYPVVDTSANTIYIPSHTFETGMEAIFDASTLDATATPVGGLTSSQSYYAIRVDQNLLKLASSLSDANAGNAISLTGQGTGDQFFQGRQATVNVGQTGGVIDTVTVTDIGSGYGAQPDLTITDSAGSNATFTVNVGRAINAVTVDTIGSYSSVPNITFTNASGDTTGSGAAATVALGYAVASVTLNNQGLGYRNLPTLSADGTPVAAAAFTVVLNEQEGRIGSIVVQNGGSGYDTAPTLTFTGGGGSGGQLLADVQSLTGNISANGSGYAPGVYPDVGFTVVTAAGTVSTVATATFTVPGFDGTITTAGSGYADGTYTSVPLVNTPTATYTVTVVTRDKN
Physico‐chemical
properties
protein length:1324 AA
molecular weight: 135796,85010 Da
isoelectric point:4,53316
aromaticity:0,08384
hydropathy:-0,09350

Domains

Domains [InterPro]
DC_0763
STR
3–780
IPR023366
STR
321–400
PF13385
LEC
449–590
AGH26128.1
1 1324
Architecture
STR
STR
STR 3-780 | STR 801-1322 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AGH26128.1
1 1324
Domain Start End Length (AA) Confidence
N-terminal 1 130 130 0,7176
Central domain 131 434 305 0,9785
C-terminal 435 1324 889 0,0975
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-130
Central
131-434
C-terminal
435-1324

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage MED4-213
[NCBI]
889956 Uroviricota > Caudoviricetes > Eurybiavirus >
Host Prochlorococcus
[NCBI]
1218 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGH26128.1 [NCBI]
Genbank nucleotide accession
HQ634174 [NCBI]
CDS location
range 38645 -> 42619
strand -
CDS
ATGGCAACGAGAATCAAGCTAAAGAGATCGACAACAGCAACAGTAGTCCCGACGACTTCTAACCTAGAAGACGGTGAGGTCGCTCTTAATATACAAGACCGAAAACTGTATGCTAGAAATGGATCAAATATAATAGAGGTCGCCAACCAGAAACCTAATACTGGTGAGGTGACTACAACTATGTTTGCTACGGACGTGACGAACGGTCCTGGCAATACTTTTTTTGTTGCGTCTACTGGAAATAATAATACAACTCTTGCTAATGGTGGTGCTAATGGTAAACATGCAGATACACCATTTTTAACTATTACAAAAGCACTTGAGACTGCTACATCTGGCGATACAATTCACATTGCAGCTGGAGAATATCAGGAAGTCTTCCCAATGACAGTTCCTGATGGTGTCACATTACGTGGAGCAAACTTAAGATCAACATCTGTAAAACCTACAGGTGCTACAAACACTAATAACGCATTTATATTATCTGGAGACTGTCATGTTTCCGACTTAACAATCAAAGAATTTTTCTATGACAGTGGCAATGATAAAGGATATGCCTTTGTTGTAGTATCAAATATGAACTCTACACAGAGTCCTTATGTTGAGAGAGTAACAGTCAATACAAAAGGTAGTGTAGTATCTGGTTCAGATCCTTATGGATATACACAAGGAGATGCAGGACGTGGTGCTTTATTGGATGGTGCAAATATTGCAGCTGCATCACAACATAGTTCTGTTCTATTCAACGAGTGCACCTTTATAACACCTAATCAGGTTGGTCTAAAAATTACCAATGGTATGCGTGTAGAGTGGTTGAATTGCTTCAACTATTTTGCATCTATTGGTATTGAGGGTGCTCAAGGTGCTACAGGTAAATCTGGCACAGGTAGCACTAGATTAAAGTTTGGTGGAACTAGTGGAACATTCTCATCATCTGAGGTTGCGTATCAATTAGAAGATAGTTTTCAGTCAGGAACTTATGCAAGATCTGGATCTACAGTTACATTAACAAGAACTGGACATGGTTTAGTAACAGGCGATTACATATATGCAGATCATATCAGTGGTGGTGCTACAGATGGATTTTATCAAGTCACTTTAGTAGATGCTAATAATGTAACTTACTCTAGTGGATCTGGAACTATATCATCTAGCAATGTAACTTATAAAAAAGCAGTAGCAACTGGAACTGTTGCTAGTAACGATGGCACATATGTATTCATTACTGGTAAGGGAACTGGAGAATTTACAACAGTCAATAAACCAACTAAGACTCTAAGTAGATTTGGTGACTCACAATTAAGCACAGCACAAAAGAAATTTGGAACAGCATCCATATTATTAGATGGAACTGAGGATAACGTAAAAGTTCCTACTGATGAAGACTTTGGATTTGGTTCTGCAAACTTCTGTATAGAAGCATTTATTAGACCTGGCAGTGTAACAGGCACACAAAGAATATTTGATCTTAGAGATAATTCTGCTACAGATACAGCACCTACAGTATATCTTGATGGAACTACTCTACATTATGCAGTAGGAAATACATCACAAATTAATGGTGGAACTTTATCAACTAACACATGGTATCATGTTGCTGTTGCTAGAAGCAATGGAACTACAAGACTATTCTTAGATGGAACTCAATTAGGAACATACACAGATAATAATGACTATGGATCTACAAAACCAGTTATCATAGGATCTAACTATGCTGCATCTCCTGTAGAAGCATTCAATGGATATGTTGACGAAGTAAGAATTAGTAAAGCATCTGCTCGTTTCACTGCAGCATTTACTCCTACAACAACTGAGTATGGTTCTGACTTAAATACTGTTCTATTACTACATGCAAACGGTGACAACGCCTCTACGACCTTTACAGACGTCTCTGGTGGTATATCTGATATTAGATCTAGCGGTGGAGATTCTGCTACATCTGTTATCACTGCTGACTACTCAGCATTTGGTGCTGAACTACGTTCTGTAGCATCTGCATGTGTTTACGGACAGAAAGGTGTACAAGCAGATGGTTCTGGTGTAAAACTCATACTTACTGCACATAACTTTGGTTATGTTGGATCTGGTGATGATTTCACCAATGACCCATCATTAGCAATACAAGCAAATGAGGTAGTAGAACTTAATGGTGGTAAAGTATTATATTCATCTACAGACCAAGATGGTGACTTCCGTGTTGGTGATGCATTTACAGTAGATCAAGAAACTGGTAATGTTCAATTCCAAGCAACATCTTCAGCTCAATCAGCAGCAAACATTACGTTAAGTGATGCTACTGGAACAACTAATATATTCCCTGCATATATTGAAACTGGCAACTTAAGATTTGCGGGTAACAGTATGACTTCTACAGCGGGTCAGGTAATTGTTGACCCAGCTGGTGAAGAAGATTTCGTTGTTAACGCTGAAACAATCGTTAAAGAAGCAGTTTATTTTGATGTTAATAAGTCAATATCATTTGGTAGCACAATTCAAGGTGCTCTAAAAATTGCAGGATTTGGTGGATCTACAGTATTTGGATCTTCAGAAGCATCCAGTTTTTCTACTAGATCATTTGTTCTACTCAAGAATGGATTAGGAACTGTCAACCTAACAGGTGCAGGATCAGGTTATCTTAGTGGACAACAAACAGTAGATGTAACCACAAACCCATTTCAAACTGCACAAGCAACAGCAGTTTTAGGCACTTCGGGTGGATTAAAAACATTTACAGTAACCAATAGAGGAATTGGATACACCGCACTTCCAACAGTAACAATTGATGGATCTGGAAATGGAGCTGCAACCGCAGCGTTTGGTGTCAGTGGTGATATTCGTTCAGTAACGATTGGTAATGGAGGAAGCAACTATGCATCTCCTACAGGAGCTATAGACGCTCCACCAACTAATGTATTTACAGGTGGTGCTACATACGAAGACGCAAATGAAGTTAGTTATCCAGTCGTTGATACATCAGCAAACACAATTTATATCCCAAGTCATACTTTTGAGACAGGGATGGAAGCAATTTTTGATGCATCAACATTAGATGCAACTGCTACTCCCGTAGGTGGTTTAACCTCCAGTCAATCTTACTATGCTATTCGAGTTGACCAAAATCTCCTTAAATTAGCATCTAGTCTATCAGATGCAAATGCAGGAAATGCAATATCATTAACAGGTCAAGGAACAGGAGATCAGTTCTTCCAAGGTAGACAAGCAACAGTTAACGTTGGACAGACTGGTGGTGTTATTGATACCGTGACTGTTACTGATATTGGTTCTGGTTATGGTGCTCAACCAGATCTTACAATTACCGACTCTGCAGGATCAAATGCTACATTTACAGTTAATGTTGGACGTGCAATTAATGCAGTCACTGTAGATACTATTGGATCTTATTCATCTGTACCAAACATCACATTTACAAATGCATCAGGAGATACCACTGGATCAGGTGCTGCTGCTACTGTTGCATTAGGATACGCTGTTGCATCGGTTACATTAAACAATCAAGGTTTAGGTTATAGAAATCTTCCAACTTTAAGTGCTGATGGAACTCCAGTTGCAGCTGCAGCATTCACTGTAGTTTTAAACGAACAAGAAGGTAGAATTGGATCTATAGTTGTTCAAAACGGAGGATCAGGATATGACACTGCACCAACACTAACATTTACTGGTGGAGGTGGTAGTGGTGGTCAACTATTAGCAGATGTTCAATCACTAACTGGAAATATCTCAGCAAATGGATCTGGATATGCACCTGGCGTTTATCCTGACGTAGGATTTACTGTTGTTACCGCTGCAGGTACAGTCTCAACCGTTGCAACTGCTACGTTCACAGTTCCTGGTTTTGACGGAACTATTACAACAGCTGGATCTGGTTATGCAGACGGAACTTATACTAGCGTTCCACTCGTAAACACTCCAACTGCAACTTATACAGTAACTGTTGTAACGAGAGACAAAAATTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
581473bb945e8ef11d7980378818c50611a8c0f72811f595fca11eaaceae94e1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5632
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
The Genome Sequence of Cyanophage MED4-213 Henn,M.R., Sullivan,M.S., Osburne,M.S., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Yu,Q., Coleman,M.L., Huang,K.H., Weigele,P.R., DeFrancesco,A.S., Kern,S.E., Thompson,L.R., Fu,R., Hombeck,B., Chisholm,S.W., Haas,B., Nusbaum,C. and Birren,B. 2011-09-23 GenBank