Genbank accession
AGH26128.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,87
TSP
Evidence RBPdetect2
Probability 0,47
Protein sequence
MATRIKLKRSTTATVVPTTSNLEDGEVALNIQDRKLYARNGSNIIEVANQKPNTGEVTTTMFATDVTNGPGNTFFVASTGNNNTTLANGGANGKHADTPFLTITKALETATSGDTIHIAAGEYQEVFPMTVPDGVTLRGANLRSTSVKPTGATNTNNAFILSGDCHVSDLTIKEFFYDSGNDKGYAFVVVSNMNSTQSPYVERVTVNTKGSVVSGSDPYGYTQGDAGRGALLDGANIAAASQHSSVLFNECTFITPNQVGLKITNGMRVEWLNCFNYFASIGIEGAQGATGKSGTGSTRLKFGGTSGTFSSSEVAYQLEDSFQSGTYARSGSTVTLTRTGHGLVTGDYIYADHISGGATDGFYQVTLVDANNVTYSSGSGTISSSNVTYKKAVATGTVASNDGTYVFITGKGTGEFTTVNKPTKTLSRFGDSQLSTAQKKFGTASILLDGTEDNVKVPTDEDFGFGSANFCIEAFIRPGSVTGTQRIFDLRDNSATDTAPTVYLDGTTLHYAVGNTSQINGGTLSTNTWYHVAVARSNGTTRLFLDGTQLGTYTDNNDYGSTKPVIIGSNYAASPVEAFNGYVDEVRISKASARFTAAFTPTTTEYGSDLNTVLLLHANGDNASTTFTDVSGGISDIRSSGGDSATSVITADYSAFGAELRSVASACVYGQKGVQADGSGVKLILTAHNFGYVGSGDDFTNDPSLAIQANEVVELNGGKVLYSSTDQDGDFRVGDAFTVDQETGNVQFQATSSAQSAANITLSDATGTTNIFPAYIETGNLRFAGNSMTSTAGQVIVDPAGEEDFVVNAETIVKEAVYFDVNKSISFGSTIQGALKIAGFGGSTVFGSSEASSFSTRSFVLLKNGLGTVNLTGAGSGYLSGQQTVDVTTNPFQTAQATAVLGTSGGLKTFTVTNRGIGYTALPTVTIDGSGNGAATAAFGVSGDIRSVTIGNGGSNYASPTGAIDAPPTNVFTGGATYEDANEVSYPVVDTSANTIYIPSHTFETGMEAIFDASTLDATATPVGGLTSSQSYYAIRVDQNLLKLASSLSDANAGNAISLTGQGTGDQFFQGRQATVNVGQTGGVIDTVTVTDIGSGYGAQPDLTITDSAGSNATFTVNVGRAINAVTVDTIGSYSSVPNITFTNASGDTTGSGAAATVALGYAVASVTLNNQGLGYRNLPTLSADGTPVAAAAFTVVLNEQEGRIGSIVVQNGGSGYDTAPTLTFTGGGGSGGQLLADVQSLTGNISANGSGYAPGVYPDVGFTVVTAAGTVSTVATATFTVPGFDGTITTAGSGYADGTYTSVPLVNTPTATYTVTVVTRDKN
Physico‐chemical
properties
protein length:1324 AA
molecular weight: 135796,85010 Da
isoelectric point:4,53316
aromaticity:0,08384
hydropathy:-0,09350

Domains

View on InterPro
AGH26128.1
1 1324 aa
STR 97–405 · STR 417–593 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

AGH26128.1
1 1324 aa
Domain Start End Length (AA) Confidence
N-terminal 1 130 130 0,7176
Central domain 131 434 305 0,9785
C-terminal 435 1324 889 0,0975
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Prochlorococcus phage MED4-213 [NCBI] · taxon 889956
Host

Coding sequence (CDS)

Genbank protein accession
AGH26128.1 [NCBI]
Genbank nucleotide accession
HQ634174 [NCBI]
CDS location
range 38645 -> 42619
strand -
CDS
ATGGCAACGAGAATCAAGCTAAAGAGATCGACAACAGCAACAGTAGTCCCGACGACTTCTAACCTAGAAGACGGTGAGGTCGCTCTTAATATACAAGACCGAAAACTGTATGCTAGAAATGGATCAAATATAATAGAGGTCGCCAACCAGAAACCTAATACTGGTGAGGTGACTACAACTATGTTTGCTACGGACGTGACGAACGGTCCTGGCAATACTTTTTTTGTTGCGTCTACTGGAAATAATAATACAACTCTTGCTAATGGTGGTGCTAATGGTAAACATGCAGATACACCATTTTTAACTATTACAAAAGCACTTGAGACTGCTACATCTGGCGATACAATTCACATTGCAGCTGGAGAATATCAGGAAGTCTTCCCAATGACAGTTCCTGATGGTGTCACATTACGTGGAGCAAACTTAAGATCAACATCTGTAAAACCTACAGGTGCTACAAACACTAATAACGCATTTATATTATCTGGAGACTGTCATGTTTCCGACTTAACAATCAAAGAATTTTTCTATGACAGTGGCAATGATAAAGGATATGCCTTTGTTGTAGTATCAAATATGAACTCTACACAGAGTCCTTATGTTGAGAGAGTAACAGTCAATACAAAAGGTAGTGTAGTATCTGGTTCAGATCCTTATGGATATACACAAGGAGATGCAGGACGTGGTGCTTTATTGGATGGTGCAAATATTGCAGCTGCATCACAACATAGTTCTGTTCTATTCAACGAGTGCACCTTTATAACACCTAATCAGGTTGGTCTAAAAATTACCAATGGTATGCGTGTAGAGTGGTTGAATTGCTTCAACTATTTTGCATCTATTGGTATTGAGGGTGCTCAAGGTGCTACAGGTAAATCTGGCACAGGTAGCACTAGATTAAAGTTTGGTGGAACTAGTGGAACATTCTCATCATCTGAGGTTGCGTATCAATTAGAAGATAGTTTTCAGTCAGGAACTTATGCAAGATCTGGATCTACAGTTACATTAACAAGAACTGGACATGGTTTAGTAACAGGCGATTACATATATGCAGATCATATCAGTGGTGGTGCTACAGATGGATTTTATCAAGTCACTTTAGTAGATGCTAATAATGTAACTTACTCTAGTGGATCTGGAACTATATCATCTAGCAATGTAACTTATAAAAAAGCAGTAGCAACTGGAACTGTTGCTAGTAACGATGGCACATATGTATTCATTACTGGTAAGGGAACTGGAGAATTTACAACAGTCAATAAACCAACTAAGACTCTAAGTAGATTTGGTGACTCACAATTAAGCACAGCACAAAAGAAATTTGGAACAGCATCCATATTATTAGATGGAACTGAGGATAACGTAAAAGTTCCTACTGATGAAGACTTTGGATTTGGTTCTGCAAACTTCTGTATAGAAGCATTTATTAGACCTGGCAGTGTAACAGGCACACAAAGAATATTTGATCTTAGAGATAATTCTGCTACAGATACAGCACCTACAGTATATCTTGATGGAACTACTCTACATTATGCAGTAGGAAATACATCACAAATTAATGGTGGAACTTTATCAACTAACACATGGTATCATGTTGCTGTTGCTAGAAGCAATGGAACTACAAGACTATTCTTAGATGGAACTCAATTAGGAACATACACAGATAATAATGACTATGGATCTACAAAACCAGTTATCATAGGATCTAACTATGCTGCATCTCCTGTAGAAGCATTCAATGGATATGTTGACGAAGTAAGAATTAGTAAAGCATCTGCTCGTTTCACTGCAGCATTTACTCCTACAACAACTGAGTATGGTTCTGACTTAAATACTGTTCTATTACTACATGCAAACGGTGACAACGCCTCTACGACCTTTACAGACGTCTCTGGTGGTATATCTGATATTAGATCTAGCGGTGGAGATTCTGCTACATCTGTTATCACTGCTGACTACTCAGCATTTGGTGCTGAACTACGTTCTGTAGCATCTGCATGTGTTTACGGACAGAAAGGTGTACAAGCAGATGGTTCTGGTGTAAAACTCATACTTACTGCACATAACTTTGGTTATGTTGGATCTGGTGATGATTTCACCAATGACCCATCATTAGCAATACAAGCAAATGAGGTAGTAGAACTTAATGGTGGTAAAGTATTATATTCATCTACAGACCAAGATGGTGACTTCCGTGTTGGTGATGCATTTACAGTAGATCAAGAAACTGGTAATGTTCAATTCCAAGCAACATCTTCAGCTCAATCAGCAGCAAACATTACGTTAAGTGATGCTACTGGAACAACTAATATATTCCCTGCATATATTGAAACTGGCAACTTAAGATTTGCGGGTAACAGTATGACTTCTACAGCGGGTCAGGTAATTGTTGACCCAGCTGGTGAAGAAGATTTCGTTGTTAACGCTGAAACAATCGTTAAAGAAGCAGTTTATTTTGATGTTAATAAGTCAATATCATTTGGTAGCACAATTCAAGGTGCTCTAAAAATTGCAGGATTTGGTGGATCTACAGTATTTGGATCTTCAGAAGCATCCAGTTTTTCTACTAGATCATTTGTTCTACTCAAGAATGGATTAGGAACTGTCAACCTAACAGGTGCAGGATCAGGTTATCTTAGTGGACAACAAACAGTAGATGTAACCACAAACCCATTTCAAACTGCACAAGCAACAGCAGTTTTAGGCACTTCGGGTGGATTAAAAACATTTACAGTAACCAATAGAGGAATTGGATACACCGCACTTCCAACAGTAACAATTGATGGATCTGGAAATGGAGCTGCAACCGCAGCGTTTGGTGTCAGTGGTGATATTCGTTCAGTAACGATTGGTAATGGAGGAAGCAACTATGCATCTCCTACAGGAGCTATAGACGCTCCACCAACTAATGTATTTACAGGTGGTGCTACATACGAAGACGCAAATGAAGTTAGTTATCCAGTCGTTGATACATCAGCAAACACAATTTATATCCCAAGTCATACTTTTGAGACAGGGATGGAAGCAATTTTTGATGCATCAACATTAGATGCAACTGCTACTCCCGTAGGTGGTTTAACCTCCAGTCAATCTTACTATGCTATTCGAGTTGACCAAAATCTCCTTAAATTAGCATCTAGTCTATCAGATGCAAATGCAGGAAATGCAATATCATTAACAGGTCAAGGAACAGGAGATCAGTTCTTCCAAGGTAGACAAGCAACAGTTAACGTTGGACAGACTGGTGGTGTTATTGATACCGTGACTGTTACTGATATTGGTTCTGGTTATGGTGCTCAACCAGATCTTACAATTACCGACTCTGCAGGATCAAATGCTACATTTACAGTTAATGTTGGACGTGCAATTAATGCAGTCACTGTAGATACTATTGGATCTTATTCATCTGTACCAAACATCACATTTACAAATGCATCAGGAGATACCACTGGATCAGGTGCTGCTGCTACTGTTGCATTAGGATACGCTGTTGCATCGGTTACATTAAACAATCAAGGTTTAGGTTATAGAAATCTTCCAACTTTAAGTGCTGATGGAACTCCAGTTGCAGCTGCAGCATTCACTGTAGTTTTAAACGAACAAGAAGGTAGAATTGGATCTATAGTTGTTCAAAACGGAGGATCAGGATATGACACTGCACCAACACTAACATTTACTGGTGGAGGTGGTAGTGGTGGTCAACTATTAGCAGATGTTCAATCACTAACTGGAAATATCTCAGCAAATGGATCTGGATATGCACCTGGCGTTTATCCTGACGTAGGATTTACTGTTGTTACCGCTGCAGGTACAGTCTCAACCGTTGCAACTGCTACGTTCACAGTTCCTGGTTTTGACGGAACTATTACAACAGCTGGATCTGGTTATGCAGACGGAACTTATACTAGCGTTCCACTCGTAAACACTCCAACTGCAACTTATACAGTAACTGTTGTAACGAGAGACAAAAATTGA

Genome Context

Tertiary structure

AGH26128.1
ESMFold structure
Source ESMFold
pLDDT 56.3
Oligomeric state monomer

Literature

Title Authors Date PMID Source
The Genome Sequence of Cyanophage MED4-213 Henn,M.R., Sullivan,M.S., Osburne,M.S., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Yu,Q., Coleman,M.L., Huang,K.H., Weigele,P.R., DeFrancesco,A.S., Kern,S.E., Thompson,L.R., Fu,R., Hombeck,B., Chisholm,S.W., Haas,B., Nusbaum,C. and Birren,B. 2011-09-23 GenBank