Genbank accession
AIK68327.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,93
Protein sequence
MARKTIDELPVATPEAVATGSLIIADENGLEYRATTAIIGGVHLYFTAEDDQTPWIFGRLTSPGRTVEEITVTFVAVPRGGGTTLTKTTKTDSLGNFTYNFTTLVRGVTYDITASVAPYLAEVTVKYVAPLSAPIDTPVATSTGKVGDTITVSAPNFDGSPTSFKYRYLRGPNYIIPGANESVYTLTPDDDADVITAQVQAVNAGGESIWYSANPIGPVSNKTSVAISDPVIIGDNTLGGRLVLTSPGTATENAIVKSDEWQLDGVKIPSTKAPLKRYLAGAANGSASASYPSGNNHFGVALISTMYTGSNTYSYSIAANATENDYIALNVGASGGNEGPQLYDLYMRHLNTNAAIRIKMYAAKYRGGVLQGEEIQMAKVGAAAGQEYSALAVTTNSVSRWTLTADFGEWVSGDYLVFRQQSTNVSSAGSNNYQWDARSGASYLMAPIGSVQDIQKTLKTFTSKADMGTKAARLKQTLSNGNNDVVLYSNEIVYEAAVTQTEVPENLTLPTIVTNSDRANTTWQADLGAWSNQPTAYNVQWTDNGVDIAGATLMNYKSTTAQEGHDITFYVIAQNAIGNSEPAIAVAHTVQAAATYYNIATGNDLNDGSTEALAKQSLASTESIPSGSTAILAGDFGTRLKLGNSRNYEGLGAGLTSIGSLTTQYAIDHYTDDGSLGRSNVKISKMTINSADRAILARQGANWVLEDIVIDSAGFGAGVIAENASGIMFYKNNGITMRRCSLLNIKSDGLYLDTIDRAIVEDCLFLPVNTAEGDTIQTRADRTSSSGAPGPHQKGFIMSGTFLDMHSKKTGSGKGCLVTNMADYVYCHDNQLDGNNFVHGTDEGDCQVYCRNTNRYARMNSYSFGYGIGGYDNQPASYNHQLYDNSWYDINRALTFTGISVTGYSGTKSGRVDIVAHDETIVKCASGIRIDRPTSGIFRGFVFHNVTRPVDRVLTTLPPGGEVQAFISDSHYTYNGSILVPPTVTTHAVITGDRTVGSELTGPNSVFDTTAVLTAFPGAVITRSYQWRRHKPQVQWAGWSQHLGWKCEWIAGATSASYTITDDDQGCLISRVDRIHLSFTEASVAKVVTALSYDASYATSTPIPRSGDLAVPLPVIPSSGTVSFAAAEGAVALNLPALITGETRTLLNPSRAHEVYTGNLLTVDSNGDILKGAGTFTLGQTFGMKLRQKRGIDTVDSTISFTVVA
Physico‐chemical
properties
protein length:1205 AA
molecular weight: 128508,61570 Da
isoelectric point:5,31811
aromaticity:0,08548
hydropathy:-0,16680

Domains

Domains [InterPro]
DC_0400
STR
216–601
IPR011050
STR
584–947
AIK68327.1
1 1205
Architecture
ATT
STR
RBD
ATT 111-252 | STR 253-947 | RBD 949-1205
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AIK68327.1
1 1205
Domain Start End Length (AA) Confidence
N-terminal 1 291 291 0,7671
Central domain 292 490 200 0,4020
C-terminal 491 1205 714 0,2262
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-291
Central
292-490
C-terminal
491-1205

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhizobium phage vB_RleM_P10VF
[NCBI]
1527770 Uroviricota > Caudoviricetes > Pootjesviridae > Innesvirus P10VF >
Host Rhizobium leguminosarum
[NCBI]
384 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Alphaproteobacteria > Hyphomicrobiales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AIK68327.1 [NCBI]
Genbank nucleotide accession
KM199770 [NCBI]
CDS location
range 76348 -> 79965
strand -
CDS
ATGGCACGCAAAACAATCGATGAACTTCCAGTTGCGACTCCTGAAGCCGTAGCTACCGGAAGTCTGATCATCGCTGATGAGAACGGCCTAGAGTATCGAGCAACGACTGCTATTATCGGCGGCGTTCATCTGTACTTCACTGCCGAAGACGACCAGACTCCGTGGATCTTTGGACGACTCACGAGTCCTGGCCGTACTGTCGAAGAAATCACTGTGACCTTCGTCGCTGTTCCTCGTGGAGGCGGTACGACTCTCACGAAGACTACAAAGACAGACTCACTCGGCAACTTCACATACAACTTCACAACTCTTGTTCGTGGTGTCACATACGACATCACAGCGAGTGTTGCTCCGTATCTCGCGGAAGTGACTGTCAAGTATGTCGCTCCACTGTCTGCACCTATCGACACACCAGTTGCAACTTCGACCGGTAAGGTCGGCGACACAATCACCGTTTCAGCTCCGAATTTCGACGGAAGCCCGACATCGTTCAAATATCGGTATCTCCGCGGTCCGAACTACATCATCCCCGGAGCAAACGAATCAGTCTATACTCTCACTCCTGACGACGACGCGGACGTCATCACGGCTCAAGTCCAGGCAGTCAACGCAGGAGGCGAGAGCATCTGGTATAGCGCAAATCCGATCGGTCCTGTCTCGAACAAAACGTCTGTGGCGATCTCGGATCCGGTCATCATAGGAGACAACACTCTCGGCGGACGTCTGGTTCTCACGTCTCCAGGAACTGCAACAGAGAATGCTATTGTCAAGTCTGACGAATGGCAACTTGACGGAGTCAAGATCCCGTCGACGAAAGCTCCTTTGAAGCGATATCTCGCTGGTGCTGCGAATGGATCAGCTTCAGCATCGTATCCTTCAGGCAACAACCACTTCGGCGTAGCTCTTATAAGCACGATGTACACAGGATCGAACACTTACAGTTATTCGATTGCCGCGAATGCTACAGAAAACGATTACATCGCTTTGAACGTAGGAGCTTCGGGCGGAAACGAAGGACCGCAACTCTACGATCTTTACATGAGACATTTGAATACGAACGCTGCTATTCGTATCAAGATGTATGCAGCTAAATATCGCGGTGGTGTTCTACAAGGCGAAGAAATCCAGATGGCGAAAGTCGGAGCCGCAGCTGGACAAGAGTATTCGGCTCTTGCAGTCACTACGAATTCTGTGTCTAGATGGACGCTGACTGCTGATTTCGGCGAATGGGTCTCTGGTGATTATCTCGTCTTCCGTCAGCAATCTACGAACGTCTCGTCAGCTGGTTCAAACAACTACCAGTGGGATGCACGATCTGGCGCTTCGTATCTGATGGCTCCGATCGGCTCCGTCCAAGACATTCAGAAGACTCTGAAGACGTTCACGAGTAAAGCAGACATGGGTACAAAGGCAGCTCGGCTCAAGCAGACTCTATCGAACGGAAACAACGATGTAGTGTTGTATTCGAATGAGATCGTCTACGAAGCTGCTGTAACTCAGACTGAGGTTCCTGAAAATCTTACTCTTCCGACTATCGTCACAAATTCGGACAGAGCTAACACTACGTGGCAAGCTGATCTCGGTGCATGGAGCAATCAGCCAACAGCATACAATGTTCAGTGGACCGACAACGGTGTCGACATCGCTGGTGCTACACTGATGAACTACAAGTCAACGACTGCTCAAGAAGGACACGACATCACGTTCTACGTGATCGCCCAAAACGCAATCGGTAATTCTGAGCCAGCGATTGCTGTAGCTCATACTGTTCAAGCAGCTGCAACGTACTACAACATCGCTACTGGTAACGATCTCAACGATGGTTCGACTGAAGCACTTGCGAAACAGTCTCTTGCAAGTACTGAATCTATTCCATCTGGATCGACAGCAATTCTCGCTGGTGACTTCGGCACTCGGTTAAAACTAGGTAACAGCAGAAACTACGAAGGTCTTGGTGCTGGACTCACGTCTATCGGTTCGTTGACGACTCAATACGCGATCGATCATTACACCGACGACGGCTCGCTCGGTCGATCCAACGTCAAGATTTCGAAGATGACGATCAACTCGGCGGACCGTGCTATCTTGGCTCGCCAAGGAGCAAACTGGGTTCTCGAAGACATCGTTATCGACAGCGCCGGCTTCGGAGCTGGTGTCATTGCCGAAAACGCTTCTGGCATCATGTTCTACAAGAATAACGGCATCACGATGCGTCGATGCTCGCTTCTGAACATCAAGTCTGACGGCCTCTATCTCGACACGATCGACAGAGCCATTGTCGAAGACTGCCTCTTTTTGCCAGTCAACACTGCAGAAGGCGACACGATTCAGACTCGCGCTGACAGAACTTCGAGCAGTGGTGCACCAGGTCCTCACCAAAAGGGCTTCATCATGAGCGGCACCTTCCTCGACATGCATTCGAAGAAGACTGGATCTGGTAAAGGCTGCTTGGTCACCAACATGGCCGATTACGTCTACTGCCATGACAACCAACTCGACGGCAATAACTTCGTTCACGGTACTGACGAAGGTGACTGCCAGGTCTACTGCCGCAACACCAACCGCTACGCTCGAATGAACAGCTACTCATTCGGCTACGGCATCGGTGGCTACGACAACCAGCCTGCTTCGTACAATCACCAGCTCTACGACAACTCGTGGTACGACATCAACCGTGCTCTGACCTTCACCGGTATCTCGGTGACTGGCTACTCTGGTACGAAATCCGGTCGTGTCGATATCGTGGCTCACGACGAAACCATCGTCAAGTGCGCGTCCGGTATCCGTATTGACCGTCCGACATCTGGTATCTTCAGAGGCTTCGTCTTTCATAACGTGACGAGACCTGTTGATCGAGTGCTTACGACTCTGCCTCCTGGCGGTGAGGTTCAGGCATTCATCTCTGACAGTCACTACACGTACAATGGATCGATCCTCGTACCTCCGACTGTAACGACTCATGCTGTTATCACTGGTGATCGAACCGTCGGTTCTGAACTGACTGGTCCGAACAGCGTCTTCGATACGACAGCGGTGCTGACTGCATTTCCAGGTGCTGTGATCACTCGTTCGTATCAGTGGCGTCGACACAAGCCTCAAGTCCAGTGGGCTGGATGGTCTCAGCATCTCGGCTGGAAGTGCGAATGGATCGCTGGTGCAACGTCTGCGAGCTACACGATCACAGATGACGATCAAGGTTGCCTCATCAGTCGTGTCGATCGTATTCATCTGTCGTTCACTGAAGCGAGTGTCGCGAAGGTTGTCACGGCTCTCTCGTATGACGCATCCTACGCAACTTCGACTCCGATCCCTCGCAGTGGTGATCTTGCTGTTCCGCTTCCGGTCATTCCGTCGTCTGGAACAGTCTCGTTCGCTGCAGCAGAAGGTGCAGTGGCTCTCAATCTTCCTGCTCTCATCACAGGCGAGACTCGGACACTGCTCAATCCTTCGAGAGCTCATGAAGTCTACACCGGCAACTTGCTGACTGTCGACTCCAACGGAGACATCTTGAAAGGAGCTGGCACGTTCACTCTCGGACAGACGTTCGGTATGAAGCTGCGCCAGAAACGAGGCATCGATACTGTCGATAGCACGATCTCCTTCACGGTCGTAGCGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
9ef8d201b4d93e802b256328331eaaab60f10bcbf9aa4cebd92f78fe7b12e32d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6503
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Isolation and characterization of Rhizobium leguminosarum phages from western Canadian soils and complete genome sequences of rhizobiophages vB_RleS_L338C and vB_RleM_P10VF Restrepo-Cordoba,M., Halmillawewa,A.P., Perry,B., Hynes,M.F. and Yost,C.K. 2015-09-16 GenBank