Genbank accession
YP_009910765.1 [GenBank]
Protein name
hydrolase
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
Protein sequence
MDRIQLRRDTSTRWKELNPVLLEGEPAFETDTRLRKIGDGVNPYNDLDYLAAENVVQGLGDSTTGAISQNTIAKLAKKQLVLFEGFVNIPTSSVINERYEGLDGTVVYNNTSRHFVYSVNGSYYSSWSDCELYGTPYSYGRVPYINKLYSYSGRTYHVTGFGFAIIEDPVVIPVADELKTSDYLYNIFYNPTTGIETFEPSRLTTDFIAVDEGFIFDYTLTGYSSNIAVAAFDANKNILIDKSVIVNVPSASAPNTGRYIVDSTVRFIRVTRLNNANGLVQTFIHNKADYSSLAKESFNGIKTLMNADFTVEGVYINPSTGLESPNPTYSATPFISIVPKETIYYSGLNGTNGAAIIGIYDKNKTPINLIRGESTYEGRVYVAPEGAAYIRFSYLVTDNPVVIKLISETSKANMTTKNKNNLIINGFDEFVVSGTWSRNGKNLKVTGGGFTAARLFSETFEDEFVMSCKLRTNSAGYFECGIGKGSNAGNYVGVWVTICSNASGSFLKLYYNINGSYVEQTNLRQTIAKGLQGSNWYVLQLSKTTDAASTLSAKVYDNTTGELLASLSTSPNATYAWGYPTCVNINNTNEFSGFTMYYPKNVYPSVAVYGDSFVEGDTVRTTKNVRWSALLQAQLGKKNCLLYGHGGASSKSDTTRILFQINRISSQYAIIALGQNDASFEAWYQYIDYMLNLCKMCDTVPILVTTCPQVNQSESYITKMTSINTWIRNSGYNYIDANMAVAINGVTWKDGYVLSDGIHPSALGHQAIYDRIAFDCPFLFNN
Physico‐chemical
properties
protein length:782 AA
molecular weight: 86499,03460 Da
isoelectric point:5,64050
aromaticity:0,12404
hydropathy:-0,18146

Domains

Domains [InterPro]
SSF69349
STR
3–74
IPR041352
ATT
4–41
IPR041352
ATT
4–45
cd00229
ENZ
606–773
IPR013830
ENZ
608–767
YP_009910765.1
1 782
Architecture
ATT
STR
ATT 3-45 | STR 46-774 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009910765.1
1 782
Domain Start End Length (AA) Confidence
N-terminal 1 173 173 0,8963
Central domain 174 381 209 0,8873
C-terminal 382 782 400 0,3961
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-173
Central
174-381
C-terminal
382-782

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacteroides phage crAss001
[NCBI]
2301731 Uroviricota > Caudoviricetes > Crassvirales > Asinivirinae > Kehishuvirus
Host Bacteroides intestinalis
[NCBI]
329854 cellular organisms > Bacteria > Pseudomonadati > FCB group > Bacteroidota/Chlorobiota group > Bacteroidota

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009910765.1 [NCBI]
Genbank nucleotide accession
NC_049977 [NCBI]
CDS location
range 20095 -> 22443
strand +
CDS
ATGGATAGAATACAATTAAGAAGAGACACATCAACAAGATGGAAAGAATTAAACCCTGTATTACTTGAGGGAGAACCAGCTTTTGAAACTGATACAAGACTAAGAAAGATTGGGGATGGTGTTAATCCTTATAATGACTTAGACTATTTAGCAGCAGAGAATGTTGTTCAAGGTTTAGGAGATAGTACTACAGGAGCTATATCACAAAATACTATAGCTAAATTAGCTAAGAAACAATTAGTTTTATTTGAAGGATTTGTGAATATTCCTACTTCCTCAGTTATAAATGAAAGATATGAAGGTCTTGATGGAACTGTTGTTTACAATAATACTTCCAGACACTTTGTTTACTCAGTAAATGGTAGCTACTATAGTAGTTGGAGTGATTGTGAACTATATGGTACTCCTTACTCTTATGGTAGAGTACCCTATATTAATAAACTTTATAGTTATTCAGGTAGAACATACCATGTTACAGGTTTTGGTTTTGCAATTATAGAAGACCCTGTAGTTATTCCTGTTGCAGATGAACTTAAGACATCTGATTATTTATATAATATATTTTATAATCCAACAACAGGTATTGAAACTTTTGAACCTTCAAGATTAACTACAGATTTTATAGCAGTTGATGAAGGTTTTATATTTGATTACACTTTAACAGGGTATAGTTCAAATATTGCAGTTGCAGCATTTGATGCTAATAAGAATATCCTAATAGATAAGAGTGTTATAGTAAATGTCCCATCAGCATCAGCTCCTAATACAGGAAGGTATATAGTAGATAGTACTGTTAGGTTTATTAGAGTTACAAGATTGAATAATGCTAATGGATTAGTTCAAACCTTTATTCATAATAAAGCAGATTACTCTTCTTTGGCAAAAGAATCATTCAATGGTATTAAAACACTTATGAATGCTGACTTCACTGTAGAAGGTGTATATATTAATCCAAGTACAGGGTTGGAATCTCCTAATCCAACTTATTCAGCTACTCCTTTTATATCTATAGTACCTAAAGAAACTATATACTATAGTGGGCTTAATGGTACAAATGGTGCAGCTATTATAGGAATATATGATAAGAATAAGACACCAATTAATCTAATTAGAGGAGAAAGTACTTATGAAGGAAGGGTTTATGTAGCTCCAGAAGGAGCAGCTTATATTAGATTTAGTTATTTAGTAACCGATAATCCTGTGGTTATTAAACTTATAAGTGAAACTTCTAAGGCTAATATGACTACTAAAAACAAGAATAATCTTATTATTAATGGATTTGATGAGTTTGTAGTTAGTGGAACATGGTCAAGGAATGGTAAGAACCTTAAAGTAACTGGAGGTGGATTTACTGCTGCAAGATTGTTCTCAGAAACATTTGAGGATGAATTTGTTATGAGTTGTAAGCTAAGAACAAATAGTGCAGGCTATTTTGAATGTGGTATTGGTAAAGGTTCTAATGCAGGTAATTATGTAGGTGTTTGGGTTACTATATGTAGTAATGCAAGTGGTTCATTCTTGAAGCTATATTATAATATAAATGGTTCTTATGTAGAACAGACTAATTTAAGACAAACTATTGCAAAGGGTTTACAAGGTAGCAACTGGTATGTACTACAACTAAGTAAAACTACTGATGCAGCTTCTACATTATCAGCTAAGGTTTATGATAATACAACTGGTGAACTGTTAGCAAGTTTATCTACTTCTCCTAATGCTACTTATGCTTGGGGTTATCCAACTTGTGTTAATATTAATAATACAAATGAATTCTCAGGATTCACTATGTATTATCCTAAGAATGTTTATCCATCAGTAGCAGTATATGGAGATAGCTTTGTTGAAGGAGATACAGTAAGAACTACTAAGAATGTAAGATGGAGTGCTCTACTACAAGCTCAACTTGGAAAGAAAAACTGTTTATTGTATGGTCATGGAGGTGCTTCTTCTAAGAGTGATACTACAAGAATACTGTTTCAGATAAATAGAATAAGCAGTCAGTATGCTATAATAGCACTTGGTCAGAATGATGCTTCCTTTGAAGCATGGTATCAATATATAGATTATATGCTTAATCTATGTAAGATGTGTGATACTGTGCCAATACTTGTAACAACTTGTCCTCAAGTTAATCAGAGTGAAAGTTATATTACAAAAATGACTTCAATCAACACATGGATTAGAAATTCAGGGTATAATTATATTGATGCAAACATGGCTGTAGCAATAAATGGGGTAACATGGAAAGATGGGTATGTATTATCTGATGGTATTCATCCTTCTGCATTAGGACATCAAGCTATCTATGATAGAATAGCTTTTGATTGTCCATTCTTGTTTAACAATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
1394f9e40781625002732912357272421948040009119555cbe766152f954d17
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6922
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
PhiCrAss001, a member of the most abundant bacteriophage family in the human gut, infects Bacteroides Shkoporov,A.N., Khokhlova,E.V., Fitzgerald,C.B., Stockdale,S.R., Draper,L.A., Ross,R.P. and Hill,C. 2018-06-26 GenBank