Genbank accession
YP_007675684.1 [GenBank]
Protein name
hydrolase
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,86
Protein sequence
MMKTILLLAFSLCSFFAVAQTQDVTIAVPTPDQIIDTINNRPEGKKISASAIEGTFEDSVLSAEAQASLARANESITSELTNEPKGSIKSDNAVSISYLEYLIAVNSQKNKSNTIYNVSSEEDVLNQDLVNSTFIDLATPYDYISETHMITDDIHDYDVRGGFSHDVTSQSVAFPVSLAKTNPTNVSGSGYNATTWEAAFVVKGDFAIRVAGNDDSMSRQRVLINGKPYQFINDTGLEDGGTRRWAVFQMRKGKEYEIRIQFSQAGSFYGFKTEVGGFIKALDPDETILFFGDSITASTGATKGVYGYALSVFAGKNANVLLSGIGGTGYVNTVGGTQYNLPERIVQNYTDLVAESGVPDKVVIAMGINDLSLSGIEVAANSAFDLLRTVYSGEVYVLNAFNANAPTKSTAYQSFESNLLAAVDGRDGFTFIDVSELSYTKFDAVHPDDAGHATIAKFLYPYLVDESKYIKKIPASAIEDESLTELKLSESLVDRLNEVNSTISEDIKNLHSSSDPAGGNESNIINAWVPATSATIESNNLDSYLGSLSIKATKTNSGSAVIYSPNYTVEVGKTYIFKCRVKVGGGFVSGSNMYLSDKTGAFLQSFINEADLGWQELTQTLSFSKTRTRVNISLSAQPANATLLIDAVELYEVGEGIKAFPEAYGFGSVSTGGRGGSVRKVTNLNNSGLGSLRAACELDNSIVIFETAGTIDLDSPISVGNNVSIYGQTAFRNGGQGITLKASNTNESSLMVFSGKDNLIVQYIRFRRGVTPFVTSATAGQNLALANAATKIMIDHCSFGWDEDESLTIWDASEVTVQNSIVTNSLMVNDYGRQKTSKSLIVGNSADKVSIYKTLIGNADQRNALFGGSTSPIEQFEFKNNLIFNWGSIGTDFAGSQLPFKVNIIGNKWKAGHNSNISRHGLRATGNTGDLFYLEGNITISRPTNDLDEWLAIGDSATPSSTLATTFQQNTPFDYPLQYAPTYGIEELESDVLKQMGVNLYVDYPDMLAKSHYTHGNGFIMNKPADIGGYPVLSGFKKAIQDSNNDGIPDDFAKVNGISSSNQIIPFYKFGTWQFDNSFKYTAIEVYAYYLSKN
Physico‐chemical
properties
protein length:1094 AA
molecular weight: 118452,51850 Da
isoelectric point:4,79672
aromaticity:0,10055
hydropathy:-0,17733

Domains

Domains [InterPro]
cd00229
ENZ
288–462
IPR013830
ENZ
290–453
YP_007675684.1
1 1094
Architecture
ATT
STR
ATT 18-278 | STR 279-997 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_007675684.1
1 1094
Domain Start End Length (AA) Confidence
N-terminal 1 121 121 0,8625
Central domain 122 322 202 0,7820
C-terminal 323 1094 771 0,2889
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-121
Central
122-322
C-terminal
323-1094

Taxonomy

  Name Taxonomy ID Lineage
Phage Cellulophaga phage phiSM
[NCBI]
756280 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Cellulophaga baltica
[NCBI]
76594 cellular organisms > Bacteria > Pseudomonadati > FCB group > Bacteroidota/Chlorobiota group > Bacteroidota
Host Cellulophaga sp. MM#3
[NCBI]
192170 Bacteria > Bacteroidetes > Flavobacteriia > Flavobacteriales > Flavobacteriaceae > Cellulophaga

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_007675684.1 [NCBI]
Genbank nucleotide accession
NC_020860 [NCBI]
CDS location
range 7209 -> 10493
strand +
CDS
ATGATGAAAACAATTTTATTACTGGCGTTTTCGCTTTGCAGCTTTTTTGCAGTAGCTCAGACTCAAGACGTGACAATAGCTGTTCCGACACCTGACCAGATTATTGACACGATAAACAATAGGCCTGAAGGAAAAAAAATCTCAGCTTCAGCTATTGAAGGAACTTTTGAAGATAGTGTTTTAAGCGCTGAAGCTCAAGCATCTTTAGCTAGAGCAAATGAATCTATAACCAGCGAATTAACTAACGAACCGAAAGGCTCAATTAAATCTGATAACGCTGTATCTATAAGCTACTTAGAGTATCTTATAGCTGTAAATTCGCAAAAAAATAAATCTAATACGATTTACAATGTTAGCAGTGAGGAAGATGTTTTAAATCAGGATTTAGTTAATTCAACTTTTATTGATTTAGCAACTCCTTACGACTACATATCAGAGACACACATGATAACAGATGATATACATGACTACGATGTTCGTGGTGGTTTTTCTCACGATGTTACGAGCCAATCTGTAGCATTCCCCGTATCTTTAGCTAAGACAAATCCCACCAATGTATCTGGATCAGGATACAACGCGACAACGTGGGAAGCTGCTTTTGTTGTTAAAGGTGATTTCGCTATTAGAGTAGCTGGAAACGACGACAGTATGTCTAGGCAAAGAGTTTTGATAAATGGAAAGCCTTACCAGTTCATAAATGATACTGGATTAGAAGATGGAGGCACTAGAAGATGGGCTGTTTTCCAAATGAGAAAAGGAAAGGAATATGAAATTAGGATACAGTTTTCTCAAGCCGGTTCTTTTTATGGTTTTAAAACTGAAGTTGGTGGATTTATAAAAGCTCTTGACCCTGATGAGACTATTCTTTTCTTTGGTGATAGCATAACAGCTTCAACAGGAGCAACGAAGGGAGTTTATGGTTATGCTCTTAGTGTCTTTGCTGGTAAGAACGCGAATGTACTTCTTTCAGGTATAGGAGGAACTGGTTATGTTAATACAGTTGGAGGCACTCAATACAATTTACCTGAAAGGATTGTCCAGAATTACACTGATTTAGTGGCTGAGTCAGGGGTTCCTGATAAAGTTGTTATAGCGATGGGAATTAACGATTTAAGCTTGTCAGGCATAGAAGTCGCAGCGAATAGCGCTTTTGATTTGCTTAGAACTGTCTATAGTGGCGAAGTTTATGTGTTAAATGCTTTTAATGCTAACGCACCAACTAAATCAACAGCTTACCAGAGTTTCGAAAGCAATCTTTTAGCTGCTGTAGATGGAAGAGATGGGTTTACTTTTATAGATGTATCAGAGCTTTCCTATACAAAGTTTGACGCTGTTCATCCTGATGACGCTGGTCACGCTACTATAGCGAAATTTTTATACCCATATTTGGTTGATGAAAGTAAATACATCAAAAAGATACCAGCTTCAGCGATAGAGGATGAAAGCTTAACTGAATTAAAGCTTTCTGAATCCTTAGTAGATAGGTTGAATGAAGTGAACTCAACTATATCCGAAGACATAAAAAATTTACACAGTTCAAGCGACCCTGCAGGTGGTAATGAATCAAATATAATTAACGCATGGGTTCCAGCGACTTCAGCTACAATAGAGTCAAATAACCTAGATAGTTATTTAGGCAGCTTATCTATTAAAGCTACGAAAACAAATTCAGGGTCTGCTGTTATTTATTCACCTAATTACACAGTTGAAGTAGGTAAAACATATATTTTTAAATGTAGAGTTAAAGTAGGTGGTGGTTTTGTTTCAGGTAGTAATATGTATTTATCTGATAAAACAGGTGCTTTTTTGCAATCATTTATTAATGAAGCTGATTTAGGTTGGCAGGAACTAACGCAAACCTTAAGTTTTAGCAAAACTAGAACTAGAGTTAATATATCATTAAGTGCGCAGCCCGCGAATGCAACATTATTAATTGATGCTGTAGAATTATATGAGGTGGGTGAAGGAATTAAAGCTTTTCCTGAAGCTTATGGTTTTGGTTCTGTATCTACAGGAGGTAGAGGTGGTTCTGTTAGAAAAGTTACTAATCTAAATAATAGTGGGTTAGGTAGCTTGAGAGCAGCGTGTGAGCTTGATAATTCTATAGTAATATTTGAAACTGCAGGTACAATAGATTTAGATTCTCCTATAAGTGTAGGTAATAACGTTTCTATTTATGGTCAAACAGCTTTTAGAAACGGGGGTCAAGGAATAACGTTAAAAGCTTCAAATACTAATGAATCTAGTTTAATGGTTTTTAGTGGTAAAGATAATTTGATAGTTCAATACATAAGATTTAGAAGAGGAGTTACGCCTTTCGTTACATCTGCCACAGCTGGTCAGAATTTAGCTCTAGCCAATGCAGCTACTAAAATAATGATAGACCATTGCTCTTTTGGATGGGATGAAGACGAAAGCCTTACTATCTGGGATGCGTCCGAAGTTACTGTACAAAACTCTATTGTTACAAATTCTTTAATGGTTAATGATTACGGTAGACAAAAAACAAGTAAGAGTTTGATAGTGGGGAATAGTGCAGATAAAGTTTCTATATATAAAACACTTATAGGAAACGCTGATCAAAGGAACGCTTTATTTGGTGGTAGTACTTCTCCTATAGAACAATTTGAATTTAAGAATAATCTTATATTTAATTGGGGTTCTATTGGAACTGATTTTGCGGGTTCTCAATTACCTTTCAAGGTTAATATTATTGGGAATAAATGGAAAGCGGGTCATAACTCCAACATAAGTAGGCATGGACTTAGAGCTACTGGTAATACTGGTGATTTATTTTACCTAGAAGGCAATATAACTATATCACGCCCTACAAATGATTTAGATGAATGGTTAGCTATTGGTGATTCAGCGACTCCATCATCTACTTTAGCTACCACGTTTCAACAAAACACTCCATTTGACTACCCGCTACAATACGCACCTACTTATGGTATAGAAGAATTAGAAAGTGATGTATTGAAGCAAATGGGTGTAAATTTATACGTAGACTACCCTGATATGTTAGCGAAATCACATTATACTCACGGAAACGGGTTTATTATGAATAAACCCGCAGACATAGGTGGTTACCCTGTTTTATCAGGCTTCAAAAAAGCTATACAAGATTCTAATAATGATGGTATTCCTGATGATTTTGCTAAAGTTAATGGAATAAGTTCTAGTAATCAAATAATACCTTTTTATAAATTCGGAACTTGGCAATTTGACAACTCTTTTAAGTATACAGCGATAGAGGTTTATGCTTATTACTTAAGTAAAAATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
747b216abd5ea96d9ad8927041cd766195ed67f75764cd3d1394527908827b5e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6780
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
The Genome Sequence of Cellulophaga phage phiSM Henn,M.R., Reimann,L., Holmfelt,K., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Fitzgerald,M.F., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Neiman,D., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B., Nusbaum,C. and Birren,B. 2011-09-23 GenBank