Genbank accession
CEK40234.1 [GenBank]
Protein name
Tail protein/endopeptidase (modular protein)
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MLRLYKANETNFKNNGLGILKDAFETKVTEELNGLFELEFTYYVGSFLFDEIDYNKIVMADASPRLKNQLFRIYYISKELDGKILVKAEHISYDLLNNFIENLELKNVTCEEALNQIFRSCTEENRFVGYSDITGNKDFSISCVSPHNAISDIQELFNNKSKLKRDNFNISLLNNIGESNNVLLAYRKNIIGLTATYDTQEVITKIYPYATKKSGKNKKITLPEKYIESKYINNYPTQRIVAVDFSDDDVKNEESLRNKCKDYFIENNVDLPKVTYKVEFVDLSTTEDYKSYKMLETVNMDDEIIVRDYNLGINATARVVKTEYNPVLKKYNSVEIGDLVNHFKDERIDDLEEKIDKVQNNVDNIVIESDNFPDTLPEPSNVTALGLWSMIQLDWTFDNKLYYNYEVYASQIKGFEPDTAGYTNRIYVGQASSLLHEVKPMQTWYYRVRAGNTHDNYNEFSNEVSATTRKLSDAAEYFEEAAIGHAVIRDLDADKINVGKVKGQYIEAKNLVVVDGNSQTTLNIDSFGNVHIGATTFTLKGKSLESIIGGEIDDITQLEIFNKLTNNGLAKGLYMVGNELYLNASYIKTGTLEGQFINGRNLTVRDNDGYTTLQVDSNGKVNIRANELSIGDKNNYESVLTSDQKAVFDALTGNRNCGIYLSGSRLYINADYIDTGTILCDRIGASSSNPFILLFEGNGAKCALDATAQFGVGIGKAMRMKYNDYSYIYVSDDVISGYLDGEEVFEFGYQDEKSYINTGLNTQHINPQIDSWYNCGSNRKAWDYLVCNNLNQLARTAATSTRMMKSTFNEEISNNCIDFVKSSLVQSDVFTPYNLKSMKNNNIEHRLQVDIDSSLDNPISKYIFKDVSDEAGEGVYAQDITSHLAVLQLSLQKTISNFENYKDITSKKIEELTKRIEVLESR
Physico‐chemical
properties
protein length:922 AA
molecular weight: 104679,95280 Da
isoelectric point:4,89380
aromaticity:0,10412
hydropathy:-0,44870

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Clostridium phage phiCD24-1
[NCBI]
1582149 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CEK40234.1 [NCBI]
Genbank nucleotide accession
LN681534 [NCBI]
CDS location
range 18640 -> 21408
strand +
CDS
TTGCTTAGATTATATAAAGCAAATGAAACAAATTTTAAAAATAATGGTTTGGGGATTCTAAAAGATGCATTTGAAACAAAAGTTACAGAAGAATTAAATGGACTATTTGAGTTAGAGTTTACATATTATGTGGGCTCTTTTTTATTTGATGAAATTGATTATAATAAAATAGTTATGGCTGATGCTTCTCCAAGGCTTAAAAATCAATTATTTAGGATTTACTATATTTCTAAGGAACTTGATGGAAAAATATTAGTTAAAGCAGAACACATTAGTTATGATTTATTAAATAATTTTATAGAAAATTTAGAACTTAAAAATGTTACCTGTGAAGAAGCTCTTAATCAAATATTTAGGTCATGTACTGAAGAAAATAGATTTGTAGGATATTCAGATATAACAGGAAATAAAGACTTTTCCATCTCTTGTGTAAGCCCACATAATGCTATATCTGACATACAAGAATTATTTAATAATAAGTCTAAACTAAAAAGAGATAATTTCAATATAAGTTTATTAAATAATATTGGAGAAAGTAACAATGTATTACTAGCATATAGGAAGAATATAATAGGTCTAACAGCTACTTATGATACACAAGAAGTTATAACTAAGATATATCCATATGCAACTAAAAAAAGTGGTAAAAATAAAAAAATTACACTACCAGAAAAATATATAGAAAGCAAGTATATAAACAATTATCCAACTCAAAGAATTGTTGCAGTTGATTTTTCAGATGATGATGTTAAGAATGAAGAAAGCTTAAGGAATAAATGTAAAGACTATTTTATTGAAAATAATGTTGATTTACCTAAAGTTACTTACAAAGTAGAATTTGTAGATTTATCAACTACAGAAGATTATAAAAGCTATAAGATGCTTGAAACTGTAAATATGGATGATGAGATAATAGTTAGAGACTATAACTTAGGTATAAATGCTACTGCAAGGGTAGTAAAAACTGAATATAATCCAGTATTAAAGAAATACAATTCGGTTGAAATTGGTGACTTAGTAAATCATTTTAAAGACGAAAGAATAGATGATTTAGAAGAAAAAATAGATAAGGTTCAAAATAATGTAGATAATATTGTAATTGAAAGTGATAATTTTCCGGATACACTTCCAGAGCCTTCAAATGTCACAGCATTAGGGTTATGGAGTATGATTCAGTTAGATTGGACTTTTGATAATAAACTATATTATAACTATGAAGTATATGCATCTCAAATAAAAGGATTTGAACCAGATACTGCAGGATATACAAATAGGATATATGTGGGTCAAGCAAGTTCATTACTTCATGAAGTTAAGCCTATGCAAACTTGGTATTATAGAGTAAGGGCAGGAAATACACATGATAACTATAATGAATTTTCAAATGAAGTTAGTGCAACAACTAGAAAGTTAAGTGATGCAGCTGAATATTTTGAAGAAGCAGCTATAGGTCATGCAGTTATAAGAGATTTAGATGCAGATAAAATAAATGTAGGAAAAGTAAAAGGGCAGTATATTGAGGCAAAGAATTTAGTTGTAGTTGATGGAAATAGCCAAACAACTTTAAATATAGATAGTTTTGGAAATGTTCATATAGGAGCAACTACCTTTACATTAAAAGGAAAGTCGTTGGAATCTATTATTGGCGGAGAAATAGACGATATAACTCAACTTGAGATATTTAATAAACTAACTAACAATGGTCTTGCAAAGGGACTCTATATGGTTGGGAATGAATTATATCTAAATGCTAGTTATATAAAAACAGGTACTTTAGAAGGACAATTTATAAATGGTAGAAATTTAACTGTCAGAGATAATGATGGATATACAACGTTACAAGTAGATAGTAATGGAAAAGTGAATATAAGAGCTAATGAGTTATCTATAGGTGATAAAAATAACTATGAAAGTGTTTTAACTAGTGACCAAAAAGCAGTATTTGATGCATTAACAGGGAATCGAAATTGTGGAATATATTTAAGTGGAAGTAGATTATATATAAATGCTGATTATATAGACACAGGTACTATTTTATGTGACAGAATAGGTGCTAGTTCATCAAACCCATTCATATTACTATTTGAGGGGAATGGTGCAAAATGTGCTTTAGATGCTACTGCTCAGTTTGGAGTAGGTATAGGAAAAGCAATGCGTATGAAATATAATGATTATTCATATATATATGTTTCTGATGATGTAATTAGTGGTTATCTTGATGGAGAGGAAGTATTTGAATTTGGTTATCAAGATGAAAAAAGTTACATTAATACAGGTTTGAATACACAACATATAAATCCTCAAATAGATAGTTGGTATAATTGTGGTTCAAATAGGAAAGCTTGGGATTATTTAGTATGTAATAATTTAAATCAACTAGCAAGAACAGCAGCTACATCTACTCGAATGATGAAGAGTACATTTAATGAAGAAATTTCAAATAACTGTATAGATTTTGTTAAAAGTAGCTTAGTACAATCTGATGTATTCACACCATATAATTTAAAATCTATGAAAAACAACAATATTGAACATAGACTACAGGTTGATATAGACTCCTCTTTAGATAATCCTATTTCTAAGTACATTTTCAAAGATGTATCAGATGAAGCTGGTGAAGGTGTCTATGCACAAGATATAACATCTCATTTAGCAGTATTACAATTATCTTTACAAAAAACTATATCAAATTTTGAAAATTATAAAGATATAACAAGTAAAAAAATAGAAGAACTTACAAAACGAATAGAAGTATTAGAAAGCAGGTGA

Tertiary structure

PDB ID
47ebfe6bf49bb6073cbb2cc7a8227f0307444ce2a5abfa48c927499d987b31a2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7027
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Whole Genome Sequence and Molecular Characterization of Siphoviridae / Myoviridae Phage Infecting Clostridium difficile Monot,M. 2011-08-01 GenBank