Genbank accession
YP_006990576.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MAQAQYYTLLTEIGKAAIANATALGTRVDFAKIKVGDGGGSAYIPTETQTELKNVVWESTLEHVQADEKNKSWVVIQKTITGDTGSFTIREVGVFDSKDQLLAISSYPETYKPAPDSGTVKEILIKIILAVSNTASINLKIDPTVVLATLKDIQDLDAKIDTTKTELTSNIETAKTELNNKIGDTTQLTTTDKTSLVGALNEVKTSVDSIETTAEKTSYNNSTSNLTATTVQGAIDEIVTEVRGNRTSIISSINNNLIPM
Physico‐chemical
properties
protein length:260 AA
molecular weight: 27948,08100 Da
isoelectric point:4,77410
aromaticity:0,04231
hydropathy:-0,20000

Domains

Domains [InterPro]
DC_1371
STR
1–257
IPR051934
Unmapped
3–159
IPR022225
ATT
4–150
YP_006990576.1
1 260
Architecture
ATT
STR
ATT 1-150 | STR 151-257 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Clostridium phage phiMMP04
[NCBI]
1204535 Uroviricota > Caudoviricetes > Sherbrookevirus >
Host Clostridioides difficile
[NCBI]
1496 cellular organisms > Bacteria > Bacillati > Bacillota > Clostridia > Peptostreptococcales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_006990576.1 [NCBI]
Genbank nucleotide accession
NC_019422.1 [NCBI]
CDS location
range 15056 -> 15838
strand +
CDS
TTGGCACAAGCGCAATATTATACATTACTTACAGAAATAGGCAAAGCGGCTATAGCAAATGCTACAGCACTTGGAACTAGAGTGGATTTTGCAAAAATAAAAGTTGGGGATGGTGGTGGAAGTGCATATATTCCAACAGAAACTCAAACAGAACTCAAAAATGTAGTGTGGGAAAGCACATTAGAGCATGTTCAAGCAGACGAAAAAAATAAAAGTTGGGTAGTTATACAAAAGACTATAACTGGAGATACTGGAAGCTTTACAATCAGAGAGGTTGGAGTATTTGACTCTAAAGACCAACTTCTCGCAATATCTAGCTATCCAGAAACTTATAAACCTGCTCCAGATTCGGGAACAGTAAAAGAAATATTAATTAAAATTATATTAGCTGTGTCTAATACAGCAAGTATAAATTTAAAAATAGACCCAACTGTTGTGCTAGCAACTTTAAAAGACATACAAGACCTAGACGCTAAAATTGATACAACTAAAACAGAATTAACAAGCAACATAGAAACTGCTAAAACAGAGTTAAACAATAAAATAGGTGATACAACACAACTTACTACAACAGATAAAACTAGTCTTGTTGGTGCATTAAATGAGGTAAAAACTAGCGTAGATAGTATAGAAACAACAGCAGAGAAAACAAGTTATAATAATTCAACAAGTAATCTTACTGCTACTACTGTGCAAGGGGCAATAGATGAAATAGTAACAGAGGTAAGAGGTAATAGAACTAGTATTATATCTAGTATAAATAATAATTTAATACCAATGTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
b0f2d1ce681ebb3cb8c391b705d3630deb36a8bc483bfed50f05db13e78cfeac
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8844
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50