Genbank accession
AFY98445.1 [GenBank]
Protein name
tail spike protein with colonic acid degradation activity
RBP type
TSP
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MVYNKHEWQDGELITSGKLNHMELGIQLADKQVSVKDYGAVGDGVADDTKSFINAFSENKGTKVIIPTGTYLITKQVSIFGDVEFQNAKVISTNNSQDYMFNVDGLDEINIIGFKYNNDKGRGAIKISNTKTVRLTSFDISGYSAETAYHKTDSALMLDNNTTIYLDDIKVHDHGFQYGAELEHLNRCISIQGDKTKTVIARGLQFSKANQGLVLSTPNGNVIVSDSSIDNTTDNSMYLLACLTFMATNVRFDDYYDESVIMGSGDYSFTNCWFSNVPNKVFGINNDTVGLSIIGCNIIQSDKHSGQIVSFRDVNYTLNTFIFESNRIVTAPDSTNNDLFYLGQINLFSIKNNTIKTFSISDGRNMFAFRNSDSNAPYTGVIKDNYVMPITKDVVIGTYYFARLYKEVGKVEISDNYISNGRMPMDNASVIYNGQKFFTSLGYVIDRTTRQSLYATTIPIKGRFNIGDVVYNTDPSNGVFAWVRVTTGLNNVSGVDWKTVSVN
Physico‐chemical
properties
protein length:503 AA
molecular weight: 55979,04630 Da
isoelectric point:5,35255
aromaticity:0,11133
hydropathy:-0,24612

Domains

Domains [InterPro]
DC_2249
ATT
4–170
IPR012334
STR
26–452
IPR011050
STR
28–290
IPR024535
ENZ
33–233
AFY98445.1
1 503
Architecture
ATT
STR
ATT 4-170 | STR 171-503
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AFY98445.1
1 503
Domain Start End Length (AA) Confidence
N-terminal 1 49 49 0,9766
Central domain 50 430 382 0,9897
C-terminal 431 503 72 0,9366
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-49
Central
50-430
C-terminal
431-503

Taxonomy

  Name Taxonomy ID Lineage
Phage Leuconostoc phage LN34
[NCBI]
1262519 No lineage information
Host Leuconostoc mesenteroides
[NCBI]
1245 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AFY98445.1 [NCBI]
Genbank nucleotide accession
KC013027.1 [NCBI]
CDS location
range 24658 -> 26169
strand -
CDS
ATGGTTTATAATAAACACGAGTGGCAAGATGGTGAATTGATAACTTCTGGTAAGTTAAACCACATGGAACTTGGAATACAATTAGCGGATAAACAAGTTTCGGTTAAGGATTACGGTGCAGTTGGAGATGGCGTTGCCGATGACACAAAGTCTTTTATTAATGCTTTCTCAGAAAATAAAGGAACTAAGGTGATTATTCCTACTGGTACGTATCTAATAACTAAGCAAGTATCTATATTTGGAGATGTCGAATTTCAAAATGCAAAAGTTATAAGCACTAACAATTCACAGGATTACATGTTTAATGTTGATGGTTTAGATGAGATTAATATTATTGGGTTCAAGTATAATAATGATAAGGGTAGGGGTGCTATTAAAATATCTAATACTAAAACAGTTAGATTGACAAGTTTTGATATTTCAGGGTATTCAGCAGAGACGGCTTATCATAAAACAGATAGCGCACTAATGCTGGATAATAACACAACTATTTATTTAGATGATATCAAAGTTCATGATCATGGTTTCCAATATGGAGCAGAGTTAGAACATTTGAACCGTTGTATATCTATTCAAGGAGATAAAACAAAAACTGTAATTGCTAGAGGACTTCAATTTAGTAAGGCAAATCAAGGTTTAGTATTATCAACACCAAATGGAAATGTTATTGTTTCTGATAGTTCAATTGATAACACTACGGATAACTCGATGTATTTACTAGCATGTTTAACTTTCATGGCTACTAATGTACGTTTTGATGATTATTATGATGAAAGTGTAATAATGGGTAGTGGGGATTATAGTTTTACAAATTGTTGGTTCTCAAATGTTCCCAACAAGGTGTTTGGTATAAATAATGATACTGTTGGATTGTCTATTATTGGTTGTAATATAATTCAAAGTGATAAGCATTCTGGTCAAATAGTTAGCTTTAGAGATGTAAACTATACGCTTAACACTTTTATTTTTGAAAGTAATAGAATAGTAACCGCACCAGATAGCACAAATAACGATCTATTCTACTTGGGTCAAATCAACTTGTTTTCAATAAAAAATAATACAATAAAGACGTTTAGCATATCAGACGGTAGGAACATGTTTGCGTTTAGAAATTCAGACAGTAATGCACCATATACTGGAGTTATAAAAGATAATTATGTAATGCCAATAACTAAAGACGTAGTTATTGGAACGTATTATTTTGCAAGATTGTACAAAGAGGTTGGAAAAGTCGAAATTTCAGATAATTACATATCAAACGGTAGAATGCCTATGGATAATGCTTCGGTAATATATAACGGACAAAAATTCTTTACTAGTTTAGGTTATGTTATCGATAGAACAACAAGACAATCATTATATGCCACAACCATTCCCATTAAAGGACGGTTTAACATTGGTGATGTTGTTTATAATACTGATCCATCAAACGGTGTATTTGCTTGGGTTAGAGTGACAACAGGATTAAATAATGTTTCTGGTGTTGATTGGAAAACAGTTAGTGTTAATTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
c74d4104799cf1335bf660184dedb2c0a3ab9b272c3e1cf0089d5b51a081c99e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7887
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50