Genbank accession
CAH9016464.1 [GenBank]
Protein name
tail spike protein with colonic acid degradation activity
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MTTKVNNRMIDGASLSVLDFGAVGDGIADDYQAFQDCALECLLREKSMYIPEATYLIGQRVILPFANSQGDLSCKIIGNKARLISGVAQENVSGSTHGCFVTGRYDGSALIPVTSGEEEYLTGNLSIEDLVIENFGEALRMHNAVYGSSLKNLYFNKCFNPIYLSRCFYLEQSNILMRGFVPALPASVGYRSLSFSNSMPASGIKVIGYERGFVLGGFDGGTIRDSTAESCDVGVDLAQESNMVTLDTMYLENNTTNIRFAAVIRRLTISGSWLFGDGTNHFSSLLGNSDYTNLTLINNRIWGGIVNTPSIRNVFGEAIQCGDAGEPTNDMSMSGLPNVVRSFKSYGFTTTTSGYNSTVRCLDNAQESNLIPASWSGGRFRNGRDSDDRSPFQSVIDNGGSLEFTTSYDFDGNSMFHYSVNVAHSVGTFKVNALIMYDNETDSYRAMQPTVGGLIESTDLVVDNNDGKVRIKLPVFTTPSLDYSIVRIV
Physico‐chemical
properties
protein length:489 AA
molecular weight: 53287,10070 Da
isoelectric point:4,79513
aromaticity:0,09816
hydropathy:-0,11043

Domains

Domains [InterPro]
IPR011050
STR
14–317
IPR012334
STR
15–326
CAH9016464.1
1 489
Architecture
STR
RBD
STR 14-326 | RBD 336-489
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAH9016464.1
1 489
Domain Start End Length (AA) Confidence
N-terminal 1 30 30 0,9320
Central domain 31 357 328 0,9718
C-terminal 358 489 131 0,9377
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-30
Central
31-357
C-terminal
358-489

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage 468E53-1
[NCBI]
2963205 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAH9016464.1 [NCBI]
Genbank nucleotide accession
OX241559.1 [NCBI]
CDS location
range 47439 -> 48908
strand +
CDS
ATGACAACTAAAGTTAATAATCGAATGATTGATGGAGCTTCCCTCAGTGTTCTAGATTTTGGGGCAGTAGGAGATGGTATTGCAGATGACTATCAGGCATTCCAAGATTGTGCTCTAGAATGTTTGCTTCGAGAAAAATCTATGTATATCCCTGAAGCTACTTACTTAATTGGGCAAAGGGTTATCCTCCCATTTGCTAATTCGCAGGGAGATCTCTCTTGTAAAATCATAGGTAATAAGGCCAGACTGATTAGCGGAGTTGCTCAAGAAAACGTGTCAGGCTCAACACACGGATGTTTTGTTACAGGGCGCTATGACGGTTCTGCTTTAATCCCTGTCACATCTGGGGAGGAAGAGTATTTGACTGGCAACTTATCTATAGAAGATTTAGTTATAGAAAACTTCGGTGAAGCGTTAAGAATGCATAATGCTGTATACGGGTCTTCTTTAAAAAACCTATATTTTAACAAATGTTTCAATCCTATTTATCTAAGCCGATGCTTTTACTTAGAACAGTCTAATATTCTAATGAGAGGTTTCGTTCCAGCACTCCCTGCCTCAGTAGGATATCGTTCTCTAAGTTTCTCGAATTCAATGCCTGCTTCAGGGATAAAAGTGATAGGTTATGAACGAGGATTCGTGCTTGGAGGTTTCGATGGTGGTACAATCAGAGACTCTACTGCGGAGAGTTGTGATGTAGGCGTAGATCTTGCACAAGAATCTAACATGGTAACTTTAGATACTATGTATTTAGAGAATAATACTACTAACATACGATTTGCAGCTGTAATTAGACGTCTTACAATATCAGGGTCTTGGTTATTTGGTGACGGAACTAACCACTTCTCCTCTCTCTTGGGTAACAGTGATTACACGAATCTTACATTAATTAATAACCGTATATGGGGCGGGATCGTAAACACCCCTTCAATACGTAATGTATTTGGAGAAGCTATACAATGTGGCGATGCAGGCGAACCTACAAATGATATGTCAATGTCTGGCCTTCCAAATGTAGTGAGAAGTTTTAAGAGTTACGGGTTTACTACTACTACGTCCGGGTATAATAGCACGGTACGTTGCCTAGATAACGCTCAAGAATCAAACCTAATTCCAGCCTCATGGTCTGGTGGTAGATTTAGAAACGGAAGAGATTCCGATGACAGAAGCCCTTTCCAGTCTGTTATAGATAATGGGGGTAGCCTTGAGTTTACAACTTCATACGATTTCGATGGTAACTCTATGTTCCATTATAGTGTCAATGTAGCTCATTCAGTTGGAACATTTAAAGTCAATGCTTTAATCATGTATGACAATGAGACAGATTCGTATAGAGCAATGCAACCAACTGTAGGAGGTCTTATTGAAAGTACAGACTTGGTTGTAGACAATAACGATGGGAAAGTTCGGATTAAACTCCCTGTATTTACAACTCCAAGCTTAGACTATTCAATAGTTAGAATAGTTTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
77d658faef19ec69aca091b1eca0d36979d64129c7c08812ee9779fdcfca6ee3
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7040
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50