Genbank accession
CAB5230088.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MPDFGFVGPSYEAPSIYQDAQECINFRPEIDPLKQPGQNGVIALYPTPGLTTKVTLFNTAEVRGMRQVSGGQYMVVVCGQYVYALNSTFTPTIIGTLNSSTGMVGISDNSLNVYIVDGTNRYTWRISNPLVAQFVGSVSGTTLTVTLMNSGTITISQQLFGIGVNPETIITALGTGTGGIGTYTINLSQTEPSEVFNTAAVAAKITGSISGTVLTVTAVTSGILYPGQTIQGTGVTAGTIITALGGSAALSYAITTGGTGYAAGDTITVTGGIYSQQATYTVATVAVGVVTGLTTVSNGVYTVVPGTPSQTTTSGNGTGLTLTLTFGTGTGGTGSYVVSTSQTVSSTTLYALNFSIMPANDGPFTGATVVDVVDNYFVYNRPNTQQWGASSPLSPISPALSFSSKDGAPDNLVSMIVDHREVYLLGETSSEVWVDSGLFPFAFQRIPGTSTQHGIAAAFSIARLGNSFAYLSKNIRGDGQIMMMNGYMPTRISNHAVEYSIEGGFIADARAWTYLIEGHEVYVVSFPTLDLTWAYDLASGMWHKWLWVDNQNVFHRHRGNCHSHFQGINLVGDHSNGQIYMLDPNNYTDSGNEIRRVRRAPHLISDYQRQYFSEFQIHFQPGVGLPNGSAPQAMCRWSDDGGSTWSNEHWTSIGVQGAYKNRAIWRRLGQSRDRIFEVVVTDPINAVITAANLKAFAGNN
Physico‐chemical
properties
protein length:700 AA
molecular weight: 74683,72400 Da
isoelectric point:5,60435
aromaticity:0,10000
hydropathy:-0,00514

Domains

Domains [InterPro]
DC_0229
STR
1–137
IPR021098
STR
374–681
CAB5230088.1
1 700
Architecture
STR
STR
STR 1-137 | STR 274-700
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
CAB5230088.1
1 700
Domain Start End Length (AA) Confidence
N-terminal 1 174 174 0,8693
Central domain 175 373 200 0,4532
C-terminal 374 700 326 0,2745
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-174
Central
175-373
C-terminal
374-700

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 Uroviricota > Caudoviricetes > Peduoviridae > Maltschvirus maltsch >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAB5230088.1 [NCBI]
Genbank nucleotide accession
LR798406 [NCBI]
CDS location
range 14303 -> 16405
strand -
CDS
ATGCCGGATTTTGGATTTGTTGGACCATCATATGAAGCCCCGTCTATCTACCAAGATGCCCAGGAGTGTATTAATTTTCGGCCTGAAATTGACCCTTTAAAACAACCTGGTCAAAACGGCGTAATTGCCCTTTATCCAACGCCTGGTTTAACTACTAAAGTTACTTTGTTTAATACGGCAGAAGTTCGTGGTATGCGCCAAGTTTCCGGTGGTCAATACATGGTGGTAGTTTGCGGTCAATATGTTTATGCGCTTAATTCTACGTTTACACCTACGATTATTGGAACTTTAAATAGTTCTACCGGTATGGTAGGAATTTCTGATAATAGTTTAAATGTTTATATTGTTGATGGAACTAACCGTTATACATGGCGTATATCTAATCCATTAGTCGCACAATTTGTTGGTTCTGTATCAGGGACAACCCTTACAGTAACTTTAATGAACTCAGGGACTATAACAATTAGCCAGCAATTGTTTGGTATTGGTGTAAATCCTGAAACAATCATTACTGCTTTAGGTACAGGCACGGGCGGCATAGGAACATACACAATTAATCTTAGTCAAACTGAACCTTCGGAAGTATTTAATACTGCGGCGGTTGCGGCTAAAATAACTGGTTCTATTTCAGGCACCGTATTAACAGTAACTGCGGTAACTAGCGGAATCTTATATCCAGGACAAACTATTCAAGGTACAGGCGTAACTGCTGGAACCATTATTACGGCTTTAGGCGGTTCTGCGGCGTTATCTTATGCAATTACTACTGGCGGTACAGGATATGCCGCTGGTGACACAATTACGGTAACTGGCGGTATTTATAGTCAACAAGCTACATATACGGTTGCAACTGTAGCGGTCGGTGTCGTTACTGGATTAACAACTGTCAGCAATGGCGTTTATACAGTAGTGCCAGGAACACCATCCCAAACAACAACTAGCGGTAATGGAACAGGGTTAACCCTTACATTAACGTTTGGTACAGGTACAGGGGGCACAGGAAGCTATGTTGTCAGCACTTCACAAACCGTTTCATCAACTACTTTATATGCGCTTAATTTTAGTATTATGCCGGCTAATGATGGTCCTTTTACCGGGGCTACTGTTGTTGATGTGGTGGACAACTATTTTGTTTATAACCGACCAAACACCCAGCAATGGGGTGCTTCTTCACCTTTATCACCTATTTCCCCGGCGTTAAGCTTTAGTTCTAAAGATGGTGCCCCGGACAATTTGGTGTCTATGATTGTGGACCATCGTGAAGTTTATTTACTTGGTGAAACATCGTCAGAAGTATGGGTGGATAGCGGTTTATTCCCATTTGCGTTTCAACGTATTCCTGGAACATCAACCCAGCATGGTATAGCGGCCGCATTTTCAATAGCTAGACTAGGCAATTCTTTTGCTTATTTAAGTAAAAATATTCGTGGTGATGGCCAAATTATGATGATGAATGGCTATATGCCAACTCGAATTAGTAATCACGCCGTTGAATATAGTATTGAAGGTGGTTTTATTGCTGATGCTAGGGCTTGGACGTATTTAATTGAAGGCCATGAAGTTTATGTAGTGAGTTTTCCTACTCTTGATTTAACCTGGGCTTATGACCTTGCCAGCGGTATGTGGCATAAATGGCTTTGGGTAGATAATCAAAACGTATTCCATCGTCATCGTGGTAACTGCCATTCGCATTTTCAAGGTATAAACCTTGTTGGTGACCATTCAAATGGCCAAATTTATATGTTGGACCCTAATAATTACACCGATAGCGGTAATGAAATACGTAGGGTACGCCGGGCACCCCATTTAATTAGTGATTATCAACGTCAATATTTTTCAGAATTTCAAATTCATTTCCAGCCTGGCGTTGGTTTGCCAAACGGTTCTGCCCCCCAAGCAATGTGCCGTTGGTCAGATGATGGCGGTTCTACTTGGTCAAATGAACATTGGACAAGTATTGGCGTACAAGGCGCATATAAAAACCGTGCTATTTGGCGTAGATTAGGGCAATCACGTGACCGTATTTTTGAAGTAGTGGTTACAGACCCTATTAATGCCGTGATTACTGCGGCTAATCTTAAAGCTTTTGCTGGGAATAATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
ede2b03eeb0cdcf9eed6e3e24e8eda86ad2eb29f975bf788a165c3d17942ae67
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7935
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50