Genbank accession
WJJ54298.1 [GenBank]
Protein name
tail appendage
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MARQQANLEYNQFQSGLITEGNILNLPIDSFREGENFILSKANAWERRKGLGLEDSGTLYPSYVDFSDQTLVSSVHVWQTHYSAIPEILVVQFGDKLHFFDTSVDPLSNGKLFINNQEFLTTEGTTEDIISGASVEGIFVFATQDADPISLQIMDIQSDSITARTKIVVDRKVLFLETRDVWGRSAPSKERPKTLSSDYLYELINQGWDTKKINSTYATIGAYPSGYDIWWLYKTTAGTDANAIGKFTPSRMKDSTTTGIGQERQNTPAPRGSTVASLQVLASGKPSCIQTFAGRVFYAGFQATPRKIDDVRPDFRNHVFFSQLVKSNAEINKCYQFADPTSEVDSALVDTDGGFIKINAARKIVAMEEVSSGLFIIAENGVWLLSGTSDGLFSATGYHVDKITDYGCVSPRSVVAYGDTVFYWAEEGIIVLSPDQTTGKHSAQNLTELKIQSLYNELTTSSKVKSVGTVERTDKYVRWLVSENANPNNFDLEIIFSLRYGAFFINRFKSDITVAERVTGYVPERNVIGNKVVSDYFIVAGIDRVKVESADVTVPLGRRIGDEEFRDTKYLTIKSDDDSFGFYSFNQDNFKDFGLVDAEAYLLSAPQKLDDTQRRKQITSLATHMLRTDTWFKDVDGEVTRENESSMLLRIRWDFTDSPTGNKWTDITRHSQCYRYRRPMNLKQGINVYPYEVITARERIRGSGTAFQFEFRTEPNKDCKLLGWAVTVSGGTKV
Physico‐chemical
properties
protein length:734 AA
molecular weight: 82492,45010 Da
isoelectric point:5,38006
aromaticity:0,11172
hydropathy:-0,35913

Domains

Domains [InterPro]
DC_1791
STR
194–734
WJJ54298.1
1 734
Architecture
STR
STR 194-734
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WJJ54298.1
1 734
Domain Start End Length (AA) Confidence
N-terminal 1 202 202 0,9024
Central domain 203 401 200 0,1197
C-terminal 402 734 332 0,2160
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-202
Central
203-401
C-terminal
402-734

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage JPW
[NCBI]
3046621 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Vibrio cholerae
[NCBI]
666 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WJJ54298.1 [NCBI]
Genbank nucleotide accession
OR039881 [NCBI]
CDS location
range 25711 -> 27915
strand +
CDS
ATGGCAAGACAACAGGCTAATCTAGAATACAATCAGTTTCAGTCTGGTCTAATTACAGAGGGGAACATTCTAAATCTGCCTATTGATTCTTTCAGAGAGGGTGAGAACTTCATCCTCTCAAAAGCTAACGCTTGGGAAAGACGCAAAGGTTTGGGTTTGGAAGATTCAGGTACTTTGTATCCTTCCTACGTAGACTTCTCAGACCAAACTTTAGTATCTAGTGTTCACGTATGGCAAACTCACTATAGTGCCATTCCAGAAATTCTAGTAGTACAGTTCGGTGATAAATTACATTTCTTTGATACTTCTGTTGACCCGTTATCTAACGGGAAATTGTTTATTAACAACCAAGAATTTTTAACCACAGAAGGTACAACAGAGGATATTATTAGTGGTGCTTCGGTAGAGGGTATCTTTGTATTTGCAACTCAGGATGCAGACCCTATATCACTCCAGATTATGGATATTCAATCAGACTCAATCACTGCCCGTACTAAAATCGTAGTAGATAGAAAGGTTCTGTTCCTTGAAACTCGTGATGTGTGGGGACGTTCGGCTCCAAGTAAAGAACGGCCAAAAACCCTATCGTCAGATTATCTGTACGAGCTAATTAACCAAGGTTGGGATACTAAAAAGATTAACTCAACTTATGCCACAATCGGTGCTTATCCTAGTGGCTACGATATTTGGTGGTTATATAAAACAACTGCTGGTACAGACGCTAACGCAATCGGAAAGTTTACTCCAAGTCGGATGAAAGACAGTACAACTACCGGTATCGGTCAGGAACGTCAAAACACTCCTGCACCTCGTGGTTCTACGGTAGCGTCATTACAAGTGTTAGCTAGTGGTAAACCTTCTTGCATTCAAACATTCGCAGGTCGAGTATTCTATGCAGGATTCCAAGCAACCCCAAGAAAAATTGATGATGTACGTCCTGATTTCCGTAATCACGTATTCTTCTCACAGTTAGTTAAATCTAACGCAGAGATTAACAAGTGTTATCAGTTTGCTGACCCCACAAGTGAAGTAGACAGTGCTCTTGTTGATACAGACGGTGGTTTCATAAAAATCAACGCAGCTCGTAAGATTGTTGCAATGGAAGAAGTTTCCAGTGGTCTGTTTATCATTGCTGAAAATGGAGTGTGGTTGTTGAGTGGTACGTCAGATGGTCTGTTCTCTGCTACTGGTTATCACGTAGACAAGATTACTGACTACGGTTGTGTGTCTCCACGTTCTGTTGTGGCATATGGTGATACCGTTTTCTATTGGGCAGAGGAAGGAATCATTGTCCTATCTCCAGACCAAACAACTGGAAAACACTCTGCACAGAATCTAACAGAACTAAAGATTCAATCCTTATATAATGAGTTAACTACTTCTAGTAAGGTTAAGTCGGTAGGTACTGTAGAACGTACAGATAAATATGTCCGCTGGCTAGTTAGTGAAAATGCCAATCCAAATAACTTTGACCTAGAGATTATCTTTAGCTTACGTTATGGTGCATTCTTCATTAACCGGTTCAAGAGTGACATTACTGTAGCAGAAAGAGTTACCGGATATGTTCCAGAGCGTAACGTTATTGGTAATAAGGTAGTTAGTGATTACTTTATCGTTGCTGGTATAGATAGAGTTAAAGTAGAGTCTGCTGATGTTACAGTCCCATTGGGACGTAGAATTGGTGATGAAGAGTTCAGAGATACTAAGTATCTAACAATCAAAAGTGATGATGATTCGTTTGGATTCTATTCGTTTAACCAAGATAACTTTAAAGACTTCGGCCTAGTTGATGCAGAGGCTTACCTGTTAAGTGCTCCACAGAAATTAGACGATACTCAACGCCGTAAACAAATTACAAGCCTAGCAACTCACATGCTGAGAACTGATACTTGGTTCAAAGATGTAGATGGAGAAGTTACCAGAGAAAACGAATCTAGCATGTTGCTACGTATTCGTTGGGACTTTACCGACTCTCCTACGGGCAACAAGTGGACTGATATTACTCGTCACAGTCAGTGCTACCGTTACCGTAGACCGATGAATCTCAAACAGGGTATCAATGTGTACCCTTATGAAGTAATCACGGCCAGAGAGCGAATTAGAGGCTCAGGAACGGCATTCCAGTTTGAGTTCAGAACAGAGCCAAACAAGGATTGTAAGTTGTTAGGTTGGGCAGTAACCGTTAGTGGAGGTACTAAGGTATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
9dac03e7b610b7a386b3fe63e56169f7ad8c2ceb9e5e066977f0c68447b43175
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6089
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50