Genbank accession
WCS68319.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MSVIYSIGQGNFSITRDNFTSFPLFQEESNTILINGKTYNKNSLQEEKVARSPYFSYATPANANNINTRTRTHSINYDGRTAFVGGRLPYRKDETIFQNSISFNDGTEIRVSESSGNTGCAIYQYKYGKILNTYRFSNGLGKTSLVKLDESSFIVVRNMINYGANGINNSGTSGSPSNSEISKNDLYGKEYLELNKYASNQFLSPSFYSNNYYSAAALNQYSLAPTSTAYNNVNLNDSILLFDKNLNLTKSYTLSGEEILNILGKTSNGSLIILTQGYIFNNETVTKEDGTTYTMSGPKTKLIIKMISPTLVETVLFSKNIHAGATTGFSTFARQIPYFNTKESCLYYLQMEYATGSNSSLCKLPIDIPNGKVGTVTNLNINGLDLKTLYAYQDNGINLFSIQIGSISVNNTKYLYIGNVFSHLNEVFYSDWINYYSPASTSNARSGSMNAAYLATNSKTKEFPIHLLQFDDTGLNLTLRDSIPFKDFHLDGVKAIFPLGDQFITIIKNNGFWIYTADSTTKRFSLVENITDPIRSIGIDDLERVWFVRNKEGANLEMVSPFLSVDISVSFEDTDITYVDQNIESAILVEARDMTGQLKETEIKLMLEGNAIFRDSQDKSYNVSTSSTSPTRVPIIITGSGAISTYTSYEGS
Physico‐chemical
properties
protein length:652 AA
molecular weight: 72520,19060 Da
isoelectric point:5,53068
aromaticity:0,11503
hydropathy:-0,29494

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WCS68319.1
1 652
Domain Start End Length (AA) Confidence
N-terminal 1 41 41 0,9323
Central domain 42 299 259 0,9076
C-terminal 300 652 352 0,1401
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-41
Central
42-299
C-terminal
300-652

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage vB_BsuM-Goe21
[NCBI]
3026978 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus subtilis
[NCBI]
1423 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WCS68319.1 [NCBI]
Genbank nucleotide accession
OM728297 [NCBI]
CDS location
range 159075 -> 161033
strand -
CDS
ATGAGTGTAATTTATTCAATTGGTCAAGGAAATTTCTCAATTACAAGAGATAATTTTACTTCTTTTCCTTTGTTTCAGGAAGAGAGTAATACAATTTTAATTAATGGAAAGACGTATAATAAAAATTCTTTACAAGAAGAGAAAGTTGCAAGATCACCATACTTTAGTTATGCAACACCAGCTAATGCAAATAATATTAATACAAGAACAAGAACTCATAGTATTAATTATGATGGTAGAACTGCTTTTGTAGGAGGAAGATTACCATATCGGAAAGATGAAACAATCTTTCAGAATTCAATTTCATTCAATGATGGTACAGAAATTAGAGTATCGGAAAGTTCTGGAAATACTGGTTGCGCTATTTATCAATATAAATATGGAAAAATCTTAAATACCTATAGATTTTCAAATGGATTGGGAAAAACTTCGTTAGTTAAGTTAGATGAAAGTTCATTCATCGTTGTTAGGAATATGATAAATTATGGAGCGAATGGAATAAATAATTCAGGTACTTCTGGATCACCTAGTAATAGTGAAATTTCTAAAAATGATTTATATGGAAAAGAATATCTTGAATTAAACAAATACGCCAGTAATCAATTTTTATCACCTTCATTTTATAGTAATAATTATTATAGTGCCGCTGCTTTAAATCAGTATTCTTTGGCTCCGACCTCAACAGCTTATAATAATGTTAACTTAAATGATAGTATATTACTATTTGATAAAAACTTAAATTTAACTAAAAGTTATACTCTAAGTGGAGAAGAAATTTTAAATATACTAGGTAAAACTTCCAATGGCAGTTTAATAATTTTAACACAAGGTTATATTTTTAATAATGAAACTGTGACCAAGGAAGATGGTACAACATATACAATGTCTGGACCAAAAACAAAGCTAATAATCAAGATGATTTCACCAACATTAGTTGAAACAGTTTTATTCAGTAAAAATATTCATGCAGGAGCAACAACAGGCTTTTCTACATTTGCTAGACAAATACCATATTTTAATACTAAAGAAAGCTGTTTGTATTATTTACAAATGGAATACGCAACTGGTTCAAATAGTTCTCTTTGTAAATTACCTATAGACATTCCTAATGGAAAAGTAGGAACTGTAACAAATCTTAATATTAATGGATTAGATTTAAAAACGCTTTATGCTTATCAAGACAATGGTATAAATCTTTTTTCAATTCAAATTGGTTCTATTAGTGTTAATAATACTAAATATTTATATATTGGAAATGTATTTTCACACTTAAATGAAGTCTTTTATAGTGATTGGATTAATTATTATTCACCTGCATCAACTTCAAATGCTAGATCTGGTTCAATGAATGCTGCTTACTTAGCAACTAATAGTAAAACAAAAGAATTCCCAATTCATCTTTTACAATTTGATGACACAGGGTTGAATCTAACTTTAAGAGATTCTATTCCTTTTAAAGATTTTCATTTAGATGGTGTTAAAGCAATTTTTCCATTAGGTGATCAATTTATTACAATTATTAAGAATAACGGTTTCTGGATTTATACTGCTGATTCTACTACAAAGAGATTTTCATTAGTAGAGAATATTACTGATCCTATTAGATCAATAGGTATTGATGATTTAGAAAGAGTTTGGTTTGTAAGGAATAAAGAAGGCGCTAATTTAGAAATGGTTAGTCCTTTCCTATCAGTAGATATCTCTGTTAGTTTTGAAGATACAGATATCACTTATGTAGATCAAAACATTGAAAGTGCAATACTAGTCGAGGCAAGAGATATGACAGGACAATTAAAAGAGACAGAAATCAAACTAATGCTTGAAGGAAATGCAATATTTAGAGATAGTCAAGATAAATCATATAATGTCTCAACAAGTTCAACTTCACCTACAAGAGTACCAATCATAATAACTGGTTCAGGAGCCATTTCTACTTATACTTCTTATGAAGGGAGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a03f58bf5be10b4298f782a3b8b72429cb7f096e475e5150238095a37f61b074
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5356
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50