Genbank accession
WPH68639.1 [GenBank]
Protein name
tail spike protein with colonic acid degradation activity
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MATTPTTGDIPSNAAVDLKFNSEQFDRVMNSDDLTYTDRFGKKRITMKGVQELANGFQDTFTNLLGSPDGFKLIGGVKSFDTLRSTPVRTEGQRIFLKSYHENGTTGGGIFVGHIGTKLDDGGTVAQGVGFYWERADCTEISVEIFGGTPNNNTIDNSDFIIAADLVSFSKGVRLTGKGLNYSVGRSFTLATPNIKDIKISPTSSFTGDAPTFTCDQNTGELVLEDVDISDFKGRGSNCTKGNYTLVPTITFKGTCKFNGNGGGPTRTTVLPKTTTISAAADLTTTSVISVADTSKFIAGDYVLVNGVRYRILSVDSSSQITFYNTSSIPTIYNDGVNSSVSAGDVLEKVFVVNTASEAVIDVADSTIFKVGDAVWIGDSKCLISVINNSTQITLVNVNGVPTLQYGGTGTGRYTSGQYVTKDKNGKNGFTINTSSSIGWNINLKGNVEFNNNGWFGLFQWCNASGGTVFGGAKANNNGYIGVGLGYVKGGELSGFVTNNNGNNGLDLFETYSETTIHDCTSNNNGVDGIFVASSKTAPKLYANTCVGNKRIGMLANGRTVTPVGLTVLDNFCVGNDLYSICLTGIGGGIVGDCSVGGGMQGIRIEGKNGIANPKSITVRDCKFSSVSSESDIFANIGGYTNGGDQGSLHLNNNSYFGRTPKFSITNIDVDKSTFIPAGYSKPNGTDLSASAGTAIAVGVTFFKPHAPTVIDNSAEDFTVQIYTNSGLTTVGTVTTATRTAGVELSNGATSNGRIIGSPAYGTFSYNFTLSSPGTRYLHIKSKYGSSVIKLTWS
Physico‐chemical
properties
protein length:794 AA
molecular weight: 83606,05080 Da
isoelectric point:6,15131
aromaticity:0,09194
hydropathy:-0,14547

Domains

Domains [InterPro]
DC_0282
STR
12–443
IPR012334
STR
353–613
WPH68639.1
1 794
Architecture
STR
RBD
STR 12-613 | RBD 646-794
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WPH68639.1
1 794
Domain Start End Length (AA) Confidence
N-terminal 1 157 157 0,9905
Central domain 158 706 550 0,9576
C-terminal 707 794 87 0,9394
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-157
Central
158-706
C-terminal
707-794

Taxonomy

  Name Taxonomy ID Lineage
Phage Stenotrophomonas phage BUCTxx100
[NCBI]
3092589 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Stenotrophomonas maltophilia
[NCBI]
40324 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Lysobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WPH68639.1 [NCBI]
Genbank nucleotide accession
OR529409.1 [NCBI]
CDS location
range 70019 -> 72403
strand +
CDS
GTGGCTACAACTCCAACTACGGGTGACATCCCCAGCAATGCTGCGGTGGACCTCAAGTTTAACTCGGAACAATTCGACCGAGTAATGAATTCTGACGATCTGACTTATACGGATCGTTTTGGTAAAAAACGCATTACAATGAAAGGGGTTCAGGAACTCGCTAATGGCTTCCAGGATACTTTCACTAATCTGTTGGGTTCTCCCGACGGATTTAAATTAATCGGCGGAGTGAAATCTTTTGATACCCTGAGATCGACCCCAGTGAGAACCGAAGGCCAACGTATTTTCCTCAAGTCCTATCATGAAAATGGAACTACTGGCGGAGGGATCTTTGTTGGTCACATCGGGACCAAGTTAGATGATGGCGGTACTGTGGCCCAGGGCGTAGGGTTCTATTGGGAAAGGGCTGATTGCACGGAGATATCCGTAGAAATCTTTGGCGGTACACCTAACAATAACACGATTGATAATTCAGATTTCATCATTGCGGCGGATTTAGTTTCTTTTTCCAAGGGGGTTCGATTAACCGGGAAAGGGTTAAACTATAGTGTAGGGAGATCCTTTACACTGGCTACCCCTAACATAAAGGATATTAAAATATCCCCAACCTCCAGCTTTACAGGCGACGCTCCAACATTTACCTGCGATCAAAATACCGGAGAGTTGGTCTTAGAAGACGTTGATATTTCCGACTTCAAAGGCCGAGGGAGTAATTGCACAAAAGGGAACTATACTCTCGTCCCGACGATCACCTTTAAAGGAACTTGCAAATTCAATGGCAATGGCGGGGGTCCTACAAGAACAACTGTTTTACCTAAAACAACTACTATATCTGCGGCTGCTGACCTGACCACCACGTCTGTTATTTCTGTGGCGGACACCTCTAAATTTATAGCGGGTGATTACGTCCTGGTAAATGGGGTCAGATATCGTATTCTCTCCGTCGATTCTTCTTCCCAAATTACTTTTTATAACACATCTTCAATCCCCACTATTTACAATGACGGTGTCAACAGTTCCGTTTCAGCGGGCGATGTGCTGGAAAAGGTTTTTGTTGTGAACACGGCTTCAGAAGCAGTCATCGATGTCGCGGACAGCACTATATTTAAAGTCGGGGATGCCGTTTGGATAGGGGATAGCAAATGTTTAATTTCCGTCATCAATAATTCAACCCAGATAACTTTAGTGAACGTAAACGGCGTCCCCACTTTGCAATATGGAGGGACTGGAACGGGAAGGTATACTTCCGGTCAGTACGTCACTAAGGATAAAAACGGTAAAAACGGATTTACGATTAATACGTCGTCTTCGATTGGTTGGAATATCAATCTAAAAGGCAACGTCGAATTTAATAACAACGGATGGTTTGGGTTATTTCAGTGGTGTAATGCCTCTGGAGGAACAGTTTTTGGGGGAGCCAAAGCCAATAATAACGGATATATCGGCGTGGGCTTGGGCTACGTGAAGGGTGGGGAACTGAGCGGATTTGTAACCAACAATAATGGTAACAACGGTTTAGATCTGTTTGAAACTTACTCGGAGACGACTATTCATGATTGCACGTCCAATAATAACGGAGTTGACGGTATTTTCGTTGCAAGTTCTAAAACAGCACCCAAACTTTACGCTAACACGTGCGTGGGTAATAAAAGAATAGGGATGCTGGCCAATGGTAGAACCGTGACGCCGGTAGGACTCACTGTACTGGATAATTTCTGTGTCGGTAACGATTTATATTCCATCTGTTTAACTGGAATAGGAGGGGGGATCGTAGGCGATTGTTCTGTTGGCGGTGGGATGCAAGGCATCCGTATCGAGGGCAAGAACGGCATAGCCAACCCCAAATCAATAACGGTAAGGGATTGTAAATTCAGTTCTGTTTCGTCCGAATCCGATATCTTTGCCAATATTGGTGGATACACCAATGGCGGTGATCAGGGATCCTTACATCTCAATAACAACAGTTATTTTGGAAGGACGCCAAAATTTTCTATCACTAATATCGATGTGGATAAATCAACCTTTATCCCGGCTGGCTACTCTAAGCCTAACGGAACGGATTTAAGCGCATCGGCCGGGACGGCGATAGCGGTCGGCGTTACATTTTTTAAACCCCACGCTCCGACTGTTATAGACAATTCTGCCGAAGACTTCACCGTTCAAATTTATACCAATTCCGGGCTGACCACAGTGGGTACGGTAACTACTGCGACAAGAACGGCTGGAGTGGAACTTTCTAATGGGGCGACTAGTAACGGGCGGATAATAGGTTCGCCGGCTTATGGAACGTTCTCTTATAACTTCACGCTCAGCTCACCCGGGACAAGATATTTGCATATCAAATCTAAATACGGGTCTTCGGTTATCAAATTAACCTGGAGCTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
bb6b7a5cb96a103c09216ed9a6deb64b61084bf5ae29de8f948ed25b9f4a7ff2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7299
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50