Genbank accession
WPH68639.1 [GenBank]
Protein name
tail spike protein with colonic acid degradation activity
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,97
Protein sequence
MATTPTTGDIPSNAAVDLKFNSEQFDRVMNSDDLTYTDRFGKKRITMKGVQELANGFQDTFTNLLGSPDGFKLIGGVKSFDTLRSTPVRTEGQRIFLKSYHENGTTGGGIFVGHIGTKLDDGGTVAQGVGFYWERADCTEISVEIFGGTPNNNTIDNSDFIIAADLVSFSKGVRLTGKGLNYSVGRSFTLATPNIKDIKISPTSSFTGDAPTFTCDQNTGELVLEDVDISDFKGRGSNCTKGNYTLVPTITFKGTCKFNGNGGGPTRTTVLPKTTTISAAADLTTTSVISVADTSKFIAGDYVLVNGVRYRILSVDSSSQITFYNTSSIPTIYNDGVNSSVSAGDVLEKVFVVNTASEAVIDVADSTIFKVGDAVWIGDSKCLISVINNSTQITLVNVNGVPTLQYGGTGTGRYTSGQYVTKDKNGKNGFTINTSSSIGWNINLKGNVEFNNNGWFGLFQWCNASGGTVFGGAKANNNGYIGVGLGYVKGGELSGFVTNNNGNNGLDLFETYSETTIHDCTSNNNGVDGIFVASSKTAPKLYANTCVGNKRIGMLANGRTVTPVGLTVLDNFCVGNDLYSICLTGIGGGIVGDCSVGGGMQGIRIEGKNGIANPKSITVRDCKFSSVSSESDIFANIGGYTNGGDQGSLHLNNNSYFGRTPKFSITNIDVDKSTFIPAGYSKPNGTDLSASAGTAIAVGVTFFKPHAPTVIDNSAEDFTVQIYTNSGLTTVGTVTTATRTAGVELSNGATSNGRIIGSPAYGTFSYNFTLSSPGTRYLHIKSKYGSSVIKLTWS
Physico‐chemical
properties
protein length:794 AA
molecular weight: 83606,05080 Da
isoelectric point:6,15131
aromaticity:0,09194
hydropathy:-0,14547

Domains

View on InterPro
WPH68639.1
1 794 aa
STR 353–613 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

WPH68639.1
1 794 aa
Domain Start End Length (AA) Confidence
N-terminal 1 157 157 0,9905
Central domain 158 706 550 0,9576
C-terminal 707 794 87 0,9394
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Stenotrophomonas phage BUCTxx100 [NCBI] · taxon 3092589

Coding sequence (CDS)

Genbank protein accession
WPH68639.1 [NCBI]
Genbank nucleotide accession
OR529409.1 [NCBI]
CDS location
range 70019 -> 72403
strand +
CDS
GTGGCTACAACTCCAACTACGGGTGACATCCCCAGCAATGCTGCGGTGGACCTCAAGTTTAACTCGGAACAATTCGACCGAGTAATGAATTCTGACGATCTGACTTATACGGATCGTTTTGGTAAAAAACGCATTACAATGAAAGGGGTTCAGGAACTCGCTAATGGCTTCCAGGATACTTTCACTAATCTGTTGGGTTCTCCCGACGGATTTAAATTAATCGGCGGAGTGAAATCTTTTGATACCCTGAGATCGACCCCAGTGAGAACCGAAGGCCAACGTATTTTCCTCAAGTCCTATCATGAAAATGGAACTACTGGCGGAGGGATCTTTGTTGGTCACATCGGGACCAAGTTAGATGATGGCGGTACTGTGGCCCAGGGCGTAGGGTTCTATTGGGAAAGGGCTGATTGCACGGAGATATCCGTAGAAATCTTTGGCGGTACACCTAACAATAACACGATTGATAATTCAGATTTCATCATTGCGGCGGATTTAGTTTCTTTTTCCAAGGGGGTTCGATTAACCGGGAAAGGGTTAAACTATAGTGTAGGGAGATCCTTTACACTGGCTACCCCTAACATAAAGGATATTAAAATATCCCCAACCTCCAGCTTTACAGGCGACGCTCCAACATTTACCTGCGATCAAAATACCGGAGAGTTGGTCTTAGAAGACGTTGATATTTCCGACTTCAAAGGCCGAGGGAGTAATTGCACAAAAGGGAACTATACTCTCGTCCCGACGATCACCTTTAAAGGAACTTGCAAATTCAATGGCAATGGCGGGGGTCCTACAAGAACAACTGTTTTACCTAAAACAACTACTATATCTGCGGCTGCTGACCTGACCACCACGTCTGTTATTTCTGTGGCGGACACCTCTAAATTTATAGCGGGTGATTACGTCCTGGTAAATGGGGTCAGATATCGTATTCTCTCCGTCGATTCTTCTTCCCAAATTACTTTTTATAACACATCTTCAATCCCCACTATTTACAATGACGGTGTCAACAGTTCCGTTTCAGCGGGCGATGTGCTGGAAAAGGTTTTTGTTGTGAACACGGCTTCAGAAGCAGTCATCGATGTCGCGGACAGCACTATATTTAAAGTCGGGGATGCCGTTTGGATAGGGGATAGCAAATGTTTAATTTCCGTCATCAATAATTCAACCCAGATAACTTTAGTGAACGTAAACGGCGTCCCCACTTTGCAATATGGAGGGACTGGAACGGGAAGGTATACTTCCGGTCAGTACGTCACTAAGGATAAAAACGGTAAAAACGGATTTACGATTAATACGTCGTCTTCGATTGGTTGGAATATCAATCTAAAAGGCAACGTCGAATTTAATAACAACGGATGGTTTGGGTTATTTCAGTGGTGTAATGCCTCTGGAGGAACAGTTTTTGGGGGAGCCAAAGCCAATAATAACGGATATATCGGCGTGGGCTTGGGCTACGTGAAGGGTGGGGAACTGAGCGGATTTGTAACCAACAATAATGGTAACAACGGTTTAGATCTGTTTGAAACTTACTCGGAGACGACTATTCATGATTGCACGTCCAATAATAACGGAGTTGACGGTATTTTCGTTGCAAGTTCTAAAACAGCACCCAAACTTTACGCTAACACGTGCGTGGGTAATAAAAGAATAGGGATGCTGGCCAATGGTAGAACCGTGACGCCGGTAGGACTCACTGTACTGGATAATTTCTGTGTCGGTAACGATTTATATTCCATCTGTTTAACTGGAATAGGAGGGGGGATCGTAGGCGATTGTTCTGTTGGCGGTGGGATGCAAGGCATCCGTATCGAGGGCAAGAACGGCATAGCCAACCCCAAATCAATAACGGTAAGGGATTGTAAATTCAGTTCTGTTTCGTCCGAATCCGATATCTTTGCCAATATTGGTGGATACACCAATGGCGGTGATCAGGGATCCTTACATCTCAATAACAACAGTTATTTTGGAAGGACGCCAAAATTTTCTATCACTAATATCGATGTGGATAAATCAACCTTTATCCCGGCTGGCTACTCTAAGCCTAACGGAACGGATTTAAGCGCATCGGCCGGGACGGCGATAGCGGTCGGCGTTACATTTTTTAAACCCCACGCTCCGACTGTTATAGACAATTCTGCCGAAGACTTCACCGTTCAAATTTATACCAATTCCGGGCTGACCACAGTGGGTACGGTAACTACTGCGACAAGAACGGCTGGAGTGGAACTTTCTAATGGGGCGACTAGTAACGGGCGGATAATAGGTTCGCCGGCTTATGGAACGTTCTCTTATAACTTCACGCTCAGCTCACCCGGGACAAGATATTTGCATATCAAATCTAAATACGGGTCTTCGGTTATCAAATTAACCTGGAGCTAG

Genome Context

Tertiary structure

WPH68639.1
ESMFold structure
Source ESMFold
pLDDT 73.0
Oligomeric state monomer