Genbank accession
WPH66363.1 [GenBank]
Protein name
tail spike with depolymerase domain
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,93
Protein sequence
MALVKLTRVAEWLGTYIHSLSSVQRLLGSKLDDTLCVLDFGAVADYDASTKTGTDNTAAFMAAIDAAISKGIRNVYAPGGAYLITGELNLGGTSFTSGGGTRDYWRGITQGVHLYGDGPYSTILVFDAPDEYTPCVSARGGWGTHSPRALSGIAIEPKVWVDYSSTAKGTGVLLQGCCFVPVTDVHIGRFHRGLHLWNKLQGPNDTANTFTRGDFTEFNRMTRVRFFNSDIDIDYQVSLGNNSFHGNSFTDCMAQINSYGGIGMRMWDDGSRNAIRPSSMPYEYIANVYNNKHEINWFGSDARTCYLMHIDKAQGRGCNGDMTVEAAVTLRVVGQYWYQSFGSLHSISAINTVVDGTTDTATRPVAFMWLNSAYPQANFDGGDALLSSALYPRQFDLNNSGNTGMELLNVRGTSTGAIWSIQNGSALGWILGRRAQSASRPGTRSVWQFSYNGEVIKSVAANDVGMQNQNGSGVGMLGDVLFRPYTTGTVSLGSPTYSFTRLRTTDWTIDTFGIVPVQDGVKNAGSASNRLGTIFAATGTINTSDARLKTDVRPMSAAEIAAARALSSEIGFFRWVDSVDNKGEDAREHCGTTVQRAIEIMQEHGLDPFNYGFICYDSWDEQVELNDETGEVISTIPAGDRYSFRMDQLALFLARGVDARLQALEGA
Physico‐chemical
properties
protein length:667 AA
molecular weight: 72523,07680 Da
isoelectric point:5,36847
aromaticity:0,10345
hydropathy:-0,20810

Domains

Domains [InterPro]
DC_1637
STR
7–657
IPR012334
STR
9–366
IPR011050
STR
31–271
IPR030392
CHP
544–667
IPR030392
CHP
544–606
WPH66363.1
1 667
Architecture
STR
RBD
STR 7-657 | RBD 658-667
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WPH66363.1
1 667
Domain Start End Length (AA) Confidence
N-terminal 1 44 44 0,9606
Central domain 45 372 329 0,9788
C-terminal 373 667 294 0,9285
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-44
Central
45-372
C-terminal
373-667

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage NNA-G4
[NCBI]
3079523 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Klebsiella pneumoniae
[NCBI]
573 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WPH66363.1 [NCBI]
Genbank nucleotide accession
OR287810.1 [NCBI]
CDS location
range 42093 -> 44096
strand +
CDS
ATGGCTTTAGTAAAACTTACACGGGTAGCGGAGTGGCTCGGAACGTATATCCATAGCTTGAGCTCCGTGCAGCGGTTGCTGGGTAGTAAGCTAGACGACACCCTGTGCGTTCTGGATTTCGGCGCTGTCGCTGACTACGATGCCAGTACAAAAACTGGTACGGATAACACAGCCGCATTCATGGCAGCTATTGATGCTGCTATATCTAAAGGTATTAGGAATGTCTACGCCCCTGGTGGGGCGTACCTTATTACCGGCGAACTCAACCTCGGCGGCACTAGTTTTACCTCTGGTGGGGGTACCAGGGATTACTGGAGGGGCATCACTCAGGGTGTGCACTTATACGGCGACGGGCCATACAGCACTATCCTAGTATTTGATGCCCCGGATGAGTACACGCCGTGTGTATCTGCCCGTGGCGGTTGGGGTACTCACTCTCCGCGAGCACTATCAGGTATCGCCATAGAGCCCAAAGTGTGGGTAGATTACAGCAGCACAGCCAAAGGTACTGGTGTACTGTTACAGGGCTGCTGCTTCGTACCGGTAACGGACGTTCATATTGGGCGATTCCACCGCGGGCTGCACTTATGGAATAAGCTGCAGGGCCCTAACGACACCGCCAATACTTTTACACGGGGTGACTTTACCGAGTTTAACCGCATGACTCGCGTGCGTTTCTTTAACTCGGATATCGATATCGATTATCAGGTTAGCTTAGGCAACAACTCGTTTCATGGTAACTCGTTCACTGACTGCATGGCACAGATTAATTCCTACGGGGGTATCGGTATGCGCATGTGGGATGACGGTTCACGAAACGCCATTCGTCCAAGTTCGATGCCTTACGAGTACATTGCCAACGTATATAACAACAAGCACGAGATTAACTGGTTCGGTAGTGATGCAAGAACCTGCTATCTCATGCACATCGATAAGGCCCAAGGTCGTGGCTGCAACGGGGACATGACTGTTGAGGCCGCTGTTACGCTGCGGGTAGTGGGTCAGTACTGGTACCAGAGTTTCGGGAGTCTGCACAGTATTTCCGCCATCAACACCGTGGTAGATGGTACTACCGACACTGCTACACGCCCAGTGGCGTTCATGTGGCTCAATAGCGCGTATCCACAAGCCAACTTTGACGGTGGTGATGCGCTGTTAAGCTCTGCTTTGTACCCTCGGCAGTTTGACCTGAACAACTCTGGGAACACCGGCATGGAGTTATTGAACGTGCGCGGAACCAGTACCGGTGCTATCTGGTCGATACAGAATGGCTCTGCACTGGGATGGATTCTGGGAAGGCGCGCGCAGTCGGCGAGCAGGCCTGGTACCCGGAGTGTGTGGCAGTTCTCCTACAACGGTGAGGTGATTAAGTCGGTAGCAGCCAACGACGTGGGCATGCAGAACCAAAACGGGTCTGGCGTAGGTATGCTCGGTGATGTACTGTTTCGTCCATACACCACTGGTACAGTGTCTTTGGGTAGTCCTACTTACTCCTTTACCAGGCTGCGTACTACTGATTGGACTATTGACACCTTCGGTATTGTACCAGTACAGGATGGTGTTAAGAATGCTGGTTCTGCCAGTAATCGGCTGGGGACTATCTTTGCTGCTACTGGTACTATTAACACCTCGGATGCGCGTCTGAAAACGGATGTCCGGCCTATGAGTGCAGCTGAGATTGCAGCTGCTCGTGCTCTTAGTTCGGAGATTGGTTTCTTCCGGTGGGTTGATAGTGTAGACAATAAGGGTGAGGACGCTAGGGAGCACTGCGGTACCACAGTACAGAGGGCCATTGAGATTATGCAGGAGCACGGACTGGACCCATTCAATTACGGGTTTATCTGTTATGATTCCTGGGATGAGCAGGTAGAGCTCAACGACGAGACCGGTGAGGTCATTAGCACTATCCCCGCCGGGGACCGATACAGCTTCCGCATGGACCAGTTAGCGCTGTTCTTGGCCCGTGGCGTGGATGCTCGGCTGCAGGCACTGGAGGGTGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
1ca52fc41c59b597eb01e3bf6a49d7e46505cc3d8219d82c907a5add223744b0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6719
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50