Genbank accession
WCF59163.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MALYREGKAAMAADGTVTGTGTKWQSSLSLIRPGATIMFLSSPIQIAVVNKVVSDTEIKAITTNGAVVASTDYAILLSDSLTVDGLAQDVAETLRYYQSQETVIADAVEFFKDFDFESLQNLANQIKADSEAAESSATAAAASENAAKSSENAAKNSEVAAENARDQVQQIINDAGEQSTLVALAQPDGAKNIGKCSSIAQLRNVEPNSAGQRILLASYKADGTADGGGEFYYDPTDSTTADDGASCIVTSGGKRWKAIINPAVKSSTFASSVNINAYLAKKGVNLQFDNALTPTGTINVQSNTRIEFSGNGEIDAPSTLIQGITIAGAAPTTFYNLSADALSGAYQVAIATDQFAVGDWIEIRSEDLVKGPNAKGVKQAQLRRVVKKETSGGQYVYSLDRVLEYDFLVASTARCGKATVIENVVLDSPRLNNINYLNQFGIGINCNYVVNLRIINPILIGSKDKFFIENDAGTGVAGRSAIKLNNCRDVTIDAPVCHHQGWYGVEVLGCSEDIKINDGDFNDCRHGVSVNWSMPYGEPRTVIFNRCVSSNATKAAFDTHDVGVDIKFIDCRAIKSQGDGFQYRARNVKYIRCYAAYCLSNGFDGAQGATGSEFKDCVAEFNKAAGFNIAFEPGTVRNCRAYGNMVGVGAMGGKILGGELEGNSLAAIDYGTGLTGVAAQSALEVTGVKMPFSDGTTTKAQPRAIYFRGAKAVDPSLATIRDCDINGYGNNWALLSSYSSQPSLPVMSGNKLDATGIVGTVTLVAGTATVPTASARKRETTNVNELSTVSKIKLTRLTYQSATPLGDLYVSQINNGVSFTIMSTSNSDVSKVMWEISL
Physico‐chemical
properties
protein length:838 AA
molecular weight: 88841,50880 Da
isoelectric point:5,15327
aromaticity:0,07518
hydropathy:-0,11265

Domains

Domains [InterPro]
Coil
Unmapped
147–174
IPR057095
ENZ
334–419
DC_1484
STR
369–838
WCF59163.1
1 838
Architecture
STR
STR 1-838
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WCF59163.1
1 838
Domain Start End Length (AA) Confidence
N-terminal 1 271 271 0,9953
Central domain 272 755 485 0,9947
C-terminal 756 838 82 0,9843
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-271
Central
272-755
C-terminal
756-838

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage vB_LZ2044
[NCBI]
2996108 No lineage information
Host Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044
[NCBI]
484021 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Klebsiella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WCF59163.1 [NCBI]
Genbank nucleotide accession
OP785155 [NCBI]
CDS location
range 10889 -> 13405
strand -
CDS
ATGGCACTATACAGAGAAGGCAAGGCGGCTATGGCCGCAGACGGAACAGTTACCGGAACTGGAACAAAATGGCAATCATCGCTTTCGCTGATTCGCCCTGGCGCGACGATTATGTTTTTGTCGTCACCAATTCAAATAGCTGTAGTAAACAAGGTGGTCAGTGACACTGAAATTAAAGCAATTACCACAAACGGCGCTGTCGTAGCGTCTACTGACTACGCGATCCTGTTAAGCGACTCGCTGACCGTTGACGGCCTGGCGCAAGATGTTGCTGAAACTCTGCGCTACTATCAGTCGCAGGAAACCGTGATTGCGGATGCAGTTGAGTTCTTTAAGGATTTTGATTTCGAATCTCTTCAAAATCTTGCCAACCAAATTAAGGCAGACTCTGAAGCTGCGGAATCAAGTGCTACGGCGGCTGCCGCTTCTGAAAACGCGGCAAAATCTTCTGAAAATGCGGCAAAAAATTCAGAGGTAGCAGCGGAGAATGCAAGAGACCAAGTACAGCAGATCATCAACGACGCTGGCGAACAGTCAACGCTGGTGGCGCTGGCGCAGCCTGATGGGGCAAAAAACATAGGCAAGTGCTCAAGCATAGCACAGTTACGAAATGTTGAACCAAACAGCGCGGGACAGAGAATCTTGCTGGCATCATACAAAGCAGACGGAACTGCTGATGGAGGCGGCGAATTTTACTATGATCCGACAGATTCCACCACGGCTGACGATGGTGCATCATGCATTGTTACCAGTGGCGGCAAAAGGTGGAAGGCAATTATAAATCCGGCAGTCAAAAGTTCGACGTTTGCATCTAGCGTAAATATTAACGCCTATCTCGCAAAGAAAGGCGTCAACTTGCAATTCGACAATGCACTAACGCCGACCGGAACAATCAACGTTCAGAGCAACACACGTATTGAATTCTCCGGCAATGGCGAAATTGACGCTCCGAGCACGCTGATTCAAGGAATTACAATCGCAGGTGCCGCCCCGACGACCTTCTATAACCTGTCAGCTGATGCCCTATCAGGCGCTTACCAGGTAGCCATAGCTACCGACCAGTTCGCTGTCGGTGACTGGATCGAAATTCGCTCTGAAGATCTTGTTAAGGGGCCAAACGCGAAGGGCGTTAAGCAGGCGCAGTTACGGCGCGTGGTGAAAAAGGAAACGTCTGGCGGTCAGTATGTGTACTCACTAGACAGAGTTCTTGAATACGATTTTCTCGTCGCTAGCACGGCGAGGTGCGGCAAGGCTACCGTGATAGAGAACGTGGTTCTTGATAGCCCGCGACTGAACAATATCAACTATCTGAATCAGTTCGGAATTGGGATAAACTGCAACTATGTGGTCAATCTTAGAATCATTAATCCGATTCTTATTGGCTCAAAGGATAAGTTCTTCATTGAAAACGACGCAGGAACTGGTGTTGCAGGACGAAGCGCGATCAAACTGAACAATTGCCGTGACGTGACCATAGACGCCCCGGTGTGCCACCATCAGGGTTGGTATGGTGTTGAAGTTCTTGGATGCAGCGAAGATATTAAAATAAACGATGGGGATTTCAATGATTGCCGACACGGCGTTTCGGTTAACTGGTCAATGCCATACGGTGAACCGAGAACAGTAATATTCAATAGATGTGTTTCAAGCAATGCAACAAAGGCGGCATTTGACACGCACGATGTTGGGGTTGATATTAAATTCATAGATTGCCGTGCAATAAAATCCCAGGGCGATGGTTTCCAGTATCGCGCCCGGAACGTGAAATACATCCGTTGTTATGCTGCATACTGCCTGTCCAACGGTTTCGACGGCGCACAAGGTGCAACAGGTTCTGAATTTAAAGACTGCGTTGCAGAATTCAACAAAGCGGCCGGTTTCAATATCGCATTCGAACCTGGAACCGTGCGGAACTGCCGAGCCTACGGAAACATGGTTGGTGTCGGTGCAATGGGAGGAAAGATACTTGGTGGCGAACTGGAAGGAAACTCACTGGCGGCCATCGACTACGGAACCGGACTAACCGGTGTTGCCGCGCAGTCAGCGCTTGAAGTTACTGGTGTCAAGATGCCGTTCAGTGACGGCACGACAACGAAAGCGCAGCCAAGAGCAATCTATTTCCGTGGCGCAAAAGCGGTTGACCCATCCCTAGCCACGATCCGCGATTGTGACATCAACGGGTATGGCAACAATTGGGCATTGCTATCGTCTTATTCTTCTCAGCCATCACTCCCGGTGATGTCTGGAAATAAGCTGGATGCGACTGGCATTGTCGGCACGGTAACGCTTGTTGCAGGTACAGCTACGGTGCCAACAGCGAGCGCGAGAAAACGCGAAACAACGAACGTCAATGAGCTTTCAACGGTTAGTAAGATCAAGCTGACCAGGCTTACTTACCAGTCGGCCACGCCGCTTGGTGACCTTTACGTGTCACAAATAAACAACGGCGTTTCGTTTACTATTATGTCTACATCAAATTCGGATGTTTCGAAAGTTATGTGGGAAATATCCCTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
5ddd41016dcbb0fba9cd604e6ee8016663b254ed8b0f903319e75ff746dce8e4
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7707
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50