Genbank accession
WCS67833.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MTVSTQVSRNEYTGNGATTQYDFTFRILDKSHVLVQTLDTSENILTLTLGTDYTVTGVNRYNGGKVVLTSALPAGYKISIERSTPVTQEASIRNQGGFFPEIHEDALDKLTMLVQQAYGWWSGLSLRKPSWLANYYDALNNRIRNLRDPSQDQDAATKKYTDTLNAAAIGHSDDLFKRTMRVPESAISILPAKDFRKNKIVAMDNNGDPLMVLPESGSAADVLIELAKPTGSIYIGHHEGNTVSEYLDLTKALPHVAPLWQKVRSAQADAYIIILGDSTGNSDFEWVYKWSTWVASKYPTHSVRYRLFVDGSGWDPEIVMSTGTTGRSIYIDNVSVPGSTERYYQGAMRSQIYNSGRTYDLVILNYGHNEGTTVPELTIQAGFTEGVLAVKQDNPGAPVIVTAQNPRRDFPDHSARAVSAWAKVAAVQGLGIIDVYSKFIELGVPESLYTDFIHPNAAGMDVWAGVAIEALNDNPAFQFDKVIETYTGPLRPNLAPNPAFNSWLSSLPVSWAVNSVAVSRDLSRRDSFAFSVKAEVLNTSSPLFYCDLSDYLFAFKGQWVTFAARIWKNSGLSTNAGRLQISGTGMTAVTSRTKANEAENGWMWVVCHALLPKTITTLQIRLIPGSTVGEYVHIDRCWFGVGLMPSDIDFINQSQVQLDDYYSPLNVGIPSGYDGTLTVDGRHITVTPATTKARVYINIEYLTVGNNYKVTWVRNNSSTGDVYIRGAASGLGYIITQGELETATSLTFTATNKTNSVLIETDGLTPIDVNISSIVKV
Physico‐chemical
properties
protein length:777 AA
molecular weight: 85375,02300 Da
isoelectric point:5,62629
aromaticity:0,09910
hydropathy:-0,16680

Domains

Domains [InterPro]
DC_0055
ATT
8–187
DC_0332
STR
265–470
SSF52266
STR
271–472
cd00229
ENZ
305–464
IPR013830
ENZ
330–461
WCS67833.1
1 777
Architecture
ATT
STR
ATT 8-187 | STR 257-767 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WCS67833.1
1 777
Domain Start End Length (AA) Confidence
N-terminal 1 242 242 0,9891
Central domain 243 446 205 0,6001
C-terminal 447 777 330 0,7086
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-242
Central
243-446
C-terminal
447-777

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage vB_KpnP_BUCT711
[NCBI]
3012326 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Klebsiella pneumoniae
[NCBI]
573 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WCS67833.1 [NCBI]
Genbank nucleotide accession
OM816836.1 [NCBI]
CDS location
range 14533 -> 16866
strand -
CDS
ATGACAGTTTCAACGCAGGTAAGCCGTAACGAGTACACCGGTAACGGCGCCACTACCCAATACGATTTCACGTTCCGCATCTTGGATAAAAGCCACGTGCTGGTGCAGACGCTGGATACCTCCGAAAACATCTTGACGCTGACGCTCGGCACCGATTACACGGTTACCGGCGTGAACCGCTACAACGGGGGTAAGGTGGTGCTGACATCGGCGCTGCCAGCTGGTTACAAAATCTCTATCGAGCGCAGCACTCCGGTTACGCAGGAAGCCAGCATCAGGAACCAGGGTGGCTTTTTCCCGGAGATCCATGAAGATGCTCTCGATAAACTAACCATGCTTGTTCAGCAGGCATACGGGTGGTGGTCTGGGCTATCTCTCAGGAAGCCGTCATGGCTCGCTAACTATTACGACGCACTCAACAATCGTATTCGTAACCTACGTGACCCATCTCAGGATCAGGATGCTGCAACAAAAAAATATACAGATACTCTAAACGCAGCTGCGATAGGCCATTCAGATGATTTGTTTAAAAGAACAATGAGAGTGCCTGAAAGCGCAATATCAATACTTCCTGCTAAGGATTTTCGAAAAAACAAAATAGTTGCCATGGATAATAATGGCGATCCACTTATGGTTTTGCCTGAAAGTGGCAGTGCTGCGGATGTGCTTATTGAATTGGCTAAACCAACTGGTTCTATTTATATCGGTCATCATGAGGGTAATACAGTGTCAGAATATTTAGACTTAACGAAAGCATTGCCTCATGTAGCTCCATTGTGGCAGAAGGTGCGAAGCGCGCAGGCAGATGCTTATATTATTATTCTTGGTGATTCGACGGGGAACTCAGATTTCGAGTGGGTTTATAAATGGTCAACATGGGTTGCATCTAAATATCCCACTCATAGTGTACGCTATCGCTTATTTGTTGATGGCTCTGGATGGGATCCCGAAATCGTAATGAGTACAGGTACAACAGGCAGATCAATTTATATTGATAACGTGTCTGTTCCTGGTTCTACAGAGCGTTATTACCAGGGAGCGATGCGCTCGCAAATCTATAACTCCGGTCGAACTTATGACCTGGTCATTTTAAACTATGGTCACAATGAAGGGACAACCGTTCCTGAGTTGACAATTCAGGCAGGATTTACAGAGGGTGTTCTTGCTGTGAAGCAAGATAATCCAGGTGCTCCTGTTATTGTTACGGCACAGAACCCGCGGCGTGATTTTCCAGACCATTCAGCGCGAGCTGTTAGCGCCTGGGCTAAGGTCGCCGCGGTGCAGGGACTTGGTATCATTGATGTCTATTCAAAGTTCATTGAGTTGGGTGTCCCAGAAAGCCTGTATACCGACTTTATCCATCCTAATGCAGCTGGCATGGACGTGTGGGCAGGGGTGGCAATAGAGGCGCTGAACGACAATCCTGCATTCCAGTTTGATAAAGTTATTGAGACGTATACAGGACCGTTGCGCCCGAACCTTGCGCCAAACCCGGCATTTAACTCCTGGCTTTCGTCTTTACCAGTCTCATGGGCAGTGAACTCAGTAGCTGTTTCTCGAGATCTTTCCCGACGAGATTCTTTTGCGTTCAGTGTAAAAGCTGAAGTTTTGAACACATCTTCTCCTCTTTTTTATTGTGATCTCAGTGATTATCTTTTTGCCTTTAAAGGGCAATGGGTGACCTTTGCTGCTCGTATCTGGAAAAACTCTGGTCTTTCTACAAACGCAGGTCGTCTTCAGATATCTGGGACAGGAATGACTGCTGTTACATCGAGGACTAAAGCCAATGAGGCAGAAAACGGCTGGATGTGGGTTGTCTGCCACGCATTATTACCTAAAACCATTACCACCCTGCAGATTCGTCTTATCCCAGGCAGCACTGTAGGCGAGTATGTTCACATTGACAGGTGCTGGTTTGGGGTTGGTCTTATGCCTTCTGATATCGATTTTATAAATCAGTCACAGGTGCAACTTGATGATTATTACAGCCCGCTAAACGTCGGTATCCCATCTGGCTACGATGGAACATTAACTGTCGACGGAAGACATATCACTGTTACTCCAGCAACTACAAAAGCGCGTGTGTATATAAACATAGAATATTTGACTGTTGGTAATAATTACAAGGTTACATGGGTTAGAAATAACTCATCAACAGGCGATGTTTATATCAGAGGTGCTGCATCTGGCTTAGGCTACATTATTACGCAGGGAGAACTTGAGACTGCTACCAGCCTCACTTTTACTGCTACCAATAAAACAAACTCAGTACTTATTGAAACCGATGGGTTAACACCTATCGATGTAAATATTTCAAGCATAGTTAAAGTGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
2f954a91d8a393cd7975451f218d8afff65badf33c18be1a7a9cc172ce2aac4c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7568
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50