Genbank accession
WJJ58412.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,91
Protein sequence
MAFDTIKIAELPSATQVVGDDFLVVEQPDKTKKATAFQVISDLDLANKSSLTGPGGAAIIGTEDGQGVQKSLDTDRANTRELWRRALHDLGLTLVDGSFEEGAELTYTTDAIWHMAGAQCYTWSGTFPKSVPAGSTPSNTGSEWLTVGGLSLLDEVTEIKNEAVEAKDEAVEAKNQAEAYVLQASEFGNLYPNTAAGLAATTSGQYFQVPQGSGSSVAFKVYKNNAGVAQEVAAAPGTGAIISTIREFPTLAAAQADADAGNILSGATAFYRSPDDSVLAIEVINNAGTLTATGRKMPSYDSLKRSNILFDAFNEQSAEQLNFASWDWYKGATPAFSTTDVNLPLPTPVIQASGVTSFDKYYDVSKLQVKPGDTLAFSVLVWFENAGGKLQIYWLDSAGATITTGEASPLVAGISSPVVVIAVPSGASLIGTHSGATVQQVLDSLSNSWQNLYYFSTTGNEVGTHVDTIALSPVQRTYGVQISTPLASFSPKVDNVLSNGTTSLRWSQVYAVNSVISTSNKKKKTNLRQITPTEAKAFYEIGKLDSVWQWLSKYSSEKGAARLHSGPTVQDAIKVMLKHGLDWTKYSAFCYDSWEANGDTPAGEEFAFRKEELLFWILRATIAVQEDLDKRLSALEKSLPGN
Physico‐chemical
properties
protein length:642 AA
molecular weight: 68720,86610 Da
isoelectric point:4,82173
aromaticity:0,09190
hydropathy:-0,15794

Domains

Domains [InterPro]
G3DSA:2.10.10.80
ATT
95–157
IPR040775
RBD
97–141
Coil
Unmapped
156–183
IPR030392
CHP
519–580
cd10144
CHP
519–620
WJJ58412.1
1 642
Architecture
ATT
ATT
STR
RBD
CHP
ATT 3-40 | ATT 92-157 | STR 158-537 | RBD 538-600 | CHP 601-639 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage vB_KpnM_NDO71
[NCBI]
3041490 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Klebsiella pneumoniae
[NCBI]
573 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WJJ58412.1 [NCBI]
Genbank nucleotide accession
OP558001.1 [NCBI]
CDS location
range 29985 -> 31913
strand -
CDS
ATGGCCTTTGATACAATTAAAATTGCGGAGTTACCCTCCGCAACCCAAGTGGTCGGAGACGACTTCCTTGTTGTAGAACAACCAGATAAGACCAAAAAGGCCACTGCCTTTCAGGTTATCTCAGACCTTGACTTGGCAAATAAATCATCCTTGACAGGACCTGGTGGGGCTGCTATTATCGGCACAGAGGATGGACAGGGAGTTCAGAAGAGTCTTGATACCGACCGCGCAAACACAAGAGAGCTTTGGAGAAGAGCGCTGCACGACCTTGGTTTGACACTTGTGGATGGCTCTTTTGAAGAAGGTGCCGAGCTTACCTATACAACCGATGCTATTTGGCATATGGCTGGGGCACAATGTTACACTTGGTCGGGTACTTTTCCAAAATCTGTACCTGCTGGATCAACACCGTCCAATACAGGATCTGAATGGCTTACCGTTGGAGGGTTAAGCCTATTAGATGAAGTTACCGAGATAAAAAATGAGGCGGTAGAAGCAAAAGATGAGGCTGTTGAGGCCAAAAATCAAGCCGAGGCTTACGTACTTCAGGCTTCAGAATTTGGAAATCTATATCCAAACACTGCGGCTGGCCTTGCGGCAACAACTTCTGGACAATATTTCCAAGTCCCTCAAGGAAGTGGAAGCTCTGTAGCGTTCAAAGTTTACAAGAATAATGCAGGGGTTGCTCAAGAAGTCGCGGCTGCACCTGGTACTGGAGCTATAATCAGCACTATCCGCGAATTCCCCACGCTGGCGGCTGCACAGGCCGATGCAGACGCTGGAAATATTCTGTCTGGCGCTACTGCATTTTACCGTAGCCCTGATGATAGCGTGCTAGCAATTGAGGTAATCAACAACGCCGGTACGCTGACAGCAACCGGAAGAAAAATGCCATCTTATGACTCATTGAAGAGATCGAATATCCTTTTTGATGCTTTCAATGAGCAATCTGCAGAACAGTTAAATTTTGCCAGCTGGGACTGGTATAAAGGCGCTACGCCCGCCTTTTCTACTACTGATGTTAATTTGCCGCTTCCAACACCTGTTATTCAAGCTTCTGGTGTAACATCGTTTGATAAATATTACGATGTGTCAAAACTACAGGTAAAACCAGGAGACACTTTAGCTTTCTCTGTTTTGGTATGGTTCGAGAACGCTGGAGGCAAGTTACAGATTTATTGGCTTGATTCAGCAGGAGCAACCATCACTACAGGGGAGGCCTCTCCTCTGGTTGCTGGCATATCCTCTCCGGTTGTGGTCATTGCTGTGCCATCTGGCGCATCCCTTATCGGAACCCACTCTGGGGCTACTGTCCAGCAAGTTTTGGACAGCCTTTCTAACAGTTGGCAGAATTTGTACTATTTCTCCACTACTGGGAATGAGGTCGGAACACATGTTGACACCATTGCACTTAGCCCTGTGCAAAGAACTTATGGTGTTCAAATATCAACACCTTTAGCATCATTTTCTCCTAAGGTAGACAATGTATTGTCCAACGGTACAACATCCCTGCGTTGGAGTCAAGTGTATGCAGTTAACAGTGTGATCTCCACCTCAAATAAAAAGAAGAAAACAAATCTTCGACAAATCACACCTACAGAAGCAAAAGCTTTTTATGAAATTGGGAAACTAGATTCTGTATGGCAATGGTTGTCTAAATACTCTTCTGAAAAGGGTGCTGCTAGACTTCACTCAGGACCTACTGTGCAGGATGCTATTAAGGTCATGCTCAAGCACGGTCTTGATTGGACCAAGTATTCAGCTTTCTGCTACGATAGCTGGGAAGCTAATGGGGATACTCCCGCAGGTGAGGAATTCGCTTTTAGAAAAGAAGAACTCTTATTTTGGATACTTAGAGCAACAATTGCGGTGCAGGAAGATCTTGACAAGCGCCTTTCAGCCTTGGAAAAATCTCTGCCCGGTAACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
6730a42526b75d0bc10de488cf7d8c862ff64ef28d4c2a42428db39341c8922e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6860
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50