UniProt accession
A0A2P1CCL2 [UniProt]
Protein name
Tail spike protein
RBP type
TSP
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,89
Protein sequence
MNKMFTQPSGPVAKQVNKQTIARIFGLKVSEITYLAPGTSIGGYTIAYYPANQKCYWVSGNSITVGSASVDSLGRLVITTSTGSTVTLNEAKLVAGASNKIWVEDFSSHNTSPNDWGPAFRAAVAYASITGINDVWFQGSYTISSTGSTWVLPFDDGTVSPDRLADSSEVTLPPEEAISVPVFIDLPYGVNIYSTSIWSNRLTFTWDAATVSTSQPIAFVGRVSNWDGTYVAATNGSNRYSSKTVKNQLSGFAIYNAFIGYVADGVAQWLDWEAMRFNTVGMCFLSAGMDHCEFGYIHMSNTIAGFLSGGWWQTRNAGNYPQSKLPPYPAPDVQAAGWNDFVYFRSIMDEGKKEEWTATSVYPKVDAYFDQYIYKTRHSVKVEDGGTGRMTKLTNAGADAYAATETDPFRGVAGRTFYYMSRGMHYNKCIFIDHLKVHGRHRIPICGTQFQTITWAGELNSCYIERAPFTNTGLTSSAGNDFYSSPLNTYNVTWPYGNASRIALFTSAPASAVQGKMGVVLFCPSNALQSKASSDGAVITASQSRQTVFSNTLTAARQTPLHQVGIQTPSAYSTMESLWYGMKYTQPLSFKDTNYLTRDTEEVLFETIAYKNSKTPISTVANGLTDGTANVALGGNSFLSYIRTRNQVEAKYFLQLPAAVDAYNSSLYIPMVGLPVPLTSISSGVLGNTVPEVTVSRASYKDVANALRTVSFTLGADGNYYLSLNKDYYGSTKGTLAELAAGTYLVFTVRYTTVG
Physico‐chemical
properties
protein length:755 AA
molecular weight: 82240,23260 Da
isoelectric point:7,54267
aromaticity:0,12583
hydropathy:-0,14185

Domains

Domains [InterPro]
G3DSA:3.30.2020.50
ATT
1–94
A0A2P1CCL2
1 755
Architecture
ATT
ATT 1-94 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A2P1CCL2
1 755
Domain Start End Length (AA) Confidence
N-terminal 1 106 106 0,9443
Central domain 107 521 416 0,9778
C-terminal 522 755 233 0,9511
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-106
Central
107-521
C-terminal
522-755

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage KP8
[NCBI]
2099850 Uroviricota > Caudoviricetes > Schitoviridae > Kaypoctavirus > Kaypoctavirus KP8
Host Klebsiella pneumoniae
[NCBI]
573 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AVJ48992.1 [NCBI]
Genbank nucleotide accession
MG922974 [NCBI]
CDS location
range 62259 -> 64526
strand -
CDS
ATGAATAAGATGTTTACCCAGCCATCCGGCCCGGTAGCAAAGCAAGTTAATAAACAAACTATTGCACGAATATTTGGGCTAAAGGTAAGTGAGATTACATACCTAGCTCCAGGAACCTCTATTGGGGGCTACACAATTGCATACTATCCTGCTAACCAAAAATGTTACTGGGTGTCAGGTAATTCAATAACGGTTGGTTCTGCTAGTGTGGATTCACTGGGACGCTTAGTTATAACTACCTCCACTGGCTCAACAGTTACACTTAATGAGGCAAAGCTGGTAGCAGGTGCTTCCAACAAAATTTGGGTTGAAGACTTCTCTTCCCACAATACTTCACCTAATGATTGGGGTCCTGCATTCCGTGCAGCAGTAGCCTATGCTTCTATTACCGGTATTAATGACGTGTGGTTCCAAGGTAGCTATACCATTAGCTCTACTGGCTCTACATGGGTACTCCCCTTTGACGATGGTACTGTATCTCCAGATAGACTTGCAGACTCGTCCGAAGTTACCCTACCACCTGAAGAGGCCATTTCTGTTCCTGTATTTATTGACCTACCTTACGGGGTTAACATCTATTCCACCTCTATTTGGAGTAATCGATTAACCTTTACGTGGGATGCTGCAACTGTTAGCACCTCTCAACCTATTGCTTTTGTAGGTCGTGTTAGTAACTGGGATGGTACCTATGTAGCTGCTACTAACGGTTCTAACCGATACTCGTCTAAGACGGTAAAGAACCAGTTAAGTGGGTTTGCCATTTACAATGCATTTATTGGATACGTTGCAGACGGTGTTGCTCAGTGGTTAGATTGGGAAGCAATGCGATTCAATACAGTAGGTATGTGCTTCCTGTCTGCTGGTATGGACCACTGCGAATTTGGTTACATTCACATGTCCAACACAATTGCCGGGTTCCTGTCTGGTGGTTGGTGGCAAACACGAAATGCAGGCAACTACCCGCAAAGTAAGTTACCTCCCTATCCGGCACCGGATGTACAGGCGGCAGGTTGGAATGACTTTGTCTACTTCCGTTCCATTATGGATGAGGGGAAAAAAGAAGAGTGGACTGCAACATCTGTGTATCCTAAGGTAGATGCATATTTTGACCAGTATATTTACAAGACTCGTCATTCAGTCAAAGTAGAAGATGGTGGTACTGGTCGAATGACCAAACTAACCAATGCTGGAGCAGATGCATATGCCGCAACGGAAACAGACCCATTCCGTGGGGTAGCAGGGCGAACCTTCTATTACATGTCCAGGGGTATGCACTATAACAAGTGTATTTTTATCGACCACCTTAAAGTGCATGGAAGACACCGTATCCCAATCTGTGGTACTCAGTTCCAAACTATTACTTGGGCTGGTGAACTCAATTCTTGTTATATAGAAAGGGCACCATTTACTAACACAGGTCTTACTTCCAGTGCTGGTAATGATTTTTATTCTTCACCACTAAATACGTATAATGTGACTTGGCCTTACGGTAATGCCAGCAGGATTGCTTTGTTTACATCTGCTCCTGCGTCCGCAGTCCAGGGTAAGATGGGAGTTGTACTGTTTTGCCCATCTAATGCTTTACAGAGCAAAGCTTCGTCAGATGGGGCGGTTATTACTGCGTCTCAGTCTAGACAGACTGTATTTTCTAATACCCTTACAGCAGCAAGACAAACGCCGCTACACCAAGTTGGTATACAGACACCATCTGCATACTCTACTATGGAATCATTGTGGTACGGCATGAAGTACACACAACCCCTGTCTTTCAAAGATACAAATTATCTTACTAGGGATACGGAAGAAGTACTGTTTGAAACCATTGCGTACAAGAATAGTAAAACACCTATTTCAACCGTAGCCAACGGGTTGACCGATGGTACGGCTAATGTGGCATTAGGTGGCAACAGCTTCCTATCCTATATTCGTACAAGAAACCAGGTCGAAGCTAAGTACTTTTTACAACTACCTGCTGCCGTTGATGCATACAACTCTAGTTTGTATATTCCTATGGTTGGGTTGCCTGTGCCATTAACAAGCATTTCGTCCGGGGTACTTGGCAATACCGTACCTGAGGTTACTGTTTCTAGGGCATCCTACAAGGATGTGGCTAATGCTCTGAGAACGGTATCTTTTACTTTAGGGGCAGATGGTAATTATTACCTTTCCTTAAATAAAGACTACTATGGTTCCACTAAGGGTACTCTTGCAGAGTTAGCAGCAGGAACATACTTAGTATTCACAGTACGTTACACTACGGTAGGTTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
61bf7318121e192296c0b1a454379fb6291206786fdfd579721c462072086072
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4605
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50