UniProt accession
A0A0S2MXS2 [UniProt]
Protein name
Uncharacterized protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MALIRLVAPERVFSDLASMVAYPNFQVQDKITLLGSAGGDFTFTTTASVVDNGTVFAVPGGYLLRKFVGPAYSSWFSNWAGIVTFMSAPNRHLVVDTVLQAMSVLNIKSNSTLEFTDTGRILPDAAVARQVLNIIGSAPSAFVPLAADAAAGSKVITVAAGALSVVKGTYLYLRSNKLCDGGPNTYGVKISQIRKVVGVSTSGGVTSIRLDKALHYNYYLSDAAEVGIPTMVENVTLVSPYINEFGYDDLNRFFTIGISANFAADLHIQDGVIIGNKRPGASDIEGRSAIKFNNCVDSTVKGTCFYNIGWYGVEVLGCSEDTEVHDIHAMDVRHAISLNWQSTADGDKWGEPIEFLGVNCEAYNTTQAGFDTHDIGKRVKFVRCVSYDSADDGFQARTNGVEYLNCRAYRAAMDGFASNTGVAFPIYRECLAYDNVRSGFNCSYGGGYVYDCEAHGSRNGVRINGGRVKGGRYTRNSSSHIFVTKDVAETAQTSLEIDGVSMRYDGTGRAVYFHGTMGIDPTLVSMSNNDMTGHGLFWALLSGYTVQPTPPRMSRNLLDDTGIRGVATLVAGESTVNARVRGNFGSVANSFKWVSEVKLTRLTFPSSAGALTVTSVAQNQDVPTPNPDLNSFVIRSSNAADVSQVAWEVYL
Physico‐chemical
properties
protein length:651 AA
molecular weight: 70068,98250 Da
isoelectric point:5,91810
aromaticity:0,09985
hydropathy:-0,01828

Domains

Domains [InterPro]
IPR011050
STR
359–533
A0A0S2MXS2
1 651
Architecture
ENZ
ATT
ENZ
STR
ENZ 1-64 | ATT 65-140 | ENZ 142-179 | STR 180-651
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A0S2MXS2
1 651
Domain Start End Length (AA) Confidence
N-terminal 1 78 78 0,9855
Central domain 79 566 489 0,9942
C-terminal 567 651 84 0,9354
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-78
Central
79-566
C-terminal
567-651

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage KpV41
[NCBI]
1747282 Uroviricota > Caudoviricetes > Autographivirales > Slopekvirinae > Drulisvirus
Host Klebsiella pneumoniae
[NCBI]
573 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ALO80745.1 [NCBI]
Genbank nucleotide accession
KT964103 [NCBI]
CDS location
range 41928 -> 43883
strand +
CDS
ATGGCATTAATTAGATTAGTAGCTCCCGAGCGGGTGTTCTCCGACTTGGCGAGCATGGTAGCATACCCAAACTTTCAGGTGCAGGACAAGATTACCCTGCTGGGCAGTGCCGGTGGGGATTTCACCTTTACTACTACTGCGTCGGTAGTGGATAACGGAACCGTGTTTGCTGTACCTGGTGGGTACCTGCTCCGTAAATTCGTGGGTCCAGCATACAGCTCCTGGTTCAGCAACTGGGCGGGCATAGTCACGTTCATGAGCGCGCCTAATAGGCACCTGGTTGTGGACACGGTTCTGCAGGCCATGAGTGTGCTAAACATCAAAAGCAACTCTACTCTAGAGTTTACCGATACCGGAAGAATACTACCGGACGCCGCGGTTGCACGTCAAGTGCTTAATATTATCGGTTCTGCGCCCTCGGCGTTCGTACCGTTAGCAGCGGATGCTGCGGCGGGCAGCAAAGTCATTACGGTGGCTGCTGGGGCTTTGTCTGTGGTAAAAGGTACGTACTTGTATCTTCGATCTAACAAGCTGTGTGACGGCGGGCCTAACACGTACGGGGTAAAGATTTCCCAGATTAGGAAAGTGGTGGGAGTCAGCACCTCCGGTGGCGTCACCAGTATTCGACTGGATAAAGCGCTGCACTATAACTACTACCTGTCTGATGCCGCGGAAGTAGGTATCCCGACAATGGTGGAGAACGTAACCCTAGTATCTCCGTACATCAACGAGTTCGGCTACGACGACTTGAACCGGTTCTTTACTATCGGCATCTCTGCCAACTTTGCTGCGGACTTGCACATTCAGGACGGAGTTATTATTGGCAACAAACGCCCGGGGGCTTCTGATATAGAAGGGCGTAGTGCTATCAAGTTCAATAACTGCGTAGACAGCACCGTTAAGGGTACGTGCTTCTACAACATCGGATGGTACGGGGTAGAGGTGCTCGGCTGCTCAGAGGACACGGAAGTACACGATATCCACGCCATGGACGTACGCCACGCAATCTCTCTGAACTGGCAGAGCACTGCAGACGGGGACAAGTGGGGAGAGCCTATCGAGTTCTTAGGCGTTAACTGCGAAGCCTACAACACAACCCAGGCTGGGTTTGATACTCACGACATTGGTAAGCGTGTGAAGTTCGTTCGCTGCGTTTCGTACGACAGTGCTGACGATGGGTTCCAAGCGCGCACTAACGGGGTGGAGTACTTGAACTGTCGAGCTTACCGCGCAGCCATGGATGGATTCGCCTCCAACACCGGTGTAGCCTTCCCTATCTATAGGGAGTGCCTAGCTTATGACAACGTACGCTCTGGCTTTAACTGTTCGTACGGTGGTGGCTACGTTTATGACTGCGAGGCTCACGGTAGCCGGAATGGTGTGCGAATCAACGGGGGCCGTGTTAAGGGTGGTCGGTACACCCGCAACTCCAGCTCTCACATCTTCGTCACTAAGGACGTAGCTGAGACAGCGCAGACATCCCTGGAGATTGATGGCGTGAGCATGCGCTACGACGGTACCGGGAGAGCTGTGTACTTCCACGGGACTATGGGCATTGACCCTACCCTGGTATCTATGTCCAATAACGACATGACCGGTCACGGTTTGTTCTGGGCATTGCTGAGCGGCTACACTGTTCAACCTACGCCTCCGCGCATGTCCAGGAACTTACTGGATGACACAGGTATCCGTGGAGTGGCAACGCTGGTTGCTGGCGAGTCTACAGTCAACGCCCGTGTACGCGGAAACTTTGGCAGCGTAGCTAACTCCTTCAAGTGGGTGTCTGAGGTTAAGTTGACGCGGTTGACCTTTCCCTCTAGTGCTGGGGCCTTGACGGTTACTAGCGTAGCTCAGAACCAGGATGTGCCTACACCTAATCCGGACCTAAACAGCTTCGTGATTAGGAGCAGTAACGCAGCGGACGTATCCCAGGTAGCCTGGGAGGTGTATCTCTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0098994 symbiont entry into host cell via disruption of host cell envelope Biological Process IEA:UniProtKB-KW (UniProt)
GO:0098996 symbiont entry into host cell via disruption of host cell glycocalyx Biological Process IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
0822c4dc1a5e78b8bb675ac32b3b2533445f6cee52d67031439ba14314c4e2ae
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8640
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50