UniProt accession
A0AAV1MC20 [UniProt]
Protein name
Tail spike protein
RBP type
TSP
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,98
Protein sequence
MLKNDFNQPKGSTIGVLRDGRTIQQAFDKMRYADTLQELRTMEPVGTRDVATVRRATEDSVLVNAPVYYDKNDTTSPDDGISVFVTAGGARWKFNTFKGYCAGLAGLKEDGSNVTTVVNGIVNRIVAKMVAAGRVNNQQRTVNIPVTGDAPEWKLTGPLVWPTVISLVFHGSVLLDGTSLTAGKILNCNNVPFMDRLTVAIMAAGTNLTDRPGRVNQAVGRAALTCIGGEVTVRLPGTQMDGSNNPTIHTVGAMIGNDAACLLDARDIYFEKFNIIGAKTGIEFGCYNTFMCGVEHFNVSRCYDGFSSPTLGSNYGERMYLRNGTIGNMDRHGAYLVGGGDFTLENVSVDFLGGDLAHFGPLSPAEYKHLSGHIEGVKGSLAAKEMPASYSKALVILGKAVRRDDRVVDQNDYRGVRQLFACPQNPYGRMLKVINESYAPGREGQVPNNPYPCETGWPGNSGVELILPKDTFVDTPYVNSYSPAVRNSVNQTISFTTASTGPLGGNILSTDYAFAAEIIGDATCSYGTPAEATSDGYMPFIINLTSPSDVVYLFCTNRFRPGSGQLVLWGNCSVTLVGTTGSIILAPVIASYLGTTWTANTTTGAVTATPIRRGIQEGGTVDMSALAAAGGIPNGTYQAMPSRPVQGFYLGCDHAVAGFKITGGTGQVRLKLPVWWFR
Physico‐chemical
properties
protein length:678 AA
molecular weight: 72529,37340 Da
isoelectric point:7,04664
aromaticity:0,08702
hydropathy:-0,08805

Domains

View on InterPro
A0AAV1MC20
1 678 aa
RBD 34–93 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

A0AAV1MC20
1 678 aa
Domain Start End Length (AA) Confidence
N-terminal 1 114 114 0,9904
Central domain 115 483 370 0,9930
C-terminal 484 678 194 0,9934
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Klebsiella phage vB_Kpl_K56PH164C1 [NCBI] · taxon 3071653
Host No host information

Coding sequence (CDS)

Genbank protein accession
CAK6589236.1 [NCBI]
Genbank nucleotide accession
OY977469 [NCBI]
CDS location
range 38934 -> 40970
strand +
CDS
ATGCTTAAGAACGATTTTAACCAGCCGAAGGGCTCAACCATTGGTGTGCTCAGGGATGGGCGCACTATCCAACAGGCATTCGATAAGATGCGATATGCTGACACACTCCAAGAGCTTCGCACTATGGAGCCAGTAGGTACTCGTGATGTGGCAACAGTACGCAGGGCCACTGAGGACTCTGTACTGGTCAACGCCCCTGTGTATTACGATAAGAACGACACAACGTCACCGGATGATGGCATCTCCGTGTTCGTCACTGCTGGCGGTGCTCGCTGGAAGTTCAACACGTTCAAGGGCTATTGCGCAGGACTCGCTGGGCTGAAGGAGGATGGCTCTAACGTCACTACTGTGGTGAATGGTATCGTAAACCGCATAGTCGCCAAAATGGTGGCAGCGGGTCGCGTGAATAACCAACAGCGGACCGTTAACATCCCAGTGACCGGGGACGCTCCTGAGTGGAAACTGACTGGTCCTCTTGTGTGGCCAACTGTCATCTCTCTCGTTTTCCATGGCTCTGTACTGCTGGATGGCACATCGCTCACGGCTGGCAAGATTCTCAACTGTAACAACGTACCGTTCATGGACCGACTCACTGTCGCCATTATGGCTGCTGGGACCAACCTCACAGACAGGCCGGGTCGAGTTAACCAAGCTGTGGGACGTGCAGCGCTCACCTGTATCGGTGGTGAGGTTACGGTACGTCTCCCCGGAACCCAGATGGATGGGAGCAACAACCCGACTATTCACACTGTCGGTGCTATGATTGGGAACGATGCGGCGTGTCTGTTGGATGCTCGTGATATCTACTTTGAGAAGTTCAACATCATCGGTGCCAAGACTGGTATCGAGTTCGGGTGCTACAATACCTTCATGTGTGGGGTCGAGCACTTCAACGTGTCTCGCTGCTACGATGGGTTCAGTAGTCCGACTCTCGGTTCCAACTATGGGGAGCGTATGTATCTCCGAAACGGGACGATTGGTAACATGGACCGACACGGGGCATACTTAGTAGGCGGCGGTGACTTCACCCTTGAGAACGTCTCTGTGGACTTCCTTGGTGGTGACTTGGCTCACTTTGGCCCACTGTCCCCGGCTGAGTATAAGCACCTGTCAGGTCACATTGAGGGTGTCAAAGGCTCCCTCGCAGCCAAAGAGATGCCTGCAAGCTACTCTAAGGCGCTGGTAATTCTCGGTAAGGCTGTTCGTCGAGATGACCGTGTTGTTGACCAGAATGACTACCGTGGTGTTCGCCAGCTGTTCGCCTGCCCGCAGAACCCGTACGGTCGAATGCTCAAAGTCATCAACGAGAGCTACGCTCCGGGACGTGAGGGTCAGGTGCCTAACAACCCGTACCCATGCGAGACTGGCTGGCCGGGAAACTCTGGGGTCGAACTGATTCTACCTAAAGACACCTTTGTTGACACACCGTACGTGAACTCGTATTCGCCAGCGGTGCGAAACAGTGTCAACCAGACAATCAGCTTCACCACAGCGAGTACCGGACCACTTGGCGGTAATATATTGTCTACCGACTACGCGTTCGCTGCTGAGATTATCGGGGATGCAACGTGCTCCTACGGTACCCCTGCTGAGGCCACGAGCGATGGATATATGCCGTTCATCATCAACCTGACGTCCCCGTCCGATGTGGTCTACTTGTTCTGTACCAACAGATTCCGTCCCGGCTCCGGTCAGTTGGTACTGTGGGGCAACTGCTCGGTAACCTTGGTGGGCACCACTGGGTCTATCATCTTGGCCCCAGTAATCGCGTCTTACCTCGGTACCACTTGGACTGCGAACACCACAACTGGAGCCGTTACAGCCACTCCAATCCGACGAGGCATTCAGGAGGGCGGTACGGTCGATATGTCCGCTCTCGCAGCGGCTGGTGGTATTCCAAACGGTACCTACCAAGCGATGCCGTCCCGGCCTGTCCAAGGGTTCTACTTAGGGTGTGACCACGCTGTGGCTGGATTCAAGATAACTGGCGGTACTGGCCAAGTGCGCCTTAAGCTGCCAGTGTGGTGGTTCCGCTAA

Genome Context

Tertiary structure

A0AAV1MC20
ESMFold structure
Source ESMFold
pLDDT 69.4
Oligomeric state monomer