Genbank accession
YP_009032516.1 [GenBank]
Protein name
minor tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MVDTPNPNPNNGPGAELEKWLGTGAFELGGGDWNYGQDFTEDAIRSLFELPAITLINAIELLEEQLLKMPIEALRVFKPLIPDAIEDDFEDVATAVAKIIDTLTDGPAALLRGEFDEWLSETFGPLAATVQQVLEILAGLVVDPVNETVGAIKDWWDLITGKTQGLNSSGQLDASKLTNLENIGEIPNGLEKMPDLQSLVDAATNALSGASQVGEEVVGAGLDIAKSTMENLFSTLSKVTRDVQALQSEQEASQVGGRRFNVDFTQYPDGPFPSGLFNVTYSGAGSSTLGISGGKAIWNTVNDGYRRATLIFPEPTLTPFQVVRGTLSSPPEQGTNVRIWSVARANASGTDFVFARGYCNGFLSYRGDIGCYKDGVEYVWASNVPLTWSLDIRIICGVGNDPRHHIVLSGDKMIIDLYEPADKQSRVDEDHCYWGAISETNGVQVPGNVAGASVVDNAPPAVVGTTLRVSKRSGGDITIPSGGAKLPNNFYETIDYQSPDLIYEPSKNCRVTATKAGTYLVEYRVYHGQYATNTGGHAQIYRNGSVYAKGQWGSCPFVAGWGIMVDPTDSTHGSFLVPLNPGDYIEPGFWFSANMSNTGDARLMAGGAQSYMSVARLGTN
Physico‐chemical
properties
protein length:620 AA
molecular weight: 66626,68150 Da
isoelectric point:4,51065
aromaticity:0,09032
hydropathy:-0,17565

Domains

Domains [InterPro]
DC_0105
STR
1–620
Coil
Unmapped
229–249
YP_009032516.1
1 620
Architecture
STR
STR 1-620
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009032516.1
1 620
Domain Start End Length (AA) Confidence
N-terminal 1 256 256 0,9978
Central domain 257 455 200 0,0277
C-terminal 456 620 164 0,9981
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-256
Central
257-455
C-terminal
456-620

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage Phantastic
[NCBI]
1486426 Uroviricota > Caudoviricetes > Veracruzvirus >
Host Mycobacterium smegmatis str. MC2 155
[NCBI]
246196 Bacteria > Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009032516.1 [NCBI]
Genbank nucleotide accession
NC_024148 [NCBI]
CDS location
range 22796 -> 24658
strand +
CDS
ATGGTAGACACACCGAATCCGAACCCCAACAACGGCCCCGGTGCAGAACTTGAGAAGTGGCTCGGCACAGGGGCATTCGAGCTAGGTGGCGGTGACTGGAACTACGGACAGGACTTCACCGAGGACGCCATTCGGTCGCTGTTCGAGCTGCCAGCGATCACCCTGATCAACGCCATCGAGCTACTCGAAGAGCAACTGCTGAAGATGCCCATCGAGGCGCTGCGGGTCTTCAAGCCGTTGATCCCGGACGCCATCGAGGACGACTTCGAGGATGTCGCAACGGCAGTCGCCAAGATCATCGACACTCTGACCGATGGGCCGGCGGCGCTGCTGCGCGGCGAGTTTGATGAGTGGCTGTCGGAGACCTTCGGTCCCTTGGCGGCTACGGTCCAGCAGGTCTTGGAGATTCTGGCTGGGTTGGTCGTTGACCCCGTCAACGAGACCGTAGGGGCCATCAAGGACTGGTGGGACCTGATCACCGGTAAGACCCAGGGCCTCAACTCATCCGGCCAGCTCGATGCATCCAAGCTGACGAACCTGGAGAACATCGGGGAGATTCCGAACGGCCTGGAGAAGATGCCCGACCTCCAGAGCCTGGTCGACGCAGCCACGAACGCGCTCTCTGGCGCGTCCCAGGTTGGTGAGGAGGTCGTCGGAGCGGGGCTCGACATCGCGAAGAGCACGATGGAGAACCTCTTCTCCACGTTGTCGAAGGTCACCCGAGACGTGCAGGCGCTGCAGTCGGAGCAGGAGGCCAGCCAGGTTGGTGGTCGACGGTTCAACGTCGACTTCACCCAGTACCCGGACGGCCCGTTCCCATCAGGGCTGTTCAACGTCACGTACTCCGGGGCGGGGTCCAGCACCCTGGGCATCAGCGGTGGTAAGGCGATCTGGAACACGGTCAACGACGGCTACCGCCGTGCGACCCTGATCTTCCCAGAGCCGACCCTGACCCCGTTCCAAGTCGTGCGAGGCACGCTGTCCTCTCCCCCGGAGCAGGGCACGAACGTCCGCATCTGGTCGGTGGCCCGTGCCAACGCATCCGGCACCGACTTCGTGTTCGCACGCGGCTACTGCAACGGCTTCCTGAGCTATCGAGGCGACATCGGCTGCTACAAGGACGGTGTCGAGTACGTCTGGGCGTCCAACGTCCCGCTGACGTGGTCCCTGGACATCCGGATCATCTGTGGCGTCGGTAACGATCCACGCCACCACATCGTCCTGTCGGGCGACAAGATGATCATCGACCTCTACGAGCCAGCCGACAAGCAGTCGCGAGTGGACGAGGACCACTGCTACTGGGGCGCGATCTCAGAGACCAACGGCGTCCAGGTCCCCGGCAACGTGGCCGGTGCGTCGGTGGTCGACAACGCTCCCCCGGCCGTCGTCGGTACGACGCTGCGTGTCTCGAAGCGGTCAGGTGGCGACATCACCATCCCCAGCGGTGGTGCGAAGCTCCCGAACAACTTCTACGAGACCATCGACTACCAGTCGCCTGACCTGATCTATGAGCCCAGCAAGAACTGCCGGGTGACCGCCACCAAGGCCGGCACCTACCTGGTCGAGTACCGGGTGTATCACGGCCAGTACGCCACCAACACAGGCGGTCACGCGCAGATTTACCGCAACGGCTCGGTGTACGCCAAGGGCCAGTGGGGCTCCTGCCCGTTCGTCGCTGGCTGGGGCATCATGGTGGACCCCACGGACTCGACCCACGGGTCGTTCCTGGTGCCACTGAACCCAGGCGACTACATCGAACCGGGCTTCTGGTTCTCCGCGAACATGTCCAACACGGGCGACGCCCGCCTGATGGCTGGTGGCGCACAGTCCTACATGTCCGTAGCCCGACTGGGCACCAACTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
22bdcf0293b0e057dfcb6761e0bc9f3ba8737bb03466170c9db180bacbbf3d98
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6903
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50