Genbank accession
QYI86596.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence RBPdetect
Probability 0,70
Protein sequence
MTCSNKPNYTPFKLQGLAPTKQWGLGRSASYERFNPNEFHALEDEVKITFPVDFRGKVKGNSNPNPSRGFSHASQIYRENVLRGSQFLSKPSLLFNGAVYNEKTLYNGAMVASTQEVNQGLMFYWEDFINPDPRIKEGDMVTFSVDVRSTGEDIPTGAVAFKGTSNFEEYYKVLEQPITKEFTRISFTTRFLSKEWEWDEAFGMFWGVENVPVTDYPMFQYDKDKLVGFIQAQESVQLEFSRPMVSTIGKTAYIESGDDMFSNNKWDWSKAVSKGRLNEFDGKYNTNGIKQDWAMPEFIPIFKGSERIHFYKKTTLTETTDAGHGYGKILFYSAANEDSYMNVYKKFPHEAGVYGGYAAEVEVPIGATHYRVHITSERGKATTEAYVQSSANDWSEFSQEWYDGLSGKLDGKTARAETFYRKEMVEMEFELHFAEAVQKALPNIFKGLTTDAEKQQRLRDIAYQFDSTMIARGRGQGNNLADWIFYRWAPDGSIEEQTMKQFRGEGLTSVTHPSSYQEWINPQGRIVSSLRSNRIIPNSMYMGTDTLSGKYDPKLTTLQLEVNGELVDKKATLDKTSEVFTFTGLKGFLKPEDKVRIAGWINRNNQWSITYSNVLMGNTPENVAEFNERAFVEADYLVTNMSIIVTQSQMDEWGFNPNLPDYRIESLERPASRTEEYMINSMVQVTHAYDIYGFVESNYPEFFGDCYTFDDRIRKINERIKQFNIVATTYSEEPWEDGNPTIFISAEARNPDRDYDVKAEINQTVELQINNSKGHFIHPNGYIYVGFARMPRSDRETGISVEAELSFDFMMDRQYDSLPRVFRYNYQKQPWFLFVRNINRSVLAPKVNTLTPINGGTRRYNFGATEDARYISMDCFIKAPSEEDMPKLMEELADFLDVGETVIQFSDNKDRYYKVILDGSTDLSQTLHVGTLTLTFVLLENTAIGEEVVESFDIDSTTGSVPFIELENMGTADAYPTYQLTFEEPVGYVDLIGTDTSANVSIGRRPKDTEDEVKVDLRPRKFYSKFTSTDGAGWTAMNDTQLPQIEGYSTKLQGTVQRVNGMANQDKWNYGDTSHRGWHGAGIVANLQKNLDDFYMEASIVATGKPVKTSANAIFLIFYDEDNNPYAYTKVGTRPQEGNLDSYVAYAGDWGKRKVVNNGAKWKDFWGKVSVQRRDNRWRLIVGQYKDRRYSPSPEASFNFGQNMLKDTRDTGWFDLPPETWGKKFARVGIFFGQYSHRPKLGHLSVRRLIVWENLEEYGETPLEGTPIMFHEGDTVVIDSSKAQTYLNGELTPSLVDPMTDWFPITKGDNYIGVNNFKGKIDIVYNERFK
Physico‐chemical
properties
protein length:1330 AA
molecular weight: 152244,35370 Da
isoelectric point:5,17896
aromaticity:0,13083
hydropathy:-0,58902

Domains

Domains [InterPro]
DC_0077
STR
691–1330
G3DSA:2.40.30.200
ATT
820–940
IPR008841
STR
862–937
QYI86596.1
1 1330
Architecture
STR
ATT
STR
STR 1-819 | ATT 820-940 | STR 941-1330
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QYI86596.1
1 1330
Domain Start End Length (AA) Confidence
N-terminal 1 16 16 0,8448
Central domain 17 246 231 0,5749
C-terminal 247 1330 1083 0,1986
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-16
Central
17-246
C-terminal
247-1330

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterococcus phage SSsP-1
[NCBI]
2859527 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Enterococcus sp.
[NCBI]
35783 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QYI86596.1 [NCBI]
Genbank nucleotide accession
MZ333457 [NCBI]
CDS location
range 30965 -> 34957
strand -
CDS
ATGACATGTTCTAATAAACCTAATTACACTCCGTTTAAATTACAAGGTCTAGCCCCTACTAAACAGTGGGGCTTAGGCCGTTCTGCTAGTTATGAACGGTTTAACCCAAATGAGTTTCACGCCTTAGAGGACGAGGTTAAGATTACTTTCCCAGTGGACTTTAGAGGAAAGGTTAAAGGAAACTCTAACCCAAACCCTTCACGAGGATTTAGCCACGCCAGTCAAATCTACCGTGAAAATGTTTTAAGAGGAAGTCAATTCCTTTCTAAGCCTTCATTACTATTTAATGGAGCAGTATATAATGAGAAGACACTTTACAATGGAGCTATGGTAGCCTCTACTCAAGAAGTTAACCAAGGATTAATGTTCTATTGGGAAGACTTTATCAATCCTGACCCAAGAATTAAAGAAGGTGACATGGTAACTTTCTCAGTAGATGTCCGAAGCACTGGAGAAGATATACCAACAGGAGCTGTGGCCTTCAAAGGAACTTCTAACTTTGAGGAATACTATAAAGTCCTTGAACAACCTATCACAAAAGAGTTTACAAGAATCTCATTCACAACTCGTTTCCTATCGAAGGAATGGGAATGGGACGAAGCCTTTGGGATGTTCTGGGGAGTAGAGAATGTTCCTGTTACAGATTACCCAATGTTCCAATATGACAAGGATAAGTTAGTTGGATTTATTCAGGCTCAAGAAAGTGTTCAACTAGAGTTCTCACGGCCTATGGTTTCAACTATCGGAAAGACTGCTTACATTGAGTCTGGTGACGACATGTTCTCTAACAACAAATGGGACTGGTCTAAAGCCGTATCTAAAGGTCGTTTAAATGAATTTGATGGTAAGTATAACACAAACGGAATCAAACAAGACTGGGCAATGCCTGAGTTCATTCCTATTTTCAAAGGCTCTGAGAGAATCCACTTCTACAAAAAGACTACCCTAACGGAAACAACGGACGCTGGACATGGCTATGGTAAAATCCTATTCTACAGCGCTGCTAACGAAGATTCTTACATGAATGTATATAAGAAGTTCCCTCATGAGGCTGGCGTATACGGTGGATACGCTGCTGAAGTAGAAGTGCCTATTGGAGCTACTCACTACCGAGTTCATATAACTTCAGAAAGAGGAAAGGCCACTACAGAAGCCTATGTACAATCTTCTGCTAACGACTGGTCAGAGTTTAGCCAAGAGTGGTACGACGGACTATCAGGAAAGCTTGACGGAAAGACTGCCAGAGCTGAAACGTTCTACCGTAAAGAAATGGTTGAGATGGAGTTTGAGCTACACTTTGCTGAAGCTGTTCAGAAAGCTTTGCCAAACATCTTCAAAGGCCTTACTACTGACGCTGAGAAACAACAAAGACTTCGTGACATCGCATATCAATTCGATTCAACAATGATTGCACGAGGCCGTGGCCAAGGTAATAACCTAGCTGATTGGATTTTCTATAGATGGGCTCCTGACGGCTCTATTGAAGAGCAGACAATGAAGCAGTTTAGAGGCGAGGGATTAACCTCTGTTACCCATCCGTCAAGCTATCAAGAGTGGATTAATCCACAAGGTAGAATCGTATCATCTCTACGCTCTAACCGAATCATTCCTAACTCAATGTACATGGGAACTGATACGTTAAGTGGTAAGTACGACCCTAAATTAACAACGCTTCAGCTAGAGGTTAATGGAGAGCTTGTTGATAAGAAGGCTACCTTAGATAAGACCTCAGAGGTGTTTACATTCACTGGTCTTAAAGGATTCTTAAAACCTGAAGATAAAGTTAGGATTGCTGGATGGATTAATCGTAACAACCAATGGTCAATCACCTACTCTAATGTTCTTATGGGAAACACTCCCGAAAACGTTGCTGAGTTTAACGAACGAGCTTTCGTCGAAGCTGATTACCTAGTTACAAACATGTCTATTATCGTAACTCAATCACAGATGGATGAATGGGGATTTAACCCTAATCTACCAGACTACCGAATTGAATCATTAGAACGTCCAGCAAGTCGTACTGAGGAGTATATGATTAACTCAATGGTACAGGTAACACATGCTTATGACATCTACGGATTTGTTGAAAGCAACTACCCAGAGTTTTTCGGTGACTGTTACACGTTCGATGATAGAATAAGAAAGATTAATGAGCGCATTAAACAGTTTAACATTGTAGCTACTACTTATTCAGAAGAGCCTTGGGAAGATGGTAACCCGACAATCTTTATCAGTGCTGAGGCTAGAAACCCTGACAGAGACTACGATGTTAAAGCTGAGATTAACCAAACAGTTGAGCTTCAAATTAACAATTCTAAGGGACACTTTATTCATCCTAACGGTTATATATACGTGGGATTCGCTCGTATGCCAAGAAGCGACCGTGAGACAGGAATCTCTGTAGAAGCTGAATTAAGCTTTGACTTCATGATGGACAGACAGTACGACAGCCTTCCAAGAGTCTTCCGATACAACTATCAGAAGCAGCCTTGGTTCTTGTTTGTAAGAAACATTAACAGAAGCGTTCTTGCCCCTAAAGTAAACACTCTAACTCCTATTAACGGAGGCACTAGACGTTATAACTTTGGAGCTACAGAAGACGCTCGGTATATCTCAATGGACTGCTTTATCAAAGCTCCTTCTGAAGAGGATATGCCTAAGTTGATGGAAGAGCTTGCTGACTTCCTAGACGTTGGAGAAACGGTAATCCAATTCTCTGATAACAAAGACCGCTACTACAAGGTTATTCTGGATGGTTCAACAGACCTTTCCCAAACGCTTCACGTAGGTACGTTGACCCTAACCTTCGTACTGCTTGAGAATACAGCCATTGGCGAAGAGGTTGTAGAAAGCTTTGACATAGACAGTACGACAGGTTCAGTACCATTCATTGAGCTTGAGAACATGGGTACGGCGGACGCTTACCCTACCTACCAACTGACCTTTGAAGAGCCTGTAGGATATGTTGACCTAATCGGTACAGATACCTCAGCCAACGTGTCAATTGGAAGACGTCCTAAAGATACTGAAGACGAGGTTAAAGTAGACCTTCGTCCTCGTAAGTTCTACAGTAAGTTTACATCTACTGATGGAGCTGGATGGACGGCTATGAATGATACTCAACTTCCTCAGATTGAAGGGTACTCAACTAAGCTTCAAGGAACTGTCCAAAGAGTTAATGGAATGGCCAATCAAGACAAGTGGAATTACGGAGACACTAGCCACAGAGGATGGCATGGAGCTGGTATCGTAGCAAACCTTCAAAAGAACCTTGACGACTTCTATATGGAGGCTTCTATTGTAGCTACAGGAAAACCAGTTAAAACTTCTGCTAACGCTATTTTCTTGATATTCTACGACGAAGACAATAACCCATATGCTTACACTAAAGTCGGTACACGTCCTCAAGAGGGTAACTTAGACAGCTACGTAGCTTACGCAGGTGACTGGGGCAAACGTAAAGTGGTTAACAACGGAGCTAAGTGGAAAGACTTCTGGGGCAAGGTTTCAGTACAGCGTAGAGACAATCGCTGGAGACTTATCGTAGGCCAGTACAAAGACCGTAGATACAGCCCTTCACCTGAAGCTTCATTTAACTTTGGTCAAAACATGTTGAAAGATACTAGAGACACAGGTTGGTTCGACTTACCTCCAGAAACTTGGGGTAAGAAGTTTGCAAGGGTTGGAATCTTCTTCGGACAGTATTCACATCGACCTAAGCTAGGCCACTTATCAGTACGTAGACTAATTGTGTGGGAAAACCTTGAGGAGTATGGAGAAACTCCTTTAGAAGGAACTCCAATCATGTTCCATGAAGGTGACACAGTTGTAATTGACTCAAGCAAAGCTCAAACATACCTTAACGGAGAGCTTACTCCATCACTTGTAGACCCTATGACCGACTGGTTCCCAATTACCAAAGGTGATAACTACATTGGAGTAAACAACTTCAAAGGCAAGATTGACATAGTATATAACGAACGATTTAAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
e703e8c2c0ed557d73524b7f25e13241bb302328f622c9d8540e010970d6e339
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,6961
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Two Novel Lytic Bacteriophages Infecting Enterococcus spp. Are Promising Candidates for Targeted Antibacterial Therapy Tkachev,P.V., Pchelin,I.M., Azarov,D.V., Gorshkov,A.N., Shamova,O.V., Dmitriev,A.V. and Goncharov,A.E. 2022 35458561 GenBank