Genbank accession
BAO47061.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence RBPdetect
Probability 0,85
Protein sequence
MSKKLKIYNKNNELLKESPYIEGSIGEITIDNLTPDTEYKEGDFKVCWDINGKESKHVNVPGFSTLKNSNEKILIVSYNVESPTTITAQNIEGLENSIDNYFYGSLYYDVVGIVNDNTEGKINGLKDEIRDKRISPLENKIGLFIGDSITEVNGRTQKNYHGFIANRTGLEVRNLGISGTGYQDRKNVAYDITEQPDFICIMLGTNDYGVVGGNNRELGNSKEHKYGTVAGSIYYTYLQLSREFPTTPIVVLTPTPRIESNPYKENTNSKGYTLGELVQVIKDIAKLFSFPVLDLYHDSNIRVWDSNVNNIFFSPGGSPADGLHPNTEGHEWLANLIQPFLELKAVLGSRIEKPVINHEIKDLGNDVFTKVIRPTGVFHKEDQSFIINLNKEDIDLSKNKVLKVLYNGHQINDPNGVLENSPYWYTLPNYKDGNEFNKTSIVSEFKKGLETIDIVDDGRVEYLRDYMTLYFTDISNNGIKGSDINIEDNSTPTEGTTPTPSKPSIPVITPVDNEDGTFTVTLTPIKMSWKQDQSFMVNINERQLNLTGKKVQNIEVNGTQLAKPNETASNYFYWFSVPTSEEGTEYNRTNQISNFVSVLELSDTEYDGRLVYKPTPIKITYK
Physico‐chemical
properties
protein length:622 AA
molecular weight: 70174,65470 Da
isoelectric point:5,04096
aromaticity:0,09807
hydropathy:-0,54421

Domains

Domains [InterPro]
SSF52266
STR
135–341
IPR036514
STR
135–347
IPR036514
STR
137–347
IPR051532
Unmapped
143–341
cd00229
ENZ
143–340
IPR013830
ENZ
144–331
BAO47061.1
1 622
Architecture
STR
RBD
STR 135-347 | RBD 348-622
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
BAO47061.1
1 622
Domain Start End Length (AA) Confidence
N-terminal 1 408 408 0,5218
Central domain 409 611 204 0,3759
C-terminal 612 622 10 0,1339
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-408
Central
409-611
C-terminal
612-622

Taxonomy

  Name Taxonomy ID Lineage
Phage Staphylococcus phage phiSA12
[NCBI]
1450142 Uroviricota > Caudoviricetes > Herelleviridae > Kayvirus > Kayvirus SA12
Host Staphylococcus aureus
[NCBI]
1280 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
BAO47061.1 [NCBI]
Genbank nucleotide accession
AB903967 [NCBI]
CDS location
range 5684 -> 7552
strand -
CDS
ATGTCAAAAAAACTTAAAATTTATAATAAAAATAATGAGCTACTAAAAGAAAGTCCATATATTGAAGGTTCTATAGGAGAGATAACTATAGACAATCTAACACCTGATACTGAATATAAAGAAGGTGATTTTAAAGTATGTTGGGATATAAACGGTAAAGAATCAAAACATGTCAATGTTCCGGGTTTTTCTACTTTAAAAAACTCTAATGAGAAAATACTTATAGTAAGCTATAATGTAGAAAGTCCGACAACAATAACTGCTCAAAATATAGAAGGGTTAGAAAATAGTATAGATAACTATTTTTATGGTAGTCTTTATTATGATGTTGTAGGAATTGTAAATGATAATACTGAAGGGAAAATAAATGGTCTTAAAGATGAGATTAGAGATAAACGTATTTCTCCTTTGGAAAATAAAATAGGATTATTTATCGGTGATAGTATTACTGAAGTAAATGGTAGAACCCAAAAGAATTATCATGGTTTCATTGCAAATAGGACTGGTTTAGAAGTCCGTAACCTAGGTATCAGCGGTACAGGATACCAAGATAGAAAAAATGTAGCATATGACATTACAGAGCAACCTGACTTTATTTGTATAATGTTAGGTACAAATGATTATGGGGTTGTTGGAGGTAATAATAGGGAGTTAGGTAATTCTAAGGAGCATAAATATGGTACTGTAGCAGGTTCCATTTATTATACCTATCTACAGCTATCAAGAGAATTTCCTACTACACCAATAGTAGTTTTAACTCCTACACCTAGGATAGAAAGTAATCCGTACAAAGAAAATACAAATAGTAAAGGGTATACACTAGGAGAACTTGTTCAAGTTATCAAAGATATAGCTAAATTATTTTCTTTCCCTGTTTTAGATTTATATCATGATAGCAATATAAGAGTTTGGGATTCTAATGTCAATAATATATTTTTCTCTCCTGGGGGTAGTCCTGCCGATGGGTTACATCCTAATACAGAAGGACATGAATGGCTTGCTAATCTTATACAACCGTTTTTAGAATTAAAAGCAGTATTAGGTTCTAGAATAGAAAAACCTGTAATAAATCATGAAATAAAAGATTTAGGTAATGATGTATTCACAAAGGTAATAAGACCAACAGGTGTATTTCATAAAGAAGACCAAAGTTTCATTATTAATCTAAATAAGGAAGATATAGACCTATCAAAAAATAAAGTATTAAAAGTATTATATAATGGTCATCAAATCAATGACCCTAATGGTGTATTAGAAAACTCCCCTTATTGGTATACATTACCTAACTATAAAGATGGAAATGAATTCAATAAAACTAGTATAGTATCAGAGTTTAAAAAAGGTTTAGAGACTATAGACATTGTTGATGATGGTAGGGTAGAGTATCTAAGAGACTATATGACTTTATATTTTACAGATATATCTAATAATGGAATAAAAGGCTCAGATATAAATATAGAAGATAATTCTACACCAACAGAAGGAACAACACCTACACCATCAAAACCAAGTATACCAGTAATTACACCTGTAGATAATGAAGATGGGACATTTACAGTTACACTAACACCTATTAAGATGTCTTGGAAACAAGACCAAAGTTTCATGGTTAATATTAACGAAAGACAATTAAACTTAACGGGTAAAAAGGTACAGAATATTGAAGTTAATGGAACACAATTAGCAAAACCTAATGAAACAGCAAGCAATTACTTTTACTGGTTCTCAGTTCCTACAAGTGAAGAAGGAACTGAATATAATAGAACAAACCAGATATCCAATTTTGTTAGCGTATTAGAGTTAAGTGATACCGAATATGATGGTAGACTTGTATATAAACCAACCCCTATTAAAATAACATATAAGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
3ba8c9ee3df6f66d1e4749a4367e6cacde307a78850f96de44c154fd398a5112
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6525
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Genomic analysis and determination of host recognition proteins in staphylococcal Twort-like phages Takeuchi,I., Osada,K., Asakawa,H., Miyanaga,K. and Tanji,Y. 2016-10 GenBank