UniProt accession
A0A3G8F9T8 [UniProt]
Protein name
Tail spike domain-containing protein
RBP type
TSP
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,98
Protein sequence
MLLTIHDANLQKVGFIDNEKQETLNFYDDTWNRNLETASSVFEFTVSKKELLSDTGNKHLYNQLNERSFVSFKYKGKTYLFNIMKTEENERWLRCYCENLNLELINEYANPYKADRPLSFAEYLDVFEIPQFAMVTLGINEISDQKRTLEWEGQETKLARLLSLANKFDAEVEFVTRLNDDSSINQLILNVYHKADDSHTGVGRIRGDIRLTFEKNIKSMTRKVDKTEIYTLVVPYGKSKETHEGEQEVRVYIDSLPPWEEKNDEGIVIFKQEGVNLYAPHAANLYPSTFGAATQDNKWIRKDLEVDSDDPKVIRAAGIANLRKHAYPAITYEVDGFLDVEVGDTVTIHDKGFTPSIDVKARVIEQKISFSNPSNNKTVFGNFKELEDRTSSDLRSVFERMVENSRPYTIMVSTDNGVLFKNNTGRSTISPTLKRGNQVVDATYRFVIDGSIVSSGLTYTVKASDITKPTVITISAWVDNKEVASEEVTFVNVSDGKQGPKGPQGPQGPKGDRGNDGLPGKDGVGLKTTTITYGISDSDTVMPNNWSSQVPTLVKGKYLWTKTTWTYTDNSSETGYQKTYIAKDGNDGNDGLPGKDGVGLVDTTIEYLKHTDGQIAPNPKYYSSFNWKGLTLSQHLEHDWVNDLTLIKSGKPVKYTDLKVGELVIDRTGEIFPIKEIFGTEGDSSNPGYINLKPSIGKWSKDIPTVNPGEYLWTRTTWFYSDGTSEQGFSVAKMGEQGPKGDRGEQGPKGDQGIPGPKGADGRTQYTHIAYADTVSGSGFSQTDVNKAYIGMYQDFNAEDSKNPQDYRWSKWKGSDGRDGIPGKAGADGRTPYVHFAYADSADGRTGFSLTQTGNKRFLGVLTNFIKEDSTNPEDYTWNDTAGSVSVGGENLIRDSAFPKNLDNWGFWETGLPNENLHIATHEFYFNGARNLFRLDNNGKVGVPAASRRFSVKRNTDYSFNIQTFATGNIKGLTIYFLGRKANETDKAFTEVVNLKTHTGSPSVTQTVKWHLTFNSGDCDEGFIRIDNSGTTDGKTSMLFFAELDCYEGTTDRAWQASSKDLEDQIDTKADDVLTQAQLNKLNEMNSIIKAELDAKASLDTVNQWIKAYQDFVNANDAEREQAQKALADASARVVKLENNLNDMSERWNFIDSYMSASENGLVVGKTDNSSSILFSPSGRISMFSSGHEVMYISQGVIHIENGIFSKTIQIGRYREEQDVINPDRNVIRYVGGA
Physico‐chemical
properties
protein length:1234 AA
molecular weight: 137914,42710 Da
isoelectric point:5,22750
aromaticity:0,10130
hydropathy:-0,61669

Domains

View on InterPro
A0A3G8F9T8
1 1234 aa
ENZ 145–385 · STR 495–542 · STR 735–787 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

A0A3G8F9T8
1 1234 aa
Domain Start End Length (AA) Confidence
N-terminal 1 949 949 0,9166
Central domain 950 1195 247 0,1368
C-terminal 1196 1234 38 0,9072
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Streptococcus phage CHPC1042 [NCBI] · taxon 2365016
Host
Streptococcus thermophilus [NCBI] · taxon 1308

Coding sequence (CDS)

Genbank protein accession
AZF91528.1 [NCBI]
Genbank nucleotide accession
MH937494 [NCBI]
CDS location
range 15734 -> 19438
strand +
CDS
ATGTTGCTAACAATTCACGACGCTAATTTACAAAAAGTTGGCTTCATCGATAACGAAAAGCAAGAAACACTAAATTTCTACGATGACACTTGGAACCGAAACCTTGAGACTGCCTCAAGCGTATTCGAATTTACCGTTTCAAAAAAGGAACTGCTTAGCGATACAGGAAATAAACACCTTTATAACCAACTAAACGAGCGCTCTTTTGTTTCCTTCAAATATAAAGGCAAGACATATCTTTTTAACATCATGAAGACGGAAGAAAATGAGCGATGGTTACGATGTTATTGCGAGAACTTAAATCTCGAGCTGATAAACGAGTACGCGAACCCTTACAAAGCTGATAGACCTTTGTCATTTGCGGAGTATCTTGATGTTTTTGAAATTCCTCAGTTTGCAATGGTAACGTTGGGCATTAATGAGATCTCAGATCAAAAAAGAACGCTCGAATGGGAGGGGCAAGAGACAAAATTAGCAAGGCTCTTGAGTTTGGCCAATAAATTCGATGCTGAGGTTGAATTTGTAACAAGGCTAAATGACGACAGCTCTATCAACCAGCTCATTTTGAACGTTTACCATAAAGCGGACGACTCGCACACAGGTGTGGGGCGGATTCGGGGCGACATTCGTCTGACATTTGAAAAAAATATCAAGTCAATGACGAGGAAAGTCGACAAGACTGAGATCTATACGCTGGTAGTGCCTTATGGCAAATCGAAAGAGACCCATGAAGGCGAGCAAGAAGTACGTGTCTACATTGACAGCCTTCCGCCTTGGGAGGAAAAGAACGATGAAGGTATTGTTATCTTCAAACAAGAAGGTGTCAACCTCTATGCACCTCATGCAGCCAACCTATACCCGTCTACTTTCGGCGCTGCCACTCAAGATAATAAGTGGATTCGAAAAGACTTAGAGGTTGATAGCGATGATCCAAAAGTCATCCGTGCTGCAGGCATTGCAAACTTACGCAAACACGCATATCCAGCTATCACTTACGAGGTTGACGGGTTCCTCGATGTTGAGGTCGGGGATACCGTTACCATCCATGATAAGGGGTTCACTCCTTCTATCGACGTAAAGGCACGGGTTATTGAGCAGAAGATTAGTTTCAGCAACCCATCAAATAACAAAACCGTTTTTGGAAACTTCAAAGAGCTTGAAGATAGAACATCCTCAGACTTGAGAAGTGTCTTTGAGCGAATGGTTGAGAACAGTAGGCCGTACACTATCATGGTTTCAACCGACAACGGAGTTCTTTTTAAGAACAACACAGGGCGGTCAACCATAAGTCCAACGTTGAAAAGGGGTAATCAGGTTGTTGATGCAACCTATCGATTTGTGATCGATGGCTCTATTGTTAGCTCTGGTCTGACATATACCGTCAAAGCAAGCGATATCACAAAACCAACTGTGATAACAATTTCCGCTTGGGTAGATAACAAAGAAGTAGCTTCAGAAGAAGTTACTTTTGTAAATGTATCAGATGGGAAACAAGGGCCTAAGGGCCCACAAGGACCACAAGGTCCAAAAGGCGATAGAGGTAACGACGGACTGCCTGGTAAAGACGGGGTAGGATTAAAAACCACAACTATCACTTATGGGATTAGCGATAGTGACACTGTGATGCCTAATAACTGGTCTAGTCAGGTGCCTACTTTAGTTAAAGGTAAATACCTTTGGACTAAAACCACTTGGACATACACTGATAATTCATCTGAAACAGGCTATCAAAAAACTTACATTGCCAAAGATGGTAACGATGGAAATGATGGCTTACCAGGTAAGGATGGAGTCGGTCTAGTTGATACTACAATCGAGTATCTAAAACACACAGATGGTCAGATAGCACCTAATCCAAAATACTACTCAAGTTTTAATTGGAAAGGTTTAACATTATCTCAACATTTAGAGCATGATTGGGTTAATGACCTGACACTTATCAAGAGTGGTAAACCTGTCAAATACACTGACTTAAAAGTCGGTGAGCTTGTAATTGATAGAACCGGTGAAATTTTCCCTATCAAAGAAATCTTTGGGACTGAAGGAGATAGCAGTAACCCAGGATACATTAATCTAAAACCTTCAATTGGAAAATGGAGTAAAGATATTCCAACAGTTAATCCCGGGGAATATCTCTGGACAAGGACTACGTGGTTCTATTCAGACGGTACGAGTGAGCAAGGTTTTTCGGTCGCTAAAATGGGCGAGCAAGGGCCAAAGGGTGACCGTGGGGAACAAGGGCCAAAGGGAGATCAAGGTATTCCAGGTCCTAAAGGTGCAGATGGAAGAACACAGTATACACATATAGCTTACGCTGATACTGTTTCAGGTAGTGGCTTTAGTCAAACAGATGTCAATAAAGCCTATATTGGGATGTACCAAGACTTCAATGCCGAAGATAGCAAAAACCCACAAGATTATCGATGGTCTAAGTGGAAAGGTAGCGATGGTCGTGATGGAATTCCTGGTAAGGCTGGGGCAGACGGACGTACGCCTTACGTACATTTCGCTTATGCAGACAGTGCCGATGGTAGAACTGGTTTCAGTTTGACCCAAACTGGTAATAAACGCTTTTTAGGTGTGCTAACCAACTTCATAAAAGAAGACAGCACTAATCCAGAAGATTATACATGGAATGACACGGCTGGCAGTGTATCAGTTGGTGGTGAGAATCTCATTCGCGACTCAGCGTTTCCAAAAAATCTTGATAATTGGGGATTTTGGGAAACAGGATTGCCTAATGAAAATCTTCATATAGCAACACATGAATTTTATTTCAATGGTGCAAGAAATCTTTTTAGACTAGATAATAACGGCAAGGTGGGAGTTCCTGCTGCATCAAGACGTTTCTCAGTAAAACGAAATACAGATTATTCGTTTAATATTCAGACTTTCGCTACTGGTAATATCAAGGGCTTAACTATCTATTTTTTGGGTCGGAAGGCGAATGAAACTGACAAGGCATTTACTGAAGTGGTCAACTTAAAAACACATACAGGTTCACCATCGGTCACACAAACGGTTAAATGGCACTTGACTTTCAACTCTGGAGATTGCGACGAAGGCTTCATTCGTATAGACAATAGTGGTACTACTGACGGTAAAACATCTATGCTATTCTTCGCAGAATTGGACTGCTATGAAGGTACGACTGATAGAGCTTGGCAAGCATCCTCTAAAGATTTGGAAGATCAGATAGATACCAAAGCCGATGATGTCCTTACGCAAGCACAACTCAACAAGCTGAATGAAATGAATTCTATCATTAAAGCTGAATTAGACGCTAAAGCATCCCTTGACACTGTTAATCAATGGATTAAGGCTTATCAAGATTTTGTTAATGCAAACGACGCAGAACGAGAACAAGCTCAAAAGGCTTTGGCCGATGCTAGTGCACGAGTAGTGAAGCTAGAAAATAACTTAAACGATATGTCAGAGCGTTGGAATTTCATTGATAGCTACATGTCAGCATCGGAAAATGGGCTTGTTGTTGGTAAAACAGATAATTCTAGCTCTATACTTTTCAGTCCTAGTGGGCGTATCTCAATGTTCTCATCTGGGCACGAGGTAATGTATATCTCGCAAGGTGTCATCCATATCGAAAACGGGATTTTCTCGAAAACTATCCAAATCGGAAGGTATCGTGAAGAGCAAGACGTTATTAACCCTGACCGTAATGTCATTAGATACGTAGGAGGTGCATAA

Genome Context

Tertiary structure

A0A3G8F9T8
ESMFold structure
Source ESMFold
pLDDT 72.4
Oligomeric state monomer