Genbank accession
ARU14483.1 [GenBank]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect2
Probability 1,00
Protein sequence
MQIWIHDKSMRKVCALNNNVPDMLPYSSSQWHTYLEYSTSTFDFTIPKIVNGKLHDDIKYINDQMFVSFYFDNSYHVFYVSQLVENDFSFQVTCNNTNLELAAEMSRPLASVDGPKTLEWYLRNLELLGFAGLEIGVNEISDRTRTLTFESQSGTKLEQLHSLMNQFDAEFIFRTELNRDGTMKRFIIDIYQEADKNHHGIGKVRGDVILYYQSGLKGVQVTSDKTQLFNAGVFTGANGVNLDSVEFEEKNELGQVEFYSRKGTSFVFAPLSRERYPSTMNPDSADNWTRKDFQTEYKDVESLKAYALRTIKQYAYPLMTYTVSVQSSFIENYKDINLGDTVKIIDNNFRGGLALEARVSEMIISFDNPANNSVVFTNFKKLDNKPSDALQQRIDEIVSKSLPYHVEIRTTNGTVFKNGIGRSTVKPILKQGDKIVDATYRFVIDGTIKYVGMTYDMVASEINQPTTLTISAWVDNKEVASEEVTFVNVSDGKQGPKGDDGKTTYFHTAWSYSADGTDGFTTVYPNLNLLDGTRDFSGNWENGGEWTSDGTYKGLVVKKRVWKWAGIYKTFTAPKDGKYTFSAYVKSSGNAANIIRFVALNGVDLYSSIRYFGNNFDWLRDSFTITLKAKDTIVAKYEIAGSGTDSILWTAGHKWEEGSVATPWMPSASEVKTADYPSYIGHYTDFTKADSTNPSDYTWSLIRGNDGKDGANGGENLIVNSAFPEDIDGWGFWDESTPNNNLHIATHGFYYNGTKPLFRLDNNTNGVVPASTKRFPVKRNTDYSLNIQIFATGNLKSVDIYFLGRKANETDKESTKVIHLKTHTGSPSTTQTVKWHLTFNSGDCDEGFIRINNNGTTDGKTSTLFFAELDCYEGTGDRAWQASSKDFEEEIDTKADDVLTQAQLNRLNETNSIIKAELDAKASLDTLNQWVEAYQNFVNANNANRAQAEKDLADASARVTKLENDLNDMSERWNFIDSYMAASNEGLVVGKKDKSSSIMFNPNGRISMFSAGNEVMYISKGVINIENGIFSKTIQIGRYREEQDLLNPDRNVIRYVGGA
Physico‐chemical
properties
protein length:1059 AA
molecular weight: 119194,18920 Da
isoelectric point:5,22176
aromaticity:0,11709
hydropathy:-0,50085

Domains

View on InterPro
ARU14483.1
1 1059 aa
ENZ 142–381 · RBD 569–635 · STR 714–867 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Taxonomy

Phage
Streptococcus phage P8922 [NCBI] · taxon 1971442
Host
Streptococcus thermophilus [NCBI] · taxon 1308

Coding sequence (CDS)

Genbank protein accession
ARU14483.1 [NCBI]
Genbank nucleotide accession
KY705283.1 [NCBI]
CDS location
range 16112 -> 19291
strand +
CDS
ATGCAAATTTGGATTCATGATAAAAGCATGCGCAAGGTGTGTGCTTTAAATAACAACGTTCCTGACATGCTTCCATACTCGAGCAGTCAGTGGCACACCTACCTTGAATACTCAACCAGTACATTTGACTTTACGATTCCTAAAATTGTAAATGGAAAACTTCACGATGATATTAAATACATCAATGATCAGATGTTTGTATCATTCTATTTTGACAATTCCTATCACGTTTTCTATGTTTCTCAACTAGTTGAAAATGATTTTAGTTTTCAAGTCACTTGTAATAATACTAACTTAGAATTAGCAGCAGAAATGTCGCGTCCTTTAGCCAGTGTTGATGGTCCTAAAACCCTTGAGTGGTATCTTCGGAATCTTGAATTGCTTGGTTTTGCAGGACTTGAAATAGGTGTCAATGAAATTTCTGATAGAACAAGAACGCTTACTTTTGAATCGCAAAGTGGCACAAAGCTAGAACAACTTCATAGCTTGATGAATCAATTTGATGCTGAATTTATTTTTCGTACCGAATTAAACCGAGACGGAACTATGAAACGTTTCATCATCGACATCTACCAAGAAGCAGATAAAAACCATCACGGAATTGGAAAGGTTCGAGGAGATGTCATTCTCTACTATCAAAGCGGTCTGAAAGGCGTTCAAGTAACTAGTGATAAGACACAACTGTTTAATGCTGGTGTTTTCACTGGTGCAAACGGGGTTAATCTTGATAGCGTTGAGTTTGAAGAAAAAAATGAGTTAGGACAAGTAGAGTTCTATTCTCGAAAGGGCACTAGCTTCGTTTTCGCCCCACTATCAAGGGAACGCTACCCATCTACCATGAATCCAGACAGCGCTGATAACTGGACACGTAAGGATTTTCAAACAGAATACAAGGACGTTGAATCCTTAAAAGCTTACGCCTTGCGTACTATCAAGCAGTATGCTTATCCACTAATGACATATACCGTAAGTGTTCAATCTAGTTTCATTGAAAACTACAAGGATATTAATCTAGGTGACACTGTTAAAATCATCGATAATAATTTTAGAGGTGGTTTAGCCCTCGAAGCGCGTGTATCTGAAATGATTATCAGCTTTGACAATCCTGCGAATAATTCAGTAGTTTTCACCAACTTTAAAAAGTTGGATAATAAACCATCGGATGCCTTGCAACAACGTATCGATGAGATTGTTTCTAAGTCACTGCCATATCATGTTGAGATAAGGACCACGAATGGAACAGTATTTAAGAACGGCATTGGTCGTTCTACTGTTAAACCAATTTTGAAACAAGGCGATAAAATTGTTGATGCAACTTATCGATTTGTGATTGACGGTACTATTAAATACGTAGGTATGACCTATGACATGGTAGCGTCAGAGATTAACCAACCAACCACGCTTACTATCTCAGCGTGGGTAGATAACAAAGAAGTAGCTTCAGAAGAAGTTACTTTTGTAAATGTATCAGATGGTAAACAAGGACCTAAGGGCGATGATGGGAAGACTACATATTTTCACACAGCATGGTCTTACAGCGCAGACGGCACTGATGGTTTCACGACTGTTTATCCTAATTTGAATTTGTTGGATGGTACTAGAGATTTTAGCGGTAATTGGGAAAATGGTGGAGAATGGACGAGCGACGGAACCTACAAAGGCTTAGTTGTTAAAAAAAGAGTGTGGAAATGGGCAGGAATTTATAAAACATTCACAGCACCTAAAGACGGAAAATACACTTTCTCAGCTTATGTTAAAAGTTCAGGAAATGCAGCAAATATAATTAGATTTGTGGCCCTTAACGGTGTGGATTTATACTCTTCAATAAGGTACTTTGGTAATAACTTTGATTGGCTTAGAGACAGTTTTACTATAACTCTGAAAGCCAAGGATACCATTGTGGCCAAATATGAAATAGCTGGTTCTGGAACAGATTCAATTCTATGGACGGCTGGGCATAAGTGGGAAGAGGGTTCAGTCGCTACCCCTTGGATGCCCTCGGCTAGCGAAGTCAAAACTGCTGATTATCCAAGTTATATCGGTCATTACACAGACTTTACGAAAGCTGACAGTACTAATCCATCCGACTACACTTGGAGTCTGATACGAGGAAACGACGGGAAAGATGGAGCAAATGGTGGAGAGAATCTAATTGTTAATTCAGCATTCCCAGAAGATATTGACGGATGGGGTTTTTGGGACGAAAGTACACCTAATAACAATCTTCATATAGCTACACATGGATTTTACTATAACGGAACAAAACCCCTTTTTAGATTAGACAATAACACCAATGGTGTGGTTCCTGCATCAACAAAACGTTTTCCAGTCAAACGCAACACTGATTATTCTCTAAATATTCAGATATTTGCAACCGGAAACCTCAAGAGCGTTGATATCTATTTTCTTGGCAGGAAGGCGAATGAAACTGACAAGGAATCGACTAAAGTGATCCATTTAAAAACACATACAGGTTCACCATCAACCACACAGACGGTTAAATGGCATCTAACATTTAACTCTGGAGATTGCGACGAAGGGTTCATTCGTATTAATAACAATGGCACTACTGACGGTAAAACTTCTACGCTATTTTTTGCAGAACTAGACTGCTATGAGGGAACCGGTGACCGAGCGTGGCAAGCGTCGTCGAAAGATTTTGAAGAGGAAATAGACACCAAGGCAGATGATGTCCTAACACAAGCACAACTCAACAGACTGAACGAAACGAACTCTATTATTAAAGCTGAATTAGACGCTAAAGCGTCGCTTGATACACTCAATCAGTGGGTGGAAGCCTATCAAAATTTTGTTAACGCAAACAATGCCAATCGTGCACAAGCTGAAAAAGATTTAGCTGATGCAAGTGCTCGTGTAACTAAACTAGAAAACGACTTAAATGATATGTCAGAACGTTGGAATTTTATCGATAGCTACATGGCAGCATCAAATGAAGGTCTTGTTGTTGGTAAAAAAGATAAATCAAGCTCTATCATGTTCAATCCAAACGGGCGTATCTCAATGTTCTCAGCTGGGAACGAGGTAATGTACATTTCAAAAGGTGTCATCAATATCGAAAACGGTATTTTCTCTAAAACTATCCAAATCGGACGATATCGAGAGGAACAAGATTTATTGAATCCAGACCGTAATGTCATTAGATACGTAGGAGGTGCATAA

Genome Context

Tertiary structure

ARU14483.1
ESMFold structure
Source ESMFold
pLDDT 76.2
Oligomeric state monomer