Genbank accession
AZF91778.1 [GenBank]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MQIWIHDKSMRKVCALNNEIPGMLPYTNSQWHPYLEYSTSTFDFTIPKIVNRKLHDDIKYINDQMFVSFYFDNSYHVFYVSKLVENDFSFQVTCNNTNLELAMEVARPLADSGGPKTIEWYLQNLELLGFAGLEIGVNEISDRTRTLTFESQSGTKLEQLHSLMNQFDAEFIFRTELNRDGTMKRFIIDIYQEADENHHGIGKARGDVILYYQSGLKGVQVTSDKTQLFNAGNFIGQDGVNLNDVEFEEKNELGQVEFYSRKGTSFVFAPLSRERYPSTMNPDSADNWTRRDFQTEYKDVESLKAYALRTIKQYAYPLLTYTVDVQSSFLDNYKDINLGDTVKIIDNNFRGGLALEARVSEMIISFDNPTNNSVVFTNFRKLDNKPSSELQQRIDEIVSKSLPYHVEIRTTNGTVFKNGIGRSTVKPILKQGDKIVDATYRFVIDGTIKYVGMTYDMVASEITQPTTLTIAAWVDNKEVASEEITFLNVSDGKPGADGRTPYVHFAYADSADGQKGFSLTQTGSKRYLGVLTNFIKEDSTNPSDYTWNDTAGSVSVGGENLIINSAFPKNLNNWGFWEPVLPNENLHIATHVFYYNAARNLFRLDDNSNSGVPAASRRFPVKRNTDYSFNIQTFATGNIKGLTIYFLGRKANETDKAFTKVVNLKTHTGSPSVTQAVKWHLTFNSGDCDEGYIRIDNSGTTDGKTSMLFFAELDCYEGTTDRAWQASSKDLEEEIDTKADDVLTQAQLNRLNETNSIIKAELNAKASLDTLNQWVEAYQNFVNANNANRAQAEKDLADASARVTKLENDLNDMSERWNFIDSYMAASNEGLVIGKKDESSSIMFNPNGRISMFSAGNEVMYISKGVIHIENGIFSKTIQIGRYREEQDLLNPDRNVIRYVGGA
Physico‐chemical
properties
protein length:903 AA
molecular weight: 102082,64220 Da
isoelectric point:5,12065
aromaticity:0,10963
hydropathy:-0,47752

Domains

Domains [InterPro]
DC_0002
STR
1–903
IPR010572
ENZ
142–381
AZF91778.1
1 903
Architecture
STR
STR 1-903
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage CHPC1067
[NCBI]
2365022 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus thermophilus
[NCBI]
1308 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AZF91778.1 [NCBI]
Genbank nucleotide accession
MH937500.1 [NCBI]
CDS location
range 14794 -> 17505
strand +
CDS
ATGCAAATCTGGATTCATGATAAAAGTATGCGTAAAGTGTGTGCTTTGAATAATGAAATTCCCGGAATGTTGCCATATACGAACAGTCAATGGCATCCATACCTTGAATACTCAACAAGTACGTTTGATTTTACAATTCCTAAAATTGTGAACAGGAAACTGCACGATGATATCAAATATATCAATGACCAGATGTTTGTATCATTCTATTTCGATAATTCCTATCATGTTTTTTATGTATCAAAACTCGTTGAGAATGATTTTAGTTTTCAAGTCACTTGTAATAACACCAACCTTGAATTGGCAATGGAAGTTGCACGACCACTTGCAGACAGTGGCGGTCCCAAAACTATTGAATGGTATCTTCAAAATCTTGAGTTGCTTGGTTTTGCAGGTCTGGAAATAGGTGTCAATGAAATTTCTGATAGAACAAGAACGCTTACTTTTGAATCTCAAAGTGGAACTAAACTAGAGCAACTTCATAGCTTGATGAATCAATTTGATGCAGAATTTATTTTCCGTACCGAATTAAACCGAGACGGAACTATGAAACGTTTCATCATCGACATCTACCAAGAAGCAGATGAAAACCATCACGGTATAGGTAAGGCAAGAGGAGATGTTATTCTCTACTACCAAAGCGGATTGAAAGGCGTTCAAGTTACTAGTGATAAAACGCAACTTTTCAACGCTGGTAATTTCATTGGACAAGATGGCGTTAACCTAAACGATGTCGAATTTGAGGAAAAGAACGAGCTAGGACAAGTAGAGTTCTATTCTCGAAAGGGCACTAGCTTCGTTTTCGCCCCACTGTCAAGGGAACGCTACCCATCTACCATGAATCCAGACAGCGCTGATAACTGGACACGTAGGGATTTTCAGACAGAATACAAGGACGTTGAATCCTTAAAAGCTTACGCCTTGCGTACTATCAAGCAGTATGCTTATCCACTATTGACTTACACAGTAGATGTTCAGTCTAGCTTTCTGGATAACTATAAAGACATCAATCTAGGTGACACTGTTAAAATCATCGATAATAATTTTAGAGGTGGTTTAGCCCTCGAAGCGCGTGTATCTGAAATGATTATCAGCTTTGACAATCCCACAAACAACTCGGTTGTTTTTACTAATTTCAGAAAATTGGATAATAAACCGTCTAGCGAATTACAACAACGTATCGATGAGATTGTTTCTAAGTCATTGCCATATCATGTTGAGATAAGGACCACAAACGGTACAGTATTTAAAAACGGTATTGGTCGTTCTACTGTTAAACCCATCTTGAAACAAGGCGATAAAATTGTTGATGCAACTTATCGATTTGTGATTGACGGAACTATTAAATACGTAGGTATGACTTACGATATGGTAGCGTCAGAGATAACTCAACCAACAACGTTGACGATTGCTGCGTGGGTAGATAATAAAGAAGTAGCTTCGGAAGAGATTACTTTCTTAAATGTATCAGATGGTAAACCTGGAGCAGACGGACGTACGCCTTACGTCCATTTTGCTTATGCCGATAGTGCCGATGGTCAAAAGGGTTTCAGTTTGACACAGACTGGAAGTAAACGCTATTTAGGTGTGCTAACCAACTTCATAAAAGAAGACAGTACTAATCCATCGGACTATACGTGGAATGACACGGCTGGCAGTGTTTCGGTTGGCGGTGAGAATCTAATCATTAACTCGGCTTTTCCAAAGAATCTTAACAATTGGGGATTTTGGGAACCGGTATTGCCTAATGAGAATCTTCATATAGCAACACATGTATTTTATTATAATGCTGCAAGAAACCTGTTTAGGCTAGATGATAATAGCAATAGTGGGGTTCCTGCTGCATCAAGACGTTTTCCAGTCAAACGCAACACAGACTACTCGTTCAATATTCAGACATTCGCTACTGGTAATATCAAGGGCTTAACTATCTATTTTTTGGGTCGGAAGGCAAATGAAACTGACAAGGCATTTACTAAAGTCGTGAATCTCAAAACACATACAGGTTCACCGTCAGTAACACAAGCTGTTAAATGGCACTTAACGTTTAATTCTGGTGATTGTGATGAAGGTTACATCCGTATAGACAACAGTGGAACGACTGACGGTAAAACCTCTATGCTATTCTTCGCTGAGTTAGACTGCTATGAGGGAACCACTGACCGAGCGTGGCAAGCGTCGTCGAAAGATTTGGAAGAGGAAATAGACACCAAAGCCGATGATGTCCTAACACAAGCACAACTCAACAGACTGAACGAAACGAACTCTATTATTAAAGCTGAATTAAACGCTAAAGCATCACTTGATACACTCAATCAGTGGGTGGAAGCCTATCAAAATTTTGTTAACGCAAACAATGCCAATCGTGCACAAGCTGAAAAAGATTTAGCTGATGCAAGTGCTCGTGTAACTAAACTAGAAAACGACTTAAATGATATGTCAGAACGTTGGAATTTTATCGATAGCTACATGGCAGCATCAAATGAAGGTCTTGTTATTGGTAAAAAAGATGAATCAAGCTCTATCATGTTCAATCCAAACGGGCGTATCTCAATGTTCTCAGCTGGAAACGAGGTAATGTACATTTCAAAAGGTGTCATCCATATCGAAAACGGTATTTTCTCTAAAACTATCCAAATCGGACGATATCGAGAGGAACAAGATTTATTGAATCCAGACCGTAATGTCATTAGATACGTAGGAGGTGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
b55665a92ce3d024542e0656f3517e075779162fae8f991e0093a8eec0a5bdf9
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7712
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
A comparative genomics approach for identifying host-range determinants of bacteriophages infecting Streptococcus thermophilus Szymczak,P., Rau,M.H., Monteiro,J.M., de Pinho,M.G., Filipe,S.R., Vogensen,F.K., Zeidan,A. and Janzen,T. 2019-05-29 GenBank