Genbank accession
AAZ82459.1 [GenBank]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,52
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MLLTIHDANLQKVAFIDNEKQGTLNYYDDTWTRSLATGSSTFEFTVFKKAVKSDLPLAKAYHHLNEHAFVSFKYKGKSFVFNIIIVEENEQTIKCYCENLNLELINELANPYKSNKAMTFKEYCEAMDLLNYTHLSIGINEISDYKRTLEWEGQETKLARLLSLAKRFDAEIEFDTQLNADSTIKKFSVNVYHENDDNHQGVGRVRNDVIVKYGKNIHSITRKVDKTGIFNTIRPTGKMPTVEEELSGDKGSKSETVKNADGSTTKTTISTASDGTKSKTIVHTKVTKLADKTRITTTTTTRSDGSIEQTVTTSKKGGASTSETKVLKKPNPKEKTNTTEDVLTIEGLDEWEVKNEKGIVEFYQRGQALYAPISMQLYPSTFTHSTGELDQWTRKDFHFETDEPNELRRLGYLKLKKYCYPAITYEVDGFVDADIGDTVKVHDDGFAPLLMIQARVTDQKISFTNPVRNKTIFDNFKALENKLSADIQSAFERLFEAAKPYTIKLSTDNGVIFKNQIGQSLVTPTLYKGGKPVVVGVTWRWALDGEVTTGMTYLVRGSNVTDTVTLTVAAYIGNKEVAVDEISLVNVADGKLGTPGTPGRDGRTPYVHTAWANNATGTDGFSLDSSINKLYIGIYTDFEPNDSTDPKKYKWAKVKGEKGEKGDKGEPGQRGLDGLQGARGEQGLPGRNGADGRTQYTHIAYSNSADGTKDFSVSASDRAYIGMYVDFNRADSNTPSDYNWTLVKGSDGANGVAGKAGTDGRTPYLHIAYATSNNGSQGFSTTDSTNKTYIGTYTDYTQADSTDYRVYKWTLIKGADGTGISNVTNYYLATTVSTGITRTSAGWTTTPQPITSDKRYLWNYRVELYTNGTSKTTEPTVIGVHGEKGERGLQGLQGLQGARGEQGIPGPRGADGRTQYTHMAYADNATGGGFSQTNTDKAFVGVYIDFNPTDSRNPADYRWTRWKGRDGANGVAGRAGADGRTPYLHIAYATSNNGSQGFSTTDSTNKTYIGTYTDYTQADSTDPKKYKWAKVKGDKGEKGDKGERGLQGLQGLQGARGEQGIPGPRGADGRTQYTHMAYADNATGGGFSQTNTDKAFVGVYIDFNPTDSRNPADYRWTRWKGRDGANGVAGRAGADGRTPYVHFAYSENADGSGLTMTDNGQRYFGHYSDYEKPDSSDKTKYKWADRWAKVDGGYVNIYALSKNRSIGKSYHVSEFNMDVLSGNITLKAIGSDPYIGAVSSHPGIFIKQQGMKIPVIQGRSICITITNPLFRKNYISFFNSLGKTVKTYKHYNTNKFLISSADLVGVEFIALRYGAGSSNIQIGTVLETKVKVEYGTVHSDWSPAPEDIESNINSKADQGLTQEQLNALNEKSQILEAEMKAKASMEAFSELEKAYNAFVKSNADSRKKSESDLVEAGRRIDLLTTQFGGLAELKTFIDTYMKSTNEGLIIGKNDASSTIKVSSDRISMFSAGKEVMYISQGVINIDNGIFTASIQIGRFRTEQYHLNKDVNVIRYIGG
Physico‐chemical
properties
protein length:1518 AA
molecular weight: 167235,88400 Da
isoelectric point:8,13825
aromaticity:0,09881
hydropathy:-0,62246

Domains

Domains [InterPro]
DC_0002
STR
1–674
IPR050149
Unmapped
590–1136
AAZ82459.1
1 1518
Architecture
STR
STR 1-1517 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage MM1 1998
[NCBI]
341698 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus pneumoniae
[NCBI]
1313 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AAZ82459.1 [NCBI]
Genbank nucleotide accession
DQ113772.1 [NCBI]
CDS location
range 29999 -> 34555
strand +
CDS
ATGCTTTTGACAATCCATGATGCAAATTTACAAAAGGTAGCATTTATTGATAACGAAAAACAAGGTACGTTAAATTATTACGATGATACTTGGACAAGAAGTCTTGCAACAGGTTCGTCAACGTTTGAGTTTACGGTATTTAAAAAGGCTGTAAAGTCTGATTTACCTCTTGCTAAAGCCTATCATCATTTGAATGAGCATGCATTTGTCTCATTTAAGTACAAGGGTAAAAGCTTTGTGTTTAATATCATTATTGTTGAAGAAAATGAGCAGACAATCAAATGTTATTGTGAAAATCTCAATCTTGAGTTAATCAATGAGCTTGCGAACCCTTATAAATCTAACAAAGCGATGACTTTCAAAGAGTATTGTGAGGCGATGGATCTTTTAAATTATACTCACCTTTCTATTGGTATCAATGAAATATCAGATTATAAGCGTACTCTGGAATGGGAGGGGCAAGAAACCAAACTAGCCCGTCTATTAAGCCTAGCCAAACGATTTGATGCTGAGATTGAATTTGATACACAGTTAAATGCTGATAGCACTATCAAAAAGTTTAGTGTTAATGTTTATCATGAAAACGATGACAACCATCAAGGGGTGGGACGTGTAAGAAATGATGTCATTGTTAAATACGGAAAAAACATCCACTCTATTACAAGAAAAGTGGATAAGACTGGTATTTTCAATACAATCAGACCGACTGGTAAAATGCCAACGGTTGAAGAAGAACTGAGCGGAGATAAGGGCTCCAAAAGCGAAACTGTAAAAAATGCAGATGGTTCAACGACGAAAACCACAATCTCTACAGCCTCAGATGGGACTAAGAGCAAAACTATTGTCCACACTAAAGTTACAAAGTTAGCGGACAAGACACGGATCACAACGACCACAACGACTCGTTCTGATGGTTCCATAGAACAAACTGTTACAACCAGCAAAAAAGGCGGAGCATCAACGTCTGAAACAAAAGTCTTGAAAAAACCAAATCCAAAAGAAAAAACAAATACAACTGAGGATGTTTTGACGATTGAGGGATTGGATGAATGGGAAGTAAAGAACGAGAAAGGGATAGTTGAATTTTATCAAAGAGGGCAAGCACTGTATGCGCCTATTTCAATGCAACTATATCCCTCAACCTTTACTCATTCAACAGGGGAGCTTGACCAGTGGACAAGAAAAGATTTTCATTTTGAAACAGATGAGCCAAACGAGTTAAGACGTTTAGGTTATCTCAAATTGAAAAAGTATTGTTATCCAGCTATCACTTATGAAGTTGATGGCTTTGTCGATGCTGATATTGGAGATACTGTTAAAGTCCATGATGACGGTTTTGCCCCTCTATTGATGATTCAAGCACGGGTTACTGATCAAAAAATCAGTTTCACAAATCCAGTGAGAAATAAGACAATATTTGACAATTTCAAGGCACTTGAAAACAAACTATCAGCTGATATCCAGTCAGCCTTTGAGAGATTGTTTGAAGCTGCTAAACCATATACTATCAAATTATCAACGGACAATGGTGTTATCTTTAAAAATCAGATCGGCCAGAGTCTAGTAACCCCAACCTTATACAAGGGAGGAAAACCAGTCGTTGTTGGTGTTACTTGGCGATGGGCACTTGATGGAGAAGTAACAACAGGGATGACTTACTTAGTTAGAGGCTCAAATGTAACTGATACAGTTACTCTGACAGTTGCAGCTTACATTGGAAATAAAGAGGTTGCTGTTGATGAGATATCGCTTGTTAATGTTGCTGATGGAAAACTTGGTACACCTGGAACTCCAGGGCGAGATGGCCGTACTCCTTATGTCCATACAGCATGGGCTAATAATGCAACAGGAACAGATGGATTTAGTCTTGATAGCTCAATCAATAAACTCTATATTGGTATTTATACAGACTTTGAACCAAACGATAGCACCGACCCTAAAAAATACAAGTGGGCTAAAGTAAAAGGAGAAAAGGGAGAAAAAGGAGATAAAGGAGAACCGGGACAACGTGGTTTAGATGGCTTGCAAGGTGCAAGAGGTGAACAAGGATTACCTGGTCGCAATGGTGCAGATGGCCGTACTCAATACACTCACATAGCTTACAGCAATAGCGCTGATGGAACTAAGGATTTTTCTGTAAGCGCCTCTGATAGAGCTTATATCGGTATGTATGTTGATTTTAATAGAGCTGATAGCAATACTCCATCTGATTACAATTGGACACTTGTAAAAGGATCTGATGGCGCAAACGGTGTGGCAGGTAAGGCTGGTACAGATGGTAGGACACCATACTTACACATAGCTTACGCCACATCAAATAATGGTTCACAAGGTTTTTCAACTACTGACAGTACAAATAAAACGTATATCGGAACATACACAGATTACACTCAGGCAGATAGTACAGATTACAGAGTGTATAAGTGGACGTTGATAAAAGGGGCAGATGGTACTGGTATTTCTAATGTAACTAATTATTATTTAGCTACTACAGTCTCAACAGGTATCACAAGAACAAGCGCAGGGTGGACAACTACGCCACAGCCTATCACATCAGACAAGCGTTATTTATGGAATTATCGAGTTGAGCTATACACAAACGGTACAAGTAAGACAACAGAGCCTACTGTTATTGGTGTGCACGGGGAAAAAGGAGAACGTGGATTACAAGGTTTACAAGGCTTGCAAGGTGCACGAGGTGAACAAGGTATTCCTGGACCTAGAGGGGCAGATGGTCGTACACAATATACTCACATGGCCTATGCCGATAACGCAACAGGTGGTGGATTCAGTCAAACAAACACTGACAAAGCCTTTGTTGGGGTGTACATTGACTTTAATCCAACAGACAGCAGAAATCCTGCTGATTATCGCTGGACAAGATGGAAAGGTCGTGATGGCGCAAATGGCGTGGCAGGTAGGGCTGGTGCAGATGGTCGTACACCATACTTACACATAGCTTACGCCACATCAAATAACGGCTCACAAGGCTTCTCAACTACTGACAGTACAAATAAAACGTATATCGGAACATACACAGATTACACTCAGGCAGATAGCACAGACCCTAAAAAATACAAGTGGGCTAAAGTAAAAGGGGACAAGGGAGAAAAAGGCGATAAAGGAGAACGTGGATTACAAGGTTTACAAGGCTTGCAAGGCGCACGAGGTGAACAAGGTATTCCTGGACCTAGAGGGGCAGATGGTCGTACACAATATACTCACATGGCCTATGCCGATAACGCAACAGGTGGTGGATTCAGTCAAACAAACACTGACAAAGCCTTTGTTGGGGTGTACATTGACTTTAATCCAACAGACAGCAGAAATCCTGCTGATTATCGCTGGACAAGATGGAAAGGTCGTGATGGCGCAAATGGCGTGGCAGGTAGGGCTGGTGCAGATGGTAGAACGCCTTATGTTCACTTTGCGTATTCTGAAAATGCAGATGGATCAGGTTTGACAATGACAGATAACGGACAGCGTTATTTTGGTCATTATTCAGATTATGAGAAACCTGATAGTTCGGATAAAACTAAGTACAAATGGGCTGATCGTTGGGCTAAAGTTGATGGAGGTTATGTAAATATCTATGCATTGTCTAAGAATAGAAGTATTGGGAAATCTTATCATGTTTCTGAATTTAATATGGATGTACTTTCAGGAAATATCACTCTAAAAGCAATTGGGTCAGATCCATACATTGGCGCCGTTTCTTCTCATCCAGGTATTTTTATAAAGCAACAGGGAATGAAAATCCCAGTTATACAAGGGAGATCTATTTGTATAACAATAACAAATCCTTTATTTAGAAAAAATTACATCTCGTTTTTTAATTCATTAGGTAAAACTGTTAAAACATATAAGCATTACAACACAAATAAATTTTTAATATCTTCAGCTGATTTGGTAGGTGTTGAATTTATCGCTTTACGATATGGTGCAGGAAGCTCGAATATACAAATTGGAACCGTACTAGAGACAAAAGTAAAGGTAGAATATGGAACCGTGCATAGCGACTGGTCGCCCGCTCCAGAAGATATTGAAAGTAATATTAACTCTAAAGCTGATCAGGGTCTGACTCAAGAGCAGTTAAATGCCCTCAATGAGAAGTCACAAATTTTAGAGGCTGAAATGAAAGCGAAAGCATCGATGGAGGCCTTTAGTGAATTAGAAAAAGCATATAATGCCTTTGTGAAATCAAATGCAGATAGTCGAAAAAAATCTGAGTCTGATTTAGTTGAAGCAGGTAGAAGAATTGATTTGCTGACGACACAATTTGGAGGATTAGCAGAGCTTAAAACATTCATTGATACTTACATGAAAAGCACAAATGAGGGCTTGATTATAGGTAAGAATGATGCAAGCTCTACTATTAAGGTATCAAGTGATAGAATATCCATGTTTTCTGCAGGTAAGGAAGTTATGTACATTTCGCAAGGTGTAATAAATATTGATAATGGTATTTTTACTGCATCAATTCAAATTGGACGTTTTAGAACAGAACAGTATCATCTTAACAAAGATGTGAATGTCATACGATATATAGGAGGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
c4ec2bb2915ad64235f6511f3970d3f03303a31af602c32cb34284064e483199
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7746
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Lysogeny of Streptococcus pneumoniae with MM1 Phage: Improved Adherence and Other Phenotypic Changes Loeffler,J.M. and Fischetti,V.A. 2006 16861634 GenBank