Genbank accession
YP_010644875.1 [GenBank]
Protein name
tail fiber protein and host specificity
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MQIWIHDKSMRKVCALNNEIPGMLPYTNSQWHPYLEYSTSTFDFTIPKIVNRKLHDDIKYINDQMFVSFYFDNSYHVFYVSKLVENDFSFQVTCNNTNLELAMEVARPLADSGGPKTIEWYLQNLELLGFAGLEIGVNEISDRTRTLTFESQSGTKLEQLHSLMNQFDAEFIFRTELNRDGTMKRFIIDIYQEADENHHGIGKARGDVVLYYQSGLKGVQVTSDKTQLFNAGNFIGQDGVNLNDVEFEEKNELGQVEFYSRKGTSFVFAPLSRERYPSTMNPDSADNWTRRDFQTEYKDVESLKAYALRTIKQYAYPLLTYTVDVQSSFLDNYKDINLGDTVKIIDNNFRGGLALEARVSEMIISFDNPTNNSVVFTNFRKLDNKPSSELQQRIDEIVSKSLPYHVEIRTTNGTVFKNGIGRSTVKPILKQGDKIVDATYRFVIDGTIKYVGMTYDMVASEITQPTTLTIAAWVDNKEVASEEVTFVNVSDGKDGRTPYVHFAYADSADGQKGFSLTQTGSKRYLGVLTNFIKEDSTNPADYTWSDTAGSISVGGRNLLKGSKGPFKPDLKPTNFDNYVYYENETSVYLEQGEQYIISAKTDGNFTKWHNGEIESDNITLWLFRFWDVVDIVSDSNTGTTGTQFTWNHPTGTYHLRVNTYHKTAIKSVWEVKIEKGTIKTDWTPAIEDVQDEIDSKADDAMTIEQINALNERAEIIKAEMEAKASAEILNNWIKNYQDFVKANETERAAAEKALISSSQRVSIIAKELGELSDRWSFIDSYMTASNDGLVIGKNDGSSGIMINPNGRISMYSAGEEVMYISQGVIHIENGIFSKTIQVGRYREEQYHLNPDMNVIRYVGGF
Physico‐chemical
properties
protein length:861 AA
molecular weight: 97779,06560 Da
isoelectric point:5,01629
aromaticity:0,11266
hydropathy:-0,46887

Domains

Domains [InterPro]
YP_010644875.1
1 861
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage 53
[NCBI]
1718280 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010644875.1 [NCBI]
Genbank nucleotide accession
NC_070652.1 [NCBI]
CDS location
range 16300 -> 18885
strand +
CDS
ATGCAAATCTGGATTCATGATAAAAGTATGCGTAAAGTGTGTGCTTTGAATAATGAAATTCCCGGAATGTTGCCATATACGAACAGTCAATGGCATCCATACCTTGAATACTCAACAAGTACGTTTGATTTTACAATTCCTAAAATTGTGAACAGGAAACTGCACGATGATATCAAATATATCAATGACCAGATGTTTGTATCATTCTATTTCGATAATTCCTATCATGTTTTTTATGTATCAAAACTCGTTGAGAATGATTTTAGTTTTCAAGTCACTTGTAATAACACCAACCTTGAATTGGCAATGGAAGTTGCACGACCACTTGCAGACAGTGGCGGTCCCAAAACTATTGAATGGTATCTTCAAAATCTTGAGTTGCTTGGTTTTGCAGGTCTGGAAATAGGTGTCAATGAAATTTCTGATAGAACAAGAACGCTTACTTTTGAATCTCAAAGTGGAACTAAACTAGAGCAACTTCATAGCTTGATGAATCAATTTGATGCAGAATTTATTTTCCGTACCGAATTAAACCGAGACGGAACTATGAAACGTTTCATCATCGACATCTACCAAGAAGCAGATGAAAACCATCACGGTATAGGTAAGGCAAGAGGAGATGTTGTTCTCTACTACCAAAGCGGATTGAAAGGCGTTCAAGTTACTAGTGATAAAACGCAACTTTTCAACGCTGGTAATTTCATTGGACAAGATGGCGTTAACCTAAACGATGTCGAATTTGAGGAAAAGAACGAGCTAGGACAAGTAGAGTTCTATTCTCGAAAGGGCACTAGCTTCGTTTTCGCCCCACTGTCAAGGGAACGCTACCCATCTACCATGAATCCAGACAGCGCTGATAACTGGACACGTAGGGATTTTCAGACAGAATACAAGGACGTTGAATCCTTAAAAGCTTACGCCTTGCGTACTATCAAGCAGTATGCTTATCCACTATTGACTTACACAGTAGATGTTCAGTCTAGCTTTCTGGATAACTATAAAGACATCAATCTAGGTGACACTGTTAAAATCATCGATAATAATTTTAGAGGTGGTTTAGCCCTCGAAGCGCGTGTATCTGAAATGATTATCAGCTTTGACAATCCCACAAACAACTCGGTTGTTTTTACTAATTTCAGAAAATTGGATAATAAACCGTCTAGCGAATTACAACAACGTATCGATGAGATTGTTTCTAAGTCATTGCCATATCATGTTGAGATAAGGACCACGAATGGTACAGTATTTAAGAATGGTATTGGTCGTTCTACTGTTAAACCAATTTTGAAACAAGGCGATAAAATTGTTGATGCAACTTATCGATTTGTGATTGACGGAACTATTAAATACGTAGGTATGACTTACGATATGGTAGCGTCAGAGATAACTCAACCAACAACGTTGACGATTGCTGCGTGGGTAGATAACAAAGAAGTAGCTTCAGAAGAAGTTACTTTTGTAAATGTATCAGATGGTAAGGACGGACGTACGCCTTATGTACATTTTGCCTATGCCGATAGTGCCGATGGTCAAAAGGGTTTCAGTTTGACACAGACTGGAAGTAAACGCTATTTAGGTGTGCTAACCAACTTCATAAAAGAAGACAGCACAAACCCAGCAGATTATACATGGAGTGACACTGCTGGTAGCATTTCGGTTGGTGGCAGGAATCTATTGAAAGGTTCGAAAGGACCTTTTAAACCTGACCTTAAACCTACAAATTTTGATAATTACGTTTATTATGAAAATGAAACTTCTGTCTATTTAGAACAAGGAGAACAATATATCATCAGTGCCAAAACAGATGGTAATTTTACTAAATGGCACAACGGAGAAATCGAAAGTGACAATATCACACTCTGGTTATTTCGTTTCTGGGATGTGGTTGACATTGTTTCTGATTCAAATACAGGTACTACAGGAACGCAGTTTACTTGGAATCATCCGACAGGTACATATCATCTACGTGTAAATACATATCATAAAACAGCAATCAAATCTGTTTGGGAAGTGAAGATTGAAAAAGGGACAATCAAAACTGATTGGACACCCGCCATCGAGGATGTACAAGATGAAATTGATTCCAAAGCCGATGATGCTATGACGATTGAACAGATTAATGCGCTTAATGAAAGGGCTGAAATCATTAAAGCAGAGATGGAAGCCAAAGCAAGCGCTGAAATTTTGAATAACTGGATTAAAAATTACCAAGATTTCGTTAAGGCAAACGAGACCGAGAGAGCTGCAGCCGAGAAAGCTTTGATTAGTTCAAGTCAGCGGGTATCAATCATTGCTAAGGAATTAGGTGAACTGTCTGATCGTTGGAGTTTCATCGATAGCTACATGACTGCATCAAACGATGGGCTTGTGATTGGAAAGAATGACGGTAGCTCTGGCATAATGATCAACCCTAACGGTCGGATTTCAATGTATTCAGCAGGGGAGGAGGTCATGTATATTTCGCAAGGTGTAATACACATCGAGAACGGGATCTTCTCAAAAACTATCCAAGTTGGTCGATATCGTGAGGAACAGTACCATCTTAACCCAGACATGAATGTCATTCGTTATGTAGGAGGTTTTTAA

Tertiary structure

PDB ID
966891b7c9059c64590058072c01d3f9656103a88ae51cd13fa42433f8928688
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7503
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50