Genbank accession
XMN69188.1 [GenBank]
Protein name
tail fiber and host specificity protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,92
Protein sequence
MDLNKKGYKVEITAYSLGLELNQEERGAHKPANAMSFAEYLAYYDPEHALEVGVNEVADKRIKLEWTGTDTILARLFSIANSFDAELEFTVELNQDYSLKRQVLNIYKKGNLGSNRAASPIRVGRGLKVINYSDNLKELRTAVRATGKDGLTIDGLNKKVYDDDGNLLYYSNANTVYAPQSRDKYPSVGKKSNDNWIIKELGETEYSTKEALWGYMLGELKKICVPEITYDIEGAVDGDVGDTRTLIDDVHYDPPLYVQGRISELTEDLITGKVTKSTLTNFERKYSQVASELLKQVEQLANDAAPYIVRLSTDNGYNFKNGKGSSTVTASLEKYGKIVNANWKWLINNSIVSDKNSVTINASQVIGTLNVVAVATVDGKEVAREYITFTNSDDGVGIKSIKRYYTTNDQAEGVTAGGQNWSTKPATVTADNKYMWSYDVITYTNDTSLVTEPAVIGARGDDGLDADTTGVTEALDKAKQELTALSANIEKVRDDSLAAVEEAKQQLTTVADDLSKVKTDLQTQASQLTAQANAQSELTKRVSSVEETANGTTTAVSELSKTVDSNTKNISSVTARTKTVEDDLTSTKTTLSQVQTTANSASQKTATLETGLDGVKADLAATTATADTTKTNLASYQASNNQAVANLQSSLQTTNGYVSSLQTQVAAVPGQITSAVSAVEGKIPTEIGGRNLYALSKNDGIYSPGTNDFRQNISSGEISFEVTNTSAAGFGAYSSRIGTSYNKLYGVKIPVVQGKDILVNLTDDKLGRIYVHFWDENNALVKPTLKYTSNKIKVLASLLIDVSSITLQCAVKTDVEIGTLIKTKIKVEYGNVYTGWSPAPEDTAIQISSLSSQIRQTADGMTLLATKTELNSAKTDLQAGISTATSKADSAQATANSNAQTISTHTTQISALNTGLQSKVSQSDFDSLSGDVDDLSSKLTQTASSITASVSSVETKADNAQTTANSAVSKADAAQAGVNTLDSTTVKSASLNLDNNGFVTKVGKTIDGNTFATMIAQNANNVKIIADEMQVTADMIVDGAVTAEKLDVNNLSAVTANLGDMTSGSITNTFTSGTRSGSVKIGNGVEITTVDTSGYLPEKAKTYSRFSDDALSFSSSTSNDEPTHSMMIMPEMISYTKHNYDNSSGGTGGWKLRHNGHYSMLEVDMVWQNVRLSNTSELPYGVRADYVRIGNLVTISVNRQITSIADVTEDKLANETIPEGFRPISQAHLTLTGNTGSTIDATCIVHLNPDGTIRFTNNKSGNRVWTGTVTYTCVEAMPYSTSNNNISTI
Physico‐chemical
properties
protein length:1289 AA
molecular weight: 138532,08630 Da
isoelectric point:5,03420
aromaticity:0,06129
hydropathy:-0,36276

Domains

Domains [InterPro]
DC_0690
STR
1–633
G3DSA:1.10.287.1490
STR
461–682
SSF57997
STR
473–649
PTHR43941
Unmapped
474–980
Coil
Unmapped
475–495
XMN69188.1
1 1289
Architecture
STR
STR
RBD
STR 1-682 | STR 734-1116 | RBD 1117-1280 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage PY4
[NCBI]
3390255 Viruses >
Host Streptococcus equinus
[NCBI]
1335 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XMN69188.1 [NCBI]
Genbank nucleotide accession
PQ621715.1 [NCBI]
CDS location
range 15080 -> 18949
strand +
CDS
ATGGACTTGAACAAGAAAGGCTATAAGGTTGAAATCACAGCTTATTCGCTCGGTCTAGAGCTTAACCAAGAAGAGCGAGGTGCACACAAGCCAGCTAATGCAATGAGTTTCGCTGAATATTTGGCTTATTACGACCCTGAACACGCTTTAGAGGTTGGTGTTAACGAAGTAGCAGACAAACGTATCAAACTTGAATGGACGGGCACAGACACGATTCTGGCGCGTCTTTTTTCAATTGCCAATAGTTTCGACGCAGAGCTTGAATTTACTGTCGAGTTGAATCAAGACTACTCGTTAAAACGTCAAGTTTTAAATATCTACAAAAAAGGTAATCTTGGTTCTAATCGAGCAGCAAGTCCAATTCGTGTTGGCCGTGGCTTAAAAGTCATTAATTACAGCGATAATTTGAAAGAATTGCGTACAGCGGTACGTGCTACTGGTAAAGACGGATTGACCATTGACGGTCTGAACAAAAAAGTCTACGACGATGACGGCAATCTGCTTTACTACTCAAATGCTAACACTGTTTATGCGCCTCAAAGCCGTGATAAATACCCGTCAGTCGGCAAGAAGTCAAATGATAACTGGATTATTAAAGAACTCGGGGAAACCGAATATAGCACGAAAGAGGCGCTCTGGGGTTACATGCTCGGTGAACTCAAAAAGATTTGCGTACCAGAGATTACATACGACATCGAAGGCGCTGTTGACGGTGACGTCGGTGACACGCGCACATTGATTGATGACGTGCATTATGACCCACCGCTATACGTTCAAGGGCGTATTTCGGAGCTTACAGAGGATTTAATCACTGGCAAGGTTACGAAGTCAACGCTTACGAATTTTGAACGTAAGTACTCGCAAGTCGCTAGCGAATTGCTAAAACAAGTTGAACAGCTAGCAAACGACGCAGCACCTTACATCGTCCGTTTATCAACCGACAACGGCTACAATTTTAAAAACGGCAAAGGTTCTAGCACAGTTACTGCTAGCCTTGAAAAATACGGCAAAATTGTAAATGCAAATTGGAAGTGGCTAATTAATAACAGCATTGTTAGCGATAAGAACAGTGTCACAATCAATGCTAGTCAAGTTATTGGCACACTAAACGTCGTAGCTGTTGCAACCGTTGACGGGAAAGAAGTAGCTCGTGAATACATCACATTCACCAATTCTGATGACGGTGTTGGTATTAAGTCAATCAAACGCTATTACACGACTAACGACCAAGCAGAGGGCGTCACAGCAGGCGGTCAAAACTGGTCTACTAAACCAGCGACTGTCACAGCAGACAACAAGTACATGTGGTCTTACGATGTTATCACGTACACAAATGACACAAGTTTAGTTACTGAACCAGCCGTTATTGGTGCTCGAGGGGATGACGGTTTGGACGCAGATACGACAGGTGTCACAGAAGCACTTGATAAAGCTAAGCAAGAATTGACTGCTTTATCAGCGAATATCGAAAAAGTGCGAGACGATTCGCTTGCAGCAGTCGAAGAAGCCAAACAGCAACTCACTACTGTAGCTGACGACTTGTCTAAAGTCAAGACAGACTTGCAAACACAGGCTAGTCAGTTGACTGCACAAGCCAATGCACAGTCAGAATTGACTAAACGTGTAAGTAGCGTTGAAGAAACTGCTAATGGCACAACGACGGCTGTTAGCGAGTTAAGCAAAACAGTAGATAGTAATACCAAAAATATTAGTAGTGTTACTGCACGAACTAAGACAGTTGAAGATGACCTGACAAGTACTAAAACAACATTGTCACAAGTTCAAACGACTGCTAACAGTGCTAGTCAAAAAACAGCTACACTTGAAACTGGCTTGGATGGTGTCAAGGCTGATTTAGCTGCAACTACAGCAACTGCCGACACGACCAAAACCAATCTTGCTAGCTATCAAGCGTCAAACAACCAAGCTGTCGCAAACTTGCAATCTAGCTTGCAGACCACGAACGGCTATGTAAGCAGTCTGCAAACACAGGTTGCTGCAGTCCCGGGACAGATTACAAGTGCTGTCAGCGCAGTTGAGGGGAAGATACCTACTGAAATTGGTGGACGAAATTTATACGCATTGTCTAAAAATGACGGAATATATTCGCCAGGCACTAACGATTTTAGGCAAAACATAAGTAGCGGAGAAATCAGCTTTGAGGTAACTAACACATCCGCAGCTGGTTTCGGTGCTTATAGTAGCCGCATTGGAACAAGTTATAACAAACTGTACGGTGTCAAAATCCCAGTCGTGCAAGGTAAAGACATACTTGTTAATTTGACAGATGACAAATTAGGTCGAATATATGTACATTTTTGGGACGAAAACAATGCGTTAGTAAAGCCAACGCTAAAGTACACATCTAACAAAATTAAAGTTTTAGCCAGCTTGTTGATTGATGTCAGCTCCATCACTTTGCAGTGCGCAGTCAAGACAGACGTTGAAATTGGTACTTTGATTAAGACCAAAATAAAAGTCGAATATGGCAATGTGTACACAGGCTGGTCTCCAGCCCCCGAAGACACAGCTATACAAATCAGCAGCTTGTCTAGCCAGATTCGGCAAACTGCTGACGGCATGACGTTGTTAGCGACTAAAACAGAGCTAAACAGTGCTAAAACCGACTTGCAAGCTGGCATTTCGACAGCGACAAGCAAGGCTGACAGTGCACAAGCTACCGCTAACAGCAACGCGCAAACAATCAGCACACACACGACTCAAATCAGCGCGTTGAACACTGGTTTGCAAAGTAAAGTTTCTCAAAGTGATTTTGATTCATTGAGCGGTGATGTCGACGATTTATCAAGCAAGCTTACTCAGACAGCAAGTTCAATCACCGCAAGTGTGTCAAGTGTTGAGACTAAAGCAGATAATGCTCAAACGACTGCTAACAGTGCGGTCTCAAAAGCAGACGCGGCGCAAGCTGGTGTCAATACGCTAGACAGCACGACTGTTAAGAGTGCTAGCTTGAACCTTGATAACAATGGATTCGTGACAAAGGTTGGAAAAACTATCGACGGCAACACGTTTGCGACCATGATTGCGCAAAACGCTAACAACGTCAAAATCATCGCTGATGAAATGCAGGTCACTGCTGACATGATTGTCGACGGTGCAGTCACAGCCGAGAAACTAGACGTCAATAATTTGTCTGCAGTTACTGCAAATCTTGGTGACATGACATCTGGTTCGATTACTAACACATTCACTTCTGGAACACGAAGCGGAAGCGTCAAAATAGGTAATGGCGTTGAAATAACAACGGTTGACACATCTGGTTATTTGCCAGAAAAAGCCAAAACATACTCACGTTTCTCTGACGACGCGCTTTCGTTTAGCTCTAGCACTTCAAACGATGAACCGACACATTCGATGATGATAATGCCGGAAATGATTAGCTACACCAAACACAATTACGACAATTCAAGTGGCGGAACAGGAGGCTGGAAACTGCGCCACAATGGTCACTATTCAATGCTTGAGGTCGACATGGTTTGGCAGAACGTTCGACTTAGCAATACGTCTGAATTACCTTATGGAGTTCGAGCAGATTATGTTAGAATCGGAAATTTGGTAACTATTTCCGTCAATCGTCAAATTACAAGCATAGCAGATGTCACTGAAGATAAATTAGCAAATGAAACAATCCCAGAGGGTTTCAGACCAATTTCACAAGCACATTTAACGTTGACTGGGAATACTGGTTCGACCATCGATGCGACTTGTATTGTACACTTAAACCCTGACGGAACCATTCGCTTTACTAACAACAAATCAGGAAACCGCGTCTGGACGGGCACAGTGACATACACATGTGTTGAAGCTATGCCTTACAGCACATCAAACAACAATATTTCAACAATTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
27fda1647485a95a90e8be38b03ac8890a881b3f374b46586295d7773ab7a484
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6315
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50