Protein
View in Explore- Genbank accession
- XPP60043.1 [GenBank]
- Protein name
- adhesin
- RBP type
-
TF
- Protein sequence
-
MKKIMLAVALMVSVMTVNMANAAVSLKSSGDQWGPAVQDLMNAKENGNWNDARFISTFNNRFNGTAEEAKALLDSLKTGNVPSSLDDFNNSISTGGGNTSPTKGDKGDKGDKGDKGDPGKDGADGKDADTSNLVNKDTQAARDSAQDTAIDSKADKTALNDVSVKTDHAFDIAVEGRATANNALSTVYGLIPQVLDNGKKITDVDNKVDANKAETDASLNTKVDKDTQATRDSGQDEHINAVQDAAQTANNRATDLEHRADSTEGAIRETNKQLEVTDSRSINNAERLDGVEAKNAEQDTAIAGKADQSALDKEVTDRQNADTALKNNIDQNKAGQAVTDSKQDKAINGKVDKSTYAVDKAVQGIHDALQDGAIIGLEVSKADKSDLNKETADRVKGDKALDDKINKAVSDQASTDNAQNAAIGKEVTDRQKADTALKNNIDQNKAAQAKTDAKQDKALSDEAKTRSDADKQLQANIDKNKSDQAKTDAKQNAAIDSKVDKSTYAVDKAKQAVHDAVQDAAIVGLAVTKADKADLNKEVAARKDADKVLQSNIDSEAQTRATADTALKNNIDQNKADQAVTDATQDKALSAETDNRIAGDTALKANIAKNKADQAVTDSDQDKVIATKASKKELASETVARQNADKNLQANIDTESSTRANADNALKANIDKNKSDQAVTDSNQDKVIATKASKKELASETASRKSADAKLSSRIDSNDATLVQHDERITSNTNRIGSVEKRVSNFENQTNRRFSDIDKRIGDNRKVASAGIAGAGAMANIPQVTQNGNVSVGAGIGGYDGEQAVAVGFSARVSESVTTKVSVSTNTQSEVLWGAGVGVEW
- Physico‐chemical
properties -
protein length: 841 AA molecular weight: 88930,84890 Da isoelectric point: 5,06415 aromaticity: 0,01902 hydropathy: -0,80975
Domains
Domains [InterPro]
DC_1037
STR
50–180
STR
50–180
G3DSA:3.30.1300.30
RBD
742–841
RBD
742–841
IPR045584
RBD
748–841
RBD
748–841
1
841
Architecture
STR 50-180 | STR 672-770 | RBD 771-781 | ATT 782-841
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XPP60043.1
[NCBI]
Genbank nucleotide accession
PQ464595
[NCBI]
CDS location
range 80007 -> 82532
strand -
strand -
CDS
ATGAAAAAGATTATGCTGGCAGTAGCCTTAATGGTTTCTGTGATGACGGTTAATATGGCTAACGCCGCCGTATCGTTGAAATCATCTGGCGATCAATGGGGTCCGGCTGTTCAGGACTTGATGAACGCCAAAGAAAACGGAAATTGGAACGACGCACGTTTCATTTCCACGTTCAACAACCGTTTCAATGGTACAGCAGAAGAAGCAAAAGCTTTGCTAGATTCGTTAAAAACTGGTAATGTTCCTTCGTCCTTGGACGACTTTAATAACTCTATCAGTACTGGTGGTGGCAATACTTCACCAACCAAAGGTGATAAGGGTGATAAAGGTGACAAAGGCGATAAAGGTGATCCGGGTAAAGATGGGGCAGATGGTAAAGATGCCGATACCTCTAACCTGGTCAACAAAGATACTCAGGCCGCACGTGACTCTGCCCAGGATACAGCTATTGACTCTAAAGCTGATAAAACTGCGTTAAATGATGTTTCAGTGAAAACTGATCATGCGTTTGATATTGCGGTTGAAGGTCGTGCGACTGCCAACAATGCGTTAAGTACTGTTTATGGGTTAATCCCACAAGTACTTGATAATGGCAAGAAGATCACTGATGTTGACAATAAAGTTGATGCTAATAAAGCTGAAACTGATGCCAGCCTGAATACCAAAGTTGACAAAGATACCCAGGCCACTCGTGATTCAGGTCAAGATGAACATATCAATGCGGTACAAGACGCAGCCCAAACTGCCAATAACCGTGCTACTGATCTGGAACATCGTGCCGATTCAACTGAAGGTGCTATTCGTGAAACCAACAAACAGTTGGAAGTAACTGATTCTCGCAGTATTAACAATGCTGAACGTTTAGATGGTGTAGAAGCCAAAAACGCTGAGCAAGATACCGCTATCGCTGGTAAAGCTGACCAATCAGCATTGGATAAAGAAGTAACTGATCGTCAGAATGCCGATACTGCGTTGAAGAACAATATTGATCAGAACAAAGCGGGACAAGCTGTAACTGATTCCAAACAAGATAAGGCGATCAATGGTAAAGTAGACAAGTCTACCTATGCTGTTGATAAGGCAGTTCAAGGTATCCATGACGCATTACAAGATGGTGCGATTATTGGCCTTGAAGTATCAAAAGCTGATAAGTCCGATCTGAACAAAGAAACTGCGGATCGTGTCAAAGGTGATAAAGCACTGGATGACAAGATCAATAAAGCAGTATCAGACCAGGCCAGTACTGATAACGCACAAAACGCTGCGATTGGTAAAGAAGTAACTGATCGTCAAAAAGCTGATACTGCGTTGAAGAACAATATCGATCAGAACAAAGCGGCTCAAGCCAAAACTGATGCTAAGCAAGACAAAGCATTGAGTGATGAAGCAAAGACTCGTTCTGATGCTGATAAACAGTTACAGGCTAATATCGACAAAAACAAATCTGACCAAGCAAAAACCGACGCGAAGCAAAATGCCGCAATTGACAGTAAAGTAGATAAATCTACCTACGCTGTTGATAAAGCTAAGCAAGCTGTTCACGATGCCGTTCAGGATGCTGCGATTGTTGGTCTTGCTGTAACTAAAGCTGACAAAGCTGATCTGAATAAAGAAGTAGCAGCCCGTAAAGATGCTGACAAAGTACTTCAATCCAATATTGATTCTGAAGCCCAAACTCGTGCTACCGCCGATACTGCGTTGAAGAACAATATTGATCAGAACAAAGCGGATCAAGCTGTAACTGATGCTACACAAGATAAGGCTCTCAGTGCTGAAACTGATAACCGTATCGCTGGTGACACTGCCCTGAAAGCAAACATCGCCAAGAACAAAGCCGACCAAGCAGTAACCGATTCCGATCAGGATAAGGTTATCGCAACTAAGGCAAGTAAGAAAGAACTGGCAAGTGAAACCGTGGCACGTCAGAACGCCGATAAAAACCTTCAGGCGAACATTGATACTGAGTCTAGTACTCGTGCCAATGCGGATAATGCCCTGAAAGCTAATATCGACAAAAACAAATCTGACCAGGCTGTAACTGATTCCAATCAGGATAAGGTTATCGCAACTAAGGCGAGTAAGAAAGAACTGGCAAGTGAAACTGCGTCACGCAAATCTGCTGATGCTAAGTTGTCCAGCCGGATCGATTCTAACGATGCTACGTTAGTACAACACGATGAACGTATCACCAGCAATACTAACCGTATTGGTTCTGTTGAAAAGCGTGTATCTAACTTCGAAAACCAAACGAACCGCCGTTTCTCAGACATTGATAAACGTATTGGTGATAATCGTAAGGTAGCAAGTGCTGGTATCGCTGGTGCTGGTGCGATGGCAAACATCCCACAGGTAACTCAGAACGGCAACGTATCTGTAGGTGCGGGTATTGGTGGTTATGATGGTGAACAAGCAGTAGCTGTGGGCTTCAGTGCTCGTGTAAGCGAAAGTGTCACCACTAAGGTATCTGTCAGTACTAACACTCAGTCTGAAGTACTGTGGGGTGCTGGTGTAGGCGTAGAGTGGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
95766e71fdb9a29a4fd787b5e8ba3b179f217b75fcbae3be40f5d183e5daf791
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50