Protein
View in Explore- Genbank accession
- XAO35239.1 [GenBank]
- Protein name
- minor tail protein
- RBP type
-
TSP
- Protein sequence
-
MSRILLRDLEPGRLYHIQARATNGDQASQWSQIWDLQTTSDIMPPAAPTALSWTVEGTAFKAVWTGPTTNQDGTPLSDFRDFQVKVYSPAAPATIITYYTTSARFDFTFEANLNSFGTPRATVNIEVRARDNTGNLSAAATASATNPPPANVAGFTATGITDAVALKWTANSDTDLKFYKVWQGTAAGSENTLVYTGLATSFVFDTISTNPQYFKIVAVDVFNTESATPATANATAKSSLAVDVTPPASPTGVTVTSSVDSSDASGGRAYIDVSWTGVADTDLQNYSVRYSTGTTWEYINVPEGVTTARINGLRPNTAYNVAVAAVDFQGNSSPYVNAGTYPITTAKDTTAPAAPTGVTVGAGVTTMSVYWNENSENDVKGGVGYYEIQLDTANTFNTGNLLTKQTSGTITSFSNLTSNTTYYTRVRAVDATGNVGSYSSIVSGVPRYIANADIQAGTINGDRITAATINGDRIVANSLDANTIKANTTFSQNLNVGATFTMAASGIMKSSNYVATTSGWQLTNNTLEINQGVIRAAALVLQNGHNMLHPAYADFEFVPSWYTSNLITFNDGGVTTWAISDATDVVGKYNSQCIKTSWTGVGTFSRVYLGPTYTSYNTQLETNTDYIISGWVYVKTGAGAKTAALGIKLADATFPGPVGNTSIPATSTWTRIWGTFNSGNNTSAEFYLSQYTSGDMYWDGLQLEKKVTADTTPSQWKPPGTTSIDGGIIRTGQIQSTASANGLAGQPAWSINMQGNAQLGDVNVRGRVVVGNPSNPSADGANSRIHSANYVAGSTGWIIRNDGYAEFRQIAVNSIKVTAFDAPFQNTLNTNMYDYTEDTSLWNTYGSVAMVIDPGAYTAEALFQFTGSGLVFRNGTGVEPIAYDPTILYRVSARVRAFSVATLNSNPGFETDTSGWFAYGSNSITRDTTKKFAGTASVRWDQVGSSNGVYGMGTVVSVKPGYTYTFSARVLPNSTVIRDNLKMNIVWKNASNVTISTTFNDMPPPVDTNGVAIPIDGTTWIQFSSTGTAPAGAVTASFEVQAGISGSPVAGIQGWFDDVTITTPPRIKVGLFGTDKSGYIIDYDFVDVATTPTKRHDMPTSDFSVLSGYAANQYMMVADNAEIPIATGASNTTADWITLTGFIKGRGGAGATGKFGKQGVYTDEYNPSAFNQEVSYMIPYVEFDVASGSIAQLDQFSVEAYESGAVSKIDTTGTTSSVKGVSIENIQDGTEFDHALRFFTGEADEKKPGMIGHITDGEMNDAGHLRIVPPLLNSLSNYNAGPFIGIWDQNPNYLYDASFQKGITGWSGMANTTLSWNQTTGREDSYSLQIQATGTISSPATTELLGKYQVSTLVNSELIGQQVTVSGYAMMGTATGRNVRLVVKFLDEAGAMITGYFIEKAVTNTAWTYFAFVTPVVVPDTCYNIEFSFSWFNGATGDIVLVEDVQLEANNQKTDFRSASASKIELNAEVVKARGPVIIGQTDIEYPTNIIGTGKADVPIYNGTYAVSDAGLAAWRLVNYTDSAGTRASSQMSFFSAQGAEEAAVRIYGMNDANYPGRTVITNANGNFALSTYASTGVDMTDASSVNVRVYGHLDVSGDPPWTAATLTNGSNYSSAYNVAYYKSNGNVYLRGALQTYTKATTLFTLPSGYRPAQTVYLNAIQWSSTTSLGAVLLQVNTTGTVQVTSSASTLSQLSLDGLWFSVSPEAVAPPTGGDTTAPSAPTGFSINAYSSGSSTGTYQLKWTNPSASDTAGVKVIWRTDRYPTVTIAGSGTKTLTTDGTIVTVTGTASQAKTYNHTGLPVNKTIYYRLVSYDTSGNHSTYISASRYLLASPVTIYASSSASYRLGYGGMWRNDGDEVYQGDWTGNDNHRGLYFYGTGIYDKLATGGVVRTPTKMTVYLKRLSTSHGNNSGVGINLRGHTYSSKPSGDPVGAMTNEGSDGDDIVFLDRGEAATVTVPSSWYNNFVDATSSSRLKGLGVYGSTTSDYAIMYGRGDSSSYGKLTIYHKG
- Physico‐chemical
properties -
protein length: 2008 AA molecular weight: 213887,73010 Da isoelectric point: 4,99105 aromaticity: 0,10657 hydropathy: -0,19861
Domains
Domains [InterPro]
IPR003961
STR
1–41
STR
1–41
DC_2195
ATT
2–44
ATT
2–44
IPR036116
STR
4–70
STR
4–70
IPR003961
STR
246–348
STR
246–348
IPR003961
STR
247–334
STR
247–334
DC_0960
STR
261–456
STR
261–456
IPR003961
STR
270–332
STR
270–332
1
2008
Architecture
ATT 1-243 | STR 244-1289 | STR 1291-1449 | RBD 1450-1718 | STR 1719-1830 | RBD 1831-2007 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
2008
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 248 | 248 | 0,5828 |
| Central domain | 249 | 548 | 301 | 0,5496 |
| C-terminal | 549 | 2008 | 1459 | 0,3006 |
Note: Constraints were applied during segmentation.
Fixed 8 C-terminal predictions appearing before Central domain
Fixed 8 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-248
1-248
Central
249-548
249-548
C-terminal
549-2008
549-2008
Taxonomy
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XAO35239.1
[NCBI]
Genbank nucleotide accession
PP537961
[NCBI]
CDS location
range 34704 -> 40730
strand +
strand +
CDS
ATGAGCAGAATTCTTCTAAGAGACCTAGAGCCAGGAAGGCTCTACCACATTCAGGCACGCGCCACAAATGGCGACCAGGCTTCTCAGTGGTCCCAAATTTGGGACCTTCAGACGACAAGCGACATTATGCCACCAGCCGCACCTACAGCCCTTTCATGGACTGTTGAGGGTACGGCTTTTAAGGCTGTCTGGACTGGCCCAACGACAAATCAGGATGGTACTCCACTAAGCGATTTCCGTGATTTCCAGGTTAAGGTGTATTCACCAGCGGCCCCAGCAACCATCATCACATATTACACGACTTCGGCAAGATTCGATTTCACGTTTGAAGCCAACCTCAATTCATTTGGAACGCCTCGCGCTACAGTAAATATTGAGGTACGTGCAAGAGACAACACGGGAAATCTTTCTGCTGCCGCTACTGCATCTGCTACAAATCCGCCACCAGCTAATGTTGCTGGATTTACTGCCACTGGAATTACTGATGCAGTGGCTCTTAAGTGGACTGCTAATTCAGATACAGACCTAAAGTTTTACAAGGTTTGGCAGGGGACTGCGGCGGGTTCTGAGAATACTCTAGTATACACAGGACTTGCTACCTCTTTCGTATTCGATACGATTTCCACCAATCCGCAGTATTTCAAGATTGTGGCAGTTGACGTCTTTAATACAGAGTCAGCAACACCGGCAACAGCCAACGCAACTGCAAAGTCTTCCCTTGCCGTAGATGTAACACCACCAGCAAGCCCAACCGGAGTAACAGTAACATCATCCGTAGATTCATCTGATGCCTCTGGTGGAAGGGCTTACATTGATGTGTCTTGGACTGGAGTAGCTGACACAGACCTACAGAATTACAGTGTTCGATACAGTACAGGAACAACATGGGAATACATCAATGTTCCAGAAGGTGTAACCACTGCTCGAATCAATGGTCTACGTCCAAACACAGCCTATAACGTTGCCGTTGCTGCCGTTGATTTCCAGGGTAATTCCAGTCCATATGTGAATGCTGGAACATATCCAATCACCACAGCCAAGGACACAACCGCCCCTGCTGCCCCAACCGGAGTAACAGTTGGCGCGGGTGTCACAACAATGTCTGTCTACTGGAACGAGAATTCAGAAAACGATGTTAAGGGTGGAGTTGGATACTACGAAATTCAGCTTGACACCGCCAATACATTCAATACTGGAAATCTCCTCACCAAGCAGACTAGTGGAACTATTACTTCGTTTTCAAATCTGACATCTAACACGACTTACTACACAAGAGTTCGTGCAGTTGATGCTACTGGAAATGTCGGTTCATATTCTTCAATCGTCTCTGGTGTTCCTCGCTACATTGCCAATGCTGATATTCAGGCGGGTACCATCAATGGTGACCGTATTACTGCTGCAACAATCAATGGTGACAGAATTGTAGCCAACAGCCTTGATGCAAATACCATTAAGGCCAATACGACGTTCTCTCAGAACCTCAACGTTGGAGCAACCTTCACCATGGCAGCCAGCGGTATTATGAAGAGCAGCAATTATGTTGCAACCACCTCAGGATGGCAGCTAACAAACAACACTCTTGAAATTAACCAGGGTGTTATTCGTGCTGCCGCTCTCGTCTTGCAGAACGGTCACAACATGCTTCACCCTGCATACGCTGACTTTGAGTTTGTTCCTTCATGGTATACATCAAATCTCATCACATTCAACGATGGTGGTGTAACAACATGGGCAATTTCAGACGCTACAGATGTTGTTGGTAAGTATAATAGTCAGTGTATCAAGACTTCATGGACTGGCGTTGGTACATTCTCAAGAGTCTACCTTGGTCCAACTTACACATCATACAATACACAGCTTGAAACAAATACAGATTACATCATTTCCGGGTGGGTTTACGTAAAGACAGGTGCAGGTGCCAAGACAGCAGCCCTAGGAATCAAGCTAGCAGATGCTACATTCCCAGGTCCAGTTGGAAATACTTCAATTCCAGCAACATCAACGTGGACTCGTATTTGGGGTACCTTCAATTCTGGTAACAACACTAGCGCAGAATTCTACTTGTCACAATACACGTCAGGAGACATGTACTGGGATGGCCTACAGCTTGAAAAGAAGGTTACCGCAGATACTACACCTTCACAGTGGAAGCCACCAGGCACTACATCAATCGATGGTGGAATCATTCGTACTGGTCAGATTCAGTCTACAGCTTCCGCAAATGGTCTGGCTGGTCAGCCTGCGTGGTCAATTAACATGCAGGGTAACGCCCAGCTTGGTGACGTTAATGTTCGTGGTCGAGTAGTTGTCGGTAATCCTAGCAACCCATCAGCAGATGGAGCAAACAGTCGAATCCATTCAGCAAATTATGTGGCGGGTTCTACAGGATGGATTATCCGCAATGATGGATACGCAGAATTCCGTCAGATTGCCGTTAACTCAATCAAGGTTACAGCATTTGATGCTCCATTCCAGAATACACTTAATACAAACATGTATGACTACACCGAGGATACCTCTTTGTGGAATACATACGGTTCTGTTGCTATGGTTATTGACCCAGGAGCTTATACCGCTGAGGCATTGTTCCAGTTTACTGGTTCTGGTCTTGTATTCCGTAACGGTACTGGTGTTGAGCCTATCGCTTATGACCCTACCATTCTTTATCGCGTGTCCGCGCGTGTTCGTGCATTCTCTGTTGCAACGTTGAATTCAAACCCAGGATTCGAAACTGATACTTCAGGATGGTTTGCTTACGGAAGCAACAGCATCACTCGTGACACAACAAAGAAGTTTGCCGGTACTGCATCTGTTCGTTGGGACCAGGTTGGTTCTTCTAATGGTGTGTACGGAATGGGTACTGTTGTCAGCGTAAAGCCAGGATATACATACACGTTCTCTGCACGTGTTCTTCCTAACAGCACTGTCATTCGAGACAATCTAAAGATGAACATTGTTTGGAAGAATGCTTCTAACGTTACTATCTCTACAACGTTCAATGATATGCCACCTCCTGTTGATACAAATGGTGTTGCAATTCCTATTGACGGAACCACATGGATTCAGTTCTCATCAACTGGAACAGCCCCAGCAGGAGCCGTAACAGCATCGTTTGAGGTTCAGGCGGGTATTTCTGGAAGTCCTGTCGCTGGTATTCAGGGATGGTTTGATGATGTAACTATCACCACCCCACCAAGAATTAAGGTTGGTCTATTCGGAACTGATAAGTCTGGATACATCATTGATTACGACTTTGTTGATGTTGCAACAACGCCAACAAAGCGTCACGACATGCCTACATCAGACTTCAGCGTCTTGTCTGGATATGCAGCAAATCAGTACATGATGGTTGCTGATAATGCTGAAATTCCAATTGCCACTGGAGCTTCAAATACCACAGCGGACTGGATTACCCTTACAGGATTCATCAAGGGTCGTGGTGGTGCTGGTGCTACTGGTAAGTTTGGTAAGCAAGGCGTCTACACAGACGAATACAACCCTTCAGCATTCAACCAGGAAGTCAGTTACATGATTCCTTATGTTGAATTCGATGTTGCTTCTGGCTCTATTGCTCAGCTTGACCAGTTCTCTGTTGAGGCTTATGAGTCTGGCGCGGTTTCCAAGATTGACACCACTGGAACGACAAGCAGTGTAAAGGGTGTATCTATCGAGAACATTCAGGATGGTACAGAATTCGACCATGCCCTTCGTTTCTTTACGGGTGAAGCCGATGAAAAGAAGCCGGGTATGATTGGACATATCACCGATGGTGAAATGAATGATGCTGGACATCTAAGAATTGTTCCTCCATTGCTGAATAGTCTAAGCAATTACAATGCTGGACCATTCATTGGTATTTGGGACCAGAATCCAAACTATCTATATGACGCTTCTTTCCAGAAGGGAATTACTGGATGGAGCGGAATGGCAAATACAACTTTGTCATGGAATCAGACTACTGGACGAGAAGATTCATACAGTCTTCAGATTCAGGCTACTGGAACGATTTCCAGCCCGGCGACAACTGAGCTTCTAGGTAAGTATCAGGTATCCACCCTTGTCAATAGTGAGCTTATTGGCCAGCAAGTAACTGTTTCTGGTTATGCAATGATGGGAACGGCAACGGGTAGAAACGTTCGACTTGTTGTAAAGTTCCTTGATGAAGCAGGAGCAATGATTACTGGATACTTCATTGAGAAGGCTGTCACAAATACTGCATGGACATACTTTGCGTTCGTGACACCAGTAGTTGTTCCAGATACATGTTACAACATTGAATTCTCATTCTCTTGGTTCAATGGCGCAACGGGAGACATTGTTCTTGTTGAAGATGTTCAGCTTGAGGCGAATAATCAGAAGACTGATTTCCGCTCGGCTTCTGCAAGTAAGATTGAGCTTAATGCAGAGGTCGTAAAGGCACGTGGACCTGTTATTATTGGTCAGACTGATATTGAATACCCAACCAATATCATTGGAACTGGAAAGGCAGATGTTCCTATTTACAATGGAACTTATGCCGTGTCTGATGCTGGTCTAGCAGCGTGGAGATTGGTTAACTACACTGACTCTGCTGGTACAAGAGCTTCATCACAGATGTCATTCTTCAGTGCTCAGGGTGCTGAAGAAGCAGCAGTTCGTATCTATGGTATGAACGATGCAAACTACCCAGGACGTACCGTTATCACCAATGCCAATGGTAACTTCGCATTGTCTACTTATGCAAGCACTGGTGTTGATATGACAGATGCTTCATCCGTCAATGTTAGGGTATATGGACATCTAGACGTTTCTGGTGACCCGCCATGGACCGCAGCAACTCTGACGAACGGTTCCAACTACTCGTCAGCATACAATGTGGCTTATTACAAGAGCAACGGAAATGTTTACCTACGTGGTGCCCTTCAGACATATACGAAGGCTACGACTTTGTTCACACTGCCTTCTGGATACCGTCCTGCACAGACTGTCTACCTGAATGCAATTCAGTGGTCAAGCACGACGTCTCTAGGAGCTGTGTTGCTCCAGGTCAACACGACTGGTACAGTTCAGGTTACGTCTTCAGCTAGCACATTGTCTCAGCTATCGCTTGATGGTTTGTGGTTTAGCGTTTCTCCAGAAGCGGTTGCTCCGCCTACTGGTGGAGATACAACGGCTCCGTCTGCACCTACAGGATTTAGCATTAATGCTTATTCTTCAGGAAGCAGCACCGGAACATATCAGCTAAAGTGGACAAATCCAAGCGCATCTGACACTGCTGGTGTAAAGGTAATTTGGAGAACTGACCGTTATCCAACCGTTACCATTGCCGGGTCTGGAACTAAGACATTGACTACTGACGGTACAATTGTTACGGTTACTGGTACTGCCTCACAGGCAAAGACCTATAACCACACTGGTCTTCCAGTCAACAAGACAATTTACTACCGCCTGGTTTCTTATGACACCAGTGGTAACCACTCAACATATATCAGTGCTTCTAGATATTTGCTAGCAAGCCCTGTGACCATCTATGCATCTAGCTCTGCGTCATACCGACTCGGATATGGTGGTATGTGGCGTAACGATGGTGACGAGGTTTACCAGGGTGACTGGACCGGAAATGACAATCACCGTGGTTTGTACTTCTACGGTACAGGAATCTACGACAAGTTGGCAACAGGTGGTGTTGTTCGCACTCCAACCAAGATGACTGTTTATCTAAAGCGTCTAAGCACTTCTCACGGAAATAACTCCGGGGTTGGAATCAATCTACGTGGACACACATATTCAAGCAAGCCATCTGGTGACCCTGTTGGAGCAATGACAAACGAAGGCTCAGACGGAGATGACATTGTATTCTTGGACAGAGGTGAAGCGGCAACTGTAACTGTTCCGTCTTCATGGTACAATAACTTTGTTGATGCAACATCATCAAGTCGTCTAAAGGGTCTCGGTGTTTATGGTAGCACAACCTCAGATTACGCAATCATGTATGGTAGAGGAGATAGCTCAAGTTACGGAAAGCTAACTATCTATCACAAGGGTTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
8f3dbd33544165068bda54d94c7c73a831476d876a55f939449cb7bc791363bc
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50