Protein
- UniProt accession
- A0A385DVT1 [UniProt]
- Protein name
- Polygalacturonase/tailspike protein
- RBP type
-
TSP
evidence: UniProt/TrEMBL
probability: 1,0000
- Protein sequence
-
MFFTQEDYRKIEKWLLANSRKDTDFAGAATPLKGNETVVLVQNGKNVKASVKDVVEQLFLLGVSDFVNITDKYGESYISLSQAIELIPYRSRKIGQVVTFLDDTGKWAMFQFQGTRKNQWGTLSLWVDLIDLMTGLTITDSEDIVTETNSANQVALKFADKTYNEADYSGLGRVYLRKNIVNVEDPVTGNIVKMNYLTQSMISKENTIYIVQYSYNLNGQTITIPSGCVLKFEGGSISNGSIKGTDTNIIAPQIRIFNTILLSGTWKVRDIFDDWFDFNATTNFDNINNFYNISILQSDDLENNVILKGNYYSSLKDGIVLSLSSNTNLVLNGSISLLPNNLSSYSIIKGVDKENIKISGGGRLIGDLQNHLGDTGEWGFGISFTGCKTVSITNIDSSYMWGDGLYIGASDDTKEETLSQNIQVNNCKFEYNRRQGISITGAVDVFVNNCYFFNTGKINGTSPKAGLDIEPSGLYNSNVTISNCIADSNVSTGFLVYGDNRNIVIDNCASKNQLISITIAQRSATTDNDDVFIRHGNIGGSLQITRGNIRVEDCEVDSVYFTADTSGIGANVTISNSVIGAKRWEGSTYFNSVFLIDSNSQINNLYIFDSKIDYDPSILTQGLFNIGGNSIRDNILFENCDISQKNTVNSLNTKVGSYRNCRFYNMTRIYLANEPNKTVEFTDNYCAMTRESSSTNIFSFLNSSSTQDVILFVVRNNTFSTKGSINVGSIGLIDVTGVNLGNKMIFENNHFLTQYPLTQEQIIKALSNRITVDTNYNFTSRFPYRATLESLPAYNTFDAGALIYGDDNLLYFWNGTNLTNSEGTDARKVVIV
- Physico‐chemical
properties -
protein length: 832 AA molecular weight: 92331,17820 Da isoelectric point: 4,85123 aromaticity: 0,10457 hydropathy: -0,20072
Domains
Domains [InterPro]
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
Name | Taxonomy ID | Lineage | |
---|---|---|---|
Phage |
Bacteroides phage crAss001 [NCBI] |
2301731 | Uroviricota > Caudoviricetes > Crassvirales > Asinivirinae > Kehishuvirus |
Host | No host information |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AXQ62665.1
[NCBI]
Genbank nucleotide accession
MH675552
[NCBI]
CDS location
range 14369 -> 16867
strand +
strand +
CDS
ATGTTTTTTACACAAGAAGATTATAGAAAAATAGAGAAGTGGCTCCTTGCAAACAGTAGAAAGGATACTGATTTTGCTGGAGCTGCAACTCCTCTTAAAGGAAATGAAACTGTAGTTCTTGTACAGAATGGTAAGAATGTAAAGGCATCAGTGAAAGATGTAGTTGAGCAACTATTTCTTCTTGGTGTATCTGACTTTGTGAACATTACTGATAAGTATGGTGAGAGTTATATTTCACTATCTCAAGCTATAGAATTAATTCCTTATAGGAGTAGAAAGATTGGTCAGGTAGTTACTTTCTTGGATGATACTGGAAAATGGGCTATGTTCCAATTCCAAGGTACAAGGAAGAATCAGTGGGGTACTTTATCTCTTTGGGTTGATTTGATAGACCTTATGACAGGTCTTACTATCACAGATAGTGAAGATATTGTTACTGAGACTAATAGTGCAAATCAAGTAGCTCTTAAGTTTGCAGATAAAACTTATAATGAAGCTGACTACTCAGGTTTAGGCAGAGTATATCTTAGAAAGAATATTGTAAATGTTGAAGACCCTGTAACAGGTAATATAGTTAAGATGAATTACCTTACACAGTCAATGATTTCTAAAGAGAATACTATCTATATAGTGCAATATAGCTATAACTTGAATGGTCAAACTATCACTATCCCAAGTGGGTGTGTACTCAAGTTTGAAGGAGGGAGTATAAGTAATGGTTCTATAAAGGGCACAGATACTAATATAATAGCTCCTCAAATTAGAATATTTAATACTATCTTATTATCTGGTACTTGGAAAGTTAGAGATATATTTGATGATTGGTTTGACTTTAATGCTACTACCAATTTTGATAATATCAATAATTTCTATAATATATCCATATTACAATCAGATGATTTGGAGAATAATGTAATTCTTAAAGGGAACTATTATAGTTCTCTTAAAGATGGAATAGTTTTGTCCTTATCTTCTAATACCAATTTAGTACTTAATGGTAGTATTAGCTTGCTTCCTAATAACCTATCTTCATATTCTATAATAAAAGGAGTTGATAAGGAAAATATTAAAATAAGTGGAGGGGGAAGATTAATTGGAGATTTGCAAAATCACTTAGGTGATACTGGAGAATGGGGATTTGGTATTAGTTTCACAGGTTGTAAAACTGTATCAATAACTAATATTGATAGTTCTTATATGTGGGGAGATGGCTTATATATTGGGGCATCAGATGATACAAAAGAGGAGACACTATCTCAAAACATACAAGTAAATAATTGTAAATTTGAATATAATAGAAGACAAGGTATATCTATTACAGGAGCTGTTGATGTATTTGTTAACAATTGTTATTTCTTTAATACTGGTAAAATAAATGGAACAAGTCCAAAAGCTGGATTAGATATAGAACCAAGTGGATTATATAATAGTAATGTTACAATAAGTAATTGTATAGCAGATTCAAATGTATCTACTGGATTTTTGGTTTATGGGGATAATAGAAATATTGTAATTGATAATTGTGCATCTAAAAACCAATTAATATCTATAACTATAGCCCAAAGAAGTGCAACAACTGACAATGATGATGTATTTATAAGGCATGGAAATATAGGAGGTTCTCTACAAATCACAAGAGGTAATATAAGGGTAGAGGACTGTGAAGTAGATTCAGTTTATTTTACTGCTGATACATCAGGGATAGGAGCAAATGTAACTATTAGTAATTCAGTGATAGGAGCAAAAAGATGGGAAGGAAGTACATATTTCAATTCAGTTTTTTTAATTGATTCTAATAGCCAAATTAACAACTTATATATATTTGATTCCAAAATAGATTACGACCCATCAATTCTTACTCAAGGATTGTTCAATATAGGTGGGAATTCTATAAGAGATAATATCCTATTTGAAAATTGTGATATTTCACAGAAAAATACAGTAAATTCTCTAAATACAAAAGTAGGCTCTTATAGAAATTGTAGATTCTATAATATGACAAGAATATATTTGGCTAATGAACCAAATAAAACAGTTGAGTTTACAGATAATTACTGTGCCATGACAAGAGAAAGTTCAAGTACCAACATATTTAGTTTTCTAAATTCAAGTAGTACTCAAGATGTTATACTCTTTGTTGTAAGAAATAACACCTTTAGTACTAAGGGGTCTATAAATGTTGGAAGTATAGGACTTATAGATGTTACTGGAGTTAATCTTGGTAATAAAATGATATTTGAGAATAATCATTTCTTAACTCAATATCCTCTTACACAAGAGCAGATAATAAAAGCCCTAAGCAATAGGATAACAGTTGATACCAATTATAATTTTACCTCCAGATTTCCTTATAGAGCTACTTTAGAATCTTTACCTGCTTATAATACTTTTGATGCAGGGGCTTTAATATATGGGGATGATAATTTGTTATACTTTTGGAATGGTACAAATTTAACTAATTCAGAAGGTACAGATGCCAGAAAGGTAGTAATAGTCTAA
Gene Ontology
Description | Category | Evidence (source) | |
---|---|---|---|
GO:0044423 | virion component | Cellular Component | IEA:UniProtKB-KW (UniProt) |
GO:0051701 | biological process involved in interaction with host | Biological Process | IEA:UniProtKB-ARBA (UniProt) |
GO:0019058 | viral life cycle | Biological Process | IEA:UniProtKB-ARBA (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Literature
Title | Authors | Date | PMID | Source |
---|---|---|---|---|
Structural atlas of a human gut crassvirus. | Bayfield OW, Shkoporov AN, Yutin N, Khokhlova EV, Smith JLR, Hawkins DEDP, Koonin EV, Hill C, Antson AA | 2023-05 | 37138077 | PubMed |
Structural atlas of a human gut crassvirus | Oliver W. Bayfield, Andrey N. Shkoporov, Natalya Yutin, Ekaterina V. Khokhlova, Jake L. R. Smith, Dorothy E. D. P. Hawkins, Eugene V. Koonin, Colin Hill, Alfred A. Antson | 2023-05-03 | 10.1038/s41586-023-06019-2 | DOI |