UniProt accession
A0A0A0RVE3 [UniProt]
Protein name
Tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TSP
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,75
TSP
Evidence RBPdetect2
Probability 0,90
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MPLIDVPIRQIGNEVVGEFVSSVSSVQEPFWFVENKTRNFLEITNLTTGILYVSVGTKSNVIVNAFESVRIENEYYAEFYVRAALGYGSFKARFAYFEYDEEDEKRLQDEIDKLEGRLDDLIKMTDYGAKGDGITDDSSAFALVESKYTNKIIDLQGFTYKVNALPFKNKYTNGKFNVGGTFTDASYSNVSRVNHGIVAFGLGAAGSAPYYPTYSGNDKFYKNIAIGGYALKNSFGSYNNIAIGWEAMPVAETGEYYNIAIGNESLWSLKRSATPSGFEATRNIAIGINGLRYLVKGHHNLAIGRNAAQCIINGTYNTIVGVNAHAGVAPLDLTGKIISYSDSDATENTALGNAALLNNVANENTAVGSYAATNVTKATRLTAIGRNALANLQKYITANGKDRTLWSKTGTYVWTGTKITVTMAAHGMLNGHLISLKLDTGDNLKTSEENQYVIQNVTTDTFDIIAPLTNNTNGTCSSTWYSNQNDNLSAYDNNTAVGHSAMENALKGQNNTVVGVWAGRALSGDSNVIVGVLAGTNLGASGTNTAIGYGALRYMQDGSNATVLQNATGIGYDSRVSGSNQIQLGNASTTTYAYGAVQARSDRRDKIDIEDSTLGLEFIEKIPVRQFRYNYRELYDNNDNSSREHAGKRIHEGVIAQEVKEVMDEMGIDFSGYQDHSLNGGSDVKTVGYQQFIMPLMNAVKELSAENKARKQKEEQQDAIISKLIDRIEKLERAVI
Physico‐chemical
properties
protein length:736 AA
molecular weight: 80272,38090 Da
isoelectric point:5,44503
aromaticity:0,09375
hydropathy:-0,33220

Domains

Domains [InterPro]
DC_1988
STR
122–345
IPR024429
ENZ
129–176
A0A0A0RVE3
1 736
Architecture
ATT
STR
ATT 1-181 | STR 182-736
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A0A0RVE3
1 736
Domain Start End Length (AA) Confidence
N-terminal 1 137 137 0,8947
Central domain 138 444 308 0,9730
C-terminal 445 736 291 0,5859
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-137
Central
138-444
C-terminal
445-736

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Pascal
[NCBI]
1540092 Uroviricota > Caudoviricetes > Pagevirus >
Host Bacillus megaterium
[NCBI]
1404 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AIW03648.1 [NCBI]
Genbank nucleotide accession
KM236247 [NCBI]
CDS location
range 10918 -> 13128
strand +
CDS
ATGCCATTAATTGATGTGCCTATTAGACAGATAGGGAATGAAGTTGTAGGGGAATTTGTTAGCAGTGTTTCATCAGTCCAGGAGCCGTTTTGGTTTGTTGAAAATAAAACCCGCAACTTTTTAGAGATTACAAACTTAACAACTGGAATCCTGTATGTTTCGGTGGGTACAAAATCAAACGTTATTGTTAATGCGTTTGAATCGGTGCGAATCGAAAATGAATACTATGCAGAGTTCTATGTAAGAGCAGCATTAGGCTATGGTTCTTTTAAAGCCCGCTTTGCTTATTTTGAATATGATGAAGAGGACGAAAAAAGGCTGCAGGATGAAATTGATAAGTTAGAAGGGCGCTTAGATGACTTAATTAAAATGACTGACTACGGCGCTAAGGGTGACGGTATAACGGATGATTCAAGCGCTTTTGCTCTTGTTGAATCGAAGTATACGAACAAGATTATTGATTTACAGGGATTCACTTATAAAGTCAATGCGTTGCCGTTTAAGAACAAATATACAAACGGAAAGTTCAATGTGGGCGGTACGTTTACAGATGCTTCATACAGCAATGTTTCGCGTGTCAATCATGGTATTGTGGCGTTCGGTCTTGGGGCGGCTGGTTCAGCTCCTTATTACCCAACGTATAGCGGCAATGATAAGTTTTATAAAAATATTGCTATCGGTGGTTATGCTTTAAAAAATAGCTTTGGTTCTTACAACAATATTGCTATCGGTTGGGAAGCTATGCCAGTAGCAGAAACAGGCGAATACTACAACATTGCCATTGGTAATGAATCTCTATGGAGCTTGAAACGTTCCGCTACTCCTTCGGGTTTTGAAGCAACACGTAATATTGCCATTGGTATTAATGGCTTGCGCTACCTAGTGAAAGGACATCATAACCTAGCGATTGGACGCAACGCCGCTCAGTGTATCATCAATGGAACATATAACACTATCGTAGGGGTTAATGCTCATGCTGGTGTAGCTCCATTGGATCTCACAGGAAAAATTATTAGTTACTCAGATTCAGACGCGACAGAAAATACAGCGCTAGGTAATGCTGCTCTTTTAAATAACGTAGCTAATGAAAATACGGCAGTTGGTTCTTATGCAGCTACAAACGTTACAAAGGCTACACGCTTAACAGCAATCGGCAGAAACGCGCTTGCTAATCTACAAAAATATATAACTGCTAATGGTAAAGACCGTACATTGTGGAGTAAAACAGGTACTTATGTTTGGACAGGTACAAAAATCACTGTAACAATGGCAGCTCATGGTATGTTAAACGGTCATTTGATTTCCTTGAAATTAGATACAGGTGATAACTTAAAAACATCAGAGGAAAATCAGTATGTGATTCAGAATGTAACAACAGATACATTTGACATTATCGCGCCATTAACGAACAACACAAACGGCACTTGCTCATCTACTTGGTACAGCAATCAGAATGACAACCTAAGCGCGTATGATAATAATACAGCAGTAGGTCACAGCGCTATGGAAAACGCGTTAAAAGGTCAAAATAATACAGTTGTCGGCGTGTGGGCTGGTCGTGCTCTATCAGGAGATTCAAACGTAATTGTTGGGGTTCTTGCTGGTACAAACTTAGGAGCAAGCGGAACAAATACAGCTATCGGTTATGGGGCTTTGCGTTACATGCAAGACGGCTCTAATGCTACGGTTCTACAGAACGCAACAGGCATTGGCTATGATTCCCGCGTAAGTGGCAGCAATCAGATACAACTAGGGAACGCTTCAACTACAACTTACGCTTATGGAGCTGTACAAGCCCGCTCAGATAGACGAGATAAGATTGATATTGAAGATAGTACACTAGGCTTAGAGTTCATTGAAAAGATTCCTGTTCGTCAATTCCGTTACAACTACCGCGAGCTGTACGATAACAACGACAATTCTAGCCGCGAACATGCAGGCAAACGAATCCATGAGGGTGTTATTGCTCAGGAAGTAAAAGAAGTTATGGACGAAATGGGAATTGATTTCTCAGGCTATCAAGACCATTCTTTAAACGGTGGTTCTGATGTTAAGACAGTGGGCTATCAGCAATTCATTATGCCGCTTATGAATGCAGTAAAAGAACTTTCTGCAGAGAATAAAGCGCGTAAACAAAAAGAAGAACAGCAGGACGCTATCATTTCAAAATTAATTGATCGCATTGAAAAACTAGAGAGAGCGGTGATTTAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
87a102314b31d0b932e04cf36261e41208da36df49b9f47d80529e2124b9ddc1
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7698
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
25635028 PubMed