UniProt accession
M1IEC9 [UniProt]
Protein name
Putative tailspike, beta-helical glycoside
RBP type
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MGAKTSNYGFSLWDTGDRIEVTANEQTANWNALEDWIVKRNGYLVNAQGGYRLGSNGKPLVGDATNATLDDQPAIQAALTKCKDDGGGIVFLPAGNYALKNTCVIYSNTTFIMHPQARIYRNKANIGAFFRNGETGASYTGYNGHGNIHVIGGYLDGNIENYDYRFNFFSFGHGYNITFENIYMKDTQTYHAIEINSTNKAVFKNLTCDGYSLDSEFQQTTPNRKTEAIQIDGMYGQDVFGGFGAYDNTPCNDITIESCTFRNWNRGVGSHSSATGAHHRNIRVVNNHFENIEDIAVVSCMWDNAVIDSNTFQTVGGGVWLRVKDSTDATFGYVISNNTFRTVTLGTSARHAIRVSGQDDADTAFKLRTVSITGNTIENTTDTSIYVDRVWRTTITGNTINNATTSGIYVTGCEFGSISANTIISCGSYGIGLSGSNWFAINGNMVSNTDLSGIYLSNSKECAITGNTTRRNGIAGGDNQGIRLVSNSDGCTVTGNSHVSGSGEADRAILCSGSTKKNVTVGNNGGGKAITNNSTDGVSSGNL
Physico‐chemical
properties
protein length:543 AA
molecular weight: 58382,41170 Da
isoelectric point:5,77827
aromaticity:0,09576
hydropathy:-0,36832

Domains

Domains [InterPro]
IPR012334
STR
61–414
IPR007742
CBM
368–498
IPR011050
STR
368–542
IPR022441
Unmapped
385–427
M1IEC9
1 543
Architecture
ATT
STR
ATT 1-70 | STR 71-542 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
M1IEC9
1 543
Domain Start End Length (AA) Confidence
N-terminal 1 56 56 0,9517
Central domain 57 532 477 0,9881
C-terminal 533 543 10 0,2541
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-56
Central
57-532
C-terminal
533-543

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Finn
[NCBI]
2884419 Uroviricota > Caudoviricetes > Ehrlichviridae > Andromedavirus finn >
Host Bacillus pumilus BL8
[NCBI]
1189615 Bacillota > Bacilli > Caryophanales > Bacillaceae > Bacillus > Bacillus pumilus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGE61019.1 [NCBI]
Genbank nucleotide accession
KC330683 [NCBI]
CDS location
range 19760 -> 21391
strand +
CDS
ATGGGAGCAAAAACAAGTAATTATGGGTTTAGTTTATGGGATACGGGTGATAGAATCGAAGTAACAGCCAATGAGCAAACAGCCAATTGGAATGCCTTAGAAGATTGGATCGTAAAAAGAAATGGTTATTTAGTAAACGCACAAGGCGGTTACCGACTAGGTTCAAATGGAAAGCCCCTTGTAGGTGACGCTACTAACGCAACCCTTGATGATCAGCCAGCAATCCAAGCGGCACTAACTAAGTGTAAAGATGATGGAGGCGGCATCGTGTTCCTCCCAGCTGGTAATTATGCTTTGAAGAATACTTGTGTGATTTACTCAAACACAACATTCATCATGCACCCCCAAGCAAGGATATACAGAAACAAAGCAAACATCGGAGCATTCTTCCGTAATGGTGAAACAGGAGCTAGTTATACAGGTTATAACGGTCACGGAAACATTCATGTAATTGGTGGCTACCTCGATGGGAACATTGAGAACTATGATTATAGATTCAATTTCTTCTCGTTTGGTCATGGATATAACATTACATTCGAGAACATCTATATGAAGGACACACAGACTTATCATGCTATTGAGATTAACTCAACAAACAAAGCTGTATTTAAAAACCTTACATGCGATGGTTATTCCTTAGATAGCGAGTTCCAACAAACTACACCTAACAGAAAAACAGAAGCAATTCAAATTGATGGTATGTATGGTCAAGATGTATTCGGAGGTTTTGGAGCATATGACAATACGCCTTGCAATGATATCACTATTGAGTCTTGTACTTTTAGAAATTGGAACCGTGGAGTAGGCTCTCACTCATCCGCAACAGGAGCGCATCACAGAAATATCAGGGTTGTCAATAACCACTTTGAAAACATCGAGGATATCGCTGTAGTTAGCTGTATGTGGGATAACGCTGTAATTGACTCCAATACCTTCCAGACTGTTGGCGGTGGTGTATGGCTTCGAGTGAAGGACTCAACAGATGCAACCTTTGGATATGTCATCTCGAACAACACTTTCAGAACTGTCACGCTAGGAACGTCAGCAAGACACGCCATTCGAGTAAGTGGTCAAGATGATGCAGACACGGCATTTAAACTCCGCACAGTTTCCATCACAGGCAACACAATTGAAAACACAACGGACACATCTATTTATGTAGACCGAGTGTGGCGCACAACCATCACAGGAAACACAATCAACAATGCCACTACAAGTGGAATATATGTTACTGGTTGTGAGTTTGGGTCTATTTCAGCGAATACTATTATCTCTTGTGGCAGTTACGGAATCGGGCTCAGTGGCTCTAATTGGTTTGCCATCAATGGAAACATGGTGAGTAATACAGACCTTTCAGGAATCTACTTGTCAAACTCTAAGGAATGTGCAATCACAGGCAACACAACACGGAGAAATGGCATTGCTGGTGGAGACAATCAGGGGATTAGACTTGTATCAAATTCTGATGGTTGTACTGTCACGGGTAATAGTCATGTATCTGGTAGTGGTGAAGCTGATAGGGCTATACTATGTAGTGGCTCCACAAAGAAAAACGTCACAGTAGGAAACAACGGCGGTGGCAAGGCAATAACAAACAATAGCACTGACGGTGTATCTTCTGGTAACTTATAA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0044423 virion component Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0051701 biological process involved in interaction with host Biological Process IEA:UniProtKB-ARBA (UniProt)
GO:0019058 viral life cycle Biological Process IEA:UniProtKB-ARBA (UniProt)

Tertiary structure

PDB ID
8faf1cc88e086be637729e9fa68e66741ac26e23b4284a9856c4772cb295e089
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8472
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50