Genbank accession
WGJ78563.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MTTTYAPTNVITENWGRLQIVIDNKDRTFWRGVPTIVQGWTSSEPFDDKTLTLRFPGITSFEDRDALPFTDFNNIEIWLIDENNQRVKSLWDGFIASETDDLVADDNGLTVECLGALYQADLFLKVPSFQLGNKDSAFAIAEELNLRAKYYGLRLNQMNWLAISSTPTRSTGSWNPLLTGWAQELLANSYSPAYMQDGERAVALQVDKQFGGYEILGDYFSVLTFGPRMPNYGSGTWWNATFIHLFEDDGFYCSDIYMNPTTRRQFTLTRGGTIDIRDDFNDISAQWQLWKGDATAGLGKKTSVPIDRQWMAIHSIDDDTGYRVVNASGEVRCFGNATHHGHYPALGQRRFADGGLLTVDFIVDMVRTKSGNGYWLLSWGGRVFAYGDATPFANFPLIDTLYTAIEVDKNGTGIWALDAKGRVQTRGTAVNYGSIGPNIPSPATGNPYTIHPDEFAMDISRSKDGNGYVILGHWGGIFTKGDAQFYRSGVFVESQNSGGNVTQWTLMKGRGRTPIARTKNTWTTHHTCTVGTPGITHSLTRDRTQMPNVFYGEGVDPTGCKWRNTKYPNFNVGNSVPPLWPGYFITVGLNDRSPGVATWQSRMRQNGWPITVDGIYDNYDMDVCKFMQNAAGITVDGIIGPQTWATTFEPGANAGNLNSAYIAPLTIDNRVEPFRYNANGATIGTNPSFDKKIMRIESYTNYGDKSSKREATISAVNDLVRNRDPGYYGSMTFSVDPETTSRFEIKAGENIEYKGYRGEDILLHVTEASIDFDNMTTTVTVDTKARDNETVAAMIERDRTVGEPVGVPTRPSNSSSRQTEDKVIFDCESAAGFVPFFYVAGGLWSVVRIPMGEVGSISKTEFIAQTPDAKFSMAVFDRIVHPSFLVRVGGGNPGVDENYWRDFDEDRGLMQAWGSSGDMCGYYPAQEGDDDATWTGRLVDGSEWKFWSSEPPWLWVAFWAEKTTIIGGRFWPGGDSGFNFAGSDAIANPMEIHPTGRPSSAYFL
Physico‐chemical
properties
protein length:1004 AA
molecular weight: 111613,99610 Da
isoelectric point:4,96366
aromaticity:0,12450
hydropathy:-0,39751

Domains

Domains [InterPro]
IPR036365
RBD
582–645
IPR036366
ATT
585–650
IPR036366
ATT
586–648
IPR002477
CBM
593–645
WGJ78563.1
1 1004
Architecture
ATT
ATT 585-650 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WGJ78563.1
1 1004
Domain Start End Length (AA) Confidence
N-terminal 1 406 406 0,4290
Central domain 407 605 200 0,6662
C-terminal 606 1004 398 0,2329
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-406
Central
407-605
C-terminal
606-1004

Taxonomy

  Name Taxonomy ID Lineage
Phage Microcystis phage Mel-Yong916-1
[NCBI]
3038322 Viruses >
Host Microcystis elabens
[NCBI]
44824 cellular organisms > Bacteria > Bacillati > Cyanobacteriota/Melainabacteria group > Cyanobacteriota > Cyanophyceae

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WGJ78563.1 [NCBI]
Genbank nucleotide accession
OQ560327.1 [NCBI]
CDS location
range 137622 -> 140636
strand +
CDS
ATGACAACCACCTACGCACCGACCAACGTCATCACTGAAAACTGGGGACGACTCCAGATCGTGATCGACAACAAGGACCGCACCTTCTGGCGCGGTGTTCCAACGATTGTGCAGGGGTGGACCTCGTCCGAGCCGTTTGACGACAAGACTCTCACGCTTCGGTTCCCAGGCATTACGTCTTTCGAAGATCGCGACGCCCTGCCGTTCACTGACTTCAACAACATTGAGATCTGGTTGATTGATGAGAACAACCAGAGGGTTAAGTCTCTTTGGGACGGCTTCATCGCATCCGAGACTGACGACCTAGTGGCGGATGACAATGGACTGACGGTCGAGTGCCTAGGTGCACTCTACCAAGCTGACCTATTTCTCAAGGTGCCAAGCTTTCAGCTTGGCAACAAGGATTCCGCCTTTGCCATCGCTGAAGAGCTGAACCTGCGAGCCAAGTACTACGGCTTGAGGCTGAACCAGATGAACTGGCTGGCTATCTCGTCTACCCCAACGCGCTCCACTGGGTCCTGGAACCCATTGCTCACGGGTTGGGCGCAGGAGCTGCTTGCCAACTCTTACTCACCCGCCTACATGCAGGATGGCGAGCGAGCAGTGGCCCTCCAGGTAGACAAGCAGTTTGGTGGATACGAGATCCTGGGTGACTACTTCTCCGTCCTTACGTTCGGCCCAAGGATGCCGAACTATGGATCCGGAACTTGGTGGAACGCCACATTCATCCACCTTTTCGAGGACGATGGCTTCTACTGCTCCGACATCTACATGAACCCCACCACTCGCAGGCAGTTTACTCTGACTCGCGGCGGGACCATCGACATTCGCGACGACTTCAACGACATCTCCGCCCAGTGGCAACTCTGGAAGGGTGACGCTACCGCTGGCCTCGGCAAGAAGACTAGCGTGCCGATCGACCGTCAGTGGATGGCCATTCACTCGATTGACGACGACACTGGTTATCGAGTTGTCAATGCGTCGGGTGAGGTTAGATGCTTCGGTAACGCTACACACCACGGGCATTACCCGGCGCTTGGCCAGCGTCGCTTTGCTGACGGTGGTCTCCTCACGGTTGACTTCATCGTTGACATGGTTCGCACCAAGTCCGGAAACGGCTACTGGCTATTGAGTTGGGGCGGAAGAGTCTTCGCGTACGGGGACGCGACACCATTCGCCAACTTCCCATTGATCGACACGCTCTACACGGCAATCGAGGTAGACAAGAACGGAACTGGAATCTGGGCGCTCGACGCAAAGGGGCGCGTCCAGACCCGTGGAACTGCCGTGAACTACGGTTCCATTGGACCCAACATCCCATCTCCAGCGACCGGCAACCCCTACACGATCCACCCCGACGAGTTTGCCATGGACATCTCTCGCTCCAAGGACGGCAATGGCTACGTGATCCTTGGTCACTGGGGTGGCATCTTCACGAAGGGTGACGCTCAGTTCTACCGTTCCGGCGTGTTCGTTGAGTCTCAGAACTCTGGCGGCAACGTGACCCAGTGGACCCTAATGAAGGGACGTGGTCGTACTCCGATCGCGAGGACCAAGAACACTTGGACAACACACCACACCTGCACCGTTGGCACCCCAGGCATCACCCACTCCCTTACCAGGGATAGGACGCAGATGCCGAACGTCTTCTACGGCGAGGGCGTTGATCCAACCGGCTGCAAGTGGAGAAACACGAAGTACCCAAACTTCAATGTAGGCAACTCCGTGCCACCACTATGGCCCGGCTACTTCATCACGGTTGGCCTCAACGACCGGTCTCCCGGCGTCGCAACTTGGCAGTCCAGGATGCGCCAGAATGGTTGGCCGATCACGGTCGATGGCATCTACGACAACTATGACATGGACGTCTGCAAGTTCATGCAGAACGCCGCTGGCATTACCGTCGATGGCATCATTGGCCCTCAGACCTGGGCGACCACGTTCGAACCTGGCGCCAACGCTGGAAACTTGAACAGCGCATACATCGCCCCCTTGACCATTGACAATCGCGTTGAGCCGTTCAGGTACAACGCCAATGGAGCCACAATTGGGACCAACCCCTCCTTCGACAAGAAGATCATGCGCATCGAGTCGTACACCAACTACGGTGACAAGTCTTCGAAGCGAGAGGCGACCATCTCGGCAGTCAATGATCTTGTCAGAAATCGTGATCCCGGCTACTACGGATCGATGACATTCTCGGTTGATCCGGAGACAACCTCCCGCTTCGAGATCAAGGCTGGGGAGAACATTGAGTACAAGGGGTACCGCGGCGAGGACATCCTTCTCCACGTAACCGAGGCATCGATCGACTTCGACAACATGACGACCACGGTCACCGTGGACACCAAGGCACGCGACAATGAGACCGTGGCGGCAATGATCGAGCGCGATCGAACTGTGGGTGAGCCGGTCGGAGTCCCGACGCGGCCGTCCAACTCGTCCAGTAGGCAGACTGAGGACAAGGTCATCTTCGACTGTGAGTCGGCTGCCGGGTTCGTTCCATTCTTCTACGTTGCCGGTGGCCTCTGGAGCGTCGTCAGAATCCCCATGGGTGAGGTTGGATCAATCTCCAAGACGGAGTTCATTGCTCAGACACCTGATGCCAAGTTCTCCATGGCGGTCTTCGACAGGATTGTCCACCCAAGCTTCCTGGTGCGCGTTGGTGGCGGTAATCCTGGCGTAGACGAGAACTACTGGAGGGACTTCGACGAAGATCGAGGCCTCATGCAAGCCTGGGGATCCAGTGGCGACATGTGTGGTTACTACCCAGCCCAGGAAGGGGACGACGATGCCACTTGGACCGGACGTCTCGTCGATGGTAGCGAGTGGAAGTTCTGGTCAAGCGAGCCACCATGGCTCTGGGTAGCGTTCTGGGCAGAGAAGACGACCATCATTGGTGGACGCTTCTGGCCAGGAGGCGACTCTGGCTTCAACTTCGCTGGCTCTGACGCCATCGCGAACCCAATGGAGATCCACCCAACCGGCAGGCCTAGCAGCGCCTATTTCTTGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
91241005790492ac6ab820c34882ae78f462966d745ac998a2bdede449314943
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,2937
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50