Genbank accession
XLE98178.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence RBPdetect
Probability 0,75
Protein sequence
MSILQRPEYPENYTLDSNWVKVLPEDGQPLVAQDLLEMQSIVQGQFQTGMDTLYRDGTILSGVELVVVGDDGTTMDILITEGRVYAAGVVVKTKPARFQVSREGETTFYLEVTTTVLEDESMRAGNQYGPRGASRLVVNSSIVLTGRGYPLYSIRNGIPINRARDLPGGIEETLAERVFERHGNFCVRGLNLALLDRPRTLADTSLVTLNDNYSTLEARATESRNLYLESKSRLDGLLLRLEDARDISSVSPTPANLTILADLETRVEEEEALVATLEATYRERQTAARGGLAELEEARRRSESSLGMSLAPGVAYVVGRRVVIDTPINLALQRTTDSQVVTAATFTYGGVTAIANRTISLQGASTWANVVTQDTKVSISFAPINNSTITVTANTSAATSVETFIDYIITRIATGTDSNSTITGTGITTDAARNLLRDAIAFRRNGPTLVMESIGIGTTSNLVNITIGITITGGAPSSLLGVDVPSGPIGGAASTNEFKLTRRPVKEVTRLVATLQENTAAIVRGPTPGTSDYLGRDTVSSVKRVFQGSINYTEGRDFQVLDGGRLEWAPNGPGALEPAPGTTYFVMYTYSSQLTLGVDFNLLTATDTIVFTGTRSPAPNTTFQVDYSYFLSRIAIVTLDKEGQPAVIYGEVGANPQPPAVSGSVLPLARVLISNNSAIIEPIDCRPVNYDSIRQLASAVSSLSDDIDRLRLTTRAEGLAFTNTGAVPNYTSIDALVDSSGINLTESTGMLSPLTNSLTSNRVYSDVRATAPSAKPNNAGDPYIVVPTYTESIFLEQTKLTKERSIHQTIAPRLYCRRVIMANRDLGRINPCDELAVRGATLFSSTSNSLYRFINEGNRAEFTRLSRRVREAISAGEAIPSIGANLMGSEEINKARAQNIRYTIRGEGLPQASYQLLIADTLMTTAVSINNTPISGTLPFAFRPRSNGILEVELFLPALPPGVHAVVLQSDTLSVSNTLSIFNNNLTHVALGGAASWGLPSSSIDTQPLPLMPRVGFDPLMQTFQAPSDMYLSGLDIKIASAPASGALIISLRDGTATTPGQILLGEALVNGAVLPDIQGRLWTKYVFPTPIYLKEDQYYTLGFRSTEGDWSVFTSEIGEVDILDAGLLIGQQLGINGNLWTSDGTIISNHEREDISMRLYRAVFPTTPISIDLGTYSSSMTAFAFNVRDIVPTGCTIDYQYKVGVNPNWISIAPNTPVCLDRVESILYLRAVSTGTAALAPILEVGTVSIYRNLSPTQHISNWQPILQTSTRFTVAITVLMPPSSTLQVRIQFSTGGWHVLSNPTTVILDAGLGLSRLTYEYVSPGPRSGELRWSIDAFGISTTDVPSIMEVVVYGTN
Physico‐chemical
properties
protein length:1359 AA
molecular weight: 146672,08300 Da
isoelectric point:5,04602
aromaticity:0,06843
hydropathy:-0,02553

Domains

Domains [InterPro]
XLE98178.1
1 1359
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Microcystis phage MaMV-CH02
[NCBI]
3378290 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XLE98178.1 [NCBI]
Genbank nucleotide accession
PP681322 [NCBI]
CDS location
range 109017 -> 113096
strand +
CDS
ATGTCTATATTACAAAGACCAGAATACCCTGAGAATTATACCCTCGATAGTAATTGGGTGAAGGTACTACCAGAGGACGGCCAACCCTTAGTGGCGCAGGATCTCCTAGAGATGCAATCAATAGTGCAGGGCCAGTTCCAGACCGGTATGGATACGCTCTATAGAGATGGTACTATACTCTCGGGTGTAGAGTTAGTCGTCGTAGGTGATGACGGTACGACCATGGATATCCTTATAACTGAAGGCCGTGTATATGCTGCGGGTGTAGTAGTCAAGACTAAGCCCGCGCGCTTTCAGGTATCACGAGAAGGAGAGACTACATTTTATCTCGAGGTCACTACTACCGTACTCGAGGATGAGAGTATGCGGGCCGGCAATCAATACGGGCCCCGGGGGGCATCTCGATTAGTAGTTAATAGTAGTATAGTCCTTACGGGGCGAGGGTATCCCCTCTACTCTATACGCAATGGAATACCTATTAATAGAGCGAGGGACCTACCTGGTGGTATCGAGGAGACACTAGCAGAGAGAGTATTTGAGAGACACGGTAACTTCTGCGTGCGCGGGCTGAACCTAGCGCTCCTGGACCGCCCCCGCACTCTAGCTGATACTAGTCTAGTGACGCTCAATGATAACTATTCTACACTAGAGGCGCGGGCTACTGAGTCACGTAACCTATACCTAGAGAGTAAGAGTAGACTAGATGGATTACTACTGCGACTAGAGGACGCACGGGATATAAGTAGTGTCAGCCCCACACCCGCTAACCTGACTATACTCGCTGATCTTGAGACACGTGTAGAGGAAGAGGAGGCCCTGGTAGCTACACTAGAGGCTACCTACAGAGAGAGACAGACAGCAGCACGGGGCGGGCTGGCAGAATTAGAGGAGGCCCGACGTAGATCAGAGTCCTCACTTGGTATGTCACTGGCACCTGGAGTAGCCTACGTCGTGGGGCGTAGAGTAGTTATTGATACACCAATAAACCTAGCGCTACAGCGTACCACAGATAGTCAGGTAGTAACGGCGGCTACATTTACTTATGGAGGTGTAACTGCTATAGCCAACCGCACTATATCGCTACAGGGTGCCTCCACATGGGCTAATGTAGTAACACAGGATACCAAAGTAAGCATAAGCTTTGCCCCTATCAATAACTCGACTATAACTGTGACGGCTAATACTTCAGCAGCCACCTCAGTAGAAACCTTCATAGACTACATAATAACTCGCATAGCTACTGGTACTGATAGTAATAGTACTATTACCGGTACCGGCATTACTACGGATGCGGCTCGTAATCTACTCCGAGATGCCATAGCCTTCCGCCGTAATGGGCCAACACTCGTCATGGAGTCTATAGGCATAGGTACCACGTCTAACCTAGTGAACATTACCATAGGTATCACTATCACAGGAGGGGCCCCCAGTAGTTTACTAGGCGTTGACGTGCCCAGCGGTCCCATTGGAGGTGCGGCCAGTACTAATGAGTTTAAGTTAACTCGCCGACCCGTCAAGGAGGTAACGCGACTAGTAGCTACGCTCCAGGAGAATACGGCTGCCATAGTGCGCGGGCCTACACCTGGCACTAGTGACTACCTAGGGCGGGATACTGTATCTAGTGTGAAACGGGTATTCCAGGGCTCTATTAATTACACAGAGGGCCGGGACTTCCAGGTTCTCGATGGTGGTAGACTGGAGTGGGCTCCTAATGGCCCGGGGGCTCTAGAGCCGGCACCTGGTACTACCTACTTTGTGATGTATACGTACTCTAGTCAATTAACACTAGGAGTAGATTTCAATCTCCTGACGGCTACTGACACTATAGTATTTACTGGTACACGGAGCCCCGCACCTAATACCACATTCCAGGTAGACTATAGCTATTTCCTCAGCCGCATAGCTATAGTAACGCTAGATAAGGAGGGGCAGCCCGCTGTCATATATGGTGAGGTAGGAGCTAATCCACAGCCTCCTGCCGTTAGTGGTTCAGTATTACCACTAGCACGTGTGCTTATAAGTAATAATAGCGCCATCATTGAGCCTATCGATTGTCGCCCTGTCAATTACGATAGTATACGGCAGCTAGCCAGCGCCGTATCATCGTTATCAGATGATATAGATAGACTACGACTCACAACTAGAGCAGAGGGGCTGGCTTTCACTAATACAGGGGCCGTACCCAACTATACATCTATAGATGCCCTAGTGGATAGTAGCGGCATTAACCTAACAGAGAGCACAGGTATGCTCTCGCCTCTAACTAATAGTCTGACATCCAATAGAGTATATAGCGATGTCAGAGCAACGGCCCCCTCGGCTAAACCTAATAACGCAGGAGACCCCTACATTGTAGTACCTACATATACAGAGAGTATCTTCCTCGAGCAAACTAAGTTGACGAAAGAAAGGAGTATACATCAGACTATAGCCCCGCGGTTATACTGCCGTAGAGTCATCATGGCTAATCGCGATCTAGGGCGGATTAACCCCTGCGACGAGCTGGCCGTCCGAGGGGCCACTCTATTCAGCTCCACTAGTAATTCCCTCTATCGATTTATCAATGAGGGGAATAGAGCAGAGTTCACGCGGCTATCTCGTCGTGTACGAGAGGCCATATCAGCAGGTGAGGCTATACCGTCTATAGGCGCTAATCTAATGGGGAGCGAGGAGATAAATAAGGCGCGGGCCCAGAACATACGATACACTATACGAGGAGAGGGGCTCCCCCAGGCCAGCTATCAACTACTGATAGCTGACACTCTCATGACTACTGCAGTATCAATTAATAACACACCAATATCGGGCACCCTACCATTCGCGTTCCGGCCCCGCTCTAATGGCATACTCGAGGTCGAGCTATTCCTACCCGCACTACCCCCCGGTGTCCATGCCGTAGTGCTACAGTCTGATACACTGAGTGTAAGTAATACGCTATCGATATTCAATAATAACCTGACCCACGTGGCGCTAGGCGGGGCGGCCTCCTGGGGCCTACCATCATCCTCAATAGATACTCAGCCCCTCCCCCTGATGCCGAGGGTAGGCTTCGACCCACTGATGCAGACATTTCAGGCACCCTCTGATATGTACCTGAGCGGCCTTGATATAAAAATAGCATCGGCGCCGGCATCAGGAGCCCTAATTATATCATTGCGGGATGGTACAGCTACTACACCAGGGCAGATACTACTAGGAGAGGCTCTTGTCAATGGAGCTGTATTGCCTGACATACAGGGGCGGCTCTGGACTAAATACGTATTCCCTACTCCTATCTATCTCAAGGAGGATCAGTATTATACACTAGGCTTCCGCAGTACAGAGGGAGACTGGAGCGTATTCACTAGTGAGATAGGAGAGGTCGATATACTAGATGCGGGGCTACTAATAGGACAGCAGCTAGGCATAAATGGTAACCTCTGGACTAGTGATGGTACTATCATCTCTAATCACGAGAGAGAGGATATCAGTATGCGGCTCTACCGCGCTGTATTCCCGACGACTCCGATTAGTATAGATCTAGGTACCTATAGCTCTAGTATGACGGCCTTCGCCTTCAATGTACGAGACATAGTACCAACAGGCTGCACAATAGACTACCAGTACAAGGTGGGCGTGAATCCTAACTGGATATCCATAGCGCCTAATACGCCTGTATGTCTAGATAGAGTAGAGTCTATACTCTATCTACGTGCAGTATCTACAGGTACGGCAGCCCTAGCTCCCATCTTAGAAGTAGGGACAGTATCCATCTATCGTAACCTGTCACCCACGCAGCATATATCTAACTGGCAGCCTATCCTGCAGACGAGTACGAGATTTACAGTAGCCATAACAGTGCTAATGCCGCCATCGAGTACGCTACAAGTAAGGATACAGTTCAGCACGGGGGGCTGGCACGTTCTAAGCAACCCAACCACAGTTATACTAGATGCAGGACTAGGACTATCTCGCCTAACTTATGAATATGTATCTCCCGGCCCGCGTAGTGGAGAGCTGAGGTGGTCTATAGACGCATTTGGAATCTCGACAACAGACGTGCCATCTATAATGGAGGTAGTGGTATATGGAACGAACTAA

Tertiary structure

PDB ID
de01ce9388a4ed8fb6bfc1ffa3a089a44b0d370016c976c7154b3ea52d112ad4
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6166
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Sequence analysis of four Microcystic aeruginosa myovirus strains Ke,F., Zhang,Q., Liu,A. and Wang,R. 2022-08-07 GenBank