Protein
View in Explore- Genbank accession
- XLE97970.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSP
- Protein sequence
-
MSILQRPEYPENYTLDSNWVKVLPEDGQPLVAQDLLEMQSIVQGQFQTGMDTLYRDGTILSGVELVVVGNDDTTMDILITEGRVYAAGVVVKTKPARFQVSREGETTFYLEVTTTVLEDESMRAGNQYGPRGASRLVVNSSIVLTGRGYPLYSIRNGIPINRARDLPGGIEETLAERVFERHGNFCVRGLNLALLDRPRTLADTSLVTLNENYSTLEARATESRNLYLESKSRLDGLLLRLEDARDISSVSPTPANLTILADLETRVEEEEALVATLEATYRERQTAARGGLAELEEARRRSESSLGMSLAPGVAYVVGRRVVIDTPINLALQRTTDSQVVTAATFTYGGVTAIANRTISLQGASTWANVVTQDTKVSISFAPINNSTITVTANTSAATSVETFIDYIITRIATGTDSNSTITGTGITTDAARNLLRDAIAFRRNGPTLVMESIGIGTTSNLVNITIGITITGGAPSNRLGVDVPSGPIGGAASTNEFKLTRRPVKEVTRLVATLQENTAAIVRGTTPGTSDYLGRDTVSSVKRVFQGSINYTEGRDFQVLDGGRLEWAPNGTGALEPAPGTTYFVTYTYSSQLTLGVDFNLLTATDTIVFTGTRSPAPNTTFQVDYSYFLSRIAIVTLDKEGQPAVIYGEVGANPQPPAVSGSVLPLARVLISNNSAIIEPIDCRPVNYDSIRQLASAVSSLSDDIDRLRLTTRAEGLAFTNTGAVPNYTSIDALVDSSGINLTESTGMLSPLTNSLTSNRVYSDVRATAPSAKPNNAGDPYIVVPTYTESIFLEQTKLTKERSIHQTIAPRLYCRRVIMANRDLGRINPCDELAVRGATLFSSTSNSLYRFINEGNRAEFTRLSRRVREAISAGEAIPAIGANLMGSEEINKARAQNIRYTIRGEGLPQASYQLLIADTLMTTAVSINNTPISGTLPFAFRPRSNGILEVELFLPALPPGVHAVVLQSDTLSVSNTLSIFNNNLTHVALGGAASWGLPSSSIDTQPLPLMPRVGFDPLMQTFQAPSDMYLSGLDIKIASAPASGALIISLRDGTATTPGQILLGEALVNGAVLPDIQGRLWTKYVFPTPIYLKEDQYYTLGFRSTEGDWSVFTSEIGEVDILDAGLLIGQQLGINGNLWTSDGTIISNHEREDISMRLYRAVFPTTPISIDLGTYSSSMTAFAFNVRDIVPTGCTIDYQYKVGVNPNWISIAPNTPVCLDRVESILYLRAVSTGTAALAPILEVGTVSIYRNLSPTQHISNWQPILQTSTRFTVAITVLMPPSSTLQVRIQFSTGGWHVLSNPTTVILDAGLGLSRLTYEYVSPGPRSGELRWSIDAFGISTTDVPSIMEVVVYGTN
- Physico‐chemical
properties -
protein length: 1359 AA molecular weight: 146775,10020 Da isoelectric point: 5,07466 aromaticity: 0,06843 hydropathy: -0,03458
Domains
Domains [InterPro]
IPR032096
9–199
9–199
1
1359
Legend:
Pfam
SMART
CDD
TIGRFAM
HAMAP
SUPFAM
PRINTS
Gene3D
PANTHER
Other
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Microcystis phage MaMV-CH01 [NCBI] |
3378289 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XLE97970.1
[NCBI]
Genbank nucleotide accession
PP681321
[NCBI]
CDS location
range 107017 -> 111096
strand +
strand +
CDS
ATGTCTATATTACAAAGACCAGAATACCCTGAGAATTATACCCTCGATAGTAATTGGGTGAAGGTACTACCAGAGGACGGCCAACCCTTAGTGGCACAGGACCTCCTAGAGATGCAATCAATAGTACAGGGCCAGTTCCAGACCGGTATGGATACGCTCTATAGAGATGGTACTATACTCTCGGGTGTAGAATTAGTCGTCGTAGGTAATGACGATACGACTATGGATATCCTTATAACTGAAGGCCGTGTATATGCTGCGGGCGTAGTAGTCAAGACTAAGCCCGCGCGCTTTCAGGTATCACGAGAAGGAGAGACTACATTTTATCTCGAGGTCACTACTACCGTACTCGAGGATGAGAGTATGCGGGCCGGCAATCAATACGGGCCCCGGGGGGCATCTAGATTAGTAGTTAATAGTAGTATAGTCCTTACGGGGCGAGGGTATCCCCTCTACTCTATACGCAATGGGATACCTATTAATAGAGCGAGGGACCTACCTGGTGGTATCGAGGAGACACTAGCAGAGAGAGTATTTGAGAGACACGGTAACTTCTGCGTGCGCGGGCTGAACCTAGCACTCCTGGACCGCCCCCGCACTCTAGCTGATACTAGTCTAGTGACGCTCAATGAGAACTATTCTACACTAGAGGCGCGGGCTACTGAGTCACGTAACCTATACCTAGAGAGTAAGAGTAGACTAGATGGATTACTACTGCGACTAGAGGACGCACGGGATATAAGTAGTGTCAGCCCCACACCCGCTAACCTGACTATACTCGCTGATCTTGAGACACGTGTAGAGGAAGAGGAGGCCCTGGTAGCTACACTAGAGGCTACCTACAGAGAGAGACAGACAGCAGCACGGGGCGGGCTGGCAGAATTAGAGGAGGCCCGACGTAGATCAGAGTCCTCACTTGGTATGTCACTGGCACCTGGAGTAGCCTACGTCGTGGGGCGTAGAGTAGTTATTGATACACCAATAAACCTAGCGCTACAGCGTACCACAGATAGTCAGGTAGTAACGGCGGCTACATTTACTTATGGAGGTGTAACTGCTATAGCCAACCGCACTATATCGCTACAGGGTGCCTCCACATGGGCTAATGTAGTAACACAGGATACCAAAGTAAGCATAAGCTTTGCCCCTATCAATAACTCGACTATAACTGTGACGGCTAATACTTCAGCAGCCACCTCAGTAGAGACCTTCATAGACTACATAATAACCCGTATAGCTACTGGCACTGATAGTAATAGTACCATTACTGGTACCGGCATTACTACGGATGCGGCTCGTAATCTACTCCGAGATGCCATAGCCTTCCGCCGTAATGGGCCAACACTCGTCATGGAGTCTATAGGCATAGGTACCACGTCTAACCTAGTGAACATTACCATAGGTATCACTATCACAGGAGGGGCCCCCAGTAATAGACTAGGCGTCGACGTGCCCAGCGGTCCCATTGGAGGTGCAGCCAGTACTAATGAGTTTAAGTTAACTCGCCGACCTGTCAAGGAGGTAACGCGACTAGTAGCTACGCTCCAGGAGAATACGGCTGCCATAGTGCGCGGGACTACACCTGGCACTAGTGACTACCTAGGGCGAGATACTGTATCTAGTGTGAAACGGGTATTCCAGGGCTCTATTAATTACACAGAGGGCCGGGACTTCCAGGTTCTCGATGGTGGTAGACTGGAGTGGGCTCCTAATGGTACAGGAGCTCTAGAACCGGCACCTGGTACTACCTACTTTGTGACGTATACGTACTCTAGTCAATTAACACTAGGAGTAGATTTCAATCTCCTGACGGCTACTGACACTATAGTATTTACTGGTACGCGGAGCCCCGCACCTAATACCACATTCCAGGTAGACTACAGCTATTTCCTCAGCCGCATAGCTATAGTAACGCTAGATAAGGAGGGGCAGCCCGCTGTCATATATGGTGAGGTAGGAGCTAATCCACAGCCTCCTGCCGTTAGTGGTTCAGTATTACCACTAGCACGTGTGCTTATAAGTAATAATAGCGCCATCATTGAGCCTATCGATTGTCGCCCTGTCAATTACGATAGTATACGGCAGCTAGCCAGCGCCGTATCATCGTTATCAGATGATATAGATAGACTACGACTCACAACTAGAGCAGAGGGGCTGGCTTTCACTAATACAGGGGCCGTACCCAACTATACATCTATAGATGCCCTAGTGGATAGTAGCGGCATTAACCTAACAGAGAGCACAGGTATGCTCTCGCCTCTAACTAATAGTCTGACATCCAATAGAGTATATAGCGATGTCAGAGCAACGGCCCCCTCGGCTAAACCTAATAACGCAGGAGACCCCTACATTGTAGTACCTACATATACAGAGAGTATCTTCCTCGAGCAAACTAAGTTGACGAAAGAGAGGAGTATACATCAGACTATAGCCCCGCGGTTATACTGCCGTAGAGTCATCATGGCTAATCGCGATCTAGGGCGGATTAACCCCTGCGACGAGCTGGCCGTCCGAGGAGCCACTCTATTCAGCTCCACTAGTAATTCCCTCTATCGATTTATCAATGAGGGGAATAGAGCAGAGTTCACGCGGCTATCTCGTCGTGTACGAGAGGCTATATCAGCAGGTGAGGCTATACCAGCTATAGGCGCTAATCTAATGGGGAGCGAGGAGATAAATAAGGCGCGGGCCCAGAACATACGATACACTATACGAGGAGAGGGGCTCCCCCAGGCCAGCTATCAACTACTGATAGCTGACACTCTCATGACTACTGCAGTATCAATTAATAACACACCAATATCGGGCACCCTACCATTCGCGTTCCGGCCCCGCTCTAATGGCATACTCGAGGTCGAGCTATTCCTACCCGCACTACCCCCCGGTGTCCATGCCGTAGTGCTACAGTCTGATACACTGAGTGTAAGTAATACGCTATCGATATTCAATAATAACCTGACCCACGTGGCGCTAGGCGGGGCGGCCTCCTGGGGCCTACCATCATCCTCAATAGATACTCAGCCCCTTCCCCTAATGCCGAGGGTAGGCTTCGACCCACTGATGCAGACATTTCAGGCACCCTCTGATATGTACCTGAGCGGCCTTGATATAAAAATAGCATCGGCGCCGGCATCAGGAGCCCTAATTATATCATTGCGGGATGGTACAGCTACTACACCAGGGCAGATACTACTAGGAGAGGCTCTTGTCAATGGAGCTGTATTGCCTGACATACAGGGGCGGCTCTGGACTAAATACGTATTCCCTACTCCTATCTATCTCAAGGAGGATCAGTATTATACACTAGGCTTCCGCAGTACAGAGGGAGACTGGAGCGTATTCACTAGTGAGATAGGAGAGGTCGATATACTAGATGCGGGGCTACTAATAGGACAGCAGCTAGGCATAAATGGTAACCTCTGGACTAGTGATGGTACTATCATCTCTAATCACGAGAGAGAGGATATCAGTATGCGGCTCTACCGCGCTGTATTCCCGACGACTCCGATTAGTATAGATCTAGGTACCTATAGCTCTAGTATGACGGCCTTCGCCTTCAATGTACGAGACATAGTACCAACAGGCTGCACAATAGACTACCAGTACAAGGTGGGCGTGAATCCTAACTGGATATCCATAGCGCCTAATACGCCTGTATGTCTAGATAGAGTAGAGTCTATACTCTATCTACGTGCAGTATCTACAGGTACGGCAGCCCTAGCTCCCATCTTAGAAGTAGGGACAGTATCCATCTATCGTAACCTGTCACCCACGCAGCATATATCTAACTGGCAGCCTATCCTGCAGACGAGTACGAGATTTACAGTAGCCATAACAGTGCTAATGCCGCCATCGAGTACGCTACAAGTAAGGATACAGTTCAGCACGGGGGGCTGGCACGTTCTAAGCAACCCAACCACAGTTATACTAGATGCAGGACTAGGACTATCTCGCCTAACTTATGAATATGTATCTCCCGGCCCGCGTAGTGGAGAGCTGAGGTGGTCTATAGACGCATTTGGAATCTCGACAACAGACGTGCCATCTATAATGGAGGTAGTGGTATATGGAACGAACTAA
Tertiary structure
PDB ID
58da6425a0e2601f32fc1ac82f52210dcce0bc8b497075858ae6587a118f738f
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Sequence analysis of four Microcystic aeruginosa myovirus strains | Ke,F., Zhang,Q., Liu,A. and Wang,R. | 2022-08-07 | — | GenBank |