Genbank accession
XLE97970.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence RBPdetect
Probability 0,74
Protein sequence
MSILQRPEYPENYTLDSNWVKVLPEDGQPLVAQDLLEMQSIVQGQFQTGMDTLYRDGTILSGVELVVVGNDDTTMDILITEGRVYAAGVVVKTKPARFQVSREGETTFYLEVTTTVLEDESMRAGNQYGPRGASRLVVNSSIVLTGRGYPLYSIRNGIPINRARDLPGGIEETLAERVFERHGNFCVRGLNLALLDRPRTLADTSLVTLNENYSTLEARATESRNLYLESKSRLDGLLLRLEDARDISSVSPTPANLTILADLETRVEEEEALVATLEATYRERQTAARGGLAELEEARRRSESSLGMSLAPGVAYVVGRRVVIDTPINLALQRTTDSQVVTAATFTYGGVTAIANRTISLQGASTWANVVTQDTKVSISFAPINNSTITVTANTSAATSVETFIDYIITRIATGTDSNSTITGTGITTDAARNLLRDAIAFRRNGPTLVMESIGIGTTSNLVNITIGITITGGAPSNRLGVDVPSGPIGGAASTNEFKLTRRPVKEVTRLVATLQENTAAIVRGTTPGTSDYLGRDTVSSVKRVFQGSINYTEGRDFQVLDGGRLEWAPNGTGALEPAPGTTYFVTYTYSSQLTLGVDFNLLTATDTIVFTGTRSPAPNTTFQVDYSYFLSRIAIVTLDKEGQPAVIYGEVGANPQPPAVSGSVLPLARVLISNNSAIIEPIDCRPVNYDSIRQLASAVSSLSDDIDRLRLTTRAEGLAFTNTGAVPNYTSIDALVDSSGINLTESTGMLSPLTNSLTSNRVYSDVRATAPSAKPNNAGDPYIVVPTYTESIFLEQTKLTKERSIHQTIAPRLYCRRVIMANRDLGRINPCDELAVRGATLFSSTSNSLYRFINEGNRAEFTRLSRRVREAISAGEAIPAIGANLMGSEEINKARAQNIRYTIRGEGLPQASYQLLIADTLMTTAVSINNTPISGTLPFAFRPRSNGILEVELFLPALPPGVHAVVLQSDTLSVSNTLSIFNNNLTHVALGGAASWGLPSSSIDTQPLPLMPRVGFDPLMQTFQAPSDMYLSGLDIKIASAPASGALIISLRDGTATTPGQILLGEALVNGAVLPDIQGRLWTKYVFPTPIYLKEDQYYTLGFRSTEGDWSVFTSEIGEVDILDAGLLIGQQLGINGNLWTSDGTIISNHEREDISMRLYRAVFPTTPISIDLGTYSSSMTAFAFNVRDIVPTGCTIDYQYKVGVNPNWISIAPNTPVCLDRVESILYLRAVSTGTAALAPILEVGTVSIYRNLSPTQHISNWQPILQTSTRFTVAITVLMPPSSTLQVRIQFSTGGWHVLSNPTTVILDAGLGLSRLTYEYVSPGPRSGELRWSIDAFGISTTDVPSIMEVVVYGTN
Physico‐chemical
properties
protein length:1359 AA
molecular weight: 146775,10020 Da
isoelectric point:5,07466
aromaticity:0,06843
hydropathy:-0,03458

Domains

View on InterPro
XLE97970.1
1 1359 aa
STR 9–199 · STR 493–722 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

XLE97970.1
1 1359 aa
Domain Start End Length (AA) Confidence
N-terminal 1 363 363 0,9933
Central domain 364 570 208 0,4555
C-terminal 571 1359 788 0,0129
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
XLE97970.1 [NCBI]
Genbank nucleotide accession
PP681321 [NCBI]
CDS location
range 107017 -> 111096
strand +
CDS
ATGTCTATATTACAAAGACCAGAATACCCTGAGAATTATACCCTCGATAGTAATTGGGTGAAGGTACTACCAGAGGACGGCCAACCCTTAGTGGCACAGGACCTCCTAGAGATGCAATCAATAGTACAGGGCCAGTTCCAGACCGGTATGGATACGCTCTATAGAGATGGTACTATACTCTCGGGTGTAGAATTAGTCGTCGTAGGTAATGACGATACGACTATGGATATCCTTATAACTGAAGGCCGTGTATATGCTGCGGGCGTAGTAGTCAAGACTAAGCCCGCGCGCTTTCAGGTATCACGAGAAGGAGAGACTACATTTTATCTCGAGGTCACTACTACCGTACTCGAGGATGAGAGTATGCGGGCCGGCAATCAATACGGGCCCCGGGGGGCATCTAGATTAGTAGTTAATAGTAGTATAGTCCTTACGGGGCGAGGGTATCCCCTCTACTCTATACGCAATGGGATACCTATTAATAGAGCGAGGGACCTACCTGGTGGTATCGAGGAGACACTAGCAGAGAGAGTATTTGAGAGACACGGTAACTTCTGCGTGCGCGGGCTGAACCTAGCACTCCTGGACCGCCCCCGCACTCTAGCTGATACTAGTCTAGTGACGCTCAATGAGAACTATTCTACACTAGAGGCGCGGGCTACTGAGTCACGTAACCTATACCTAGAGAGTAAGAGTAGACTAGATGGATTACTACTGCGACTAGAGGACGCACGGGATATAAGTAGTGTCAGCCCCACACCCGCTAACCTGACTATACTCGCTGATCTTGAGACACGTGTAGAGGAAGAGGAGGCCCTGGTAGCTACACTAGAGGCTACCTACAGAGAGAGACAGACAGCAGCACGGGGCGGGCTGGCAGAATTAGAGGAGGCCCGACGTAGATCAGAGTCCTCACTTGGTATGTCACTGGCACCTGGAGTAGCCTACGTCGTGGGGCGTAGAGTAGTTATTGATACACCAATAAACCTAGCGCTACAGCGTACCACAGATAGTCAGGTAGTAACGGCGGCTACATTTACTTATGGAGGTGTAACTGCTATAGCCAACCGCACTATATCGCTACAGGGTGCCTCCACATGGGCTAATGTAGTAACACAGGATACCAAAGTAAGCATAAGCTTTGCCCCTATCAATAACTCGACTATAACTGTGACGGCTAATACTTCAGCAGCCACCTCAGTAGAGACCTTCATAGACTACATAATAACCCGTATAGCTACTGGCACTGATAGTAATAGTACCATTACTGGTACCGGCATTACTACGGATGCGGCTCGTAATCTACTCCGAGATGCCATAGCCTTCCGCCGTAATGGGCCAACACTCGTCATGGAGTCTATAGGCATAGGTACCACGTCTAACCTAGTGAACATTACCATAGGTATCACTATCACAGGAGGGGCCCCCAGTAATAGACTAGGCGTCGACGTGCCCAGCGGTCCCATTGGAGGTGCAGCCAGTACTAATGAGTTTAAGTTAACTCGCCGACCTGTCAAGGAGGTAACGCGACTAGTAGCTACGCTCCAGGAGAATACGGCTGCCATAGTGCGCGGGACTACACCTGGCACTAGTGACTACCTAGGGCGAGATACTGTATCTAGTGTGAAACGGGTATTCCAGGGCTCTATTAATTACACAGAGGGCCGGGACTTCCAGGTTCTCGATGGTGGTAGACTGGAGTGGGCTCCTAATGGTACAGGAGCTCTAGAACCGGCACCTGGTACTACCTACTTTGTGACGTATACGTACTCTAGTCAATTAACACTAGGAGTAGATTTCAATCTCCTGACGGCTACTGACACTATAGTATTTACTGGTACGCGGAGCCCCGCACCTAATACCACATTCCAGGTAGACTACAGCTATTTCCTCAGCCGCATAGCTATAGTAACGCTAGATAAGGAGGGGCAGCCCGCTGTCATATATGGTGAGGTAGGAGCTAATCCACAGCCTCCTGCCGTTAGTGGTTCAGTATTACCACTAGCACGTGTGCTTATAAGTAATAATAGCGCCATCATTGAGCCTATCGATTGTCGCCCTGTCAATTACGATAGTATACGGCAGCTAGCCAGCGCCGTATCATCGTTATCAGATGATATAGATAGACTACGACTCACAACTAGAGCAGAGGGGCTGGCTTTCACTAATACAGGGGCCGTACCCAACTATACATCTATAGATGCCCTAGTGGATAGTAGCGGCATTAACCTAACAGAGAGCACAGGTATGCTCTCGCCTCTAACTAATAGTCTGACATCCAATAGAGTATATAGCGATGTCAGAGCAACGGCCCCCTCGGCTAAACCTAATAACGCAGGAGACCCCTACATTGTAGTACCTACATATACAGAGAGTATCTTCCTCGAGCAAACTAAGTTGACGAAAGAGAGGAGTATACATCAGACTATAGCCCCGCGGTTATACTGCCGTAGAGTCATCATGGCTAATCGCGATCTAGGGCGGATTAACCCCTGCGACGAGCTGGCCGTCCGAGGAGCCACTCTATTCAGCTCCACTAGTAATTCCCTCTATCGATTTATCAATGAGGGGAATAGAGCAGAGTTCACGCGGCTATCTCGTCGTGTACGAGAGGCTATATCAGCAGGTGAGGCTATACCAGCTATAGGCGCTAATCTAATGGGGAGCGAGGAGATAAATAAGGCGCGGGCCCAGAACATACGATACACTATACGAGGAGAGGGGCTCCCCCAGGCCAGCTATCAACTACTGATAGCTGACACTCTCATGACTACTGCAGTATCAATTAATAACACACCAATATCGGGCACCCTACCATTCGCGTTCCGGCCCCGCTCTAATGGCATACTCGAGGTCGAGCTATTCCTACCCGCACTACCCCCCGGTGTCCATGCCGTAGTGCTACAGTCTGATACACTGAGTGTAAGTAATACGCTATCGATATTCAATAATAACCTGACCCACGTGGCGCTAGGCGGGGCGGCCTCCTGGGGCCTACCATCATCCTCAATAGATACTCAGCCCCTTCCCCTAATGCCGAGGGTAGGCTTCGACCCACTGATGCAGACATTTCAGGCACCCTCTGATATGTACCTGAGCGGCCTTGATATAAAAATAGCATCGGCGCCGGCATCAGGAGCCCTAATTATATCATTGCGGGATGGTACAGCTACTACACCAGGGCAGATACTACTAGGAGAGGCTCTTGTCAATGGAGCTGTATTGCCTGACATACAGGGGCGGCTCTGGACTAAATACGTATTCCCTACTCCTATCTATCTCAAGGAGGATCAGTATTATACACTAGGCTTCCGCAGTACAGAGGGAGACTGGAGCGTATTCACTAGTGAGATAGGAGAGGTCGATATACTAGATGCGGGGCTACTAATAGGACAGCAGCTAGGCATAAATGGTAACCTCTGGACTAGTGATGGTACTATCATCTCTAATCACGAGAGAGAGGATATCAGTATGCGGCTCTACCGCGCTGTATTCCCGACGACTCCGATTAGTATAGATCTAGGTACCTATAGCTCTAGTATGACGGCCTTCGCCTTCAATGTACGAGACATAGTACCAACAGGCTGCACAATAGACTACCAGTACAAGGTGGGCGTGAATCCTAACTGGATATCCATAGCGCCTAATACGCCTGTATGTCTAGATAGAGTAGAGTCTATACTCTATCTACGTGCAGTATCTACAGGTACGGCAGCCCTAGCTCCCATCTTAGAAGTAGGGACAGTATCCATCTATCGTAACCTGTCACCCACGCAGCATATATCTAACTGGCAGCCTATCCTGCAGACGAGTACGAGATTTACAGTAGCCATAACAGTGCTAATGCCGCCATCGAGTACGCTACAAGTAAGGATACAGTTCAGCACGGGGGGCTGGCACGTTCTAAGCAACCCAACCACAGTTATACTAGATGCAGGACTAGGACTATCTCGCCTAACTTATGAATATGTATCTCCCGGCCCGCGTAGTGGAGAGCTGAGGTGGTCTATAGACGCATTTGGAATCTCGACAACAGACGTGCCATCTATAATGGAGGTAGTGGTATATGGAACGAACTAA

Genome Context

Tertiary structure

XLE97970.1
ESMFold structure
Source ESMFold
pLDDT 60.5
Oligomeric state monomer

Literature

Title Authors Date PMID Source
Sequence analysis of four Microcystic aeruginosa myovirus strains Ke,F., Zhang,Q., Liu,A. and Wang,R. 2022-08-07 GenBank