Genbank accession
AON97603.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MAREVPGSGAVIKPIFDEVFGVRAVELINSGSGYDPADPPRLTIDGCGTPEREALLYPIIDADSGKIIHVRVLERGRGYDPLRLQIVPTSETPSVLDSFDVNRIWQSHPNSLTRGTFQTAGVPPVKNDRLRIESDNHPKPTWTSAEAVPGGGPLVDRSFDQVFIYRGGKDVPNPGTRTGQNNKSLGILANGGLLHTPEWGTVGNAPINFSIDTVKYDYVKDTNANDVIVDNNIHYYQTSKLINEFKDTNGVFEWGKFEQFVWNIKVEFDNVMLTVNNIDETLSEVEVGRTVTEIGGSARGEIAKIVRNGQNVITRVYLRDVTGTFEDGDLLLGSTGFGMRVNADPITFPNGIFYIDFGADATEFGPFVPGQYYFAPENVRVKRNYLIVWNQDDSTNQPSAMHTLGHPMQFSTTQDGLLTGGTLYYNSTGSSAAPATDYENEFRPIFMMNADETNRIYYYCKVHRYMSGYDGDEGYMILDPTIEDEDIVNNYYVENYYQSDSNDPATIDRSRHVNGHSKVLGMSFDGYPIYGPYGYTTGRTVGKMISAYRLRTTEELPGTREEVVTASTVTYTITVSNNKFYVGGQEEQLLTLKRGKTYAFNQDDSSNDSHYLFFSLSNDGWHGTGDPANIGSEAYLYTGEDSVTYHLEGSAVTKLEYLQGFNTATAREIRITIPVNAPRVAYVFSYLDSGHGIRLVNEGYILGDLTQDYIYDATIGGSSLDPEYNNGPIVNVVGDGSDFFKREVTSNGVRILGAGTVGGQTAVPDAWLEKVARMVELFTDVNGAGINETSQRNLIKTLSGDAGTYHAGLPTIQRVARGAGADYTPNFLTDAGIIAWNLTDLFDTTVQNDMVWYLNSTGDGYGDGDIDAQEVIEHVFHTLHMHGLPADDIKLYPYISSDWNTGDLYAAMEEAYDAGKWDPSGYQSPSNAWKTDADAFEVAAKEYLFLLNFAMFEYTELWDGGSLAPEWTDDMRTQAGIQANNPLGYAFHNTYIAPVISKPSLATIRSIFQDGNTPAQDDPSLAGASGYVVDISSGESLDEFNGKFGPTPEYPNGTYAYFMTEDQSGIPQYPYAIGPKYYSVPLFEGDTVPDLVSSFPTQASGDIILNNDGTISYIKMTKKGDSFFGSAKAVILGGEGTGAKATPVTQTVTGLSLLNQGRSYATPPNLIFEGGGGQGAEGAAQIDTLGKVTSISVVDPGEFYQEPPFILITGGGGIGAKAEAVISQGEITGINVIDPGEGYTSPPNIVFTKLVNLKRKTRARQAFNSTDIYLTGLTKTLGSQDTQIFVTSTDAYAGSGQLIVNKETITYTAKSRGRFTGLTRGVNFKYDQRIILDTGQNDGDGVSTYGFNVGDRVIRRVENAGNKVAKVYDWDPATRELLVTFEVDELAFIDAGIPSTEDAIVQFDAGVANSSGTGVLPHTVIEETDSTIITLTYPIGTIQNRTFEDDDENDGAGDGIPDLLNAGTTFADQINLDGGIYNSLYGIEETQGGQNTTLFQVGDNIKDASIPFKFATVIAAGGLSDGVEHVAQVVLTLGSGSGTFQVNELVTGDVSGVRGTVVSWDNNTKKLTLKDIVPYNLNNVALGVNGFLYEFSHNSTIVDFVITDNGTNYTVAPTITVENTGDIQATGTVNLTAAGDQVASISITNGGYGIPQTVDAGYSLHPTITFTNDVSDTTGSGASAQAVLGGELAVGNGGGSYRIKSIEYVTLVRSDSA
Physico‐chemical
properties
protein length:1713 AA
molecular weight: 185276,48580 Da
isoelectric point:4,46939
aromaticity:0,10158
hydropathy:-0,28313

Domains

View on InterPro
AON97603.1
1 1713 aa
STR 1025–1066 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

AON97603.1
1 1713 aa
Domain Start End Length (AA) Confidence
N-terminal 1 129 129 0,9558
Central domain 130 902 774 0,5247
C-terminal 903 1713 810 0,0280
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
AON97603.1 [NCBI]
Genbank nucleotide accession
KX349226 [NCBI]
CDS location
range 75540 -> 80681
strand +
CDS
ATGGCAAGAGAAGTTCCTGGATCTGGCGCAGTAATCAAGCCAATCTTCGATGAAGTTTTTGGTGTACGCGCAGTTGAGTTAATCAACAGTGGTAGTGGGTATGATCCTGCAGATCCACCTAGACTAACTATCGATGGTTGCGGGACGCCTGAACGAGAGGCGTTATTGTATCCTATTATCGACGCAGACTCGGGTAAAATCATTCATGTCCGTGTTCTTGAAAGGGGTAGAGGATACGACCCCTTAAGACTTCAGATTGTTCCTACTTCCGAAACACCCAGTGTTTTAGATTCTTTTGATGTCAATAGAATCTGGCAGAGCCATCCTAACTCATTGACTAGAGGCACTTTTCAAACTGCTGGTGTACCTCCTGTTAAGAATGATAGACTTCGTATTGAATCAGATAACCATCCCAAACCTACCTGGACAAGTGCAGAGGCGGTTCCTGGTGGTGGTCCTCTTGTAGATCGTTCTTTTGATCAGGTATTCATTTACAGGGGTGGTAAAGACGTTCCTAATCCTGGAACTAGAACAGGACAAAATAATAAGTCGCTTGGTATTTTAGCGAACGGTGGTCTTCTTCACACTCCCGAATGGGGAACGGTTGGCAATGCTCCAATCAACTTTTCTATTGATACTGTAAAGTATGATTATGTAAAGGATACGAATGCTAATGATGTTATTGTTGATAACAATATTCATTATTATCAAACAAGTAAACTGATAAATGAGTTTAAAGATACCAATGGTGTATTTGAATGGGGTAAGTTTGAGCAGTTTGTCTGGAATATTAAGGTAGAGTTTGATAATGTGATGTTGACAGTCAATAATATTGACGAAACATTATCTGAAGTTGAAGTTGGTAGAACTGTAACTGAAATTGGTGGAAGTGCTCGTGGTGAGATTGCAAAGATTGTTAGGAACGGTCAAAATGTAATCACTAGAGTTTACCTCAGAGACGTTACAGGAACATTTGAAGATGGCGACCTTCTTCTTGGTTCTACAGGATTCGGTATGCGAGTCAATGCAGATCCAATAACGTTCCCTAACGGTATTTTCTATATTGATTTTGGTGCCGATGCTACCGAATTCGGTCCTTTTGTTCCTGGTCAGTATTATTTTGCTCCAGAAAATGTCAGGGTCAAGAGAAACTATCTGATTGTTTGGAATCAGGATGATTCTACAAATCAACCTAGTGCAATGCATACCTTAGGTCATCCAATGCAGTTCAGCACGACTCAAGACGGTCTGCTAACTGGTGGCACTTTGTATTACAACAGCACTGGTTCATCTGCTGCTCCTGCTACTGATTACGAGAACGAGTTCAGACCTATATTCATGATGAATGCGGATGAAACTAACCGCATCTATTACTATTGCAAAGTCCATCGTTACATGTCTGGATATGATGGTGACGAGGGTTATATGATTCTCGATCCCACTATTGAAGATGAGGACATTGTAAATAACTATTATGTTGAGAACTATTATCAATCTGATTCTAACGATCCAGCAACCATTGACCGTTCTAGGCACGTAAACGGTCACTCTAAGGTTCTGGGTATGTCCTTTGATGGATACCCCATCTACGGTCCATATGGATACACTACAGGTAGAACTGTAGGCAAAATGATTAGTGCTTACAGATTAAGAACTACAGAAGAACTTCCTGGTACTAGAGAAGAAGTTGTCACTGCAAGTACGGTAACGTATACTATTACTGTTTCTAATAATAAGTTCTATGTTGGTGGGCAAGAAGAACAACTTTTGACTTTGAAGAGAGGTAAGACATATGCCTTTAATCAAGACGATTCTTCAAATGATAGTCATTATCTGTTCTTCTCTCTGTCGAATGACGGATGGCATGGTACAGGAGATCCTGCGAACATTGGCAGTGAGGCATACTTATATACTGGTGAAGATTCAGTAACTTACCACCTAGAGGGATCTGCAGTTACAAAACTAGAATATCTGCAAGGATTTAATACTGCTACAGCAAGAGAAATCAGAATCACTATTCCTGTGAATGCACCTCGTGTTGCATATGTATTCTCATATCTCGATTCTGGTCATGGTATCCGTCTTGTTAACGAGGGATATATTCTTGGTGATTTGACTCAAGACTACATCTATGATGCTACTATTGGTGGTAGTAGTTTAGATCCAGAATATAACAATGGTCCAATTGTTAATGTTGTTGGTGATGGTAGTGATTTCTTCAAGCGTGAAGTTACATCTAACGGTGTAAGAATACTGGGTGCTGGCACAGTAGGTGGTCAAACAGCAGTTCCCGATGCATGGTTAGAAAAAGTTGCTCGTATGGTTGAACTGTTTACAGATGTAAATGGTGCAGGTATTAACGAAACATCCCAAAGAAACTTGATCAAAACATTGAGTGGTGATGCAGGAACTTATCACGCAGGATTACCGACTATACAAAGAGTAGCAAGAGGTGCAGGAGCAGATTACACTCCAAACTTCTTAACTGACGCAGGTATTATTGCTTGGAACTTAACTGACTTATTTGACACCACTGTTCAAAATGACATGGTGTGGTACTTGAACTCAACTGGTGATGGGTATGGCGATGGTGACATTGATGCACAAGAAGTTATTGAACACGTATTCCACACACTTCATATGCACGGGTTACCTGCAGATGATATAAAACTATATCCATATATAAGTTCCGATTGGAACACTGGTGATTTGTATGCCGCAATGGAAGAAGCATATGATGCAGGCAAATGGGATCCATCAGGTTATCAAAGTCCATCAAATGCTTGGAAAACAGATGCAGATGCATTTGAAGTAGCCGCAAAAGAATACTTGTTCCTACTAAACTTTGCTATGTTTGAATACACAGAATTATGGGACGGTGGAAGTCTTGCTCCAGAATGGACAGACGACATGCGTACCCAAGCAGGTATTCAAGCAAATAACCCATTAGGTTATGCATTCCATAACACATACATTGCTCCAGTTATTAGTAAACCATCACTTGCTACTATTAGAAGCATATTCCAAGATGGTAATACACCAGCACAAGACGATCCAAGTTTAGCAGGTGCGTCAGGATATGTTGTTGACATTTCTTCTGGGGAATCTCTTGATGAGTTCAATGGTAAGTTTGGACCCACTCCTGAGTATCCTAATGGTACATATGCATACTTCATGACTGAGGATCAATCTGGTATTCCTCAGTATCCATATGCGATTGGTCCTAAGTATTACAGCGTTCCTTTATTTGAAGGTGATACTGTTCCCGATCTTGTATCTTCTTTCCCAACACAGGCATCTGGTGACATCATTCTGAATAATGATGGAACGATTTCATACATTAAGATGACGAAGAAGGGTGACAGTTTCTTCGGTTCTGCAAAAGCAGTTATTCTTGGTGGTGAAGGAACAGGGGCAAAAGCAACTCCCGTAACTCAAACTGTCACTGGTCTGTCTCTGCTAAATCAGGGTAGAAGTTATGCTACACCACCTAACCTCATCTTTGAAGGTGGTGGTGGACAAGGTGCTGAGGGTGCTGCTCAAATCGATACTCTTGGTAAGGTTACCTCTATCAGTGTGGTAGATCCTGGGGAGTTCTATCAAGAACCTCCTTTTATTCTCATCACTGGTGGTGGTGGCATTGGTGCTAAGGCAGAAGCAGTCATTAGTCAAGGTGAGATTACTGGGATTAATGTTATTGACCCTGGTGAAGGATACACCTCTCCTCCAAATATTGTCTTCACAAAACTCGTTAATCTCAAACGTAAGACAAGAGCACGTCAAGCATTTAACTCAACTGACATCTATCTTACTGGTCTTACCAAAACTCTTGGATCTCAAGACACGCAGATTTTTGTCACATCTACTGATGCATATGCTGGTTCTGGTCAACTTATCGTTAATAAAGAGACCATTACATATACTGCGAAGAGCAGGGGTAGATTTACAGGACTAACTAGAGGTGTAAACTTCAAATATGATCAGAGAATCATTCTTGACACAGGACAGAACGATGGTGATGGAGTTTCCACTTACGGATTTAATGTTGGTGACCGAGTAATCCGTAGGGTTGAGAATGCGGGCAACAAAGTTGCTAAAGTCTATGACTGGGATCCTGCGACAAGAGAACTGTTAGTTACGTTTGAAGTCGATGAACTGGCATTTATTGATGCTGGTATTCCTTCTACTGAAGATGCAATTGTTCAGTTTGATGCTGGCGTTGCTAATAGTTCTGGTACTGGTGTTCTTCCTCATACAGTTATCGAAGAAACCGATTCCACTATTATTACTTTGACATATCCTATCGGAACTATTCAAAATAGAACCTTTGAGGATGATGATGAAAATGATGGTGCTGGTGATGGTATTCCCGATCTTCTCAATGCTGGAACTACCTTTGCCGATCAAATCAATCTTGATGGTGGTATCTATAACTCTCTATATGGTATTGAAGAGACACAAGGTGGTCAGAATACTACATTGTTCCAAGTTGGCGATAATATCAAAGACGCATCTATACCATTCAAGTTTGCAACCGTCATTGCGGCAGGTGGACTGAGTGATGGCGTTGAGCACGTTGCACAAGTTGTTCTAACTCTTGGTAGTGGTAGTGGTACTTTCCAAGTAAATGAACTTGTCACTGGTGATGTTTCTGGTGTTAGAGGAACAGTTGTTTCTTGGGATAATAACACTAAGAAACTTACACTTAAGGATATTGTTCCTTATAACTTGAATAACGTTGCACTTGGTGTCAACGGATTCCTGTATGAATTCTCTCATAATAGCACTATCGTTGATTTTGTTATTACAGATAATGGAACAAACTATACTGTTGCTCCAACTATTACAGTTGAAAATACTGGAGATATTCAGGCAACTGGAACAGTAAATCTTACTGCTGCAGGTGACCAGGTTGCGTCGATTTCAATCACAAATGGAGGATACGGAATCCCTCAAACAGTTGATGCTGGTTACAGTCTCCATCCTACGATCACCTTTACTAATGATGTATCCGATACAACTGGATCTGGTGCATCTGCTCAGGCAGTTCTTGGCGGAGAACTGGCAGTTGGTAACGGTGGCGGTTCGTATAGAATCAAGAGCATTGAGTATGTGACATTGGTCCGCTCAGATTCCGCATAA

Genome Context

Tertiary structure

AON97603.1
ColabFold structure
Source ColabFold
pLDDT 81.4
Oligomeric state monomer