Genbank accession
AIW03308.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,83
TF
Evidence RBPdetect2
Probability 0,93
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MAEDIKKVKLLFRKGTQAELSNLDAGEPGLATDTQNLYVGTGGDSKVQLAKQADLTSLQTKVNNNKSQTDQTLTDHSNKLTQQQSSIEGQYNALQNHEGRIVANEQALPNKVDTTTYNGLKSTVDTLKTNTENSISQLNQDVSALEGTVAQNTSDIEYLKANGGGGGGGSSEDLTELQSEVSTLKSDVSTNKSDISTLKQTTQTQGTAISAAQGDISKNKTDIQTLQATKKEVDDAKGTGTLKEKFDSIEKSQNSEEFEVTTARKVFTLTSGQYVPNSKTLKVYIQGILQSPQDYVQTNSTTVTFKEDVPAGNIVTLEWLQGKLPTQFGHNSTHELGGPDEIDLAKLKNYQEKVVKPIADVYKRSDIFFNMLETTAKGDGATNDTSVFTTLENSFTDKIIELNGKTYLVDSLPTKNKYVNGRFLVGGSYFDASFTINVKSNHGVIALGIGAAKSSPTFPVYSGTDKFYKNIAIGGYALQNSIGSFNNIAIGWNAMDAATSGEFYNIAIGNEALWSVKKTNTSDSFAASRNIAIGMNAMRFNINGHHNVALGRNNLQCSKNGSRNTAIGVNAMAGIAPLDLTGVIADYTKYDSNDTTAIGAEALLNSVGNENTGVGAYAANNLVKATRNVAVGKNALYSLQKDMTVDGNNKIYWSKTGSYVWSGTKITVTMSGHGLQNGNLISLKLTTGSNLKTSEENQYTISNVTTNTFDIVSPLSNNTVGNCSSSWYSDMTANTKTADNNNAIGNYAMENSVSGQNNTAIGTWTLRNIIGEFNSFVGNLSGTNLTSGSFNSALGYGALRAMQDGTTATNLTNVTALGYNTRASGDNQVQLGDANTTTYAFGPVQSRSDKRDKLDIQDTDLGLEFIKKVPIRKFRYNYRDLYEEGDNSKGDKAGKRFHQGVIAQEVKKVMDDLGVDFGGYQDHKVNGGSDVLSIGYEEFIPVSMKAIQELAEKDKLKDQKIEKLESAIASLLERVEKLEGGN
Physico‐chemical
properties
protein length:982 AA
molecular weight: 105813,90370 Da
isoelectric point:5,52318
aromaticity:0,07026
hydropathy:-0,50418

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Mater
[NCBI]
1540090 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Bacillus megaterium
[NCBI]
1404 Bacteria > Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AIW03308.1 [NCBI]
Genbank nucleotide accession
KM236245.1 [NCBI]
CDS location
range 88111 -> 91059
strand -
CDS
ATGGCAGAGGATATTAAAAAGGTTAAACTACTTTTTCGTAAAGGTACTCAGGCAGAGCTAAGCAACTTAGACGCAGGGGAACCGGGACTAGCTACAGATACACAAAACCTGTATGTAGGTACTGGAGGGGATAGTAAAGTTCAGCTGGCTAAACAAGCGGATTTAACCTCTTTACAGACAAAAGTAAATAACAATAAATCACAAACTGATCAGACACTGACTGATCATAGTAATAAGCTAACTCAACAACAATCAAGTATAGAAGGTCAATATAACGCTTTACAAAATCATGAGGGGCGTATTGTTGCTAATGAACAGGCCTTACCAAATAAGGTAGATACTACTACGTATAACGGTCTTAAAAGTACTGTAGACACTCTAAAGACAAATACAGAGAATAGTATATCGCAGCTAAACCAAGATGTAAGTGCTCTAGAGGGTACCGTTGCTCAAAATACCTCAGACATTGAATACCTTAAAGCTAATGGAGGCGGTGGGGGCGGAGGTAGTAGCGAAGACCTTACTGAGCTTCAGTCAGAGGTATCGACATTAAAATCAGATGTCTCTACTAACAAGTCTGATATTTCTACCTTAAAGCAAACGACTCAAACTCAAGGTACAGCTATTTCAGCAGCTCAAGGAGACATCTCTAAAAATAAAACAGACATCCAAACGTTGCAGGCTACTAAGAAAGAGGTAGACGATGCTAAAGGAACTGGTACTCTTAAAGAGAAGTTCGACAGCATTGAAAAGTCTCAAAATTCAGAAGAGTTTGAAGTTACTACCGCTAGGAAGGTATTTACGCTTACTTCAGGCCAATACGTACCAAACAGTAAAACATTAAAGGTGTACATACAAGGTATCTTACAATCTCCTCAAGACTATGTACAAACAAACTCTACTACCGTCACTTTCAAAGAGGACGTACCAGCAGGGAACATTGTTACGCTTGAATGGCTGCAAGGTAAACTACCTACACAATTCGGGCATAACTCTACTCACGAACTAGGTGGCCCGGATGAAATTGACTTAGCTAAACTTAAAAACTATCAAGAGAAAGTCGTAAAGCCGATCGCAGACGTGTACAAACGATCAGATATCTTCTTTAACATGCTAGAAACAACAGCTAAAGGTGATGGAGCTACAAATGATACGAGCGTATTCACTACACTAGAGAACTCGTTTACAGATAAAATCATCGAGCTAAATGGTAAAACATATCTAGTAGATAGTTTACCAACAAAGAATAAATATGTCAATGGTCGTTTCCTAGTAGGCGGTTCATACTTCGATGCTAGCTTTACCATTAATGTCAAGTCTAACCACGGAGTTATTGCTCTTGGTATTGGTGCAGCTAAGTCATCTCCAACGTTCCCTGTTTACTCCGGGACTGACAAGTTCTATAAGAATATTGCTATTGGAGGGTATGCACTACAAAATAGTATTGGTTCTTTCAATAACATTGCTATCGGGTGGAATGCAATGGACGCAGCTACTTCGGGGGAGTTCTATAACATTGCTATCGGTAATGAAGCTTTATGGAGTGTAAAGAAAACAAATACTTCAGATAGCTTTGCAGCAAGTCGTAATATTGCTATCGGCATGAACGCTATGCGCTTTAACATTAACGGACACCATAACGTAGCATTAGGTCGTAACAATTTACAATGTTCAAAAAACGGTTCGCGCAACACAGCGATTGGTGTTAATGCTATGGCAGGTATTGCTCCATTAGACCTTACAGGAGTTATTGCAGACTATACCAAATATGATAGCAACGATACAACGGCTATCGGAGCAGAAGCTTTACTTAACAGTGTAGGGAATGAGAACACAGGTGTCGGTGCTTATGCAGCAAACAACCTTGTCAAAGCTACACGTAACGTAGCAGTAGGTAAGAACGCATTGTATAGCCTCCAAAAAGATATGACAGTAGATGGTAACAATAAAATCTATTGGAGTAAGACAGGCTCGTATGTCTGGTCTGGAACAAAGATTACAGTTACAATGTCTGGTCATGGTCTTCAGAATGGAAACCTAATTTCACTTAAACTTACGACTGGCAGTAACTTGAAAACTTCTGAAGAAAATCAGTACACAATCTCCAACGTAACTACTAATACATTTGACATTGTTTCCCCGTTATCTAACAACACGGTAGGTAACTGTTCTTCTTCTTGGTATAGCGACATGACTGCTAACACGAAGACTGCGGATAACAATAATGCGATTGGTAACTATGCAATGGAGAACAGCGTAAGCGGCCAGAACAATACAGCTATCGGTACATGGACACTCCGTAATATTATCGGAGAGTTTAACAGCTTCGTAGGTAACTTGTCAGGAACTAACTTAACTAGCGGAAGCTTTAACTCTGCTTTAGGTTATGGTGCGCTTCGAGCTATGCAGGATGGTACTACAGCAACTAACCTAACAAATGTAACAGCTCTTGGTTATAACACTCGAGCAAGCGGAGACAATCAAGTTCAGTTAGGGGATGCTAATACTACTACATATGCTTTCGGGCCTGTACAGAGTCGTTCAGATAAACGAGACAAGCTTGATATTCAAGACACGGATTTAGGTCTCGAGTTTATCAAGAAGGTTCCTATCCGTAAGTTCCGATATAACTACCGCGATCTTTACGAGGAAGGAGATAACAGTAAGGGAGACAAAGCTGGTAAACGTTTCCACCAAGGGGTTATCGCTCAGGAAGTCAAGAAAGTTATGGATGACCTTGGCGTAGACTTTGGGGGGTACCAAGACCACAAAGTAAACGGCGGTAGTGATGTTTTATCCATTGGTTATGAGGAATTTATCCCGGTATCCATGAAAGCTATCCAAGAGTTAGCGGAAAAGGATAAGCTAAAAGATCAGAAAATTGAAAAACTTGAATCTGCTATCGCTAGTCTACTTGAGCGTGTAGAGAAATTAGAAGGAGGGAACTAG

Tertiary structure

PDB ID
dac8ae1212740c0165c800b605adb5f849c0353c6926c6093af3668a71cde3a6
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7101
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50