UniProt accession
A0A2S1GSC9 [UniProt]
Protein name
Putative tail fibers protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MDAQGNITGTGTKWKEPLSLIRTGATIVFLTSPIKLAVINTIVSDTSMTAISTDGSAVPNGNYVILLSDSLTVDGMAQDVAETLRYYQGKETQIEEALEFFENFDLQQLIDLKNQTQKFRNDAEGFKNNAASSATAANNAKTGAETARNQAQSAQQAAASSAQAASGSATTASQKATAATNEANRAKGYADSLQPNTFMKKDQNLADLLNKVTARYNLEVPFVPRESLTASENLNTKVGYANVGFHLNAATANATPANNYPVQAAGVLVVLYSGANGANEATQIYHPYNADEFYKRRGMPSGSTVNWTSWVKFESSNQVDNRISNKLDEFGLNTTGVIAPNNANDLQRNGFFAGAGIPGVNYARPYAPGIVMRRVNDVYQAQLDDNGRWVARFYTPSLGWGNWNKSVIEGDYGVGYQYAARPTVSANSFFSENDGGTLWAASNGAGFQACYDPARLAQFMVNPSGNAYCRWLQTGNPQTPKSQVPWNQLQIAGTSDERVKDIKGSMNIESALDNINRMDFKLFKYTFDSPERSARRGVIAQQVMKIDKEYVHINGNTPGMMCLDLNPLVTDSMAAIKALRARDVENKERISKLESEVEDLKTLVADLLKAR
Physico‐chemical
properties
protein length:611 AA
molecular weight: 66507,29650 Da
isoelectric point:5,89627
aromaticity:0,08674
hydropathy:-0,43715

Domains

Domains [InterPro]
DC_0253
STR
1–611
Coil
Unmapped
137–157
IPR030392
CHP
495–551
A0A2S1GSC9
1 611
Architecture
STR
STR 1-611
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS_IME347
[NCBI]
2496546 Uroviricota > Caudoviricetes > Drexlerviridae > Badaguanvirus > Badaguanvirus IME347
Host Escherichia coli BL21(DE3)
[NCBI]
469008 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AWD92259.1 [NCBI]
Genbank nucleotide accession
MH051918 [NCBI]
CDS location
range 33915 -> 35750
strand -
CDS
ATGGACGCTCAAGGCAACATTACAGGCACCGGGACGAAGTGGAAAGAGCCTCTTTCCTTGATTCGCACTGGTGCAACGATCGTATTCCTTACGAGTCCAATTAAGCTGGCGGTGATTAACACGATCGTAAGCGACACTTCAATGACGGCTATTTCAACTGACGGCTCGGCGGTTCCGAACGGGAATTATGTGATCCTGCTCAGTGATTCGCTGACGGTTGACGGAATGGCGCAGGACGTTGCGGAAACCTTGCGTTACTATCAGGGCAAAGAAACACAGATCGAAGAGGCTTTGGAATTCTTTGAAAACTTCGATCTTCAGCAATTAATCGATCTGAAAAATCAGACTCAGAAATTCCGAAACGATGCGGAAGGCTTTAAGAACAACGCCGCATCAAGCGCAACGGCAGCGAACAACGCAAAGACAGGTGCGGAAACAGCAAGGAATCAGGCTCAATCTGCACAGCAAGCCGCAGCATCATCAGCGCAGGCTGCATCTGGAAGTGCGACAACGGCGAGCCAGAAGGCGACGGCTGCAACTAACGAGGCCAATCGCGCGAAAGGTTACGCTGACTCACTTCAGCCAAACACGTTCATGAAGAAGGATCAGAACCTTGCAGATCTGCTAAATAAGGTTACGGCAAGGTATAACCTTGAAGTGCCATTCGTGCCGCGCGAAAGTCTGACGGCAAGCGAAAATCTAAACACAAAAGTTGGTTATGCTAACGTAGGGTTTCACTTGAATGCGGCAACGGCTAACGCCACTCCAGCGAATAACTATCCAGTGCAAGCGGCTGGTGTTCTCGTTGTGCTTTATAGTGGCGCGAACGGGGCGAATGAGGCGACGCAAATTTATCACCCGTACAACGCCGATGAATTTTACAAGCGGCGAGGCATGCCGTCAGGCTCTACTGTAAACTGGACTTCGTGGGTTAAATTTGAATCATCAAATCAAGTCGACAATCGAATCAGCAACAAATTAGACGAATTCGGATTGAATACCACTGGAGTGATCGCGCCAAACAACGCAAACGACCTTCAAAGAAACGGATTTTTCGCTGGTGCTGGCATACCCGGAGTTAACTATGCAAGGCCGTATGCTCCCGGAATTGTAATGAGGCGAGTTAATGATGTTTATCAGGCTCAGCTTGACGATAACGGCAGGTGGGTTGCTAGATTCTACACGCCTAGTTTAGGGTGGGGTAATTGGAATAAATCGGTAATTGAGGGGGATTATGGAGTTGGATATCAGTACGCCGCGAGGCCGACCGTTTCAGCAAACTCCTTCTTTTCCGAGAATGACGGCGGGACTTTGTGGGCTGCGTCAAATGGTGCTGGATTTCAAGCATGTTATGATCCTGCAAGATTGGCGCAATTTATGGTAAACCCATCGGGGAACGCATATTGTCGTTGGCTGCAAACTGGAAACCCGCAAACCCCAAAATCACAAGTTCCTTGGAATCAGCTTCAGATAGCGGGAACTTCTGACGAGCGAGTCAAGGATATCAAAGGCAGTATGAACATTGAGTCGGCGCTTGACAATATCAATCGCATGGATTTCAAGCTGTTCAAATACACCTTTGATTCGCCTGAAAGGTCTGCCCGTAGAGGTGTTATCGCACAGCAGGTAATGAAGATCGATAAAGAATATGTTCACATTAACGGTAACACGCCTGGGATGATGTGCTTAGATCTCAACCCGTTAGTAACTGATAGCATGGCTGCAATCAAGGCGCTTCGTGCGCGTGACGTTGAAAACAAGGAGAGAATCAGCAAGCTTGAAAGTGAGGTTGAGGATCTTAAAACACTTGTCGCTGACTTACTGAAGGCGAGATAG

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

PDB ID
0f07358a29bc0516618b5c17912e7191f134c3a200fee1d12980d3be899d11ff
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7848
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50