Genbank accession
AUE23423.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
Protein sequence
MRMNNLSDVAAGASYVTSIGSGGYWLLQLLDKVTEIKNEAVEAKDQSEDARDEARSYALQASELGNLYASVSEGLAATTDGQYFQVPQGSGSYVAFKVYKNNAGVAQEVAAVPGTGAIIGTIREFPTLAAAQTDADAGNILVGSTAYYRSPEDNALAIEVINNAGTLVETGRKMPSSKLVESVLSMISADGITRDAFMQWADKNGLIIAKWLQSDEGVSFSSALLNFGPEGFIAKLAGIGADKIFTSFIEMISGDDFIVSDKNGLIISQIINSIVKLPGFMLNIPFLNLYAKKLNLLPGMTLGADDATITLSGPLQTVDQNGLIAFSQDAKGKVKFSDMGKRGSGGGAATGAMKESQVIDLLDGAGAAYASKRHNQVRFPVPRATNLKKLTYWFLVGQSFVNGGGSSFAIPDTTDMGNIMLGQSPRGSTFVKGLPSYDFTPVGGNVFYPLQEVRQTDAGVISTTSGSHGQTIAKGFADELKRRYNERTRQQNNTDHIFGVACCGVSGASIADLTKGAAAGYYNRFLTALSGVAAAAAAAGYDWEVGGLIYMQGEQDNGTTTEIYLPQLQTMHDNMIADAMAASGQKATPIFLINQIGNSFISGRNFGVVEAQRRFVENNPLAFMVGSYTGLPNPVDHLFANSYRWLGAQFAKVADRVMWGNDEANFQMVGAYWSGNTAYIGFSTRVPPLTFKSAYVTYTETMYDDKGFTVSDGSGVLTGSNLTVSIVSDNVVKIVAARELTGTVTIMLGDGTAHAGGTQHR
Physico‐chemical
properties
protein length:761 AA
molecular weight: 80831,08660 Da
isoelectric point:5,20483
aromaticity:0,09198
hydropathy:-0,04744

Domains

Domains [InterPro]
DC_0547
STR
16–761
Coil
Unmapped
26–53
SSF52266
STR
392–655
IPR005181
ENZ
490–653
AUE23423.1
1 761
Architecture
STR
STR 16-761
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Raoultella phage Ro1
[NCBI]
2053702 Uroviricota > Caudoviricetes > Vequintavirinae > Mydovirus Ro1 >
Host Raoultella ornithinolytica
[NCBI]
54291 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AUE23423.1 [NCBI]
Genbank nucleotide accession
MG250486.1 [NCBI]
CDS location
range 111781 -> 114066
strand +
CDS
ATGCGAATGAACAACCTTTCAGACGTAGCGGCGGGAGCCTCCTACGTCACTTCTATTGGCAGCGGTGGTTACTGGCTGTTGCAGCTTCTCGATAAAGTCACAGAGATCAAAAACGAGGCAGTTGAGGCAAAAGACCAGTCAGAGGATGCGAGAGATGAGGCCAGATCTTATGCATTGCAGGCCTCTGAGCTAGGGAACTTATATGCATCTGTTAGTGAAGGTCTTGCTGCAACTACGGATGGACAATACTTCCAAGTACCTCAAGGAAGCGGCAGCTATGTAGCATTCAAAGTGTACAAGAATAATGCAGGGGTTGCTCAAGAAGTCGCGGCAGTGCCTGGCACCGGGGCTATAATCGGGACAATCCGCGAATTTCCCACCCTGGCGGCAGCACAGACTGATGCTGACGCTGGAAATATCCTGGTTGGGTCAACAGCCTATTACCGAAGCCCGGAGGACAACGCGCTTGCTATCGAGGTAATTAACAACGCAGGGACTTTGGTCGAAACCGGGCGAAAAATGCCCAGCTCTAAGCTAGTTGAGTCTGTGCTCAGTATGATAAGTGCAGATGGGATCACTCGGGATGCCTTCATGCAATGGGCTGATAAAAATGGGCTTATCATTGCTAAATGGCTTCAGTCAGATGAAGGCGTTTCATTTTCATCTGCGTTGCTAAATTTTGGTCCTGAAGGGTTTATTGCTAAACTGGCAGGCATTGGCGCTGATAAAATATTCACCAGTTTTATTGAGATGATCAGCGGTGATGATTTTATAGTAAGTGACAAAAATGGACTGATTATATCTCAGATAATCAACAGCATCGTAAAACTACCTGGTTTTATGCTTAATATTCCTTTCCTGAATCTATATGCTAAAAAATTAAATCTTCTTCCTGGTATGACGCTAGGGGCTGACGATGCGACAATTACGCTCTCCGGCCCACTTCAGACGGTAGACCAGAATGGGCTGATTGCTTTTTCTCAAGATGCCAAAGGGAAGGTTAAATTCTCTGATATGGGTAAGCGTGGCTCAGGAGGTGGAGCAGCCACGGGGGCCATGAAGGAGAGCCAGGTTATTGACCTGCTGGATGGGGCTGGTGCTGCATATGCATCGAAGCGACATAACCAAGTTAGATTCCCGGTGCCACGCGCGACGAATCTAAAGAAACTCACATACTGGTTTCTGGTAGGGCAGTCATTTGTTAATGGCGGGGGTAGTAGTTTCGCCATCCCTGACACAACGGACATGGGCAACATCATGCTCGGTCAGTCACCTCGCGGCAGTACGTTCGTTAAGGGGCTTCCCTCCTATGATTTTACGCCGGTCGGCGGGAACGTGTTTTACCCATTGCAGGAGGTCCGCCAGACCGACGCTGGGGTTATTTCCACGACCAGCGGCAGTCATGGGCAAACTATCGCCAAGGGGTTTGCTGATGAATTGAAGCGCCGCTATAACGAACGTACCCGGCAGCAAAATAACACCGACCATATCTTCGGCGTGGCCTGCTGCGGCGTGTCTGGTGCATCAATCGCCGATCTGACAAAAGGTGCTGCGGCGGGTTATTACAACCGGTTCCTGACGGCGCTTTCAGGCGTTGCAGCCGCGGCGGCCGCAGCGGGTTATGATTGGGAGGTTGGCGGGCTGATTTACATGCAGGGTGAGCAGGACAACGGTACAACCACCGAAATCTATTTGCCTCAGCTACAGACAATGCATGACAACATGATCGCTGATGCTATGGCTGCATCCGGGCAGAAGGCAACACCCATTTTCCTGATTAACCAGATCGGCAACAGCTTCATCTCGGGTCGCAACTTTGGTGTGGTCGAAGCCCAGCGCCGGTTCGTTGAAAACAACCCTCTGGCCTTTATGGTTGGCAGCTACACAGGTCTTCCCAATCCTGTCGATCACCTGTTTGCGAATTCATATCGCTGGCTGGGCGCACAGTTCGCTAAAGTCGCGGACCGTGTAATGTGGGGCAACGATGAAGCTAACTTCCAGATGGTAGGTGCCTACTGGTCTGGTAATACTGCGTACATCGGTTTCTCAACCCGAGTCCCGCCGCTGACATTTAAGTCTGCTTATGTCACATACACTGAAACGATGTATGACGATAAAGGATTTACCGTGTCTGACGGCTCCGGGGTGCTTACAGGTTCAAATCTTACAGTTTCAATCGTCAGTGATAACGTTGTGAAGATTGTTGCGGCTCGCGAGCTAACCGGGACGGTGACTATCATGCTCGGTGACGGAACGGCGCACGCGGGGGGTACACAACATCGCTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
929b8e0ea78eafd0c50f54935b8e7f978a68cc7b9ef4491ebb3d97f8436e893f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7203
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Antibacterial composition for extension of chilled fish shelf life and decreasing of risk of food-borne infections, bacteriophage strains for its preparation Zulkarneev,E.R., Aleshkin,A.V., Rubalsky,O.V., Kiseleva,I.A., Rubalskii,E.O. and Lebedev,S.N. 2019-10 GenBank