Genbank accession
QNR53179.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MLPIPLLKLGRHFLDKRGGIKELAVLSYPIDNSDGTSNTVSYVNSLAILFDDGQLNITGYNRFGECRTGDLDTINYPNEAAWNVDHVWRADRAFVIRTFDNKFFYIGCTAGLIGSEAAGGNDVCVREWTPLPEQIVTGLHLDTHPERLVEVMGGVNNTVWVIAAPEADGMLHLYGSGNNTYGSLHVDKNQHATPVKIGETSENPETGPWKNPSINCEVHDNSVIFGGPRGFWIAGYDFLRNNSRNDLVWPPVQITRNDLRGIPADEEWKGFMCGPNGAIIATQRMHRPDDQQLVNVFYGQDVWGDDTWRSLNITYTHEVIMARGYGTSGIFFNNGTKQYRGFSRNLCNDIGAQSANNSPRANFIHYQALATAKYVEQRLPVSWTESVYFQGIHREGFLGTFTVVNGKLWWSGIPRGSFAGSNNLFGGRLNSQGFTEVPENWYKNVPVDSWGVDDIFDVNGVSSVSNIYIGDTVKMKLKPQPEGATFIIDKIELVNAAGTVVTDANYQFSTNWNHGGANEVVVTQYNRNVNRRGLYSVKITYHDKHGTGRTYTTRTLNWNTIVPAYPSNGKWHTVGRNKQFHVNDTVYFGLNGTQPAVEGDTSYMVRLHRMDAGSVTYDVTQEIYDQRRNTYNNLVQTQAVWEFNPNGRGGKMLQVNEQNGTSLTVHEHPDPWPDPGKPHPGARTLKIVSHDVGYFGIRWEATVRYADGTTNNIGITLGGTSEDNSLKIAYTPRGISIDNMDVVRNGYGDVNVKVTLGEHLGGERIIMYAFDHDPRTNGTYADQAWSYLISAPEPNTKEFYYGMKRDVCKKTGTHDWIAICVKDERTAWDEPVNRWFIGIPTRSDKYVAEYIVCMGGTNLNMCWNEDVNKYSDYDYMRDYSCNLWFNQTTGYPPRQARVNPAIFTDTQVFLTKQANEVQTFKNKYDPNKWFYNCYGAFFWGPGELPGGGSCFEAATYASDRFMGQVKKYKIIPGPETLGNAVDPYIIMSDLRSNMDGTSISMQVPVTNGYKRVVMIIKCDLIGKQVLAENGTSTHLFETALHYQFADSPFAGDKRISDTDAVRCKKFMLGGGWWWYEFDLTDKFTDTSKVVTGLRLDLGENMHKSVCDGTYGDPTIYLKYVSFEHPEDVVYGPKLRLFGNWIAKDRVGMGKKVRGFLMDAGTEDMLVNAVWPELPGTQWNNSAKSINWFNIHRAMWTTNCYLWKELNDGAFGFTGGRRMAIICWTTLQRCYDHDYEIGGRAWKNIRDRIITNFADPNGGGAYGFDCSRLIHLNGSSAYKKEGYSGSMIEWGLVKDARVLMGQQLAAAIGPNAVQSVKPAWFDIPLWSPGAPGTAAINPTTGDLEISWEDLKQVGGWDKTGYQVQWWRADGSLAADEFVKDNFYTMSSAKAQQLFGQATPSTITMSMCCKDNRTGALGPRVAKVFSGIKWNLPVQSVSWKQIGDNKLLVTPACQFNATLNVDPAVAANSAKASDFSVSNTAMADVRKIDTLNARITCKNTYGTFQIINNFTDADSKVVRTASQTLSLGTLAYAALITEQSATLQGGGVGKSIATPVWKPNEWVVFDLTVDFSSDNNWTWVRNCLSQLMGGPSSISDSHDSTDPSVFQVGKTHPETGETLPDRKYALVCISYGKADVTFSGTHTYNGTYNFSRKYSLKAGNIIDEVGALYNPGNGIGIVGGQLQMQEPSITPSNVSGIRKTWESSNTNIATVDATTGLVTFKATGNVTIKFVVTDDAGRKTSSTSFTVKQMAPQWFIWEGNPLDGAYPRPAGTSNMRVFTGPVMNNPTTTTNPTFFGAYIPEIIGLPRNQIQLLFGAGVDGLATFGYSDNIDAARSSGWIGFKFQWNPGGRTLGIASIGVMLPGNQQYHLEAYTNFPS
Physico‐chemical
properties
protein length:1875 AA
molecular weight: 208263,19640 Da
isoelectric point:6,21417
aromaticity:0,11573
hydropathy:-0,38421

Domains

Domains [InterPro]
IPR009091
STR
41–197
G3DSA:2.60.40.1080
STR
1675–1746
IPR003343
STR
1677–1733
QNR53179.1
1 1875
Architecture
STR
RBD
STR
RBD
STR 1-1664 | RBD 1665-1674 | STR 1675-1746 | RBD 1747-1875
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QNR53179.1
1 1875
Domain Start End Length (AA) Confidence
N-terminal 1 10 10 0,0268
Central domain 11 362 353 0,9404
C-terminal 363 1875 1512 0,1501
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-10
Central
11-362
C-terminal
363-1875

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vb_EcoM_bov22_2
[NCBI]
2763523 Uroviricota > Caudoviricetes > Vequintavirinae > Vequintavirus slur16 >
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QNR53179.1 [NCBI]
Genbank nucleotide accession
MT884010 [NCBI]
CDS location
range 45786 -> 51413
strand -
CDS
ATGCTTCCAATCCCTTTGCTAAAACTTGGTCGTCATTTCCTCGATAAGAGGGGTGGAATTAAGGAACTTGCTGTTCTGTCATACCCAATAGACAACAGTGATGGTACAAGCAACACAGTATCCTATGTAAATAGCTTAGCTATACTTTTTGATGATGGTCAGCTTAATATTACTGGCTATAATAGGTTCGGAGAGTGTCGGACTGGAGATCTTGATACAATAAACTATCCTAATGAGGCAGCTTGGAACGTAGACCATGTTTGGAGAGCTGACCGTGCATTTGTAATCCGCACATTCGATAATAAATTTTTCTATATTGGTTGCACAGCTGGTCTTATCGGATCGGAGGCAGCAGGTGGAAACGATGTTTGTGTCAGAGAGTGGACACCTTTGCCGGAACAGATCGTAACAGGGTTGCATTTGGACACCCATCCTGAGAGACTTGTTGAAGTAATGGGTGGCGTAAACAATACAGTGTGGGTAATTGCTGCACCAGAGGCTGATGGTATGCTTCACCTGTACGGCTCTGGCAATAATACTTACGGCTCTCTTCATGTTGACAAAAATCAGCATGCAACCCCTGTAAAGATCGGTGAGACATCGGAAAATCCTGAAACAGGCCCTTGGAAAAACCCAAGCATCAACTGCGAGGTGCATGACAACTCTGTTATTTTCGGTGGCCCTAGGGGCTTCTGGATTGCTGGATACGACTTTCTGAGAAATAACAGTAGAAACGACCTTGTTTGGCCTCCTGTGCAGATCACTCGTAACGATTTGAGGGGCATTCCTGCAGATGAGGAGTGGAAAGGGTTTATGTGTGGTCCTAACGGGGCAATAATTGCCACACAGAGGATGCATAGACCCGATGACCAGCAGCTTGTTAACGTATTCTACGGGCAAGACGTTTGGGGAGATGACACATGGCGTTCCTTAAATATCACCTACACCCATGAGGTAATAATGGCTCGTGGGTACGGCACAAGTGGCATTTTCTTCAATAATGGCACTAAGCAGTATCGTGGATTCTCTCGCAACTTGTGTAATGATATTGGTGCACAATCTGCAAACAATAGCCCTAGGGCCAATTTTATCCACTATCAAGCTCTTGCTACAGCTAAATATGTTGAGCAACGCCTGCCAGTTAGCTGGACAGAGAGTGTTTATTTCCAAGGCATCCACAGGGAAGGCTTCTTAGGGACCTTTACTGTTGTTAATGGTAAACTTTGGTGGTCAGGCATCCCAAGGGGAAGTTTTGCAGGCTCTAACAACCTGTTTGGCGGCAGACTTAACAGTCAGGGGTTCACAGAGGTTCCTGAGAATTGGTATAAAAATGTCCCAGTAGACAGTTGGGGTGTTGATGATATTTTCGACGTTAACGGTGTTAGCAGCGTTAGCAATATTTACATTGGTGACACGGTCAAGATGAAATTGAAGCCTCAGCCAGAAGGGGCTACGTTCATTATTGACAAGATTGAGCTGGTAAATGCTGCAGGCACGGTTGTTACAGATGCAAATTACCAATTCTCCACCAACTGGAACCACGGCGGGGCCAACGAAGTTGTTGTTACCCAGTATAACAGAAATGTTAACAGACGTGGCCTGTACTCTGTAAAAATAACCTATCATGACAAGCATGGCACAGGCAGAACTTACACGACAAGGACTTTGAACTGGAACACCATAGTTCCTGCGTATCCGTCCAACGGCAAGTGGCATACTGTCGGTAGAAACAAACAGTTCCATGTCAACGATACTGTATACTTCGGACTGAATGGGACTCAGCCTGCAGTAGAAGGGGATACATCGTATATGGTCAGACTCCACAGAATGGATGCTGGATCAGTCACTTATGATGTTACCCAAGAGATTTATGATCAGAGACGAAATACCTACAACAACTTGGTGCAGACACAGGCCGTATGGGAGTTCAACCCTAACGGCAGAGGTGGTAAAATGTTGCAGGTTAACGAGCAAAACGGAACGTCGTTGACTGTCCACGAACACCCAGACCCCTGGCCTGATCCTGGAAAACCTCACCCTGGCGCTCGTACACTGAAAATTGTCAGCCATGATGTAGGATATTTCGGAATACGCTGGGAAGCTACAGTAAGATATGCCGACGGGACAACAAATAACATAGGGATTACACTTGGTGGCACTAGTGAGGATAACTCGCTGAAGATTGCCTATACCCCTAGAGGCATCTCTATTGACAATATGGATGTTGTTCGCAACGGTTACGGCGATGTTAATGTCAAAGTTACTCTTGGCGAACATCTTGGTGGGGAAAGGATCATCATGTATGCCTTTGATCATGACCCTCGCACCAACGGAACCTATGCAGACCAGGCGTGGAGCTACCTGATAAGCGCACCTGAGCCAAATACTAAAGAGTTCTACTACGGTATGAAGCGCGATGTGTGCAAGAAAACAGGAACACATGACTGGATTGCTATCTGCGTTAAAGATGAGCGCACAGCTTGGGATGAGCCTGTTAACAGATGGTTCATAGGTATTCCTACCAGAAGTGATAAGTATGTCGCAGAATATATTGTATGCATGGGCGGTACCAACCTAAACATGTGTTGGAACGAAGATGTAAACAAATACTCTGATTATGACTATATGAGGGATTACTCTTGTAATTTGTGGTTTAACCAAACTACAGGTTACCCTCCGCGTCAGGCAAGGGTAAATCCTGCAATCTTTACAGATACGCAGGTTTTCTTGACAAAACAGGCCAACGAGGTCCAGACCTTCAAGAACAAATACGATCCTAATAAGTGGTTCTATAACTGCTACGGGGCATTTTTCTGGGGACCTGGAGAATTACCTGGTGGTGGCTCTTGTTTTGAGGCTGCAACTTACGCCAGTGACAGATTTATGGGCCAGGTCAAGAAATACAAGATAATTCCAGGGCCTGAAACTTTAGGTAACGCTGTCGACCCCTATATCATCATGAGTGATCTAAGATCGAACATGGATGGAACATCCATATCTATGCAAGTTCCGGTAACAAACGGATACAAGCGTGTTGTGATGATCATTAAGTGCGATCTGATAGGTAAGCAAGTTCTTGCAGAGAACGGAACTTCCACCCACCTATTTGAGACTGCTCTGCACTATCAGTTTGCTGATTCACCTTTTGCAGGGGATAAGAGAATATCTGACACAGACGCTGTGCGATGCAAGAAATTTATGCTAGGTGGCGGGTGGTGGTGGTATGAATTTGACTTAACAGACAAATTCACCGACACGTCTAAAGTAGTGACGGGGCTTCGTCTAGACCTTGGCGAAAACATGCACAAAAGTGTTTGTGATGGCACCTACGGTGACCCAACAATCTATTTAAAGTATGTCTCTTTTGAGCACCCAGAGGATGTTGTTTATGGTCCAAAACTGAGACTGTTTGGCAATTGGATTGCTAAAGACAGGGTAGGTATGGGCAAGAAGGTAAGAGGTTTCTTAATGGATGCCGGTACAGAGGACATGCTAGTCAATGCAGTGTGGCCTGAATTACCCGGAACTCAGTGGAACAACTCCGCTAAATCTATAAACTGGTTCAATATCCACAGGGCTATGTGGACAACCAACTGTTACCTGTGGAAAGAGCTGAATGACGGCGCTTTCGGGTTCACAGGTGGCAGAAGAATGGCAATCATTTGCTGGACAACGTTGCAAAGATGTTATGACCATGACTACGAGATTGGAGGTAGGGCATGGAAAAATATTAGGGATAGAATCATAACCAACTTTGCAGATCCTAACGGTGGTGGAGCGTATGGCTTCGACTGCAGCAGGCTCATCCATTTGAACGGATCCTCTGCCTACAAGAAAGAAGGTTACTCTGGGTCTATGATAGAATGGGGCCTGGTGAAAGATGCTCGCGTGCTGATGGGTCAACAACTTGCTGCTGCAATAGGTCCTAATGCAGTGCAGTCGGTTAAACCAGCGTGGTTCGATATCCCATTGTGGTCACCAGGAGCACCGGGAACTGCTGCAATCAACCCTACCACTGGGGATCTGGAAATCTCCTGGGAGGACTTGAAGCAGGTCGGTGGTTGGGATAAAACGGGGTATCAGGTCCAGTGGTGGAGGGCTGATGGATCTTTAGCCGCTGATGAGTTTGTTAAGGACAATTTCTACACCATGTCCTCTGCAAAAGCACAGCAATTATTTGGTCAGGCCACTCCGTCAACGATCACCATGTCTATGTGCTGTAAAGACAACAGGACTGGAGCTTTGGGGCCAAGGGTTGCTAAGGTTTTCTCAGGTATTAAATGGAATCTCCCTGTTCAAAGTGTTTCATGGAAGCAAATAGGTGATAACAAGCTGCTGGTTACCCCTGCCTGTCAGTTCAACGCAACTCTTAACGTTGATCCTGCTGTTGCGGCAAACTCAGCTAAAGCTTCTGACTTCTCTGTGTCTAACACTGCTATGGCAGATGTGAGGAAGATTGACACACTGAACGCCAGAATTACCTGTAAAAACACTTATGGCACATTCCAAATCATCAACAACTTCACAGATGCTGATTCTAAGGTAGTAAGGACAGCAAGCCAGACTCTGAGTTTAGGGACTCTGGCCTATGCGGCCCTGATCACCGAACAGTCGGCAACACTCCAAGGAGGCGGTGTAGGCAAGTCCATTGCAACACCTGTGTGGAAGCCGAATGAGTGGGTTGTGTTTGATTTGACTGTGGACTTCTCCAGTGATAATAACTGGACATGGGTAAGGAATTGTTTGTCACAGTTGATGGGCGGTCCAAGCTCCATTAGCGATAGTCACGACTCTACTGACCCGAGCGTGTTCCAGGTGGGTAAAACTCACCCAGAGACTGGAGAAACGTTGCCTGACAGGAAGTATGCGTTGGTTTGCATCTCTTACGGGAAGGCAGATGTCACCTTCTCAGGGACACACACTTATAACGGAACCTACAATTTCTCAAGGAAGTATAGTCTCAAAGCAGGGAACATTATAGACGAGGTTGGTGCACTGTATAATCCTGGCAACGGCATAGGGATTGTGGGTGGTCAACTGCAGATGCAGGAACCTTCTATTACCCCTTCCAACGTATCTGGGATTAGAAAGACTTGGGAAAGTAGCAATACCAACATTGCAACAGTGGATGCTACCACAGGACTGGTAACATTTAAAGCTACTGGGAATGTCACCATAAAGTTTGTAGTTACAGATGATGCAGGCCGCAAAACGTCCTCAACATCATTCACTGTCAAACAGATGGCACCTCAGTGGTTTATTTGGGAAGGAAATCCTCTTGACGGGGCCTACCCTAGACCTGCAGGAACGTCTAATATGAGGGTCTTTACCGGTCCGGTAATGAACAATCCGACCACAACCACCAACCCTACATTTTTTGGTGCTTACATACCTGAAATTATTGGCCTACCAAGGAACCAGATACAACTGCTTTTCGGAGCAGGTGTTGATGGTCTAGCCACTTTTGGGTACAGTGACAACATTGATGCAGCCAGAAGCTCGGGTTGGATTGGATTTAAGTTTCAGTGGAACCCTGGTGGTAGAACATTAGGTATTGCAAGCATAGGTGTTATGCTACCTGGTAACCAGCAATACCATCTCGAAGCTTATACAAACTTCCCTAGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
e15cd9203c2a62bf99fb7fc883d08e78512dff2ed6a209397c530bc8f2ff6fc3
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4240
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genomes of novel bovine T4, rv5-like and Dillon viruses effective against Escherichia coli O157 Svab,D., Falgenhauer,L., Chakraborty,T. and Toth,I. 1975-07-29 GenBank