Genbank accession
QZI79662.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,81
Protein sequence
MSYTYTDHVANGSQVTFPFRFAGRDKAYISATDVQVGFVEAGTWLPATGWTLSGTNQITFLTPPTEGKQIRIRRIVDKINPYAYFERGNVLDINSLNNSFIKNLQVSQEILDGFFPEGFYFKADLDMGGHKIINLHEGSESGDAVNYDQFKELSDRVNEIESDLTGLSHRTVPMYYIATGGESRWEIGHWTFDAAILFINGVFQNQNLGAFTISNNGFNFAEPLQKGDEVYALVGSRIAAPNEYVTISDLKDPDKAWRLIGRKYPFATDGTFMSKVVHDVDYSNLFTGATGLSQGFCIVDTPQGARMFFHQPIPSTTKVRFVETTFNPEGDNENPSVISFSQELDDIGHQTVCGVYEDGAVRLYTLSADEKSYYIIDWQGNNTSNVHMRKVQVPTEGVYSKDLFVSVGMSEDTKTLVFHATTNTQAFGNKDDARAIYLFDRKTLDSTLTRGLKRRFEVGAPAAPDMAFQGVACDDYFLYIYHGYNGVLLSHKIFVFTLSGEHVRDIPVDSVRVKYGERMYGDETLGYPVLQEPEGLAMYKGKLYMLCMDFWYKNASVVTFAGRTFATRKSSAFSGYSPLQASYWTPTKLIPPTGAPDYSKTATYSCSNATKLSKAVVSLEIDDGTGFPASIGCSLPESSASVYLNNRAMNMAIGPNENFQFGVFHQNLQSYKNLLQLNRENPNTDGSAAVWRLFGGAFTDDIQNERFVQIKHRLNPTQDAMELRADVDLTSGGGINLYSMNDSASPGRVRLYCTDGYSYWSALLSPASPSFHPDQDKTLNLGTANNRWNLVHCQGIRIISNDELQKQVFLETTRKKGALQISTSGNLGVWDSSAGTYVVATRPDGTGFSQVEMSFKGDVFPEINNTYNLGRENKAWANAYFQNAPTVVCDARLKTDARELTAAEKSAFLEISKLPAVWQWLAKLEVEGEDARLHSGCTVQAAISVMEKHGLDWTKYSAFCYDKWDAKEAVYDIVDGKAILVEEAVEAGDRYRLRREELMWWCMKAQNAWIESIEGRLAKLEDKLKEV
Physico‐chemical
properties
protein length:1027 AA
molecular weight: 114767,37830 Da
isoelectric point:5,31180
aromaticity:0,11782
hydropathy:-0,33905

Domains

Domains [InterPro]
DC_0041
STR
1–472
IPR030392
CHP
889–951
cd10144
CHP
890–1008
QZI79662.1
1 1027
Architecture
ATT
STR
RBD
ATT 1-113 | STR 114-976 | RBD 977-1023 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QZI79662.1
1 1027
Domain Start End Length (AA) Confidence
N-terminal 1 251 251 0,9955
Central domain 252 465 215 0,9243
C-terminal 466 1027 561 0,7562
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-251
Central
252-465
C-terminal
466-1027

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoP-101118B1
[NCBI]
2865797 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QZI79662.1 [NCBI]
Genbank nucleotide accession
MZ234020.1 [NCBI]
CDS location
range 32352 -> 35435
strand +
CDS
ATGAGTTATACATATACAGACCATGTAGCTAATGGCTCACAGGTTACATTCCCTTTTCGTTTTGCAGGCAGAGATAAGGCATACATTAGTGCGACAGATGTCCAAGTGGGTTTTGTAGAAGCAGGCACTTGGCTGCCAGCAACAGGTTGGACGCTAAGTGGCACCAATCAGATCACTTTCTTAACCCCACCGACAGAAGGTAAACAAATACGAATCAGACGTATTGTAGATAAAATAAACCCATATGCTTACTTCGAGCGGGGTAATGTTTTAGATATTAACTCTTTAAATAACTCATTCATTAAGAACCTGCAAGTCAGTCAAGAAATCCTTGATGGCTTCTTCCCAGAAGGTTTCTACTTCAAGGCAGACTTAGACATGGGTGGTCATAAGATTATAAACCTACATGAGGGTTCTGAGTCAGGTGATGCTGTAAATTATGATCAGTTTAAAGAGTTATCAGACCGCGTTAATGAGATTGAATCAGATTTAACAGGTTTGTCTCACAGAACAGTACCTATGTACTATATCGCAACTGGTGGAGAGTCACGTTGGGAGATCGGTCACTGGACTTTCGACGCTGCAATATTGTTTATTAATGGTGTATTCCAGAACCAAAACCTAGGGGCTTTCACCATCAGCAACAATGGTTTTAACTTTGCAGAACCATTGCAGAAAGGGGATGAAGTATATGCCCTTGTAGGCAGCCGCATTGCAGCACCCAATGAGTATGTAACAATCTCTGACTTGAAAGACCCTGATAAAGCGTGGAGGCTGATCGGACGCAAGTACCCTTTTGCGACAGATGGAACTTTTATGTCAAAGGTCGTACATGACGTAGACTACTCTAATCTCTTCACTGGAGCGACAGGTCTTAGTCAAGGCTTCTGTATCGTAGACACGCCGCAAGGTGCGCGCATGTTCTTCCACCAACCTATCCCAAGCACTACTAAGGTTCGTTTCGTTGAAACAACCTTTAACCCTGAAGGGGACAATGAAAACCCTTCTGTTATTTCATTCAGTCAGGAACTGGATGATATTGGACACCAGACTGTGTGTGGTGTATATGAGGATGGGGCGGTGCGTTTATACACCTTAAGTGCAGATGAAAAATCTTATTATATCATAGATTGGCAAGGTAATAATACGTCTAATGTACATATGCGTAAGGTGCAGGTTCCTACGGAAGGTGTGTATTCTAAGGACTTATTTGTCTCAGTGGGTATGTCAGAAGATACTAAAACGCTGGTATTCCATGCAACCACCAATACTCAGGCTTTTGGTAACAAGGATGATGCGAGGGCTATTTACCTCTTCGATAGAAAAACATTGGATTCAACCTTAACTAGAGGTCTTAAGCGTAGATTCGAAGTAGGAGCACCTGCTGCACCAGATATGGCCTTCCAAGGGGTTGCTTGTGATGATTACTTCTTGTATATATATCATGGATATAATGGTGTCCTGTTGTCTCACAAGATCTTCGTCTTCACCCTCTCAGGGGAGCATGTACGGGATATTCCCGTTGATAGTGTGCGTGTAAAATATGGAGAAAGGATGTATGGTGACGAGACTTTAGGGTATCCTGTATTGCAGGAACCGGAAGGTTTGGCAATGTATAAAGGCAAACTTTATATGTTGTGTATGGATTTCTGGTACAAGAATGCAAGTGTAGTTACTTTCGCTGGCAGGACTTTTGCTACAAGGAAATCTTCAGCGTTTTCTGGGTACAGCCCATTACAAGCATCATATTGGACACCTACTAAACTAATACCACCCACGGGTGCACCTGATTATAGCAAGACAGCCACCTACTCCTGCTCTAATGCTACTAAGCTCAGTAAAGCTGTTGTCAGTTTAGAAATAGATGATGGCACAGGCTTCCCTGCAAGTATTGGGTGTTCATTGCCAGAGAGTAGCGCCAGTGTATACCTAAACAACCGAGCGATGAATATGGCTATCGGGCCAAATGAAAACTTCCAGTTTGGGGTATTTCACCAGAACCTACAGTCATACAAAAACCTGTTACAGCTCAACAGAGAGAACCCAAATACGGATGGATCAGCGGCTGTGTGGAGGTTGTTCGGAGGTGCGTTCACAGATGACATTCAGAATGAAAGATTTGTTCAGATTAAGCATAGACTTAATCCCACACAAGATGCCATGGAGCTTCGTGCGGATGTTGACCTTACGAGCGGTGGTGGGATTAACTTGTATTCTATGAACGACTCTGCCTCTCCTGGTCGAGTAAGGCTATATTGTACTGACGGGTATTCTTATTGGTCAGCTCTACTGAGTCCTGCATCCCCATCTTTTCACCCTGATCAAGATAAGACTCTAAACTTAGGTACAGCAAACAACCGTTGGAACCTAGTCCATTGTCAAGGTATCCGTATCATCTCTAACGACGAGTTACAGAAGCAAGTCTTTCTTGAAACTACCCGTAAAAAAGGTGCACTACAGATCTCCACTTCCGGCAATCTAGGGGTGTGGGATAGTAGCGCAGGTACTTATGTTGTGGCAACCAGACCTGATGGTACAGGCTTTTCACAGGTAGAGATGAGCTTCAAAGGTGATGTTTTTCCTGAAATAAATAATACTTACAATTTAGGTCGTGAGAATAAGGCATGGGCTAATGCCTACTTTCAAAACGCACCGACTGTTGTCTGTGATGCCAGACTTAAAACAGATGCTAGGGAGTTAACAGCCGCAGAGAAATCCGCATTCTTAGAGATCTCTAAACTTCCGGCAGTTTGGCAATGGCTTGCTAAACTGGAAGTAGAAGGAGAGGATGCCCGCTTACACTCTGGATGTACTGTCCAAGCTGCTATCTCGGTTATGGAGAAACATGGCCTAGATTGGACTAAGTATAGTGCTTTCTGTTATGACAAATGGGATGCCAAGGAGGCTGTCTATGATATAGTGGATGGTAAGGCTATCCTTGTTGAAGAGGCAGTTGAGGCTGGCGACCGTTATCGTCTTCGTAGAGAAGAGCTGATGTGGTGGTGCATGAAGGCACAGAATGCTTGGATTGAATCTATTGAGGGGAGGCTGGCGAAATTAGAAGATAAACTCAAGGAGGTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
37b41c25146812e78f49bb3ef313a7b03691ac3ff626b28678e3faab63cd1753
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5950
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Naturally bred epsilon2 phages have an improved host range and effectivity in uropathogenic E. coli over their ancestor phages Saez,D., Loose,M., Mutti,M., Visram,Z., Hitzenhammer,E., Dippel,D., Tisakova,L., Schertler,S., Wittmann,J., Corsini,L. and Wagenlehner,F. 2020-05-19 GenBank