Genbank accession
ASZ77104.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,82
TF
Evidence RBPdetect2
Probability 0,73
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
QGKAVWSLGTEINSGTFVLHHYKEDGTQGHTSRFNADGTVNFPDNVQVGGGEATIARNGNIFSDIWKTFTSAGDITNIRDAIATRVSKEGDTMTGRLTLKTNSDAVVIDYPADEAGYVKGKKGGADNWYVGNGGADNGLAFWSFQSQGGININPNGEVILSPQGTSIFNINRDRIHMNGAHWVARKSGAWGDQWGLEAPYFLDFGSVGEDSYYPIIKGRSVITGQGYTTSVDLGIRRTPQAWGQAIIRVGNAERGDGPVGVFEFHSSGLFYAPSLVQTPAIGVGTVNGLGSPSIAIGDNDTGLSHGGDGRINMVADGMHIASWSGSYHIHEGLWDTTGALWTETGRAIISFGHLVQQHDGYSTFVRDVYVRSDIRVKKDLVKFENASQTLSKINGYTYMQKRGLDEEGNQKWEPNAGLIAQEVQAILPELVEGDPDGEALLRLNYNGVIGLNTAAINEHTAEIAELKSEIEELKALVKSLLK
Physico‐chemical
properties
protein length:482 AA
molecular weight: 51978,02610 Da
isoelectric point:5,27093
aromaticity:0,08921
hydropathy:-0,33402

Domains

Domains [InterPro]
DC_0339
STR
1–413
G3DSA:6.20.80.10
STR
104–159
IPR030392
CHP
372–431
DC_2320
RBD
399–482
ASZ77104.1
1 482
Architecture
STR
RBD
STR 1-413 | RBD 414-482
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
ASZ77104.1
1 482
Domain Start End Length (AA) Confidence
N-terminal 1 33 33 0,0026
Central domain 34 232 200 0,2484
C-terminal 233 482 249 0,9056
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-33
Central
34-232
C-terminal
233-482

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ECO4
[NCBI]
2025816 Uroviricota > Caudoviricetes > Pantevenvirales > Tevenvirinae > Tequatrovirus
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
314b3cb53c542a438ef9a5062d5b362046e639ec4d7bf34fc8e1c348dffac1ce
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7236
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50