Genbank accession
ATJ04759.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,74
TSP
Evidence RBPdetect2
Probability 0,84
Protein sequence
MANNPKYAAVVAGKGSTNLISDVLVDFSTSDARQAHGVTVEGSDNVINNVLMSGCDGTNSLGQAQTATIARFIDTANNNYASVFPSYSATGVITFESGSTRNFVEVKHPGRRNDLLSATGTIEGKVTIDGTSNSNVVHAPALGQYIGSMSGRFEWRIKSMSLPSGVLTSADKYRMLGDGAVSLAVGGGTSSQVRLFTSDGTYRTVSLTNGNVRLPTSSTGYLQLGSNAMTPDSTNTYALGSASRAWSGGFTQSAFTVVSDARDKTEPLNISDALLDAWSEVDFVQFQYLDRIEEKGADSARWHFGIIAQRATEAFERHGIDAHRYGFLCFDSWDDVYEEDANGSRKLITPAGSRYGIRYEEVLILEAALMRRTIKRMQEALAVMSK
Physico‐chemical
properties
protein length:386 AA
molecular weight: 41582,74120 Da
isoelectric point:5,59787
aromaticity:0,08549
hydropathy:-0,24326

Domains

Domains [InterPro]
IPR012334
STR
1–259
cd10144
CHP
259–372
IPR030392
CHP
259–319
IPR036388
RBD
260–382
ATJ04759.1
1 386
Architecture
STR
RBD
STR 1-259 | RBD 260-384 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
ATJ04759.1
1 386
Domain Start End Length (AA) Confidence
N-terminal 1 10 10 0,0035
Central domain 11 199 190 0,8344
C-terminal 200 386 186 0,9852
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-10
Central
11-199
C-terminal
200-386

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ZG49
[NCBI]
1897641 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
5f119be3d8ff75dc68191beeb5a1ee1ee9928de36870be867dd516fb7fa59189
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7309
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50