Genbank accession
AVD99004.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MALTNLIYNASKNNDREQWRRSLVEAGLTLVDGSFEEGATANSKTDAVWHIAGGQCYTWDATFPKTVPAASTPETSGGVDSGAWVSVDDVVLRQTLSSDEGFDVVSGKIGLPGAVKQSLKEALSVSVNIISFGADPTGIKDSSDAIQAAIDAVYALGGGEVFIPKCTGSAGNQGVGYRVTKSIMLKPFVNLRGEGINSCLKAYSDLGKGLLVLMHGGGISGRYLYNFLLQGNGTGIGIGTDIESTAEAEQQIYGFDIQGVCAEVFEIGMQLQGLWHSTLRNCTTSSCRMGLHLWGQNVSINITGCHFRRDHYSTLGNTFGIRIQPRVYAWSPDQSSGSRSEAIIIDGETMCISVDYGIYVDDCLDLQMSNLDLDYIRKTAVAIINVNGGFNLSDSWIAGDAASTEQFFAVSLNANSIQQQKSITGLHCNLHNANPLNNNIGINITKSVGPVYIGQSTFVGGWSGIAIFNHTDGGIVIDSCTLANQLYLTGSSDVVVQNSGLTGIEETLKPTNSHHYYRSNRGTPATNGVVNVPVAGGATSGTLAIPNADTTKSYSVRPWADNPATVNDTASCSAGVITVNRPIAVGVPLPTTVEYSLF
Physico‐chemical
properties
protein length:598 AA
molecular weight: 63318,13690 Da
isoelectric point:5,04238
aromaticity:0,07692
hydropathy:-0,02993

Domains

Domains [InterPro]
G3DSA:2.10.10.80
ATT
28–88
G3DSA:2.10.10.80
ATT
30–88
IPR040775
RBD
32–88
AVD99004.1
1 598
Architecture
ATT
STR
ATT 28-88 | STR 89-598
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AVD99004.1
1 598
Domain Start End Length (AA) Confidence
N-terminal 1 140 140 0,9916
Central domain 141 505 366 0,9923
C-terminal 506 598 92 0,9532
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-140
Central
141-505
C-terminal
506-598

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage SFN6B
[NCBI]
1785176 Uroviricota > Caudoviricetes > Autographivirales > Slopekvirinae > Drulisvirus
Host Shigella flexneri
[NCBI]
623 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
72df907ab292f6de2b00253a56ea601234c5e5b5a051345e8fe588f8b9dda799
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8271
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50