UniProt accession
Q9XJP3 [UniProt]
Protein name
Tail spike protein
RBP type
TSP
Evidence UniProt/Swiss
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MTDIITNVVIGMPSQLFTMARSFKAVANGKIYIGKIDTDPVNPENQIQVYVENEDGSHVPASQPIVINAAGYPVYNGQIVKFVTEQGHSMAVYDAYGSQQFYFQNVLKYDPDQFGPDLIEQLAQSGKYSQDNTKGDAMIGVKQPLPKAVLRTQHDKNKEAISILDFGVIDDGVTDNYQAIQNAIDAVASLPSGGELFIPASNQAVGYIVGSTLLIPGGVNIRGVGKASQLRAKSGLTGSVLRLSYDSDTIGRYLRNIRVTGNNTCNGIDTNITAEDSVIRQVYGWVFDNVMVNEVETAYLMQGLWHSKFIACQAGTCRVGLHFLGQCVSVSVSSCHFSRGNYSADESFGIRIQPQTYAWSSEAVRSEAIILDSETMCIGFKNAVYVHDCLDLHMEQLDLDYCGSTGVVIENVNGGFSFSNSWIAADADGTEQFTGIYFRTPTSTQSHKIVSGVHINTANKNTAANNQSIAIEQSAIFVFVSGCTLTGDEWAVNIVDINECVSFDKCIFNKPLRYLRSGGVSVTDCYLAGITEVQKPEGRYNTYRGCSGVPSVNGIINVPVAVGATSGSAAIPNPGNLTYRVRSLFGDPASSGDKVSVSGVTINVTRPSPVGVALPSMVEYLAI
Physico‐chemical
properties
protein length:623 AA
molecular weight: 67065,69120 Da
isoelectric point:5,09513
aromaticity:0,08668
hydropathy:-0,03435

Domains

Domains [InterPro]
IPR009093
ATT
1–100
G3DSA:2.170.14.10:FF:000001
Unmapped
2–110
IPR036730
ATT
2–110
IPR036730
ATT
7–109
Q9XJP3
1 623
Architecture
ATT
STR
ATT 1-113 | STR 114-623
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
Q9XJP3
1 623
Domain Start End Length (AA) Confidence
N-terminal 1 175 175 0,9911
Central domain 176 549 375 0,9922
C-terminal 550 623 73 0,9798
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-175
Central
176-549
C-terminal
550-623

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage Sf6
[NCBI]
10761 Uroviricota > Caudoviricetes > Lederbergvirus >
Host Shigella flexneri
[NCBI]
623 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AAD33394.2 [NCBI]
Genbank nucleotide accession
AF128887 [NCBI]
CDS location
range 20 -> 1891
strand +
CDS
ATGACAGACATTATAACCAATGTTGTAATTGGGATGCCTTCGCAACTCTTCACTATGGCTCGTTCTTTTAAAGCCGTAGCCAATGGCAAAATTTATATCGGTAAAATTGACACTGACCCGGTAAATCCTGAAAACCAGATTCAGGTTTATGTAGAGAATGAAGACGGATCTCACGTCCCTGCTTCTCAGCCAATTGTTATCAATGCTGCCGGATATCCTGTGTACAACGGGCAGATTGTCAAATTTGTAACTGAGCAAGGCCATTCAATGGCTGTATATGATGCGTATGGTTCGCAGCAGTTCTATTTTCAGAATGTGCTGAAGTACGACCCTGATCAGTTCGGTCCGGATTTAATTGAGCAACTAGCTCAATCCGGTAAGTATTCGCAGGATAACACCAAAGGCGATGCCATGATTGGCGTCAAGCAGCCTTTACCAAAAGCAGTTTTAAGAACTCAGCATGACAAAAATAAAGAAGCAATAAGTATCCTGGATTTTGGTGTTATTGATGATGGTGTGACAGATAATTACCAGGCAATACAAAATGCAATAGATGCCGTTGCTTCACTACCCTCCGGCGGGGAGCTGTTTATCCCTGCGAGCAACCAAGCGGTTGGGTATATTGTTGGATCCACTTTGCTTATTCCTGGCGGTGTTAACATCAGAGGGGTTGGTAAGGCATCGCAACTCCGAGCAAAAAGCGGACTTACAGGATCTGTGTTAAGGCTGTCTTATGATTCAGACACTATCGGCCGTTATCTGAGAAATATACGAGTAACTGGTAATAACACCTGCAATGGTATTGACACAAACATTACAGCAGAAGACTCTGTCATCAGACAGGTTTATGGCTGGGTATTTGATAATGTAATGGTGAATGAAGTTGAAACCGCTTATTTAATGCAAGGGCTCTGGCACTCAAAATTTATAGCATGTCAGGCTGGAACCTGTAGAGTCGGTCTTCACTTTTTAGGCCAGTGCGTAAGTGTTAGTGTCAGCTCCTGCCATTTCAGCAGAGGAAATTATTCTGCTGATGAAAGCTTTGGCATCAGGATTCAGCCTCAAACGTATGCGTGGTCGTCAGAGGCAGTTAGGTCAGAAGCAATAATTTTAGACAGTGAGACCATGTGCATTGGTTTTAAAAATGCCGTCTATGTTCATGATTGCCTTGATTTGCATATGGAACAACTGGATTTAGATTATTGCGGCTCAACAGGCGTGGTTATAGAGAATGTAAACGGAGGATTTTCTTTCTCAAACTCATGGATAGCAGCAGATGCCGATGGCACTGAACAATTTACGGGGATATATTTTAGAACGCCGACCTCAACGCAGTCACATAAAATTGTCAGTGGTGTTCATATCAACACTGCAAACAAAAACACGGCTGCAAACAATCAAAGTATAGCGATAGAACAGTCGGCGATCTTCGTCTTTGTAAGTGGTTGTACGTTAACTGGTGATGAATGGGCTGTAAATATTGTCGACATCAATGAATGTGTTTCTTTCGATAAGTGCATATTCAATAAGCCTCTACGCTATCTTCGTAGCGGTGGCGTGTCAGTTACTGACTGTTATTTGGCTGGCATTACCGAGGTGCAGAAACCGGAAGGCAGATACAATACGTATCGTGGCTGTTCAGGCGTACCGTCTGTAAATGGGATCATTAATGTCCCAGTTGCGGTAGGTGCCACAAGCGGATCTGCAGCTATACCGAACCCGGGGAACCTGACATACAGAGTAAGAAGCCTTTTTGGTGACCCTGCATCAAGTGGTGACAAGGTTAGTGTTTCCGGGGTGACAATTAATGTTACTCGCCCAAGCCCAGTAGGAGTCGCGCTGCCTTCAATGGTTGAGTATCTGGCCATCTGA

Genome Context

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098024 virus tail, fiber Cellular Component IDA:UniProtKB (UniProt)
GO:0052775 endo-1,3-alpha-L-rhamnosidase activity Molecular Function IDA:UniProtKB (UniProt)
GO:0044409 symbiont entry into host Biological Process IDA:UniProtKB (UniProt)
GO:0098994 symbiont entry into host cell via disruption of host cell envelope Biological Process IEA:UniProtKB-KW (UniProt)
GO:0098995 symbiont entry into host cell via disruption of host cell envelope lipopolysaccharide Biological Process IEA:UniProtKB-KW (UniProt)
GO:0019062 virion attachment to host cell Biological Process IDA:UniProtKB (UniProt)

Tertiary structure

1 / 7
PDB ID
Source
Method
Resolution
Oligomeric State

Literature

Title Authors Date PMID Source
12424253 PubMed