UniProt accession
A0A7D5JQB5 [UniProt]
Protein name
Tailspike protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TSP
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MNKMFTQPTGPVAKQTNKQVIARHFGVKQSEVVYFSVGALLTGYKVIYDKVTQRAYSLPADIGSGVTAVSLSPAGVLVHSAGSVDLGALAVTREEYVTLPGSFNTGVTVNTKNELVVFAGGKYRWDGVLPKVVPAGSTPTSTGGVSPNAWVSVGDATLRQEMSSETGASMVSRNASPLDRIIRASLFEYLTESDQQALLTTSGAVVSADYALKAAIAAGVMVLDIPRNLGIIELGSDPATLPLGFSLIGWGCRRPYTVSDDSSFLNCGVVIRVAAGASFPFYSTGRHVFRDIVFDGRDKTTYLFYSADKSTEFNGTRLEGCGIYRFAIGIGWNKEGSARYIATVKAHFCSLSGNGDGVKNLIDSMMIGCTINANDRGVALVGGANSNFFGGCRNEWNAGDNWYAYESIENQISGELCDRAGRGGVVAGKDSSWIVTGVVVARSGTNQPMNDNYSANFVIIDNGKIVISGVRTRNGAKEWGQGGTNSPSFNVSVLGSGNGTLLISGSDMTGFVTSALNQKAPTLNKAITGNLGMDDDVNVGMSQVIKGRRIIGAQSSGRLPATVGATLSFTKPNIAQDLYDTYVTRAILIECRIGNTAQGDYIKIPIGIRHETNFYLDIITSGIVASSGRIGLSGTGVTVSLSINSSTGDISVVLTSVDGLVRDVNVSLLPSM
Physico‐chemical
properties
protein length:672 AA
molecular weight: 70738,13980 Da
isoelectric point:7,99339
aromaticity:0,07738
hydropathy:0,05625

Domains

Domains [InterPro]
G3DSA:3.30.2020.50
ATT
1–95
DC_1953
STR
1–668
IPR040775
RBD
90–155
A0A7D5JQB5
1 672
Architecture
ATT
STR
ATT 1-165 | STR 166-668 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A7D5JQB5
1 672
Domain Start End Length (AA) Confidence
N-terminal 1 207 207 0,9933
Central domain 208 548 342 0,9900
C-terminal 549 672 123 0,9891
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-207
Central
208-548
C-terminal
549-672

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoP_SP5M
[NCBI]
2750853 Uroviricota > Caudoviricetes > Schitoviridae > Gamaleyavirus > Gamaleyavirus Sp5m
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QLF80674.1 [NCBI]
Genbank nucleotide accession
MT682708 [NCBI]
CDS location
range 62323 -> 64341
strand -
CDS
ATGAATAAGATGTTTACCCAGCCAACAGGCCCGGTAGCAAAACAGACTAATAAACAAGTGATAGCTCGTCACTTTGGAGTTAAGCAATCCGAAGTTGTTTATTTCTCAGTTGGTGCTCTGCTTACTGGCTATAAAGTTATCTATGACAAAGTGACACAGCGTGCTTATTCCTTACCTGCTGACATTGGTTCAGGGGTGACTGCTGTAAGCCTTAGCCCTGCTGGTGTGTTGGTACATTCTGCCGGTAGTGTGGACTTAGGTGCACTAGCTGTTACTCGTGAGGAATATGTAACCTTACCTGGTTCATTTAATACTGGAGTAACTGTTAATACTAAGAATGAACTGGTTGTTTTTGCTGGTGGTAAATACCGTTGGGATGGTGTACTACCTAAGGTAGTTCCTGCTGGTTCAACCCCTACATCAACCGGTGGTGTTAGCCCTAATGCGTGGGTTAGTGTTGGTGATGCAACGCTTAGGCAGGAGATGTCTTCTGAAACAGGGGCGTCAATGGTGTCCCGGAACGCCTCCCCTCTGGACAGAATCATTCGTGCGTCACTTTTTGAATACCTTACCGAATCGGATCAACAAGCATTACTTACAACTTCTGGAGCAGTAGTCTCAGCGGACTATGCGTTAAAAGCAGCTATAGCTGCAGGAGTTATGGTCCTTGATATACCGAGGAACCTGGGGATTATTGAACTTGGTAGCGATCCCGCTACGCTTCCATTGGGATTCTCACTTATAGGGTGGGGATGTCGTCGACCATACACTGTATCAGATGACAGTAGCTTCCTTAATTGCGGGGTCGTTATCCGTGTAGCAGCTGGAGCAAGCTTTCCGTTTTATTCAACTGGTCGTCATGTGTTCCGGGATATTGTTTTTGATGGTCGAGATAAAACAACGTATCTTTTTTATTCCGCAGATAAATCTACCGAGTTCAACGGTACTCGACTTGAAGGGTGCGGGATTTACCGTTTTGCTATTGGTATTGGTTGGAATAAGGAAGGATCAGCAAGATACATCGCAACAGTGAAAGCACACTTTTGTTCACTATCTGGAAATGGGGACGGAGTTAAAAATTTAATAGACTCTATGATGATTGGTTGCACAATCAATGCCAATGACCGAGGCGTAGCTCTTGTCGGTGGGGCAAATAGTAACTTTTTCGGGGGATGCCGTAACGAATGGAACGCTGGTGATAACTGGTATGCGTATGAATCAATAGAAAACCAGATTTCAGGTGAACTGTGCGACAGGGCTGGACGAGGAGGGGTAGTTGCTGGCAAGGATTCGTCATGGATTGTAACCGGAGTTGTCGTCGCTCGCAGTGGTACTAATCAACCAATGAATGATAACTACTCAGCAAACTTTGTTATTATTGATAACGGTAAAATAGTAATCTCTGGGGTGAGAACCCGAAACGGAGCTAAAGAGTGGGGTCAGGGTGGGACAAATTCTCCATCGTTTAACGTATCGGTTCTTGGCTCTGGTAATGGGACACTCCTGATATCTGGAAGTGATATGACCGGTTTTGTTACTTCGGCACTTAACCAGAAGGCTCCTACATTAAATAAAGCTATAACCGGAAATCTCGGTATGGATGATGATGTAAATGTTGGAATGTCACAGGTTATCAAAGGAAGACGGATTATTGGTGCACAGTCATCAGGTAGATTACCAGCCACTGTGGGAGCAACGTTATCATTCACTAAGCCCAATATTGCGCAAGATTTATATGATACATACGTCACCAGAGCCATATTGATTGAGTGCAGAATTGGCAATACTGCACAAGGTGATTATATAAAAATACCTATAGGGATAAGGCACGAAACCAATTTCTATCTTGATATCATCACCAGCGGAATTGTCGCAAGCTCTGGAAGGATTGGTCTTTCTGGAACAGGGGTTACTGTTAGCTTATCTATTAATAGTTCAACTGGTGACATCTCAGTTGTGTTAACTAGTGTTGATGGGCTTGTGAGAGATGTTAATGTCTCTCTACTGCCATCAATGTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
32ae5e6ffa1a603f8df5cbe09970f15ff5be029683f90e5b5357137ccfe79c07
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7756
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50