Genbank accession
WRQ13383.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MGYFQMTRNVEELFGGVVTAPHQIPFTYKSNVGGETFLSLPFYPVTGVVTINGGMQVPLDNFEIEGNTLNLGRALSKGDVVYCLFDKILSPEDTAKGIRIYKFQAVGGETEFTPDFTSYGVQSLYIGGEYKTPEIEYSYDSTTGKVSLQTALTAGVWVVAEMSVKQPNISPAFDRSIQEIARSANVKDSEVIVSTDTISLLDGKKVVYDIATQTSYGLPTIPDGSVISSVSAGKLNYNPGDVQVDLLPLEDSFINVINTLGRNDGAKYIGECHSVADLRNTEPTMDGQRIILKQHTAGTLLGGGVFRALIDGTGKTDNNGTVIKTVGGAAWLRVNADRVNPFMFGALGGSNDDTIPVQSCVDSGKATQLTGVHYVSNIQLKYNTSSIYGSGLHYSRLHQLPSATGNCITIKDTCSLIVLDAFGVYGTGAQQGTSFTAGTTGIYVETPSGLSADYPFHTTADPRRDLCISKVHIAGFDEYGLNIDSGNFSVTTDSLLVNHINQVGVRCATTDWTWTNIQVNTCGKQCLVLDGCGNGRIIGGKFIWANWQPYGTVGQFPGITINNSQNMVINGIEVQDCGGNGIEISDSYSISMNGLNTNRNGINANNTFYNIVFNKSDAVINGFVGLNYAANSGSGANSSAGNFQFLSNDCSVTINGVVETGYMGINFIGDNNIINPTNSDLSINGLVNYSKTGLQTMNETPTFDGVSTTPVYVSVPSSVGQVNGLRLSQANKDKLLYSRTAGPEGITMAAVVVPTISGAEVFNFMAIGSGFSDTSNSLHLQLVIDASGKQTIALLLGGDGTTQILSGDLPNDLKLQSGVPYHIAIGAKPGYFWWSILNIQTGKRIRRSFRGAYLAVPFNSIFGLTSSLTFFSDSNAGGDACSGVGAKVYVGMFSSENDYVASRYYNLINPVDPTKLISYRILDSSI
Physico‐chemical
properties
protein length:926 AA
molecular weight: 98798,63870 Da
isoelectric point:4,99753
aromaticity:0,09179
hydropathy:-0,03855

Domains

Domains [InterPro]
DC_0041
STR
4–444
G3DSA:3.30.2020.50
ATT
161–250
WRQ13383.1
1 926
Architecture
STR
ATT
STR
STR 4-160 | ATT 161-250 | STR 251-926
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WRQ13383.1
1 926
Domain Start End Length (AA) Confidence
N-terminal 1 353 353 0,9952
Central domain 354 775 423 0,9800
C-terminal 776 926 150 0,4947
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-353
Central
354-775
C-terminal
776-926

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage vB_SenAc-pSK20
[NCBI]
3093916 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella enteritidis
[NCBI]
149539 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WRQ13383.1 [NCBI]
Genbank nucleotide accession
OR729889.1 [NCBI]
CDS location
range 140348 -> 143128
strand -
CDS
ATGGGGTATTTTCAAATGACCAGAAATGTAGAAGAATTATTCGGCGGCGTAGTCACAGCTCCCCACCAGATTCCTTTCACGTATAAATCAAATGTCGGTGGAGAAACTTTCCTTTCCTTGCCGTTCTATCCTGTCACTGGCGTAGTCACAATCAACGGTGGTATGCAAGTTCCCTTAGACAACTTTGAAATCGAAGGAAATACGTTGAATCTCGGGCGCGCATTGTCCAAAGGCGATGTTGTGTATTGCTTATTCGATAAAATTCTTTCGCCAGAAGATACAGCCAAAGGTATCCGCATATACAAATTTCAGGCCGTAGGAGGTGAAACCGAGTTCACTCCTGATTTCACATCTTATGGAGTCCAATCTCTTTATATCGGTGGCGAGTACAAAACCCCCGAAATTGAATATTCCTATGACAGCACGACAGGAAAAGTATCTTTGCAAACTGCACTGACTGCAGGCGTTTGGGTAGTCGCTGAAATGTCTGTTAAACAACCGAATATCAGTCCGGCGTTTGACCGAAGTATTCAAGAAATCGCCCGTTCTGCTAATGTAAAAGACTCTGAAGTCATCGTTAGTACGGACACCATATCTTTGTTGGATGGGAAGAAAGTTGTTTATGATATAGCGACGCAAACCAGTTATGGTTTACCAACCATTCCTGATGGTTCTGTCATTTCTTCTGTATCTGCTGGGAAATTGAATTACAACCCAGGTGATGTGCAGGTTGATTTGTTGCCTTTAGAAGATTCATTTATTAATGTGATAAACACTCTGGGGCGCAATGATGGTGCCAAGTATATTGGAGAATGCCATTCTGTTGCTGATCTCAGGAATACTGAACCCACTATGGATGGACAACGCATTATTCTTAAGCAACACACTGCGGGTACTCTTCTTGGTGGAGGGGTATTCCGTGCGTTAATTGATGGTACAGGAAAGACTGATAATAACGGTACTGTGATCAAAACTGTTGGCGGCGCGGCATGGTTACGTGTTAATGCTGATAGAGTTAACCCATTCATGTTTGGTGCTTTGGGTGGTTCTAATGATGATACTATTCCAGTACAATCTTGTGTGGATAGTGGTAAGGCCACACAATTAACTGGTGTACATTACGTTAGCAATATCCAGTTAAAATATAATACGTCGTCTATTTATGGGTCTGGATTACATTACTCAAGGTTGCATCAGTTGCCTTCTGCTACTGGGAATTGTATTACCATAAAAGATACATGCTCCCTTATTGTATTAGACGCCTTTGGGGTATATGGCACAGGTGCACAACAAGGCACGTCATTTACTGCGGGCACAACAGGTATCTATGTAGAAACTCCTTCAGGTCTCTCAGCCGATTATCCGTTCCACACTACCGCAGACCCAAGACGCGACTTGTGTATTTCTAAGGTCCATATAGCAGGTTTTGATGAATATGGGTTAAATATTGATAGTGGTAACTTTAGTGTTACTACAGATTCTCTTTTAGTCAACCACATCAATCAGGTGGGTGTCCGTTGTGCTACTACTGATTGGACTTGGACAAATATCCAGGTTAATACCTGCGGTAAACAATGTCTGGTTCTTGATGGTTGTGGTAATGGTCGTATTATTGGCGGTAAATTCATTTGGGCTAACTGGCAACCTTATGGTACAGTAGGACAGTTCCCAGGCATTACTATTAATAACAGCCAGAATATGGTTATTAATGGTATTGAGGTACAAGATTGTGGCGGGAATGGCATTGAGATTAGCGATTCATATTCAATTTCCATGAACGGATTGAACACCAATCGTAACGGCATCAATGCTAACAACACTTTCTACAACATCGTATTTAACAAAAGTGATGCAGTTATCAACGGATTCGTAGGACTCAATTATGCCGCGAATAGTGGTTCAGGTGCTAACTCTAGTGCAGGCAATTTTCAGTTCCTGTCTAATGATTGTAGTGTCACCATTAATGGTGTGGTTGAGACTGGTTATATGGGCATTAACTTTATTGGTGATAACAATATTATCAACCCCACCAATTCCGACCTGAGCATTAACGGATTGGTTAATTATTCCAAGACTGGTTTGCAAACCATGAACGAGACCCCTACATTTGATGGTGTTAGCACTACACCTGTTTATGTAAGTGTCCCATCTTCTGTAGGGCAAGTAAATGGTCTGAGACTATCACAAGCCAACAAAGATAAATTACTGTATTCAAGAACAGCAGGTCCAGAAGGTATTACCATGGCTGCTGTTGTAGTACCCACCATATCTGGAGCTGAAGTATTTAACTTCATGGCCATTGGTTCAGGGTTTAGTGATACATCCAACAGTCTTCATCTTCAATTAGTTATAGACGCTTCTGGAAAACAAACAATTGCTTTGCTATTGGGGGGCGATGGTACAACCCAAATTTTATCTGGGGATTTACCTAACGACCTTAAACTACAAAGTGGTGTACCATATCATATAGCTATTGGTGCTAAACCTGGATATTTCTGGTGGAGTATTCTTAATATTCAGACGGGTAAGAGAATCAGACGGTCATTCCGAGGCGCTTATTTAGCCGTACCATTTAATTCTATATTCGGATTAACTTCTTCATTAACATTCTTCTCGGATAGCAATGCTGGTGGGGATGCTTGTTCTGGGGTGGGCGCTAAAGTGTATGTTGGTATGTTCTCTTCTGAGAACGATTATGTAGCTTCACGATACTACAACCTGATTAATCCTGTAGACCCTACTAAGTTAATTAGTTACCGTATATTGGATTCTTCTATTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
ab05130c5154a064bd3bc295be74aa089067ef8263d8d6738b4374802f4a8c09
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7225
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequence of Salmonella virus vB_SenAc-pSK1 Kim,Y., Park,S.Y. and Kim,J.H. 2012-09-15 GenBank