Genbank accession
YP_009320872.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,84
Protein sequence
MALKTKIIVQQILNIDDTTTTASKYPKYTVVLGNSISSITAGELTAAVEASAASAAAAKDSEIAAKESETNAKDSENLAAIYATSSETSATQSAASAAEAERQAGLSQKSAEASATSAEESKGFRNSAELAAQNAETSRRLAEQAKTAAQQAQTAAETAKTGAETAKAGAEAAATTAGEHAASAKQSELNAKESETNAAGSAVEAGDKAVDATTEANRAKAEADRAAQIVDSKLDKEDISGFIKVYKTKAEADADVSNRVLGEKILVWNQTDSKYGWYKVAGTAEAPVLELVEIEQKLVSINNVHADDAGNVQITLPGGNPSLWLGEVTWFPYDKDSGVGYPGVLPADGREVLRVDYPDTWEAIEAGLIPSVSEAEWQAGATLYFSTGDGSTTFRLPDMMQGQAFRAPTKGEEDGGAIKEQIPYITTVNGIGPADDTGAIKLPYVAMVNGTIRPDENGNLALGNVVTKNVWNGTDGEVLLRGAFGLGGAGLILNEPDAVSFFKAMRAFGSGYYRNDNESNLVIPKYSAGFYSKVGDTHTFICSAYSNGVAFVASASDRDLDEESTVHTNILYGTANKPDLNNDTQGVLGLDKGGTGASTVSSAKTNLEVDRIKQLVGGTHITSQDQNIVFMVQDTKNWGVYDHSENKWISLPVEHGGTGATEPVQARKNLGAASAGINSDITQLRNMEGWPLSIKNGGIITRKYHAVPSDGFYLGSEVFSAQVQLDNSENPDTPRLEALFYAEGNYAQSRTERATVAAYRRTADGSLTATKYANLYMDSGAWNAERMQALGYSKGWDDDSFGFLAPFQASDVTGNDHGFVPIISAMTQSTGGYPMRATTGLISRGTSTWPAYAFRLRGDSNWGCTYQFHMTGDIDGWGSDYNNVVFNFTYTKNAVSDINLKDNIQDVSGEESLENIEKMEFKKFTYKFDKKKNIRRGVIAQQIELIDPQYVKAIGNPETDDITLTLDTNPLLMDALAAVKVLSERNKSLETKLQEMSTVIENINEKLNLMTRLSNIEAELDKMKGTS
Physico‐chemical
properties
protein length:1027 AA
molecular weight: 109977,38040 Da
isoelectric point:4,77308
aromaticity:0,07498
hydropathy:-0,40166

Domains

Domains [InterPro]
DC_0608
ATT
2–790
DC_1942
ATT
724–1026
IPR030392
CHP
896–993
YP_009320872.1
1 1027
Architecture
ATT
ATT 2-1026 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009320872.1
1 1027
Domain Start End Length (AA) Confidence
N-terminal 1 499 499 0,8991
Central domain 500 698 200 0,2288
C-terminal 699 1027 328 0,8987
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-499
Central
500-698
C-terminal
699-1027

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage 100268_sal2
[NCBI]
1813783 Uroviricota > Caudoviricetes > Demerecviridae > Epseptimavirus > Epseptimavirus 100268sal2
Host Salmonella enterica
[NCBI]
28901 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009320872.1 [NCBI]
Genbank nucleotide accession
NC_031902.1 [NCBI]
CDS location
range 87826 -> 90909
strand -
CDS
ATGGCACTTAAAACTAAAATTATTGTACAGCAGATTCTGAACATAGATGACACTACAACTACTGCTAGTAAGTATCCTAAATATACAGTAGTTTTAGGTAATTCTATTAGTTCTATTACTGCTGGTGAATTAACTGCTGCTGTAGAAGCCTCTGCTGCTTCTGCTGCGGCAGCAAAAGATTCCGAAATTGCAGCAAAAGAATCTGAAACAAATGCTAAGGACTCGGAGAACCTAGCTGCAATTTATGCTACTTCTTCTGAAACATCTGCAACTCAATCTGCGGCATCTGCCGCGGAAGCTGAGAGACAAGCTGGTCTGTCACAAAAGAGTGCTGAAGCATCCGCTACATCGGCGGAAGAATCTAAAGGATTCAGGAATTCTGCTGAATTAGCTGCTCAAAATGCGGAGACCAGTCGTAGACTTGCCGAACAAGCTAAAACTGCGGCTCAACAAGCTCAGACAGCTGCAGAAACTGCTAAGACTGGTGCTGAAACAGCAAAAGCAGGTGCAGAAGCTGCCGCTACAACTGCTGGAGAACATGCTGCTTCCGCTAAACAATCAGAATTAAATGCTAAAGAATCAGAAACTAATGCGGCTGGTTCTGCTGTAGAAGCAGGAGATAAGGCTGTAGATGCTACTACTGAGGCAAATCGTGCTAAGGCTGAAGCCGATCGCGCAGCTCAGATTGTAGATAGCAAGCTGGATAAAGAAGATATATCTGGTTTCATCAAAGTTTATAAGACAAAAGCAGAAGCTGATGCAGATGTCTCAAATCGTGTTCTGGGTGAAAAGATTCTAGTCTGGAACCAAACTGATTCAAAATATGGTTGGTATAAGGTTGCTGGTACTGCTGAGGCTCCTGTCCTAGAATTAGTAGAAATTGAACAGAAACTCGTATCAATCAATAACGTTCATGCAGATGATGCTGGTAACGTACAGATCACTCTTCCAGGAGGTAACCCTTCTTTATGGTTAGGTGAAGTTACTTGGTTCCCTTACGATAAAGATTCGGGTGTTGGATACCCTGGTGTTCTGCCAGCTGATGGTCGTGAAGTACTTCGTGTTGACTATCCTGATACTTGGGAAGCCATCGAAGCTGGTCTAATTCCTTCTGTTTCAGAAGCTGAATGGCAGGCCGGTGCAACTCTCTACTTCTCCACTGGTGATGGTTCCACAACCTTCCGTTTACCAGATATGATGCAAGGGCAGGCTTTTCGTGCACCTACTAAGGGAGAAGAAGACGGCGGTGCTATTAAGGAACAGATTCCTTATATCACTACTGTGAATGGAATTGGTCCTGCTGATGATACTGGAGCTATTAAGCTTCCTTACGTGGCAATGGTCAATGGAACTATTCGTCCAGATGAGAACGGAAATCTAGCACTAGGTAACGTAGTAACCAAAAACGTTTGGAACGGTACTGATGGAGAAGTATTACTAAGAGGTGCTTTTGGTCTTGGTGGTGCTGGTTTAATCCTTAATGAGCCTGATGCTGTTTCCTTCTTTAAAGCAATGCGTGCTTTCGGTTCAGGGTATTATAGAAATGATAACGAGAGTAACTTAGTAATCCCTAAGTATTCTGCTGGATTCTACTCTAAAGTTGGAGATACCCATACCTTTATTTGTTCTGCCTATAGCAATGGTGTTGCTTTTGTAGCTTCTGCTTCTGATAGAGATTTAGATGAAGAATCTACAGTACATACTAATATTCTTTATGGTACTGCAAATAAGCCTGATTTAAATAATGATACTCAAGGTGTCTTAGGTCTCGATAAAGGAGGAACAGGAGCTTCTACAGTTAGTTCTGCCAAAACTAATCTAGAAGTGGATAGAATAAAGCAGCTAGTAGGAGGAACACATATAACCTCTCAGGATCAAAATATAGTGTTTATGGTTCAGGACACTAAAAACTGGGGGGTATACGACCATTCTGAAAACAAGTGGATATCTTTACCTGTTGAACACGGTGGTACTGGAGCTACTGAGCCAGTTCAAGCTAGAAAAAATTTAGGGGCTGCAAGTGCCGGTATTAACTCCGATATTACTCAGCTACGTAATATGGAGGGTTGGCCTCTTAGCATTAAGAACGGTGGTATTATTACTCGTAAATACCACGCAGTTCCTTCGGATGGGTTTTACTTAGGATCGGAAGTTTTTTCTGCTCAGGTCCAACTGGATAATTCAGAGAATCCAGATACCCCTCGTTTAGAGGCATTATTTTACGCTGAAGGTAACTATGCCCAATCTCGTACTGAAAGGGCTACAGTAGCTGCTTATAGGCGTACTGCAGATGGTTCGTTAACTGCTACTAAATATGCTAACCTGTACATGGATTCAGGGGCTTGGAATGCAGAACGTATGCAAGCTCTAGGGTATTCTAAAGGTTGGGACGATGATTCCTTTGGTTTCCTCGCTCCTTTTCAAGCTTCTGATGTAACGGGCAATGATCATGGATTCGTACCTATTATTAGTGCTATGACTCAATCCACAGGTGGCTACCCAATGAGGGCTACTACTGGATTAATATCTCGAGGTACTAGTACTTGGCCTGCCTATGCATTTAGACTTCGTGGAGATTCTAATTGGGGATGTACCTATCAGTTCCACATGACAGGGGATATTGACGGATGGGGGTCAGACTACAATAACGTTGTCTTTAACTTTACTTACACAAAAAATGCGGTGTCGGATATCAATCTAAAAGATAATATTCAAGACGTATCGGGTGAAGAATCTCTTGAAAACATTGAAAAAATGGAGTTCAAGAAATTCACCTATAAGTTTGATAAGAAAAAGAATATTAGACGTGGTGTTATTGCACAACAAATAGAGTTGATTGATCCTCAATATGTTAAAGCTATTGGCAATCCTGAAACCGACGATATTACTTTAACCCTAGATACCAATCCATTGTTAATGGATGCTTTAGCTGCTGTTAAGGTATTATCTGAAAGAAATAAGTCTCTTGAGACTAAATTACAAGAGATGTCTACTGTTATTGAAAACATTAACGAAAAGCTTAATTTGATGACTAGATTATCAAATATAGAAGCAGAGTTAGATAAAATGAAAGGTACTAGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
7bb05a5d2075f4164a4596cefdd2e40e60c2332f0ad9850e4c7cd564bf617b77
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5800
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete Genome Sequences of a lytic Siphoviridae Bacteriophages infecting several subspecies of Salmonella enterica Paradiso,R., Lombardi,S., Iodice,M.G., Riccardi,M.G., Orsini,M., Bolletti Censi,S., Galiero,G. and Borriello,G. 2016-12-29 GenBank