Genbank accession
XXS07644.1 [GenBank]
Protein name
long tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,83
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MATLKQIQFKRSKTAGQRPAASVLAEGELAINLKDRVLFTKDDQGNIIDLGFAKGGSIDGNVIHKGNYNQTGDYTLNGTFTQTGNFNLTGIARVTRDIIAAGQIMTEGGELITKSSGTAHVRFHDSADRERGIIFSPANDGLTTQVINIRVQDYKAGSESTFAFNGNGLFSSPEVFGWKSVSTPVIYTNKVITNKKVKDDYDIYSMADNVPLSESTTAINHLRVMRNAVGSGIFHEVKDNDGITWYSGDGLDAYLWSFTWSGGIKSSHSISIGLTPGPKDYSILGPSSIALGDNDTGFKWHQDGYYFSVNNGTKTFLFSPSETTSLRKFIAGYSTNGTDLTTPPTENYALATVVTYHDNNAFGDGQTLLGYYQSGNYHHYFRGKGTTNINTHGGLLVTPGNIDVIGGSVNIDGRNNNSTLMFKGYTMGQSSVDNMYIAVWGNTFTNPSEGTRKNVMEISDDIGWMHYIQRNKDNTVEAVLNGQQTINENIIAKKDIWVDRAVHTIGEITTNAVNGLRIWNNDYGVIFRRSEESLHIIPTAFGEGETGDIGPLRPLSVALNSGKVTIPDLQSSYNTFAANGYIKFAGHGAGAGGYDIQYSQAAPIFQEIDDAAVSKYYPIVKQKFLNGKAVWSLGTEINSGTFVLHHLKEDGSQGHTSRFNADGTVNFPDNVQVGGGEATIARNGNIFSDIWKSFTSAGETTNIRDAIATRVSKEGDTMTGKLTLSAGNDALILTAGEGASSHIRSDVGGTGNWYIGKGGGDNGLGFYSYITQGGVYITNNGEISLSPQGQGTFNFNRDRLHINGTQWVAHQAGDWGNQWRQEAPIFVDFGNVGNDSYYPIIKGKSGITNEGYISGVDFGMRRTTNQWAQAIIRVGNQENGSDPQAIYEFHHNGVLYAPNMVQAGARLSAGGGDPVWTGPCLVIGDNDTGLVHGGDGRINMVANGAHIASWSSSYHSHPGLWDSNGAFWTEVGKAIISHGHLVQANDSYSTYVRDVYVRSDIRVKKDLVKFENASQTLSKINGYTYMQKRGLDEEGNQKWEPNAGLIAQEVQAILPELVEGDPDGEALLRLNYNGVIGLNTAAINEHTAEIAELKSEIEELKALVKSLLK
Physico‐chemical
properties
protein length:1109 AA
molecular weight: 120234,70940 Da
isoelectric point:5,64323
aromaticity:0,09648
hydropathy:-0,37971

Domains

Domains [InterPro]
DC_0538
STR
1–713
IPR048390
ATT
449–556
IPR030392
CHP
999–1097
DC_0594
RBD
1006–1109
XXS07644.1
1 1109
Architecture
STR
ATT
STR
RBD
STR 1-448 | ATT 449-556 | STR 557-1036 | RBD 1037-1109
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
XXS07644.1
1 1109
Domain Start End Length (AA) Confidence
N-terminal 1 218 218 0,2754
Central domain 219 418 201 0,4355
C-terminal 419 1109 690 0,7277
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-218
Central
219-418
C-terminal
419-1109

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage Ec_O157kw
[NCBI]
3447159 Viruses >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XXS07644.1 [NCBI]
Genbank nucleotide accession
PV857777.1 [NCBI]
CDS location
range 105146 -> 108475
strand -
CDS
ATGGCTACTTTAAAACAAATACAATTTAAAAGAAGCAAAACGGCAGGTCAACGTCCAGCTGCTTCAGTATTAGCCGAAGGTGAATTGGCTATAAACTTAAAAGACCGTGTACTTTTTACTAAAGATGACCAGGGAAATATTATTGATTTAGGTTTTGCTAAAGGCGGTAGTATTGACGGGAATGTTATTCATAAAGGCAATTATAACCAAACTGGCGATTATACTTTAAACGGTACATTCACCCAGACTGGTAATTTTAATTTAACCGGTATTGCTCGAGTAACTCGTGATATTATTGCTGCTGGGCAGATTATGACTGAAGGCGGAGAACTTATTACAAAAAGTTCAGGAACGGCACACGTTCGTTTTCATGATTCCGCCGACCGTGAACGCGGTATTATTTTTTCTCCTGCTAATGACGGTTTAACTACACAAGTAATTAACATCAGAGTTCAAGATTACAAAGCCGGTTCAGAAAGCACTTTCGCTTTTAATGGAAATGGTTTGTTTTCTTCACCAGAAGTTTTTGGGTGGAAATCTGTATCAACTCCGGTAATTTATACCAATAAAGTTATCACCAATAAAAAAGTTAAAGATGATTATGACATCTATTCGATGGCAGACAATGTTCCATTGTCTGAAAGCACTACTGCTATTAATCATCTTCGTGTTATGCGTAATGCAGTTGGTTCTGGTATTTTCCATGAAGTTAAAGATAATGATGGAATAACATGGTATAGTGGAGATGGATTAGACGCTTATCTTTGGTCATTTACTTGGAGCGGCGGAATTAAATCAAGTCACTCAATTTCCATTGGTTTAACACCCGGACCTAAGGATTACTCAATATTAGGACCGTCTAGTATCGCTTTAGGAGATAATGATACTGGATTTAAATGGCATCAAGACGGATATTATTTCAGTGTTAACAATGGCACAAAAACGTTTTTATTTAGTCCAAGCGAAACAACTAGCCTAAGAAAATTTATAGCTGGATATTCTACTAACGGAACCGATTTAACTACTCCTCCAACTGAAAATTATGCTCTTGCTACTGTAGTGACATACCATGATAATAACGCGTTTGGGGATGGTCAGACTCTTTTAGGATATTATCAAAGCGGTAACTATCATCATTATTTCCGCGGCAAGGGCACTACAAACATTAATACTCATGGCGGTTTGTTAGTTACTCCAGGCAATATTGACGTTATTGGTGGTTCTGTTAATATCGATGGTAGAAATAATAATTCAACTTTAATGTTTAAAGGCTATACCATGGGTCAAAGCTCCGTTGATAACATGTATATAGCTGTTTGGGGAAATACTTTTACTAATCCAAGTGAAGGCACCCGTAAAAATGTCATGGAAATTTCTGATGATATTGGATGGATGCATTATATTCAACGTAATAAAGATAATACGGTTGAAGCCGTGTTAAATGGTCAACAAACAATTAATGAAAATATTATTGCGAAAAAGGATATTTGGGTTGACCGAGCAGTTCATACCATTGGCGAAATCACTACAAATGCTGTTAATGGTCTTCGTATTTGGAACAATGATTACGGAGTTATTTTTAGACGCTCAGAAGAAAGTCTTCATATTATTCCTACAGCATTTGGCGAAGGAGAAACCGGTGATATTGGGCCTTTACGTCCTCTCAGCGTAGCTTTAAATTCCGGTAAAGTTACTATTCCAGATTTACAGTCAAGTTATAATACGTTCGCTGCAAATGGTTATATTAAATTTGCTGGTCATGGGGCTGGTGCCGGTGGTTATGATATTCAGTATTCACAAGCTGCTCCTATTTTCCAAGAAATCGATGATGCTGCTGTAAGCAAATATTATCCTATTGTTAAACAGAAGTTTTTAAACGGTAAAGCCGTTTGGTCTTTAGGTACTGAAATTAATTCTGGTACATTTGTTTTACATCATTTGAAAGAAGATGGTTCACAAGGCCATACATCAAGATTTAATGCTGATGGTACAGTTAATTTCCCCGATAACGTTCAAGTTGGCGGCGGTGAAGCTACTATTGCTCGTAATGGTAATATTTTCTCAGATATTTGGAAATCGTTTACTTCTGCGGGAGAAACCACAAATATTCGCGATGCAATAGCTACTCGTGTTTCTAAAGAAGGCGACACGATGACTGGTAAATTGACTTTATCGGCAGGCAATGATGCTCTCATTTTAACTGCGGGCGAAGGTGCTTCATCACATATCCGTAGTGATGTAGGTGGTACAGGTAACTGGTATATAGGCAAAGGCGGCGGCGACAATGGTCTAGGTTTTTATAGTTACATTACACAAGGCGGTGTATACATAACAAATAACGGCGAAATATCGCTTTCTCCTCAAGGTCAAGGAACATTTAATTTTAATAGAGATCGCCTTCATATAAACGGTACACAATGGGTTGCGCACCAAGCTGGTGATTGGGGAAACCAATGGCGACAAGAAGCGCCAATATTTGTAGATTTTGGCAATGTCGGTAATGATAGTTATTACCCGATTATCAAAGGAAAATCTGGTATTACTAATGAAGGATACATATCGGGTGTTGATTTTGGTATGCGACGCACTACTAACCAATGGGCTCAGGCTATTATCCGTGTTGGTAACCAGGAAAATGGTTCTGACCCACAAGCTATCTATGAATTTCACCACAATGGAGTTCTGTATGCTCCTAATATGGTTCAAGCTGGAGCAAGATTATCAGCTGGCGGTGGTGACCCTGTATGGACCGGCCCGTGTCTTGTTATTGGTGATAATGATACTGGATTAGTTCATGGTGGTGACGGCCGAATCAATATGGTTGCAAATGGAGCGCATATTGCTTCGTGGTCTTCATCTTATCATTCTCATCCTGGCCTTTGGGATTCAAATGGAGCTTTTTGGACAGAAGTTGGCAAAGCAATTATTTCTCACGGCCATCTTGTCCAGGCGAATGACAGTTATTCCACATATGTCCGCGATGTTTATGTCCGTTCTGATATTCGTGTTAAAAAAGACCTTGTTAAATTTGAAAATGCTTCACAAACACTTTCAAAAATTAACGGTTACACTTATATGCAGAAGCGAGGCCTAGATGAAGAAGGCAATCAGAAATGGGAACCTAACGCCGGTTTGATAGCTCAAGAAGTTCAAGCTATTTTGCCTGAATTAGTTGAAGGTGACCCTGATGGCGAAGCTTTACTTCGTTTGAACTATAACGGTGTAATTGGTTTAAATACAGCTGCAATCAATGAGCATACTGCAGAAATTGCGGAACTTAAATCAGAAATTGAAGAACTTAAAGCATTAGTTAAATCATTGTTAAAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
d6aa352bd9503517ac147900bef4b959f681b9e0cf3e9c6b2674a2e79408b674
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,7128
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50