Genbank accession
URC10088.1 [GenBank]
Protein name
side tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,69
TF
Evidence RBPdetect2
Probability 0,66
Protein sequence
MNMAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSRPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATRATDAAGSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSEMNAAASQKSAATSASTATTKASEAATSARDASASKVAAKSSETSAASSAGSAASSATAAGNSAKAAKTSETNADNSAQAAADSQTASANSATAAKKSETNAKNSEAAAKVSETNAKASENKAKEYLDKVGGLVSPMTQYDWPVVTGNESFYIKIAKLSDPGSGNCHVTLMVTNAGNYGSPYGNIDFIEISARGLPSLLSADNVSRHLSIRRLGSIGLTDNNQMRYGLVKGDGFIEVWAFQRAFINGAKVAVLAQTARPELYIPDGFVKQTAAPSGYVESPVVRIYDQLNKPTKADLGLSNAMLTGAFGLGGSGISTNGKMSDVEILKALRDKGGHFWRGDKPTGSTATIYSHGSGIFSRCGDTWSAINIDYSTAKIKIYAGNDARLNNGTFSINELYGSANKPSKSDVGLGNVTNDAQVKKSGDVMSGDLDILKETPSIRLKSAKGTAHLWFMNNDGSERGVVWSPENNESLGEIHIRAKNTKGESSGDFIVRHDGRVEARNLKITYKISAATAEFANTSTSSDNTTVSIKGSQHTPLVLTSNNTIKNLSIGFKVDDVDQKYLGIAGDGDLYFGSYSDHTKNSKVITQAKLDSGVTVGGKTTFSDFATFNAGMAGSIEPETIDNKTIDLNDLIIANTVAGSVKYYQCKTVAGGAYITNKPDGVSGNFLLRVESTRKTTGSDYAIMQTLIGSDTKRIYVRFVVNGNWTEWSQVVVSGWNQDVTVRSLTSTTPSKLGGGRVDVLGSTSDYSSMNCAVRGVDSTGTNSAWSVGTSESTGKMLFLKNHRSSAQVLLNGDDGAVQLLSGTVNGATAQALTINKDEVNSTADLVIRKQTGTGNRFALLNSGNSELPVSIRVWGSSTRQNVFEVGTSAAYLFYAQKTTDGQNLTEPPRVSWRVFYL
Physico‐chemical
properties
protein length:1109 AA
molecular weight: 115992,12590 Da
isoelectric point:8,58366
aromaticity:0,06312
hydropathy:-0,33255

Domains

Domains [InterPro]
IPR013609
ATT
3–136
G3DSA:2.60.40.1120
STR
7–99
URC10088.1
1 1109
Architecture
ATT
STR
ATT 3-136 | STR 137-1098 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
URC10088.1
1 1109
Domain Start End Length (AA) Confidence
N-terminal 1 128 128 0,9905
Central domain 129 381 254 0,9033
C-terminal 382 1109 727 0,5557
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-128
Central
129-381
C-terminal
382-1109

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS-640R1
[NCBI]
2946111 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URC10088.1 [NCBI]
Genbank nucleotide accession
ON470612.1 [NCBI]
CDS location
range 50096 -> 53425
strand +
CDS
GTGAATATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCACAGGAAAACCGGTACAGAACTGCACAATCCAGCTGAAAGCAAAACGTAACAGTACCACGGTGGTGGTGAACACGCTGGCCTCAGAAAATCCGGATGAAGCCGGGCGTTACAGCATGGACGTTGAGTACGGTCAGTACAGCGTTATTCTGTTGGTGGAAGGATTCCCGCCGTCACATGCCGGGACCATCACCGTGTATGAAGATTCCCGACCCGGTACGCTGAATGATTTTCTCGGTGCCATGACGGAGGATGATGCCCGTCCTGAGGCACTGCGCCGTTTTGAACTGATGGTGGAAGAGGTGGCGCGTAACGCGTCCGCGGTGGCACAGAACACGGCAGCCGCGAAGAAGTCAGCCAGCGATGCCAGCACATCAGCCCGTGAGGCGGCAACCCGTGCGACTGATGCTGCAGGCTCAGCACGTGCAGCCAGCACGTCAGCCGGACAGGCCGCGTCGTCGGCTCAGTCAGCGTCTTCCAGCGCAGGAACGGCATCAACAAAGGCCACTGAAGCATCAAAAAGTGCTGCCGCCGCAGAGTCTTCAAAAAGCGCGGCAGCCACCAGTGCCGGTGCAGCGAAAACGTCGGAAATGAATGCCGCAGCATCACAAAAATCTGCGGCCACATCTGCATCCACCGCGACCACGAAAGCGTCAGAAGCTGCCACCTCAGCCCGGGATGCGTCGGCTTCAAAAGTGGCGGCAAAATCATCAGAAACGAGCGCTGCCTCGAGCGCCGGCAGTGCAGCTTCCTCGGCAACGGCAGCAGGAAATTCCGCGAAGGCCGCAAAAACGTCTGAGACGAATGCGGATAACAGCGCACAGGCGGCAGCAGACTCACAAACTGCATCGGCAAACTCCGCGACAGCAGCCAAAAAATCAGAAACCAACGCGAAAAATAGTGAGGCAGCAGCAAAGGTCAGCGAAACCAACGCTAAAGCGTCAGAGAACAAGGCGAAAGAATATCTCGACAAGGTCGGGGGACTCGTCAGCCCGATGACGCAATACGATTGGCCCGTTGTTACTGGTAATGAGTCTTTTTACATAAAGATCGCGAAACTTTCCGATCCCGGAAGCGGCAATTGCCATGTAACGCTAATGGTTACTAATGCCGGTAACTACGGCTCCCCTTACGGAAACATTGACTTTATCGAGATCTCGGCGCGCGGTCTGCCTTCTTTGCTTAGTGCGGATAATGTTTCTCGTCATCTGAGTATACGCCGCTTAGGGTCAATCGGGCTGACCGATAACAACCAGATGCGTTACGGCCTGGTTAAAGGTGACGGCTTTATTGAGGTTTGGGCCTTCCAGCGCGCATTTATCAACGGCGCAAAGGTTGCGGTGCTGGCGCAGACGGCACGCCCGGAATTATACATTCCAGACGGATTTGTTAAGCAAACCGCCGCGCCTTCTGGATATGTTGAAAGCCCCGTTGTAAGGATTTACGACCAGTTAAACAAGCCGACTAAAGCAGATTTGGGTCTTTCTAATGCTATGCTTACAGGCGCTTTCGGTCTTGGCGGTAGCGGGATATCAACAAACGGCAAGATGAGCGATGTAGAGATCTTAAAAGCTCTGCGTGACAAAGGTGGTCATTTCTGGCGCGGTGATAAGCCGACCGGAAGCACGGCGACCATTTATAGCCACGGTTCTGGTATATTCTCGCGGTGCGGCGATACGTGGTCAGCGATCAATATCGACTACTCAACCGCGAAGATTAAGATCTATGCCGGCAACGATGCCCGGCTTAACAACGGGACTTTTAGCATCAATGAGCTATACGGCTCGGCAAACAAGCCGTCGAAATCGGATGTTGGACTTGGCAACGTAACTAACGATGCGCAGGTGAAAAAATCCGGCGATGTTATGTCTGGTGATCTTGATATATTGAAAGAAACGCCATCTATCAGGCTAAAATCAGCAAAAGGAACCGCTCATCTGTGGTTCATGAACAACGACGGAAGCGAGCGCGGCGTTGTTTGGTCGCCTGAAAACAACGAATCACTTGGCGAAATCCACATCAGGGCGAAAAACACAAAAGGTGAATCAAGTGGTGATTTTATTGTTCGCCACGACGGGAGGGTTGAGGCCCGCAATCTAAAAATAACTTACAAAATCAGCGCAGCCACCGCAGAATTTGCAAACACAAGCACCAGTTCCGATAACACTACGGTAAGCATCAAAGGATCTCAGCATACGCCTTTGGTTTTAACGAGCAACAACACAATTAAAAACTTGTCCATTGGGTTTAAGGTTGATGATGTTGATCAAAAATACCTGGGTATAGCTGGTGACGGTGATTTGTATTTTGGTAGTTATTCTGACCACACAAAAAACAGCAAAGTAATCACACAAGCAAAACTCGATAGCGGGGTGACGGTAGGCGGTAAAACAACCTTTTCTGACTTTGCCACATTTAACGCAGGTATGGCGGGATCTATCGAGCCGGAAACCATTGACAACAAGACTATTGATTTAAACGACTTGATCATTGCTAATACAGTGGCTGGATCTGTTAAATACTATCAATGCAAAACTGTTGCAGGTGGTGCATATATTACCAATAAGCCTGACGGCGTAAGCGGTAACTTTTTGCTACGTGTAGAATCTACTCGTAAAACTACGGGTTCAGATTATGCGATCATGCAAACGCTGATTGGCAGCGACACAAAACGCATATACGTTCGCTTTGTTGTCAATGGAAACTGGACGGAGTGGAGTCAGGTAGTTGTTTCAGGATGGAATCAGGATGTAACCGTCAGGTCGTTAACCTCGACGACTCCATCAAAATTAGGCGGCGGGCGTGTTGATGTGCTGGGGAGTACGTCAGATTACAGTAGTATGAATTGTGCTGTGCGCGGTGTTGATAGCACTGGAACCAATTCGGCGTGGTCGGTAGGCACATCAGAAAGCACAGGCAAAATGTTGTTCCTGAAAAACCACAGAAGCAGCGCTCAAGTGCTGTTAAATGGCGATGATGGAGCGGTTCAACTACTAAGTGGCACTGTTAACGGTGCTACAGCGCAGGCGCTAACCATCAACAAAGATGAGGTTAACTCAACTGCCGATTTAGTAATTAGAAAACAAACAGGGACTGGCAATCGTTTTGCTTTACTTAATTCAGGTAATTCAGAACTACCAGTTAGTATCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACAACAGACGGGCAAAACCTTACTGAACCGCCCCGGGTTTCCTGGAGAGTGTTTTATCTGTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
b08c22588581c62f9ac0c5d8d4e654b390ce86d87eb08a172a82ba64da9fedce
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5733
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50