Protein
View in Explore- UniProt accession
- A0AB39CBS3 [UniProt]
- Protein name
- Tail fiber
- RBP type
-
TFTFTFTF
- Protein sequence
-
MIVYNNQAPDAVNNVGQFGATEGSIGAYKQAAEYAADSKYWALLAESKFGTIDDLIAEVERLYQQGVLMKQDIEDLKQDFKDQDARLMTLIAQTNAAVSDANNAVALINQKLIEVQNQLDVLLGMSVDVTTLPPGTPATGSFNPNTGVISLGIPEGDPGKDGSVKDLDTAPTGVPELGDLGFYVDKDDNTVHKTTLDNIANLIPSVRSVSINGGPALDGEVALTLNKETVGLGNVLNVAQYSRQEINDKFDKTTKTYQSKAEADADAQYRQVGEKVLVWEATKYEFYTVAANKTLTPVKTEGRILTVNSRSPDSSGNIDITIPTGNPSLYLGEMVMFPYDPTKNISYPGVLPADGRLVSKESASDLGPSLVSGQLPVVSETEWQAGAKQYFSWGKLADGITDADSTNFINIRLPDWTGGEAIRAPDSDKDSQYNGSVQAQKPYVVTVNNQAPDEITGNVNISRSILGAAASGTNSDITSLTGLTTALSVTQGGTGAKDAASARSNLGLSSIATLNTIPIANGGTGATTVDAARSNLSIDRVDQASGESRLLSPNKETYLFVDNNGWGCYSTSPGRVGDLALGVERGGTGAKNAASARSNLGLGSVSTLDNVPIANGGTGAGDAAGARFNIGALSNTPANSGVGGTGDRVQHASGNGLFTLDLFNCYWYMQPEDTNFWIAHGVSYAGSGGEASQYGRISYAIKIADGTIKYVHCLTNKNTTVDVNGFIKAASPVVHIYSDGRYETNEESEGVNVIRQGVGEYLITGCLGLNADAKWGGIDGGFEIPTDRNKQPRVWLDYEIKEDGSILVKTYHRTHPTSPVFSRNEIEGFSDGDPIDIPSDAFISVRVEMPSE
- Physico‐chemical
properties -
protein length: 852 AA molecular weight: 90195,70430 Da isoelectric point: 4,59449 aromaticity: 0,07042 hydropathy: -0,30200
Domains
Domains [InterPro]
Coil
Unmapped
73–93
Unmapped
73–93
DC_0709
STR
607–852
STR
607–852
1
852
Architecture
STR 183-852
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage EC.W8-1 [NCBI] |
3236638 | Uroviricota > Caudoviricetes > Mktvariviridae > Kuravirus > |
| Host |
Escherichia coli ATCC 25922 [NCBI] |
1322345 | Pseudomonadota > Gammaproteobacteria > Enterobacterales > Enterobacteriaceae > Escherichia > Escherichia coli |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
XDJ04158.1
[NCBI]
Genbank nucleotide accession
PQ030848
[NCBI]
CDS location
range 42979 -> 45537
strand -
strand -
CDS
ATGATCGTTTATAATAACCAAGCACCTGATGCAGTGAATAACGTTGGGCAGTTTGGTGCTACTGAAGGCTCTATCGGAGCTTACAAGCAAGCAGCAGAATATGCAGCTGACTCCAAATATTGGGCACTGCTAGCAGAATCTAAGTTTGGTACAATTGATGATTTAATCGCTGAAGTAGAGCGTCTATATCAACAGGGTGTTCTGATGAAGCAGGATATTGAAGATCTTAAGCAGGATTTTAAGGATCAAGATGCTCGTTTGATGACTCTGATTGCCCAAACCAACGCAGCAGTATCGGATGCTAATAATGCCGTTGCTCTCATTAACCAAAAACTTATTGAAGTCCAGAATCAGCTTGATGTTCTGTTAGGAATGTCTGTTGATGTAACAACTCTCCCTCCGGGAACTCCAGCTACTGGCTCTTTTAATCCTAACACTGGTGTAATCTCTTTAGGTATTCCAGAAGGTGATCCCGGAAAAGATGGTTCTGTTAAGGATTTGGATACAGCACCTACAGGTGTTCCAGAGCTAGGTGATTTAGGTTTCTATGTTGACAAAGATGACAACACCGTCCACAAAACTACTCTAGATAACATTGCTAACTTAATCCCATCTGTCCGTTCTGTCTCTATTAACGGCGGTCCAGCTCTTGATGGAGAGGTTGCTCTAACGCTTAACAAAGAGACGGTAGGTTTAGGAAATGTTCTGAACGTTGCTCAGTACAGTCGTCAAGAGATTAATGACAAGTTTGATAAAACCACCAAGACATATCAATCTAAAGCAGAAGCTGATGCTGACGCTCAGTATCGCCAAGTTGGTGAGAAGGTCTTAGTTTGGGAAGCTACTAAGTATGAATTTTATACTGTTGCTGCTAACAAAACATTAACTCCTGTTAAAACTGAAGGTAGAATTCTTACTGTTAATTCCCGTTCTCCAGACTCCAGTGGTAATATTGACATTACCATTCCAACAGGAAACCCTTCTCTATATCTTGGTGAGATGGTAATGTTCCCTTATGACCCAACAAAGAATATCTCCTATCCGGGCGTTCTTCCAGCCGATGGTCGTCTAGTGTCTAAAGAGTCTGCTTCAGATTTAGGCCCATCCCTTGTCAGTGGACAACTTCCTGTTGTTTCTGAAACTGAATGGCAAGCAGGGGCTAAACAATACTTCTCTTGGGGTAAATTGGCAGATGGTATTACCGATGCAGATTCTACTAATTTTATCAACATTCGACTTCCTGATTGGACTGGAGGGGAGGCAATAAGAGCACCAGATTCTGATAAAGACTCTCAGTACAATGGGTCTGTACAGGCTCAGAAACCTTATGTTGTTACTGTAAATAACCAAGCTCCTGATGAGATTACTGGTAATGTGAACATCTCCAGATCAATTTTAGGTGCTGCCGCTTCTGGTACAAACTCTGATATAACATCCCTCACGGGACTCACTACAGCACTGTCTGTCACACAAGGAGGCACTGGAGCTAAAGACGCTGCTAGTGCTCGTTCTAATCTAGGTCTAAGCTCCATAGCTACACTCAACACAATTCCTATAGCTAATGGTGGTACAGGGGCAACTACTGTTGATGCTGCTCGTTCCAATTTATCTATAGATAGAGTTGATCAAGCCTCTGGGGAGAGTAGATTACTATCTCCTAACAAAGAAACCTACCTCTTTGTGGATAATAATGGGTGGGGATGTTACAGTACCTCGCCTGGAAGAGTTGGCGATTTGGCTCTAGGTGTTGAAAGAGGAGGCACTGGAGCTAAAAATGCTGCTAGTGCTCGTTCTAATCTAGGGTTGGGAAGTGTTTCTACTCTGGATAATGTACCTATTGCTAATGGTGGGACAGGGGCTGGAGATGCTGCTGGTGCAAGGTTCAATATTGGAGCGCTAAGTAACACCCCAGCAAATTCAGGCGTAGGTGGTACTGGGGATCGTGTCCAACATGCCTCCGGGAACGGCCTGTTTACTTTAGACTTGTTTAACTGCTACTGGTATATGCAGCCAGAGGATACTAACTTCTGGATTGCCCACGGTGTGTCATATGCAGGGTCTGGTGGCGAAGCCTCACAATACGGTCGCATATCTTACGCAATAAAGATTGCAGACGGAACAATAAAATATGTCCATTGTCTTACAAACAAAAACACCACTGTTGATGTAAACGGATTTATCAAAGCGGCATCTCCGGTAGTTCATATTTACTCCGACGGTCGATATGAGACGAACGAGGAATCTGAGGGGGTTAATGTCATTCGGCAAGGTGTTGGTGAGTACTTGATTACCGGTTGCCTTGGCTTAAACGCAGATGCAAAATGGGGAGGCATTGACGGCGGCTTTGAAATCCCTACTGACCGCAACAAACAACCTAGGGTTTGGCTGGACTATGAGATTAAAGAAGATGGTTCTATTTTAGTTAAAACCTATCACAGAACCCATCCAACCTCTCCGGTTTTTTCTAGAAATGAGATAGAAGGTTTTTCTGACGGTGATCCTATTGATATCCCGAGTGATGCTTTTATTTCAGTTCGTGTTGAAATGCCTTCTGAGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
8a9bdfa27fc0611753d16063b55c5509e63acc178133ec1cc959008df975b453
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50