Protein
View in Explore- Genbank accession
- UKL15177.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MGKGGGGKARTPVEDKDTIESSQMISVIDLWSEGQIYGLVDGLKSVRLDNTAVVAEDGSVNIPGVDISYNLGTEDQDYLDGFPQNASEISVGVEVKQSAPVIRTITDPRIDMLRINLYSRALFVITNDGDTKRTDLKMRVETRKGSEPWEVKARIDFIDQKSRNEFSFKAEIWDLPATPFDVRVVRETSDDQTGDFQQIQNSSFWRSYSQVINQKYRWPLTAYMGLKFDSKSFEGAIPRRNYIVRGMIVKVPSNYDPETRKYNGFWNGQFKAAYTNNPAWIVYDIIKNPRYGLGRRGINVEETEMYKAAQWCDQLVPDGRGGMEPRVTCNCYITEQRNAWDLITDIMSCFRAMPLWNGQQFVPSLDIAKDVVATYNNSNVINGTFEYSASSMEDRHSVIEVRYANKANNYEQDTVQITDDLMIEQYGWNVLKVEAFGTDTESQAYRFGSYLLETERLERKTVSFSTGAEGLRNLPGDVIAVADSRQYGRIIGGRILSVSEDRKSIEIDDEVEIPNNAETIIIVIGDDRKPVELLCTNNPGNAKVLNFSTTCPESLGRLSPWSLKINNSGLKLWRCVSVKENDDGTYAINCVEHVPKKNEIVDNGVKFNPPEETLYGNNLPPVENISVEAVVENPNANVRVYWDAPRTARQIRYNVRIYRSGNLVTNQNIDNPSFSFMADTAGTYRAEIRCLGSDGKLGDSVDVVFVIAEPSMPSDVSWRASNFTVTLRPIPGGLVTIGEVYEWFIGSTEQEVLAMNNNLGEAFVLNQVGLKPNTEYWFGVRAVNMIGRSAIKTVLTKTAFETESLEGLIDIALPKTDYIKEMNKDIEGLGELASLRVVDKNGGRPRVTGVYLNAGDAGNNIASVIDFVADAVSISSPDTLERWVYFDSTNRRLVLGGEIQAVSGRLKNVVIEENCVIEGKLSVANIEGYAMEGQSYEFNISNTGGSKTINYGGNAKIPVRLFGQVWARQYKNQKTRVTVNGKTINQMEVSVIVNNNGTVTTRTYTWLYTFVVDLSINQGAEIFVSAGSLDQGNSESSTYRTQFWIAPQSNGFTSN
- Physico‐chemical
properties -
protein length: 1055 AA molecular weight: 117824,82110 Da isoelectric point: 4,97423 aromaticity: 0,09289 hydropathy: -0,37403
Domains
Domains [InterPro]
DC_0323
STR
1–1034
STR
1–1034
IPR053171
Unmapped
3–793
Unmapped
3–793
IPR055385
ATT
87–213
ATT
87–213
IPR003961
STR
619–695
STR
619–695
IPR013783
STR
620–803
STR
620–803
IPR036116
STR
632–796
STR
632–796
IPR057587
ATT
712–802
ATT
712–802
1
1055
Architecture
STR 1-86 | ATT 87-213 | STR 214-333 | ATT 334-498 | STR 499-711 | ATT 712-802 | STR 803-1034 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1055
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 501 | 501 | 0,8556 |
| Central domain | 502 | 918 | 418 | 0,0509 |
| C-terminal | 919 | 1055 | 136 | 0,8899 |
Note: Constraints were applied during segmentation.
Sequence started with non-N-terminal domain
Sequence started with non-N-terminal domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-501
1-501
Central
502-918
502-918
C-terminal
919-1055
919-1055
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Serratia phage vB_SmaS_Bonzee [NCBI] |
2914027 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
UKL15177.1
[NCBI]
Genbank nucleotide accession
OM135608.1
[NCBI]
CDS location
range 22434 -> 25601
strand +
strand +
CDS
ATGGGAAAAGGTGGAGGCGGTAAGGCAAGAACACCCGTCGAAGACAAAGACACCATCGAGTCTAGTCAGATGATTTCTGTAATTGACCTCTGGTCAGAGGGTCAGATTTACGGACTTGTTGACGGACTTAAAAGTGTTCGCCTTGATAATACTGCGGTAGTTGCTGAAGACGGTTCAGTAAATATCCCCGGTGTTGATATTTCATATAACCTTGGGACGGAAGACCAAGATTATTTAGATGGCTTTCCGCAAAACGCTTCGGAAATAAGCGTTGGAGTTGAGGTAAAGCAGTCGGCTCCTGTAATTAGAACCATTACAGACCCGCGCATAGATATGCTTCGCATAAATCTCTATAGCCGTGCTTTGTTCGTAATAACCAATGACGGCGACACGAAGCGCACTGATTTAAAGATGCGAGTAGAAACAAGAAAAGGCTCTGAGCCTTGGGAAGTTAAAGCGCGCATTGATTTCATAGACCAGAAATCAAGAAACGAGTTTTCTTTCAAGGCTGAAATATGGGATTTACCGGCGACGCCATTTGACGTGCGAGTAGTTCGCGAAACGTCAGACGACCAGACCGGCGATTTCCAGCAGATTCAAAACTCGTCTTTCTGGCGTTCGTATTCTCAGGTAATTAACCAAAAATACCGCTGGCCTTTAACTGCGTATATGGGTCTTAAATTCGACTCGAAATCATTCGAGGGCGCAATACCAAGAAGAAATTATATTGTACGCGGGATGATTGTCAAGGTTCCATCAAACTATGACCCGGAAACCAGGAAATATAACGGTTTCTGGAATGGTCAATTTAAGGCGGCTTATACAAATAACCCTGCGTGGATTGTTTACGACATTATCAAAAACCCTAGATACGGCCTTGGGCGTCGTGGAATTAACGTTGAAGAAACTGAAATGTATAAAGCCGCGCAATGGTGTGACCAGCTAGTGCCTGACGGTCGTGGTGGAATGGAGCCGCGAGTTACATGTAATTGCTATATCACTGAACAGAGAAACGCCTGGGACTTGATAACCGACATAATGAGCTGCTTCCGTGCTATGCCGTTATGGAACGGACAGCAATTTGTTCCGTCTCTTGATATTGCAAAAGACGTTGTCGCGACATATAACAATTCAAACGTAATTAACGGCACGTTTGAATATAGCGCATCTTCTATGGAAGACCGTCACTCAGTTATAGAGGTTCGTTACGCGAATAAGGCCAACAACTACGAACAGGATACAGTTCAAATAACTGATGACCTGATGATAGAACAATACGGATGGAACGTGCTGAAGGTTGAGGCCTTTGGTACTGATACCGAATCACAGGCTTATCGATTTGGCTCTTATCTGCTGGAGACTGAAAGACTGGAAAGAAAAACGGTTTCATTCTCAACTGGCGCAGAGGGTTTAAGGAATCTTCCTGGCGACGTAATAGCTGTTGCTGATTCACGCCAGTATGGCAGGATTATTGGAGGCAGGATTCTGTCTGTATCCGAAGACAGAAAGAGCATTGAGATTGATGATGAAGTTGAAATCCCAAACAACGCAGAAACTATAATTATAGTTATCGGCGATGACAGAAAGCCAGTTGAGCTTTTATGCACTAACAATCCTGGCAACGCTAAGGTTTTGAATTTTTCCACCACTTGCCCCGAAAGTTTGGGTCGCTTGTCTCCTTGGTCGTTAAAAATAAATAACAGCGGTCTGAAGCTTTGGCGATGCGTTAGCGTTAAGGAAAACGACGATGGGACTTACGCCATTAACTGCGTTGAGCACGTTCCTAAAAAAAACGAAATTGTAGATAACGGCGTAAAATTTAACCCTCCTGAAGAAACTTTGTATGGTAACAACCTGCCGCCAGTTGAGAATATCAGCGTTGAGGCTGTAGTAGAAAACCCTAACGCAAACGTTCGAGTTTATTGGGATGCACCAAGAACGGCGAGGCAAATTCGCTACAACGTTAGAATCTATCGCTCTGGTAATTTGGTCACAAACCAAAATATCGATAATCCATCATTTTCTTTCATGGCGGATACGGCGGGAACATACCGCGCGGAAATTCGCTGCCTCGGCTCTGATGGGAAACTAGGTGATAGCGTTGATGTTGTTTTTGTCATTGCTGAACCCTCAATGCCGTCAGATGTTTCATGGCGCGCATCAAACTTCACGGTTACGTTAAGGCCAATTCCTGGAGGACTGGTTACTATAGGTGAGGTTTACGAGTGGTTTATAGGCTCAACCGAGCAAGAAGTTTTGGCTATGAATAACAATCTTGGCGAAGCGTTCGTTTTAAACCAGGTTGGATTAAAGCCAAATACTGAATATTGGTTCGGTGTTCGAGCTGTAAACATGATTGGCCGCTCGGCGATAAAAACGGTATTGACGAAAACGGCGTTCGAAACCGAATCACTTGAAGGTCTTATTGATATTGCGCTGCCTAAGACTGACTACATCAAGGAAATGAACAAAGACATAGAGGGTCTTGGAGAGCTTGCGTCTCTTAGGGTTGTTGACAAGAACGGCGGAAGGCCTCGCGTAACTGGTGTTTATCTAAACGCTGGTGATGCAGGAAACAACATAGCTTCGGTGATTGATTTTGTAGCTGATGCCGTCTCAATCTCAAGTCCTGACACGCTAGAGCGCTGGGTGTACTTTGATTCAACCAACAGACGTTTGGTGCTTGGTGGTGAGATACAGGCGGTTTCTGGTCGACTGAAAAACGTTGTCATAGAAGAAAACTGCGTCATAGAAGGAAAGTTGTCAGTGGCGAACATCGAAGGCTATGCGATGGAGGGGCAAAGCTACGAGTTCAACATAAGCAACACTGGAGGGAGTAAGACCATAAACTACGGAGGGAACGCAAAGATACCTGTTAGGCTGTTCGGCCAAGTGTGGGCTAGACAGTACAAAAACCAAAAGACTAGGGTAACGGTAAACGGAAAAACCATAAACCAAATGGAGGTATCAGTTATCGTTAACAACAACGGCACGGTGACGACCAGGACTTACACCTGGCTTTATACGTTTGTTGTTGACCTGAGCATAAACCAGGGGGCTGAAATCTTCGTCAGCGCCGGGTCGCTAGACCAGGGCAACAGCGAATCCTCAACATACAGGACGCAGTTTTGGATTGCCCCCCAATCCAATGGATTCACCTCTAACTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
2afc4fc7532c43e306321a47256ab98dccebeacc68b2af56662c6b69c21adf28
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50