Protein
View in Explore- Genbank accession
- QIW88875.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MIKNSVVTGRKGGSSKPHTPQEMEDNLISINKIKVLLAVSDGEVDANFSLKDLYLNDVPVIAPSGEVNYEGVTAEFRPGTQTQDYIKGFNDTAAEFTVGRELKTTTPYVISVTNKQLSAVRVKILMPRGVTTKENGDMVGVVVKWAVDMAVDGGNYQEVLSDVIDGKTISGYDKTKRIDIPAFNSQVLLRVRRVTADSTDARVVDAINVQSYAEVIDAKFRYPLTGLVYVEFNSELFPQIPSISTKKKWKIINVPSNYDPVLREYSGTWDGSFKKAWSNNPAWVLYDLITNQRYGLDQRELGIAVDKWSLYDAGQYCDQKVPDGHGGTEPRYLCDVVIQSQVEAYNLVRDICSIFRGMSFWNGESLSIVIDRPREASYIFTNDNVVDGSFAYTFASEKSMYTTCNVTFDDEQNMYSQDIEGVFDLNASLRFGHNPTSITAIGCTRRSEANRRGRWVLKTNLRSTTVSFATGLEGMIPMIGDVVAIADNFWSSNLTLSLSGRVMEVSGLQVFTHFKVDARAGDFIIVNKADGNPVRRTISKVSADGKTIELNVGFGFDVQPNTVFAIDRTDVALQQYVVTGITKGDGDDEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVDPDNMQPPKNIKVSSYSRIVQGMSVETMVISWDKVQYASKYDVQWRKDNGNWMNVPRTANKEVEVEGIYAGNYHVRVRSISSGGNTSPWSEVVSVGLTGKIGKPDKPTVVIASDDQVFGIRVKWGFPEGSGDTAYTELQQIPDNGAGGHAGEENASLLTMIPYPQYEYWHSTLPAGYVNWYRARLVDRIGNVSDWSDIVRGMASDDVESIIGEIKVDIENSEGFKYLQQNAIESNGAIQAQAESILENAIANDTDVRRMTKENGRRKAEYVQAVNLIADETQARVEALTQLKAQIDDEVVASITEVQTALATETEARTTADTALSAQLGDTQAALNEKLDSWVDAESAGAQYGVKLGLKYNGVEYSAGMSMELVGSGAGVKSQFIFDANRFAISNGVGSGSGQWALPFVVESNQVFIQSAVIKDGSITNAKIGNRIQSNNYVQGSHGWAIDKSGFAELSNATVRGSLYANNGNFAFNGTSNTVQINGNGITVNLPGGGRVVVGVWS
- Physico‐chemical
properties -
protein length: 1127 AA molecular weight: 123630,64220 Da isoelectric point: 4,88016 aromaticity: 0,09051 hydropathy: -0,29423
Domains
Domains [InterPro]
DC_0014
STR
1–1127
STR
1–1127
IPR053171
Unmapped
3–881
Unmapped
3–881
IPR055385
ATT
93–218
ATT
93–218
IPR013783
STR
627–721
STR
627–721
IPR003961
STR
628–722
STR
628–722
IPR003961
STR
628–716
STR
628–716
IPR003961
STR
647–715
STR
647–715
1
1127
Architecture
STR 1-92 | ATT 93-218 | STR 219-339 | ATT 340-502 | STR 503-1127
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1127
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 503 | 503 | 0,8829 |
| Central domain | 504 | 1057 | 555 | 0,0628 |
| C-terminal | 1058 | 1127 | 69 | 0,7666 |
Note: Constraints were applied during segmentation.
Fixed 41 C-terminal predictions appearing before Central domain
Fixed 41 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-503
1-503
Central
504-1057
504-1057
C-terminal
1058-1127
1058-1127
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage CJ19 [NCBI] |
2723904 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QIW88875.1
[NCBI]
Genbank nucleotide accession
MT176427.1
[NCBI]
CDS location
range 11986 -> 15369
strand -
strand -
CDS
ATGATTAAAAATAGTGTAGTGACGGGCCGAAAGGGTGGCAGCAGCAAGCCGCATACCCCACAGGAGATGGAAGATAACCTGATCTCAATCAACAAGATTAAGGTTTTGCTCGCTGTTAGTGATGGCGAGGTTGACGCGAATTTCTCATTAAAGGACTTATATCTTAATGACGTTCCTGTAATTGCACCATCCGGCGAGGTTAACTATGAAGGCGTAACGGCTGAGTTTCGCCCTGGAACGCAGACTCAGGATTACATCAAAGGCTTCAACGACACGGCGGCTGAGTTCACTGTAGGCCGTGAATTGAAAACAACGACGCCTTACGTGATCTCGGTAACGAATAAACAGCTATCAGCCGTCCGCGTAAAAATCCTGATGCCTCGCGGCGTTACAACGAAAGAAAACGGCGATATGGTCGGCGTGGTCGTGAAGTGGGCCGTAGATATGGCGGTTGATGGCGGGAACTACCAGGAAGTTTTATCAGACGTCATCGACGGAAAGACCATTAGCGGATATGACAAAACAAAGCGAATCGACATCCCAGCGTTCAATAGTCAGGTTCTATTGCGTGTGCGCCGAGTTACGGCTGATTCAACTGATGCGCGTGTAGTTGACGCGATAAACGTTCAGTCATATGCGGAAGTGATTGATGCTAAATTCCGTTACCCTCTGACCGGTCTTGTTTACGTTGAGTTTAACAGCGAGTTGTTCCCGCAGATCCCAAGTATTAGCACGAAAAAGAAATGGAAGATCATCAACGTTCCTAGTAACTACGATCCAGTACTGAGAGAATATAGCGGCACATGGGATGGTAGTTTTAAAAAAGCATGGTCAAATAATCCGGCGTGGGTGCTTTATGATTTGATCACCAATCAGAGATACGGACTCGACCAAAGAGAACTAGGGATCGCGGTAGATAAGTGGTCACTCTATGACGCTGGTCAATACTGCGATCAGAAAGTACCAGACGGTCACGGCGGAACAGAGCCTCGCTATCTTTGCGATGTTGTGATTCAGTCTCAGGTTGAAGCGTACAATCTGGTGCGAGACATTTGCTCAATATTCCGAGGAATGAGTTTCTGGAACGGAGAGAGCCTTTCAATTGTCATCGACCGACCGAGAGAGGCGTCTTATATTTTCACTAACGATAACGTCGTTGATGGCTCATTTGCCTATACGTTCGCAAGTGAAAAAAGCATGTACACAACCTGCAACGTCACGTTCGACGACGAGCAGAACATGTACAGCCAGGACATTGAGGGTGTGTTCGACCTTAACGCCTCGTTACGCTTCGGTCATAACCCGACAAGCATCACAGCAATCGGATGTACTCGACGCAGTGAAGCGAATCGACGGGGCCGCTGGGTTCTCAAAACAAACCTTCGCAGCACGACGGTGAGTTTTGCAACTGGCCTTGAGGGTATGATTCCGATGATTGGCGACGTGGTGGCAATCGCTGATAACTTCTGGTCAAGCAACCTGACGCTTTCGTTATCAGGTCGCGTGATGGAGGTGTCTGGCCTGCAAGTGTTCACACACTTTAAGGTAGATGCACGCGCTGGTGATTTCATTATCGTGAACAAGGCTGATGGTAATCCGGTGCGCCGGACTATTTCAAAAGTGTCTGCGGACGGGAAAACGATTGAGCTTAACGTTGGGTTTGGCTTTGATGTTCAACCAAACACAGTATTCGCAATCGACCGCACTGATGTTGCCTTGCAGCAATACGTTGTAACCGGAATCACGAAGGGCGACGGCGACGATGAGTTCACGTACAGCATTACGGCAGTTGAATACGATCCGAACAAATACGACGAGATTGACTACGGCGTAAATATTGATGACCGACCAACCAGCATTGTAGATCCTGACAACATGCAGCCACCTAAAAATATCAAGGTGTCTTCATATTCAAGAATTGTCCAGGGAATGAGCGTAGAAACTATGGTCATCTCATGGGATAAAGTGCAGTACGCGAGCAAGTACGATGTGCAGTGGCGCAAGGACAATGGCAACTGGATGAACGTACCGCGCACAGCAAACAAAGAAGTCGAGGTTGAAGGGATTTACGCAGGCAATTATCACGTACGCGTAAGGAGCATCTCAAGCGGTGGAAATACGTCTCCATGGTCTGAAGTTGTCAGCGTCGGATTGACGGGTAAGATTGGGAAACCAGATAAACCAACAGTAGTAATAGCATCTGACGATCAGGTGTTTGGGATTAGGGTTAAATGGGGATTCCCTGAAGGCTCTGGTGATACTGCATACACGGAGTTGCAGCAGATACCAGATAATGGTGCTGGAGGTCACGCTGGTGAGGAGAATGCGAGTCTATTAACAATGATTCCATACCCACAGTACGAGTACTGGCACTCGACACTGCCAGCAGGATATGTGAACTGGTACAGAGCGAGATTAGTGGACAGAATTGGGAACGTATCTGACTGGTCTGACATTGTTCGAGGAATGGCTAGTGATGACGTGGAATCGATCATTGGTGAAATTAAGGTTGATATCGAGAATTCCGAGGGGTTTAAGTATCTACAGCAGAACGCCATTGAGTCAAACGGAGCTATTCAGGCGCAAGCGGAATCAATTCTTGAGAACGCCATTGCAAACGACACGGATGTGCGTCGAATGACTAAGGAAAACGGCAGGAGGAAGGCTGAATATGTTCAGGCAGTGAATCTTATTGCAGATGAAACTCAGGCTCGTGTAGAGGCTCTCACGCAGCTTAAAGCGCAGATTGATGATGAGGTCGTAGCCTCCATAACTGAAGTTCAGACGGCACTAGCAACCGAAACGGAAGCTAGGACTACTGCCGACACTGCTTTAAGCGCTCAGCTTGGCGACACGCAGGCCGCACTGAATGAAAAACTTGACTCATGGGTAGATGCTGAATCAGCTGGCGCTCAGTACGGCGTTAAATTGGGCCTCAAATACAATGGGGTTGAATATAGCGCAGGGATGAGCATGGAACTTGTTGGCAGCGGTGCTGGTGTTAAGAGTCAGTTCATATTTGACGCTAACAGATTCGCGATCAGTAATGGGGTTGGTTCTGGTTCCGGTCAGTGGGCATTGCCTTTTGTTGTTGAGAGCAACCAGGTGTTCATACAGAGCGCCGTAATCAAGGACGGTTCAATCACGAACGCTAAGATTGGTAACAGGATCCAATCTAACAACTACGTTCAAGGAAGTCATGGGTGGGCTATTGATAAATCTGGTTTTGCTGAACTAAGTAATGCCACCGTTAGAGGAAGCCTTTACGCTAACAATGGTAACTTCGCGTTTAACGGAACAAGCAACACTGTTCAGATTAACGGTAACGGAATCACTGTTAACCTGCCAGGTGGAGGAAGGGTTGTAGTTGGAGTATGGAGTTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
fa0ba4f879124ba53ed81742fc44e20806e4f27e57223617daeacd2fa6ccc4a4
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50