Protein
View in Explore- UniProt accession
- A0AAE8B2N3 [UniProt]
- Protein name
- Tail fiber protein
- RBP type
-
TFTFTSPTFTF
- Protein sequence
-
MTTYTNIPQDITKLKDEKMDKNNNLSDLADRAAAWLNVRPIGSTPLAGDPVGDYDAATKRWVENKINTGTVGPTMNGVMNYGVGDFHLRDSRAYIQPYEVVSDGQLLNRADWPELWAYAQMVGAIDDSVWLADKFQRGRYSLGDGTTTFRVPDRNGVQQGSIRALYGRGDGGNSGANGQLFESAAPNITGIVPSYSSTSYAQVFGAAAAGAFFVNNGIFPSGDGDAIPSTGNIALTGRYNTLNFDASHSSPIYGASTDEILTRNFVGVWVIRASGGFVAANTQWQVINADAVRPGSVVKASSGQVKAQYKIGTTVEAEAGLRCDAYMDDVYYAIMSVYNKTKGVTKNLAFDDTGTLNSDRYSARYGTMMAWSETGLVKGTFSTEEQAIGTNYAFNSILSGSQYSNAGYKTSAHFGLIHNDLGSFADSCWHVSGDADNKYGVRLKIQPNNNSIYFYSWWPNGAATYTLQLNAISDSRLKHDIKAIDATKSIEVLKGLEFQSFIYNNDEESRIRRGVIAQQVETIEPLYVKTRRFYNDDGVEQEQKELDTTPMLLDTMHVVQDLIKRVEDLEEELKQLRANLVQ
- Physico‐chemical
properties -
protein length: 582 AA molecular weight: 63914,22360 Da isoelectric point: 5,10621 aromaticity: 0,10309 hydropathy: -0,39330
Domains
Domains [InterPro]
DC_1651
STR
10–524
STR
10–524
Coil
Unmapped
11–31
Unmapped
11–31
IPR030392
CHP
473–573
CHP
473–573
DC_0409
RBD
504–582
RBD
504–582
1
582
Architecture
STR 10-524 | RBD 525-582
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
582
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 287 | 287 | 0,3082 |
| Central domain | 288 | 486 | 200 | 0,1114 |
| C-terminal | 487 | 582 | 95 | 0,9975 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-287
1-287
Central
288-486
288-486
C-terminal
487-582
487-582
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage SelmaRatti [NCBI] |
2852006 | Uroviricota > Caudoviricetes > Demerecviridae > Tequintavirus > Tequintavirus selmaratti |
| Host |
Escherichia coli K-12 [NCBI] |
83333 | Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QXV84327.1
[NCBI]
Genbank nucleotide accession
MZ501103
[NCBI]
CDS location
range 23377 -> 25125
strand +
strand +
CDS
ATGACTACTTATACTAATATCCCACAGGATATTACAAAACTTAAAGACGAAAAGATGGATAAAAACAACAATTTATCCGATCTTGCAGACCGTGCAGCAGCATGGTTAAATGTGCGGCCTATTGGGTCCACCCCTCTTGCGGGGGATCCTGTTGGGGACTATGATGCCGCAACTAAACGTTGGGTAGAAAATAAGATTAACACCGGTACAGTCGGTCCCACCATGAATGGTGTTATGAATTACGGCGTTGGTGATTTCCACCTTCGAGATAGTCGTGCGTATATTCAACCATACGAAGTCGTTTCTGACGGACAGCTTCTTAATAGGGCTGACTGGCCTGAATTGTGGGCGTACGCGCAAATGGTTGGGGCCATAGATGATTCAGTTTGGTTAGCAGATAAATTTCAGCGTGGCAGATACTCATTGGGTGATGGAACAACAACCTTTCGTGTTCCAGATAGAAATGGGGTGCAACAAGGATCAATACGTGCTCTTTATGGTCGTGGTGATGGTGGGAATAGTGGTGCAAATGGCCAACTATTTGAATCAGCTGCGCCTAATATAACAGGTATTGTTCCTAGCTACTCAAGTACTAGCTACGCTCAGGTATTTGGGGCTGCTGCTGCCGGTGCATTCTTTGTTAATAATGGGATTTTTCCTAGTGGGGATGGGGATGCGATCCCCAGTACAGGAAATATTGCATTAACTGGTCGCTATAATACATTAAATTTTGACGCCTCACACTCAAGCCCGATCTATGGGGCATCAACTGACGAAATCCTTACACGAAACTTTGTCGGCGTTTGGGTGATTCGCGCATCTGGTGGATTTGTTGCCGCGAATACGCAATGGCAAGTCATCAATGCTGATGCTGTTCGCCCAGGGTCTGTAGTGAAGGCATCTTCTGGGCAGGTAAAGGCACAATACAAAATTGGGACGACAGTTGAAGCAGAAGCAGGATTGCGATGCGATGCATACATGGATGATGTGTATTACGCAATAATGTCTGTATACAACAAAACAAAAGGGGTGACAAAAAACCTTGCCTTTGATGATACAGGTACATTAAACTCTGATAGATATAGCGCCCGCTATGGGACTATGATGGCTTGGAGTGAAACAGGTCTTGTTAAAGGTACATTTAGTACAGAAGAACAGGCTATTGGAACAAATTACGCTTTTAACAGTATTTTGAGTGGTTCCCAGTATTCTAATGCGGGGTATAAAACATCTGCACATTTTGGATTAATTCATAATGACTTGGGTTCCTTTGCTGATTCTTGCTGGCACGTATCGGGAGACGCAGATAACAAATATGGTGTACGTTTAAAAATTCAACCGAACAATAATAGTATTTATTTCTATTCTTGGTGGCCCAATGGCGCAGCAACGTATACTCTGCAGTTAAATGCTATATCTGATTCTCGTTTAAAGCATGATATTAAAGCTATAGATGCAACTAAGTCCATCGAGGTATTAAAAGGATTAGAATTCCAGTCCTTTATATACAACAATGATGAAGAATCCCGTATCCGACGTGGGGTTATTGCTCAGCAAGTAGAGACTATAGAGCCACTCTATGTAAAGACTAGAAGATTCTATAATGATGACGGAGTAGAACAGGAGCAAAAGGAATTAGATACAACTCCAATGCTTCTTGACACTATGCATGTTGTGCAAGATCTTATTAAAAGGGTTGAAGACTTAGAGGAAGAACTCAAGCAATTAAGAGCAAACCTAGTGCAGTAA
Genome Context
Genome Context
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0098015 | virus tail | Cellular Component | IEA:UniProtKB-KW (UniProt) |
Tertiary structure
PDB ID
661761749902ca6c84921f52194fa1726d63322e4c6266ac35f0b9569a8eabb4
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50