Protein
View in Explore- UniProt accession
- A0A7D7F8Z3 [UniProt]
- Protein name
- Long tail fiber protein Gp37
- RBP type
-
TFTSPTSP
- Protein sequence
-
MADLSRIQFKRTSTKGRRPDAGTMNPGELAINLADQYLLTKNDSGAIINLSCPPVYDRDVTMAGKVKGNNYILSKTANYLEDQTARDLNYFGAFRTNGLDGLLELTLNVPHSSGVQHGRGFTFQYGHTGSRVETYGYNKEGQKAFSYKMYHEGDKPTPGELNVYSKQEVDRMFVKNVKLSTGSGDIVEGYFKLATAMIPQNGLSVFFRIHGGNGYNVTAYDQVDIVEIVIRSGNGRPKGVNVIAYRRNTNKSFDVLAVNTSGDNYDIYVKYQRYTDNVIVEFGKSVGVNLVVHDVPDFVAERPAGDNVIGGRAVTLFNTENKRGVLSFDDNTQNSYDIVHMSNDKGTGRKYIRKFRSNYNEMIWHETVQGNTYRLATGTTDASEVFRITSRTMFTGKGVFDAGQNVLRLERPSNQSNYIEWQDRRNGADVRQGWIGFGGAETNDFQWYSDHAKNSFMLDANGQCSIITGATKIVYTNGQYYSANSDAYRMIYGNYGAFWRNDGTKVYLLSTAENDRFGGWNGNRPFIYDLTSGNVQLGGDGNEDALTLECASRAARFSNDVYIKKGLLTFDAGRAGSRDYIRFNHWGDSNNARDNVLYIEDSQGRHFSTERAMGTGALKAYFLGDLEVGGKFTWGKNTATSSFNIRAWGNDSRKQVLECADESGWHWYTQRPGGPGTNEIIFNVNGKVQSQGFETSGNIKITGPDIEFRRTGNKHIWFRDPDGLELGLIYCDDNGVIRFRGQKQTQDWIFGNKTIQLGTSATVSGSGNGLIRGQVQGGAWTQWRDRAAGLLVDCQNTENSAHNVWKATHWGKFHIAAMGIHVSGGNIANALARLHVNDATFDHSASGDFQASRNGSFNDVYIRSDARLKINKEEYKENATDKVNRLTVYTYDKVKSLTDRTVIAHEVGIIAQDLEKELPEAVTTSKIGDPDKPEEILTISNSAVNALLIKAFQEMSEELKAVKAELAELKKN
- Physico‐chemical
properties -
protein length: 972 AA molecular weight: 108220,90530 Da isoelectric point: 7,31156 aromaticity: 0,10597 hydropathy: -0,57243
Domains
Domains [InterPro]
DC_0932
STR
31–565
STR
31–565
IPR048390
ATT
445–526
ATT
445–526
IPR030392
CHP
864–966
CHP
864–966
1
972
Architecture
STR 31-444 | ATT 445-526 | STR 527-972
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
972
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 251 | 251 | 0,1924 |
| Central domain | 252 | 450 | 200 | 0,3887 |
| C-terminal | 451 | 972 | 521 | 0,6507 |
Note: Constraints were applied during segmentation.
Fixed 203 C-terminal predictions appearing before Central domain
Fixed 203 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-251
1-251
Central
252-450
252-450
C-terminal
451-972
451-972
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage vB_EcoM_011D4 [NCBI] |
2735300 | Uroviricota > Caudoviricetes > Pantevenvirales > Krischvirus > Krischvirus RB49 |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QMP82575.1
[NCBI]
Genbank nucleotide accession
MT478991
[NCBI]
CDS location
range 103078 -> 105996
strand +
strand +
CDS
ATGGCAGATTTAAGCAGAATCCAATTTAAACGTACTAGCACTAAAGGCCGTCGACCTGATGCTGGTACGATGAATCCGGGGGAATTGGCAATCAACCTTGCGGATCAATATCTTCTTACTAAAAATGATTCCGGTGCTATTATCAATTTAAGTTGTCCTCCGGTTTATGACCGCGATGTTACAATGGCAGGTAAGGTTAAAGGTAATAATTATATCTTAAGTAAAACCGCTAACTATTTGGAAGACCAGACAGCGCGAGATCTTAACTACTTTGGTGCTTTCCGTACCAATGGTCTTGATGGTCTTCTCGAACTCACGCTAAACGTTCCTCATTCTTCTGGTGTTCAGCATGGTCGAGGATTTACTTTCCAGTATGGACACACTGGATCGCGTGTAGAAACTTATGGCTATAATAAAGAAGGTCAAAAAGCATTTAGCTATAAAATGTATCATGAAGGTGATAAACCAACTCCAGGAGAATTGAACGTTTATAGCAAACAAGAAGTTGACCGTATGTTTGTTAAGAACGTTAAACTTTCTACAGGTTCTGGTGATATCGTTGAAGGCTATTTTAAATTAGCAACTGCAATGATTCCACAAAACGGTCTTAGCGTATTTTTCCGTATTCATGGTGGTAACGGATATAACGTTACTGCATACGACCAAGTTGATATTGTAGAAATTGTTATTCGCAGTGGAAATGGTCGCCCTAAAGGTGTTAACGTTATTGCATATCGCCGAAATACAAACAAATCATTTGATGTTTTGGCTGTTAATACTTCTGGTGATAACTATGATATCTATGTGAAATATCAGCGTTACACTGATAACGTTATTGTTGAATTTGGTAAAAGTGTTGGTGTTAATCTGGTAGTTCATGACGTTCCAGATTTTGTTGCTGAACGTCCTGCTGGCGATAATGTTATTGGCGGTCGTGCTGTAACTCTTTTTAACACAGAAAATAAGCGTGGTGTGTTGAGTTTTGACGATAACACACAAAACAGCTATGATATTGTTCATATGAGTAATGATAAAGGTACTGGTCGGAAATATATTCGTAAATTCCGTAGTAACTATAATGAAATGATCTGGCACGAGACGGTTCAAGGTAATACATATCGATTGGCAACCGGAACGACTGATGCATCGGAAGTTTTTAGAATTACTAGCCGCACCATGTTTACTGGTAAAGGTGTATTTGATGCCGGACAAAACGTTCTTAGATTAGAGCGTCCTAGTAACCAATCAAACTATATTGAATGGCAAGATCGCCGAAATGGTGCTGATGTTCGTCAAGGTTGGATCGGTTTTGGTGGTGCTGAAACAAATGACTTCCAATGGTATAGCGATCACGCAAAAAACTCATTCATGTTGGATGCTAACGGTCAATGCTCTATTATTACTGGTGCGACCAAAATTGTATATACCAACGGACAATATTATTCCGCTAACTCTGATGCATATCGTATGATCTATGGCAATTATGGTGCATTCTGGCGTAATGACGGGACTAAAGTTTATCTTCTGTCTACGGCTGAAAATGACCGTTTTGGTGGCTGGAACGGAAACCGACCATTCATTTACGATCTTACTTCCGGTAACGTTCAATTAGGCGGTGATGGTAACGAAGATGCATTAACGTTAGAATGTGCTTCTCGTGCCGCTCGCTTTAGTAATGACGTTTACATTAAGAAAGGGCTTTTGACTTTCGACGCTGGGCGCGCTGGATCTCGCGATTATATTCGATTTAATCATTGGGGTGATAGTAATAATGCCCGTGATAACGTTTTGTACATAGAAGATAGTCAAGGCCGACATTTTAGCACAGAACGTGCGATGGGTACTGGTGCTCTTAAAGCATACTTCTTAGGCGATCTTGAAGTCGGTGGTAAGTTTACTTGGGGTAAAAATACAGCTACATCTAGCTTTAATATTCGTGCATGGGGTAATGATTCCCGTAAACAAGTATTAGAATGCGCGGATGAAAGTGGTTGGCATTGGTATACCCAACGTCCAGGCGGTCCGGGAACAAATGAGATTATATTCAATGTTAACGGGAAAGTTCAATCTCAGGGTTTTGAGACTTCCGGTAATATTAAGATAACTGGACCAGACATTGAGTTTCGTCGCACTGGCAATAAGCACATCTGGTTTAGAGATCCTGACGGCTTAGAATTGGGTTTGATTTATTGTGACGACAACGGCGTTATTCGTTTCCGTGGTCAAAAACAAACTCAAGATTGGATATTTGGCAATAAGACGATCCAATTAGGAACTAGTGCTACTGTTAGTGGATCTGGTAACGGTTTGATTCGCGGACAAGTTCAAGGTGGTGCTTGGACACAATGGAGAGATCGTGCTGCTGGCTTATTGGTTGATTGTCAGAATACCGAAAACTCTGCACATAACGTGTGGAAAGCGACACATTGGGGCAAATTCCACATTGCAGCTATGGGTATTCACGTTTCAGGTGGAAATATAGCAAACGCATTGGCGCGTCTACATGTTAATGATGCAACTTTTGACCATAGTGCGAGCGGTGACTTCCAGGCAAGCCGTAATGGTAGCTTTAACGATGTTTATATTCGTTCTGATGCTCGCCTTAAAATCAATAAGGAAGAGTATAAGGAGAATGCCACCGATAAAGTTAATCGCTTGACTGTGTACACCTACGATAAGGTTAAGTCTTTGACGGACCGCACTGTTATCGCTCATGAAGTTGGTATTATTGCTCAGGATCTTGAAAAAGAATTGCCGGAAGCAGTAACAACTTCTAAGATCGGCGATCCTGATAAGCCAGAAGAGATCTTAACAATTTCTAACTCTGCTGTCAACGCTCTTTTAATTAAGGCGTTTCAGGAAATGAGCGAAGAACTTAAAGCCGTTAAAGCTGAACTAGCGGAACTTAAAAAGAATTAA
Genome Context
Genome Context
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0098024 | virus tail, fiber | Cellular Component | IEA:UniProtKB-KW (UniProt) |
| GO:0019062 | virion attachment to host cell | Biological Process | IEA:UniProtKB-KW (UniProt) |
Tertiary structure
PDB ID
f104bc409eb886dfa5d001acf6e965188d2605dac18645ad8bcd4354a1d08fdf
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50