Protein
View in Explore- Genbank accession
- QHB37168.1 [GenBank]
- Protein name
- minor tail protein
- RBP type
-
TSP
- Protein sequence
-
MADFTGAYSGRPNHYLLLRVNGTAPAVAWEAWAIRGSGAVSFALDCFTWNVGIFGYAYGGCHNLDFRNTSQILLASGSTPASGTGYCGANHLNASIFGSAYAEGYFTAASAPPPPNIYCISPNPREITQTGMTVAFCSTGDGGSPITSWGLQRATDAAFTQNVQIVASSGTTVYNDMVPGTTYYFRARGQNAIGVGGWSGTVSATTLPAVPPGMTVAPALSGQSAVVTLTPPGGISGVTSYKVEYRNKAGGPTTTLTGASPMTVTGLTPGQTYEWRALANFGSAPSPWTAWTEYFQPNPNTNPGNYFDGATADTADLDFRWNATANNSPSTAYGKHATGWADFVQAAEVSGGTGAQYRVTGALARYPEGQAAGSFSARFAFFGDANVAGFRAGTDPVVGGAEVSEGGVYWCSIYAQPSRSQRLAAGVSWYDALGNLISRQIGAAQVVAPGAPVRLAMSAMCPEDGIATVEAVDVAGDGWSKWLGGETITVDGGMVTVGLQYPYFDGAAPDTAQYDYAWLAAAHASPSTRTTLDATNDDPLADPDCPPPPAPPSPPVITDDCITEVGVWRRYWVQVPPGEVAKWLATIPTLTLTTGEQAAREVRIRVFENPDDTPASTFEPGEWVSEQIVRYIPPNTSLRIDGVSQRVRASVNGAPLVAADHLLYGTGGGPASWPVLACGIGYLIALDVPLDAALGNLTTDLALTRRML
- Physico‐chemical
properties -
protein length: 708 AA molecular weight: 73947,46550 Da isoelectric point: 4,68850 aromaticity: 0,10311 hydropathy: -0,06031
Domains
Domains [InterPro]
DC_2279
ATT
9–311
ATT
9–311
IPR013783
STR
98–203
STR
98–203
IPR003961
STR
111–196
STR
111–196
IPR003961
STR
112–206
STR
112–206
IPR003961
STR
113–209
STR
113–209
IPR036116
STR
114–295
STR
114–295
IPR003961
STR
211–283
STR
211–283
1
708
Architecture
ATT 9-311 | STR 337-660 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
708
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 10 | 10 | 0,0086 |
| Central domain | 11 | 207 | 198 | 0,5644 |
| C-terminal | 208 | 708 | 500 | 0,4077 |
Note: Constraints were applied during segmentation.
Fixed 3 C-terminal predictions appearing before Central domain|Sequence started with non-N-terminal domain|N-terminal too short, forced to 10 residues
Fixed 3 C-terminal predictions appearing before Central domain|Sequence started with non-N-terminal domain|N-terminal too short, forced to 10 residues
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-10
1-10
Central
11-207
11-207
C-terminal
208-708
208-708
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Microbacterium phage Terij [NCBI] |
2686229 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QHB37168.1
[NCBI]
Genbank nucleotide accession
MN813684
[NCBI]
CDS location
range 24246 -> 26372
strand +
strand +
CDS
ATGGCTGACTTCACAGGGGCGTACTCCGGGCGCCCAAACCACTACCTCCTCCTCCGCGTCAACGGCACGGCGCCCGCGGTGGCATGGGAGGCGTGGGCGATCCGCGGGAGTGGCGCCGTCTCGTTCGCGCTCGACTGCTTCACCTGGAACGTCGGCATCTTCGGGTACGCCTACGGCGGATGCCACAACCTCGACTTCCGCAACACGTCGCAGATTCTCCTCGCGTCGGGCTCCACCCCGGCGAGCGGGACCGGCTACTGCGGGGCAAACCACCTGAACGCGAGCATCTTCGGGTCGGCGTACGCGGAGGGCTACTTCACCGCGGCCAGCGCACCGCCCCCACCGAACATCTACTGCATCAGCCCGAACCCGCGAGAGATCACGCAGACCGGCATGACCGTCGCGTTCTGCTCGACCGGCGACGGCGGCTCCCCGATCACGTCGTGGGGGCTCCAGCGGGCGACGGACGCCGCGTTCACGCAGAACGTCCAGATCGTCGCCTCCAGCGGCACGACCGTCTACAACGACATGGTGCCCGGCACGACGTACTACTTCCGAGCGCGGGGGCAGAACGCGATCGGCGTCGGCGGCTGGTCCGGGACCGTCTCGGCGACGACGCTCCCCGCCGTCCCGCCGGGCATGACCGTGGCCCCGGCGCTGTCCGGGCAGAGCGCCGTCGTCACGCTGACCCCTCCCGGCGGCATCTCCGGCGTCACCTCGTACAAGGTCGAGTACCGCAACAAGGCGGGCGGGCCGACGACGACCCTCACCGGCGCGAGCCCGATGACCGTGACCGGCCTGACGCCGGGGCAGACGTACGAGTGGAGGGCGCTGGCGAACTTCGGGAGCGCACCGAGCCCGTGGACGGCGTGGACCGAATACTTCCAGCCGAACCCGAACACGAACCCCGGCAACTACTTCGACGGCGCGACGGCAGACACCGCCGACCTCGACTTCCGGTGGAACGCGACCGCGAACAACAGCCCGAGCACGGCGTACGGAAAGCACGCGACCGGCTGGGCCGACTTCGTGCAAGCCGCGGAGGTATCGGGTGGAACCGGCGCGCAGTACCGGGTGACCGGCGCGCTCGCCCGCTACCCGGAGGGGCAGGCGGCAGGGTCGTTCTCCGCACGGTTCGCGTTCTTCGGAGACGCGAACGTGGCAGGCTTCCGCGCAGGCACGGACCCCGTGGTGGGCGGCGCCGAGGTGAGCGAGGGCGGCGTCTACTGGTGCTCGATCTACGCACAGCCCTCCCGATCGCAGAGGCTCGCCGCCGGAGTGTCCTGGTACGACGCGCTCGGCAACCTCATCTCCCGGCAGATCGGCGCCGCGCAGGTCGTCGCGCCAGGAGCGCCCGTACGGCTCGCGATGAGCGCCATGTGCCCCGAGGACGGCATCGCGACCGTGGAAGCGGTAGACGTGGCCGGAGACGGCTGGAGCAAGTGGCTGGGCGGCGAGACGATCACCGTGGACGGCGGCATGGTCACCGTCGGCCTCCAGTACCCGTACTTCGACGGCGCCGCACCCGACACGGCGCAGTACGACTACGCCTGGCTGGCCGCGGCGCACGCGAGCCCGAGCACGCGGACGACGCTCGACGCGACGAACGACGACCCGCTGGCCGACCCGGACTGCCCGCCGCCGCCCGCACCGCCGAGCCCGCCCGTCATCACGGACGACTGCATTACGGAGGTTGGCGTCTGGCGCCGGTACTGGGTGCAGGTGCCGCCCGGCGAGGTCGCGAAGTGGCTGGCGACGATCCCGACGCTGACGCTGACGACCGGAGAGCAGGCGGCGCGCGAGGTCCGCATCCGCGTGTTCGAGAACCCGGACGACACCCCGGCGAGCACGTTCGAGCCGGGAGAATGGGTCAGCGAGCAGATCGTCCGTTACATCCCGCCGAACACGTCGCTCCGCATCGACGGCGTGAGCCAGCGGGTGCGCGCGAGCGTGAACGGCGCACCGCTCGTCGCCGCCGACCACCTCCTCTACGGGACCGGCGGAGGCCCGGCGTCGTGGCCGGTGCTCGCGTGCGGGATCGGCTATCTGATCGCGCTCGACGTGCCGCTGGACGCCGCACTCGGTAATCTAACCACGGACCTCGCCCTGACCCGAAGGATGCTCTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
c1e93c67d466a079ee335c18e583eb76db9d0917349b7b73fcdfd3648b026bfe
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50