Protein
View in Explore- Genbank accession
- QEG13812.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSPTSPTSP
- Protein sequence
-
MMELKTFFAQDKFGNLVPGATVTVYNAGTTNLATGLKDQSGAALSNPFNADQSGKIAYQAPNGTYDMVIGTAEGTTSRIALQFFDYQSVVDAQNQTSANLAAQQALLTQTQDIINGAGDQSTLVRLNQPDGLNLIGAAPEGYGNPPTTPSHLNDVIKRVTPFQFGAKGDGVTDDSDAIQAAHDWCMQQSSRDAADPTRRIGGYVLDLNGPKYWLIKKPLKLNFGSINIEFGKSTLDCRQFPVGADANNHTVVISATYAGAYQQTRAWWSDLNMLGPGQESFCTAVRCNMTSAVPGGYERLCNVLYGGGIEQFGRGLSVGSNCYFLTCYNLSFGHCNEAYYFEPGGKNYGEQITFYSCVFNAGNYQVTTYGGMSNFYGCSFDYCYHQQMKLLGGMVNCVNCWWEGYGPVDYIINIPESTTSVRLNIIGGLFSFKAGATGAESRATSNPFYFGTRASVKFENVQFQKLGNAYYGDSTTAWIDGASQGVSFSGCYFPNSANSFTTVIMNSDPSVINNQFATESYTLGIGAENWLGDTWILPNSDNGLVRKNRWGYGLSNGGTTFSFTRTSNGFVNLDVTQPANSGTFEAVIGTIPVKSTGQVVHYVSGNLYLGTGDGSVNIKTYWVKLIVKGTPDNPDEPRILQMVKATDVNYPIGGTSGQLHSFRWPPSMPTGTQSPASTTVAPDWATHCIMTLDISKIKRNMGGIRINEVFMRQI
- Physico‐chemical
properties -
protein length: 714 AA molecular weight: 77379,59350 Da isoelectric point: 5,58013 aromaticity: 0,11625 hydropathy: -0,23964
Domains
Domains [InterPro]
DC_1436
ATT
2–200
ATT
2–200
IPR012334
STR
119–605
STR
119–605
DC_0715
STR
155–608
STR
155–608
IPR011050
STR
157–540
STR
157–540
1
714
Architecture
ATT 2-200 | STR 201-608 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
714
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 175 | 175 | 0,9937 |
| Central domain | 176 | 524 | 350 | 0,9949 |
| C-terminal | 525 | 714 | 189 | 0,9778 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-175
1-175
Central
176-524
176-524
C-terminal
525-714
525-714
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Erwinia phage vB_EamM_TropicalSun [NCBI] |
2591372 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QEG13812.1
[NCBI]
Genbank nucleotide accession
MN013090
[NCBI]
CDS location
range 14463 -> 16607
strand +
strand +
CDS
ATGATGGAATTAAAAACCTTCTTTGCGCAAGATAAGTTCGGCAACCTTGTGCCTGGCGCAACAGTAACAGTCTACAACGCCGGCACAACCAACCTCGCAACTGGCCTGAAAGATCAGAGTGGCGCGGCATTGTCAAACCCGTTTAACGCAGATCAGAGCGGTAAAATCGCATATCAGGCTCCAAACGGAACATATGACATGGTCATTGGTACTGCAGAGGGCACGACTAGTCGAATTGCCCTTCAATTCTTCGATTACCAAAGTGTAGTGGATGCGCAGAACCAAACTAGTGCCAACTTGGCGGCGCAACAAGCCCTGCTTACTCAGACCCAGGACATCATCAACGGAGCCGGTGACCAATCGACTCTGGTCAGACTCAACCAGCCGGATGGCCTAAACCTTATCGGCGCGGCACCAGAGGGGTATGGCAATCCGCCGACTACCCCATCGCACCTGAATGACGTCATAAAGCGAGTGACGCCATTCCAATTTGGCGCAAAGGGCGACGGTGTTACTGACGATTCCGATGCGATACAAGCAGCGCATGATTGGTGCATGCAACAATCTTCAAGGGACGCGGCGGATCCCACTCGAAGGATTGGCGGATATGTACTTGATTTGAATGGGCCAAAATACTGGCTGATCAAGAAGCCGCTGAAATTAAACTTCGGCAGTATCAACATCGAGTTTGGCAAGTCAACCCTTGATTGCCGCCAGTTCCCTGTCGGAGCCGACGCAAACAATCACACGGTCGTAATCAGCGCAACATACGCTGGCGCGTATCAGCAAACACGTGCGTGGTGGAGCGACCTGAACATGTTGGGGCCAGGTCAGGAATCGTTCTGCACGGCAGTTCGTTGTAACATGACTTCAGCCGTTCCTGGTGGATACGAGCGCCTTTGCAACGTCCTGTACGGCGGCGGAATCGAACAATTCGGGCGTGGCCTCTCTGTTGGCTCCAACTGCTATTTCTTGACTTGTTACAACCTTTCGTTTGGGCATTGTAATGAGGCTTATTACTTCGAGCCTGGTGGCAAGAACTATGGCGAACAAATCACATTCTATTCTTGTGTATTCAACGCCGGAAACTACCAGGTCACAACGTATGGTGGCATGAGCAACTTCTATGGATGCTCATTCGACTATTGCTATCACCAGCAAATGAAGTTGTTGGGCGGCATGGTGAACTGCGTTAATTGTTGGTGGGAGGGTTATGGCCCAGTTGATTATATCATCAACATCCCAGAGTCAACGACGTCAGTGCGATTGAACATCATCGGCGGATTGTTCTCATTCAAAGCTGGCGCGACTGGCGCTGAGTCAAGGGCGACGAGCAACCCATTCTATTTCGGAACCAGGGCGTCAGTTAAATTTGAGAATGTGCAGTTCCAGAAGTTGGGCAATGCGTATTATGGTGACAGCACCACCGCATGGATTGACGGAGCGTCGCAGGGCGTTTCATTTAGTGGATGTTATTTCCCGAACAGCGCAAACTCTTTCACGACCGTGATTATGAACTCAGACCCATCGGTGATCAACAACCAATTTGCAACCGAAAGCTACACGCTTGGTATTGGGGCTGAAAACTGGCTGGGTGATACGTGGATACTTCCGAATTCCGACAACGGATTGGTTAGAAAGAATCGCTGGGGATATGGGCTGTCAAATGGCGGCACCACATTTTCCTTCACAAGAACATCAAACGGATTCGTGAACCTTGATGTCACCCAGCCAGCCAACTCAGGCACCTTTGAAGCGGTCATAGGCACCATTCCTGTAAAAAGTACAGGCCAGGTTGTTCACTATGTATCCGGCAATTTGTATCTTGGCACAGGCGACGGATCAGTGAACATCAAGACTTACTGGGTTAAATTAATCGTCAAGGGAACGCCAGACAATCCTGATGAACCTAGAATCCTTCAGATGGTTAAGGCCACTGACGTGAACTATCCAATCGGAGGAACTTCAGGGCAGCTTCACTCATTCAGGTGGCCGCCATCAATGCCGACCGGCACCCAATCGCCTGCATCAACTACAGTTGCGCCTGATTGGGCAACCCATTGCATTATGACTTTGGATATTTCAAAGATCAAACGTAATATGGGCGGCATCCGAATCAATGAAGTGTTCATGCGCCAAATCTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
84f84200b64fc4d5a4e5a25c820053b393dadd1f84555c8b00210d19d3c0e272
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50