Protein
View in Explore- Genbank accession
- ANH50889.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TF
- Protein sequence
-
MLGEDDELSFCLRDGDVINVYCQPSGAIGDLIGAILKPVTKIFSFLTPKVSTPKTDKSSKTSPNTGLKAQTNIARNGEARPDNFGQIRAFPDLLQESLFEYINNIKYVTEFMNFGLGKYDVSSVRYSESNLGSLAGASYTIYQPGEVIPVVYEPYAFDDVDGQELYGPNELDTDPPPVVIETATTTTVTETEFAGGQIAVKIPKNSAFDYFVDLSMPHDVVFKLNITYAQGGGASVTENVTLSGRLASATETDDGGLPPVNYWYTFIINSINYSGAPISSLNGVTINNTYFNLTDNQPIVSGPYFSPIDGDQLWVHLQHQTNDGNDFSVLIEWWKIDDDNVQIPGTYQSMNYYRDVDRNDTFYYTIKLTPSAGTGRYAIQMRRTNNSSDTSILQLEEIHSIVTRTNVSYPDDTVVKVVVRATENATGSRDRKYNALITRHTIGYNRDTGTVRYTLAPSRSFADAVLHNWLITAGNPENTIDIVKLYEIADSLPDERLGCFDYTFDDEDKSIGERLQTICDAARVTAFWDDGVMSFSRDEKREYPATVFNTRNTQSDGYKLSYDISLPGTYDGVNVEYRDPTTNKQANVYYRITDSGIVEGEPTKAKKFDMLYVRNRYQAVDRAILECRRLIYSRRSMEIKALSDGEYVNVGDMIQVVDMYDDVQQTGVIEARNGNVFTTSEQLTADDNLYVVITSSDGSTSDRLPATVTGLHTFTCNLPTDFQLNIWDGASVQSESRYVLSTEKELDTTLWVVSQKNPGSDGTTTLTMSEYSDDMYEYVIPSS
- Physico‐chemical
properties -
protein length: 783 AA molecular weight: 87403,74180 Da isoelectric point: 4,47797 aromaticity: 0,10473 hydropathy: -0,41047
Domains
Domains [InterPro]
DC_0118
STR
2–625
STR
2–625
NF040662
Unmapped
311–777
Unmapped
311–777
1
783
Architecture
STR 2-625 | RBD 641-782 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Salmonella phage 64795_sal3 [NCBI] |
1813769 | Uroviricota > Caudoviricetes > Saltrevirus > |
| Host |
Salmonella enterica [NCBI] |
28901 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
ANH50889.1
[NCBI]
Genbank nucleotide accession
KX017520
[NCBI]
CDS location
range 28749 -> 31100
strand +
strand +
CDS
GTGCTCGGTGAAGATGATGAGCTTTCTTTTTGCCTGCGCGATGGCGACGTAATCAACGTTTATTGTCAGCCATCTGGCGCGATAGGAGACCTTATCGGTGCGATACTGAAACCAGTAACGAAGATTTTCTCCTTCCTTACACCGAAAGTATCAACGCCAAAAACGGATAAAAGTTCAAAAACATCACCTAATACCGGCCTGAAGGCACAGACAAACATTGCGCGCAATGGAGAGGCGCGACCTGATAACTTCGGGCAGATTCGCGCGTTTCCTGATTTGCTTCAGGAATCATTATTTGAATATATCAATAACATTAAATACGTCACCGAGTTCATGAACTTTGGTCTCGGTAAGTATGACGTATCATCCGTGCGTTATTCTGAGTCAAACCTCGGCTCACTGGCTGGCGCGAGTTACACCATCTATCAGCCGGGAGAGGTTATTCCGGTTGTGTATGAACCTTACGCATTTGATGATGTTGATGGACAGGAGCTTTACGGACCAAACGAACTGGATACCGACCCTCCTCCAGTGGTCATTGAAACTGCAACCACAACTACGGTCACAGAAACGGAATTTGCTGGAGGTCAGATTGCTGTCAAGATACCTAAGAATTCAGCATTTGATTACTTCGTTGACCTAAGTATGCCTCATGATGTGGTTTTTAAGCTAAATATCACATACGCGCAGGGAGGCGGTGCATCTGTAACTGAAAACGTTACGCTATCAGGAAGACTTGCATCTGCAACAGAAACTGACGATGGCGGGCTGCCACCAGTAAACTACTGGTACACATTTATTATTAACAGCATCAACTACTCAGGTGCGCCAATATCATCACTGAATGGCGTAACGATTAATAACACCTATTTCAATCTGACAGATAACCAGCCGATTGTTTCCGGTCCGTACTTTTCGCCGATTGATGGTGATCAGCTTTGGGTGCATCTGCAACACCAGACCAATGACGGCAATGATTTCAGCGTGCTCATTGAGTGGTGGAAGATTGACGACGATAACGTTCAGATCCCCGGAACATATCAGTCGATGAACTATTATCGGGACGTGGACAGAAACGACACGTTCTACTACACGATAAAGTTAACCCCATCCGCTGGCACTGGTCGCTACGCGATTCAGATGCGACGGACAAACAACAGTTCAGACACGTCAATCCTTCAGCTTGAGGAAATTCACTCAATCGTCACGCGCACTAACGTATCTTATCCAGATGATACGGTGGTTAAGGTGGTCGTGCGAGCAACAGAGAACGCCACAGGCAGCCGTGACAGAAAATATAATGCGTTAATCACACGTCACACCATCGGATACAACCGCGATACTGGCACTGTGCGCTACACGCTTGCACCATCACGCAGCTTTGCTGATGCAGTTCTGCATAACTGGCTTATTACCGCTGGAAATCCAGAAAACACGATCGACATAGTGAAGCTGTATGAAATTGCCGACAGCCTGCCTGATGAGCGACTTGGTTGTTTCGACTATACATTCGATGATGAAGATAAAAGTATTGGCGAACGCCTGCAAACAATCTGCGACGCAGCACGAGTCACCGCATTCTGGGATGATGGCGTGATGAGCTTTTCGCGTGATGAGAAACGAGAATATCCAGCAACAGTATTTAACACCAGAAACACGCAGAGCGACGGATACAAGTTAAGTTACGACATCAGCCTTCCAGGGACTTATGACGGGGTTAACGTCGAATACCGCGACCCAACAACGAATAAGCAAGCCAACGTTTACTACCGAATTACCGATAGCGGAATAGTCGAAGGCGAACCAACGAAAGCCAAGAAATTCGACATGCTTTATGTTCGCAATCGATATCAGGCTGTTGACAGGGCAATCCTTGAGTGTCGCCGCCTGATTTACTCCCGTCGTAGCATGGAGATTAAGGCTCTTTCGGATGGGGAATATGTGAACGTTGGTGACATGATTCAGGTTGTCGATATGTATGATGATGTGCAACAGACTGGCGTTATTGAAGCGCGCAACGGAAACGTATTCACAACAAGCGAGCAACTAACGGCTGATGATAATCTTTATGTTGTGATTACCAGTTCTGATGGCAGCACATCAGACAGATTGCCAGCAACAGTGACCGGATTGCATACATTCACCTGCAACCTGCCGACTGATTTCCAGCTGAATATATGGGATGGAGCAAGCGTGCAATCTGAATCTCGCTATGTGCTGAGCACTGAAAAAGAACTGGATACCACTCTGTGGGTTGTCAGCCAGAAAAATCCAGGAAGTGACGGTACAACAACTCTGACCATGAGCGAATACAGTGATGACATGTACGAATATGTCATCCCGTCATCGTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
e5deb0aab5283d849183ef0ab44c2142a116e0e6bac578a07f8035f4bda94955
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Complete Genome Sequences of three Siphoviridae Bacteriophages infecting Salmonella enterica enterica subsp. Enteridis | Paradiso,R., Lombardi,S., Iodice,M.G., Riccardi,M.G., Orsini,M., Bolletti Censi,S., Galiero,G. and Borriello,G. | 2016-12-29 | — | GenBank |