Genbank accession
ASH99372.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
TF
Evidence Phold
Probability 1,00
Protein sequence
MSSGCGDVLSLADLQTAKKHQIFEAEVITGKSGGVATGADIDYATNAVTGQTQKTLPAVLRYAGFSPVSWDFSTGGTLTVNGRDKVVYDPVSKTWYSYAGALPVTVPAGFNPVGNADWKPQTDPDLRNELTSTSGASLVGGSVYVVDTFESAKLVENVNSKTLRTLGHHTEGKGPAEYVKTGTTGTPSSGTEALFYDATGHGWRLKTDSVDYYMFGAELDGATDDTLPVVAAHTYANANKIPIVQNGGNAFVTPNDSSRQIVFNTDAAFTGGFKFSSNSITGRGFVVRSVADEITLSQSDVVISEFVATRMKIPSLSGYANCYAAIQSTETDLNRKRDTGPALQLKRQPVVIGKDGTLDAPLWTTFSSISAIKIQSILLPTITIDGFRLRSEGALTNANPFGVQRNNVVIKNFYYEKGTTSATIPVQSLLSVEFCYNVTIEGISCDPLSVGIVDYNYVINTWTSSKISIRNMTSFDGWAQLDGNYCRNVSISDSIIDRVGSHFRCYDYTFKNLTARRGRCIAVSGGGLLHVENIKVYAESSTVSDEQYTVVALRGDYGAEWDGAVVVKNVIFDFSNFYSSSTGITASIVSAFIDSTVGAHDFMRTVVMPKSISIKDCTFIINTMPGSYLRALGVGCTSAIVSSVIYPSNIDIENITCDNSSGTNKFKVRGVEWNTLQRSEIRRHDVNIRVKGVVNIDPATYGETILDDANSTNTILTTGTNTRIYMTVRDCPWQSLRAEGNYIRTKVYGGSITYFTGTAGTSNRISFYGTEFNGTRFRGAIKVTIQDCIFNNYVDYTSASVPIGNGQSVDTNTTFIIGTATEFGATITGTVVTASTAKTGYSQSGYFQ
Physico‐chemical
properties
protein length:848 AA
molecular weight: 91426,13200 Da
isoelectric point:5,84012
aromaticity:0,09906
hydropathy:-0,11521

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ST20
[NCBI]
2569975 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ASH99372.1 [NCBI]
Genbank nucleotide accession
MF153391 [NCBI]
CDS location
range 19675 -> 22221
strand -
CDS
ATGTCAAGCGGATGCGGTGACGTTTTAAGCCTGGCGGATTTACAAACCGCCAAGAAGCACCAGATTTTCGAAGCCGAGGTTATCACCGGTAAATCCGGCGGGGTGGCTACTGGCGCTGATATTGATTACGCGACCAATGCTGTTACGGGTCAGACGCAGAAGACGCTCCCGGCGGTACTGCGCTATGCTGGGTTTTCTCCAGTATCGTGGGATTTCTCCACGGGCGGTACGTTGACAGTTAATGGCCGCGACAAAGTGGTGTATGACCCTGTAAGTAAAACGTGGTACTCGTATGCGGGTGCACTTCCTGTTACAGTTCCAGCCGGTTTTAACCCTGTGGGAAACGCTGACTGGAAACCGCAAACTGACCCCGATTTACGCAACGAGCTAACATCCACATCTGGTGCGTCGTTAGTAGGAGGGTCTGTATATGTCGTAGATACTTTTGAGTCGGCTAAACTTGTAGAAAACGTTAACTCTAAAACATTACGCACTTTAGGACACCACACAGAAGGTAAAGGCCCGGCGGAGTACGTTAAGACCGGAACTACAGGCACTCCGTCTAGCGGAACAGAAGCTTTATTTTACGATGCGACAGGTCACGGGTGGCGTCTAAAAACAGACAGTGTTGATTATTACATGTTTGGTGCGGAGCTTGACGGGGCAACAGACGATACATTACCGGTGGTAGCGGCGCACACCTACGCTAACGCTAATAAAATTCCTATTGTGCAAAATGGTGGTAATGCTTTCGTCACCCCAAACGACTCCTCCCGGCAGATTGTATTTAACACTGATGCCGCGTTTACCGGTGGGTTTAAGTTCAGTTCAAATTCAATCACTGGCCGTGGGTTTGTCGTACGCTCTGTTGCTGATGAGATAACCCTCTCACAGAGTGATGTTGTTATTTCTGAGTTCGTAGCAACACGGATGAAGATTCCAAGTCTGTCCGGTTACGCAAACTGTTACGCAGCTATTCAGTCTACGGAAACCGACCTAAATCGTAAACGTGATACTGGCCCTGCGTTGCAACTTAAACGTCAACCGGTAGTAATTGGCAAGGACGGAACTCTTGACGCGCCACTGTGGACGACTTTCAGCTCAATATCTGCGATTAAAATTCAAAGTATCCTTCTACCGACTATAACCATTGATGGTTTCCGGTTGCGGTCTGAGGGTGCGTTAACCAATGCCAACCCCTTCGGCGTTCAAAGGAACAACGTTGTCATAAAAAATTTCTACTATGAGAAAGGCACGACATCAGCAACAATACCGGTTCAATCCCTACTCAGCGTAGAATTTTGTTATAACGTAACTATAGAGGGTATCTCATGTGACCCGTTGAGTGTAGGTATCGTTGATTACAACTACGTTATAAACACATGGACAAGTTCTAAAATTTCCATCAGAAATATGACATCTTTTGATGGTTGGGCGCAGCTTGACGGCAACTATTGCCGAAATGTTTCTATTAGTGATTCCATAATCGACCGTGTAGGCTCTCACTTTAGGTGCTACGACTACACTTTCAAAAACCTCACGGCACGAAGGGGGCGCTGCATCGCTGTTTCCGGAGGAGGATTACTCCATGTGGAAAACATCAAGGTTTATGCGGAATCATCCACGGTATCCGATGAACAGTACACTGTCGTAGCCCTTCGAGGTGACTACGGTGCTGAGTGGGACGGAGCCGTCGTAGTTAAGAATGTTATTTTCGACTTCAGTAACTTCTACTCATCGTCTACTGGCATAACGGCATCTATAGTGTCGGCATTTATCGATTCCACCGTGGGTGCACACGATTTTATGCGTACAGTCGTTATGCCCAAGAGCATATCTATTAAAGATTGCACTTTCATCATAAACACCATGCCTGGAAGTTATCTGCGTGCATTAGGGGTTGGTTGTACCTCTGCAATAGTCTCTAGTGTTATTTACCCAAGTAATATAGATATAGAAAACATAACGTGTGATAACAGTTCTGGTACAAATAAATTTAAGGTCCGCGGGGTGGAATGGAACACTTTACAGCGTTCTGAGATACGGCGACATGACGTGAACATCCGCGTGAAAGGTGTTGTAAATATAGACCCGGCGACATACGGAGAGACCATACTTGATGACGCTAACAGCACAAATACCATCCTTACTACCGGAACAAACACCAGAATCTATATGACTGTTCGTGACTGCCCGTGGCAGAGCTTGCGTGCGGAAGGTAACTATATACGCACAAAAGTTTATGGAGGGAGCATTACTTACTTCACAGGCACGGCGGGTACATCAAACCGTATATCTTTTTATGGAACTGAGTTCAACGGAACAAGATTTCGCGGTGCGATAAAGGTTACTATCCAGGACTGTATTTTCAACAATTATGTTGATTACACATCCGCATCTGTTCCCATAGGGAACGGGCAGAGTGTTGATACCAACACGACTTTTATCATCGGAACTGCTACAGAGTTTGGAGCTACCATAACTGGCACTGTTGTTACGGCTTCAACAGCAAAAACTGGATACTCGCAATCTGGGTATTTCCAGTAA

Tertiary structure

PDB ID
a34cd4ef897bf204024455b79c40c8559bd9837fa1f639aa182a08559d2dee43
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7413
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome of phage ST20 Liu,X., Liu,H. and Li,J. 2019-09-18 GenBank