Genbank accession
YP_004934724.1 [GenBank]
Protein name
hypothetical protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MLGWSESGPPAPIPESVLGWFDVPQLTSSEDLYLQDEGFLSWLDNNQTMFGDLGLNKLSITNSDRLHFGDEAVVKIPQSVDDRLVSQDQGAFRFPGEGFTQYGDTGMLLPRSTGSDTVAFADGDSSRLKLESGASLAYQSEGDIAFTPRTETYTQITVTGSVQQYRVPAWCNFLDIIGLSAGASGQTGHGGNGNAGKGGKAGSWDYAMRERGRSNFPSTLIILSVTIGAGGSQPANSDLAGPNAGGATIVTSPETGTILNIPGGSGTAGGQNGESPGGLTVNGVSYTGGAGGTGNGGNATAPGGAGAGGNGGFFTSRTRGGVGSAGRVWIRAYQ
Physico‐chemical
properties
protein length:334 AA
molecular weight: 34018,76460 Da
isoelectric point:4,59506
aromaticity:0,08084
hydropathy:-0,29521

Domains

Domains [InterPro]
DC_1790
STR
36–334
IPR049304
STR
159–333
YP_004934724.1
1 334
Architecture
STR
STR 36-334
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Gordonia phage GTE7
[NCBI]
1100814 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Gordonia terrae
[NCBI]
2055 cellular organisms > Bacteria > Bacillati > Actinomycetota > Actinomycetes > Mycobacteriales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_004934724.1 [NCBI]
Genbank nucleotide accession
NC_016166 [NCBI]
CDS location
range 30698 -> 31702
strand +
CDS
ATGCTAGGCTGGAGCGAGTCTGGACCCCCAGCGCCGATCCCAGAATCGGTGCTGGGGTGGTTTGACGTGCCCCAGCTTACTTCATCCGAAGACCTATACCTTCAGGACGAGGGCTTCCTCTCCTGGCTGGACAATAATCAGACAATGTTCGGTGATCTTGGATTGAATAAGTTGTCCATAACTAATTCAGATCGCCTACACTTCGGCGACGAGGCAGTTGTAAAGATTCCACAGTCCGTGGATGATCGACTTGTTTCCCAGGACCAAGGTGCATTCAGATTCCCGGGTGAAGGATTTACCCAGTATGGCGATACTGGAATGCTTCTGCCGAGATCGACAGGCAGCGACACTGTCGCGTTCGCTGACGGGGATTCGTCTAGACTGAAGCTCGAGTCTGGGGCCTCGCTAGCTTACCAGAGTGAGGGCGACATAGCCTTTACTCCACGGACTGAAACCTACACGCAGATAACAGTGACCGGCTCCGTCCAGCAGTACAGGGTGCCAGCCTGGTGCAACTTTTTGGACATCATAGGACTGAGTGCTGGCGCGTCCGGCCAGACTGGTCACGGCGGTAACGGAAATGCAGGCAAGGGAGGAAAAGCTGGCTCGTGGGATTACGCGATGCGCGAGCGCGGCAGGTCCAACTTCCCGTCTACGCTTATCATCCTCAGCGTTACGATCGGCGCGGGTGGATCACAGCCTGCCAACTCGGACCTCGCTGGACCTAATGCAGGTGGCGCAACCATCGTGACCTCGCCAGAAACCGGAACAATACTGAACATTCCAGGCGGCTCCGGCACGGCTGGCGGACAGAACGGCGAGTCTCCGGGCGGCCTTACTGTTAACGGCGTGTCATACACAGGAGGAGCCGGAGGCACGGGAAACGGAGGCAACGCAACCGCCCCCGGCGGGGCTGGGGCAGGCGGAAACGGCGGCTTCTTTACCTCGAGAACTAGAGGTGGAGTTGGCTCTGCCGGTCGAGTGTGGATAAGGGCGTACCAGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
323c15f2c4038028771d5b4c655465111c3a788ddec6165e186fa67b873a7a41
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6880
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50