UniProt accession
A0AAE9KIA9 [UniProt]
Protein name
Tail fibre protein gp37 trimerization region domain-containing protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,92
Protein sequence
MKGYSFSELVKLAEAAMPRAGGTFDGAITAPQVNIGNSQSSDPKAATRFDWVNARLQEKLSLSGGSMSGQLIGKTTGVPNRLEQLNGQYSAFNVANIEVDPASQGGVSYAEALHSSSQVKGVGFVQHMSLGQKRQSGASYHDVYLATGGSDTGATVEWLFGSTGVLACPGAVLLGGSTQATQTNAAVRRDYLLSQLGNYLPLSGGTVQGQVVIAHDANTLNLRPKTAAAPSYLLGQDYAGNNQWYVGKGDTSADIYLHNYVGGSNLQLEDGGTIALNPGNRQVWLNGYSLSMCTAGNKEFRMYGANGEDHFALWHEPASGQAHLRVNGGGACTLQADGSFVSPARIWANDVTLNSGKASLRTATPGGHADWWTRIPALQVYADATRTAATSVWMAGELGASQIAAMDVNRPEAGAIVSLHLEGGYAQHQWTHRDYTASGAVRGLGGVFDAGSRVYSPVNVPILSATHGYNDVINGGTFNVGAYAFLQNHPDLRVGTVAPGATCDGSRLFPTSVDNDWDDMYGAPVPGVWRCQGYSNLGGAVPGGWETDWTAGKTLWIRVA
Physico‐chemical
properties
protein length:560 AA
molecular weight: 58885,61720 Da
isoelectric point:5,94646
aromaticity:0,08929
hydropathy:-0,21714

Domains

Domains [InterPro]
DC_0339
STR
20–218
G3DSA:6.20.80.10
STR
217–275
DC_1924
RBD
224–560
A0AAE9KIA9
1 560
Architecture
STR
ATT
RBD
STR 20-217 | ATT 218-272 | RBD 273-560
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage ZPAH14
[NCBI]
2924887 Uroviricota > Caudoviricetes > Chaseviridae > Shantouvirus > Shantouvirus ZPAH14
Host Aeromonas hydrophila
[NCBI]
644 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UOT58054.1 [NCBI]
Genbank nucleotide accession
OM810291 [NCBI]
CDS location
range 48712 -> 50394
strand +
CDS
ATGAAAGGCTATTCTTTTAGCGAGCTGGTAAAACTTGCAGAAGCCGCGATGCCTCGCGCTGGCGGCACCTTCGATGGGGCAATCACCGCTCCTCAAGTGAACATCGGCAACTCCCAATCAAGCGACCCCAAAGCTGCAACCCGATTCGACTGGGTCAATGCACGACTGCAAGAAAAGCTGTCCCTGTCTGGCGGCTCAATGAGCGGCCAGCTCATTGGCAAAACCACCGGTGTGCCAAACCGACTAGAACAACTTAACGGCCAGTATTCGGCGTTCAATGTCGCGAACATTGAAGTTGATCCAGCCTCCCAAGGCGGAGTATCTTACGCCGAGGCACTCCACTCGAGCTCCCAAGTCAAGGGCGTAGGGTTCGTGCAGCACATGTCTCTCGGCCAGAAACGTCAGAGTGGCGCAAGCTACCATGACGTGTATCTTGCTACTGGTGGGTCTGATACTGGTGCAACAGTCGAGTGGCTGTTTGGGTCTACTGGGGTACTGGCTTGCCCTGGAGCTGTGCTGTTGGGCGGAAGCACTCAAGCAACCCAAACCAACGCTGCGGTACGCCGAGACTACCTGCTGAGCCAGTTGGGCAACTATCTGCCGCTGTCAGGCGGGACTGTCCAGGGGCAAGTAGTAATCGCCCACGACGCTAACACGCTCAACCTTCGACCAAAGACTGCAGCAGCCCCAAGCTATCTCCTAGGCCAAGACTACGCTGGCAACAACCAGTGGTACGTTGGTAAGGGAGACACTTCGGCAGATATTTACCTGCATAACTACGTCGGCGGATCGAACCTACAGCTAGAAGATGGAGGTACGATTGCACTTAACCCAGGCAACCGCCAAGTGTGGCTGAATGGGTACTCCCTGTCGATGTGTACAGCGGGCAACAAAGAGTTCCGAATGTACGGAGCTAATGGGGAAGACCACTTCGCCCTCTGGCATGAGCCCGCTTCTGGGCAAGCACACCTGCGAGTCAATGGTGGAGGCGCATGCACCCTGCAGGCGGATGGGTCTTTTGTGAGCCCTGCGCGGATCTGGGCCAACGATGTTACCTTGAATTCAGGGAAAGCCTCCTTACGGACGGCCACACCTGGAGGCCATGCGGACTGGTGGACCCGGATCCCCGCGTTACAAGTGTACGCGGACGCCACCAGAACAGCAGCAACAAGCGTATGGATGGCCGGAGAGCTTGGCGCGTCACAAATCGCAGCGATGGACGTGAATAGACCAGAGGCTGGGGCAATTGTATCCTTGCACCTCGAAGGAGGCTACGCGCAACACCAGTGGACACACCGAGACTACACCGCCTCGGGGGCAGTTCGTGGGTTGGGCGGGGTGTTCGACGCGGGGTCCAGAGTCTACTCTCCAGTAAACGTACCAATTCTGTCTGCTACTCATGGTTACAATGACGTGATCAATGGCGGTACATTCAACGTTGGCGCATACGCGTTCCTCCAGAACCACCCAGACCTACGGGTTGGGACTGTAGCCCCCGGAGCAACCTGTGACGGCAGCCGGCTGTTCCCAACCAGCGTGGACAACGACTGGGATGATATGTACGGAGCTCCAGTGCCAGGCGTCTGGCGCTGCCAAGGCTACTCAAACCTCGGTGGTGCAGTACCTGGTGGCTGGGAAACTGACTGGACTGCAGGTAAAACACTATGGATTCGAGTAGCATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
08f327cd1335ec894720a2bb5db70b23e2371bc6830c16af9932ae76a29efe46
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6016
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50