Genbank accession
QAU03635.1 [GenBank]
Protein name
short tail fibers protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MSTNTLKHISDKSEFKTFDPTGSNFDPSITNVQDALASISAIGVTGNIPSASETEQGIIRLATQQEVIDGTDTFSAVTPATLKGRLDIPTQATETYVGITRYATNAEAIAGTETQAAIVASSLKATIDYTFTNRLATENTTGVLKISTLPAALAGTDDTTAMTPLKTAQAIGAATSALPTYASATQTTEGIVRIATNAEVANGTLTNGVAISPSGLKSLTSTQGRAGIIRLATPQEASAGSDSNIALSPSTLLSRTGTTGRLGVVKLSTTVGSGDGNTALAYNANVISTTGGTINGTLNVNGTLRRNGRDVVTIDQLKDSVPIGTIVMWGGQINNIPAGWAVCDGGNSGEQGFRNVVGSKWGSTGARPDFRGLYPRGANQTNDGGLKEWDANVRIRDAKGNDAKGKPKLGVGCGSYGHGTVQAQQLRFHKHAGGFGEHDNAGAFGNTVRSNFAGTRKGLDWDNRSYFTNEGYEIDGSGSRDSRTTLNSEGLIGNENRPWTMSILFIIKVA
Physico‐chemical
properties
protein length:510 AA
molecular weight: 52973,04530 Da
isoelectric point:6,98224
aromaticity:0,05490
hydropathy:-0,28608

Domains

Domains [InterPro]
DC_0176
STR
1–435
SSF69349
STR
249–333
G3DSA:2.10.280.10
Unmapped
249–285
IPR015173
STR
250–308
QAU03635.1
1 510
Architecture
STR
ATT
STR
STR 1-309 | ATT 310-371 | STR 372-509 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage AnYang
[NCBI]
2499909 Uroviricota > Caudoviricetes > Pantevenvirales > Tevenvirinae > Dhakavirus
Host Escherichia coli O157
[NCBI]
1045010 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QAU03635.1 [NCBI]
Genbank nucleotide accession
MK234886.1 [NCBI]
CDS location
range 57082 -> 58614
strand +
CDS
ATGTCTACAAATACACTAAAACACATAAGTGATAAATCAGAATTTAAAACATTCGACCCTACCGGGTCGAATTTTGACCCCTCTATTACTAATGTCCAGGATGCATTGGCGTCTATTTCCGCTATAGGTGTTACAGGTAATATACCTTCGGCTTCTGAAACAGAACAGGGTATTATTAGATTAGCGACCCAACAAGAAGTTATTGATGGTACTGATACTTTCTCTGCGGTAACTCCGGCCACTTTAAAAGGTCGTTTAGATATTCCTACTCAAGCAACAGAAACTTATGTGGGTATCACCCGTTATGCCACTAATGCTGAAGCCATTGCCGGAACAGAAACCCAGGCTGCAATCGTAGCATCGTCTTTGAAAGCTACTATAGATTATACATTCACCAATCGTTTAGCCACAGAGAACACTACAGGCGTACTCAAGATTTCAACCCTGCCTGCTGCTCTAGCAGGTACAGACGATACGACGGCAATGACACCTCTGAAAACCGCTCAGGCAATTGGTGCCGCTACATCCGCATTACCGACTTATGCCAGTGCTACTCAAACTACTGAGGGTATTGTTAGAATCGCAACAAATGCTGAAGTGGCAAATGGTACATTGACCAACGGTGTAGCGATTTCTCCTTCCGGGTTAAAATCTTTAACTTCTACCCAAGGACGAGCAGGTATTATTAGATTAGCCACCCCGCAAGAAGCTTCTGCCGGCAGCGATTCCAATATAGCTTTATCTCCTTCAACTCTTCTTTCTCGTACTGGCACTACCGGAAGGTTGGGTGTTGTTAAATTATCGACTACGGTCGGTTCTGGTGATGGTAATACGGCATTGGCATATAATGCTAACGTTATTTCTACCACAGGTGGCACCATTAATGGTACTTTAAACGTTAATGGTACTTTACGTCGCAATGGGCGTGATGTTGTTACTATCGACCAATTAAAAGATTCTGTCCCTATCGGTACTATTGTTATGTGGGGAGGCCAGATTAATAATATTCCTGCGGGTTGGGCTGTTTGTGATGGCGGTAACTCCGGTGAACAAGGATTCCGAAATGTAGTTGGTAGTAAATGGGGTTCAACCGGAGCACGTCCTGACTTCCGTGGATTATATCCTAGAGGCGCAAACCAGACTAATGATGGTGGATTAAAAGAATGGGACGCTAACGTCAGAATTAGAGATGCTAAAGGTAATGATGCTAAAGGTAAACCTAAATTAGGTGTAGGCTGTGGTTCATATGGTCACGGCACTGTACAAGCCCAGCAACTTCGTTTCCATAAACACGCCGGCGGCTTTGGTGAACACGACAATGCGGGTGCATTTGGTAATACCGTTCGTTCTAATTTTGCCGGCACTCGTAAAGGCCTGGACTGGGATAACCGTTCTTACTTTACAAACGAAGGATACGAAATTGATGGTTCAGGTTCACGAGATTCAAGAACTACTCTAAACAGTGAAGGGTTAATTGGTAACGAAAACCGTCCTTGGACTATGTCCATCCTATTCATTATTAAAGTTGCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
02d08d41c8a81aa9b825f18cd52af1886f1f1a1e099d01462d4fcadcd42da1be
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6678
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50