UniProt accession
A0AAF0C0E1 [UniProt]
Protein name
Tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,90
Protein sequence
MSRNLMPKSGAMAPYVVVNRDAAVAGVFSIDGEAGAVVLTSKYLQITKYTTDKTATDAAIKSINDSIGNINTALGGINTTLATKAAKGANNDITELNALTKAITIAQGGTGAATLENARTNLQVERLRQQDNGTFITSPDGKYSLFIYNSGDFGLIDSASAAVSAMKVAFGGTGGTTPKTARKGLNVPVGALAEIVPDGDNVLNYVAVAGQSGYYSSGELVVNGPPKQEGWWTYNFHCHGVDINGAAQYGVLRAVGLSGSSWINVLDGTGNWRGWQEQFNAQTTVQLSNGGTGATSFEGARTNLGIDRFKQSATETMMYAPGVGYRITARPNGEWGMWRDDTGGWIPLSITAGGTGANDASQARKNLGLTEWFVLTSIDQAYPNGFNFDTISVNCKYTVPPSSGSNIVGTRPFQQISHWEDAWFFLETLIHNDSRYRMQRATLFTGAWKNSVAVRTMDNNTWSPWIPVKDAFGSRGNTVGNRVTFLSVGTVSGDAAGLIGGFSRIR
Physico‐chemical
properties
protein length:506 AA
molecular weight: 53821,42200 Da
isoelectric point:8,33739
aromaticity:0,09486
hydropathy:-0,20968

Domains

Domains [InterPro]
DC_1346
ATT
1–165
DC_1346
ATT
148–347
A0AAF0C0E1
1 506
Architecture
ATT
STR
ATT 1-347 | STR 348-495 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage GSW6
[NCBI]
3025422 Uroviricota > Caudoviricetes > Demerecviridae > Epseptimavirus > Epseptimavirus GSW6
Host Salmonella enterica subsp. enterica serovar Infantis
[NCBI]
595 Pseudomonadota > Gammaproteobacteria > Enterobacterales > Enterobacteriaceae > Salmonella > Salmonella enterica

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WCX68662.1 [NCBI]
Genbank nucleotide accession
OQ362005 [NCBI]
CDS location
range 27206 -> 28726
strand -
CDS
ATGTCACGTAATTTAATGCCTAAATCTGGCGCAATGGCGCCTTACGTTGTAGTTAATAGAGATGCTGCAGTTGCTGGTGTTTTCTCTATTGATGGAGAGGCTGGTGCTGTTGTACTAACTTCCAAATATCTACAAATTACAAAATATACTACTGATAAAACTGCTACTGATGCCGCTATCAAGAGTATTAACGATTCAATTGGTAATATCAATACTGCTCTTGGGGGTATTAATACTACTTTAGCTACTAAAGCTGCAAAAGGTGCTAATAACGATATTACAGAACTAAATGCACTAACAAAAGCTATTACTATTGCTCAAGGCGGTACAGGGGCAGCAACGCTAGAAAATGCTAGAACAAATCTACAAGTAGAACGCCTGCGTCAGCAAGATAATGGAACTTTTATTACCTCTCCTGATGGTAAATATTCTTTATTTATCTATAATAGCGGAGATTTTGGTTTAATTGATAGTGCTAGTGCTGCTGTTAGTGCTATGAAAGTTGCTTTTGGTGGTACGGGTGGAACTACTCCTAAAACTGCAAGAAAAGGTCTTAACGTCCCTGTTGGTGCCTTGGCTGAGATAGTGCCCGATGGAGATAACGTATTAAACTATGTAGCTGTTGCTGGCCAGAGTGGGTATTACTCTTCCGGGGAATTAGTAGTTAATGGACCTCCCAAACAAGAGGGTTGGTGGACTTACAACTTCCATTGTCATGGTGTGGACATAAATGGTGCTGCTCAGTACGGGGTATTACGTGCCGTGGGCTTATCCGGCAGTTCTTGGATTAATGTTCTAGATGGCACTGGTAACTGGAGGGGCTGGCAAGAACAGTTTAATGCCCAAACTACTGTTCAGCTTTCTAATGGTGGTACGGGTGCTACTTCGTTCGAGGGTGCTAGAACCAATTTAGGTATCGATAGATTTAAACAATCTGCCACAGAAACTATGATGTATGCTCCGGGAGTTGGATATAGAATCACTGCAAGACCCAATGGAGAGTGGGGGATGTGGCGTGATGATACTGGTGGATGGATTCCTTTATCAATTACTGCTGGTGGTACGGGTGCTAATGATGCGTCACAAGCTAGAAAAAATCTTGGACTAACAGAATGGTTTGTATTGACCTCTATTGATCAGGCATACCCAAATGGATTTAACTTTGACACTATTAGTGTAAACTGTAAATATACAGTTCCTCCTTCTTCTGGCTCCAATATTGTTGGTACCAGACCTTTCCAACAGATTTCTCACTGGGAAGATGCATGGTTCTTCTTAGAGACTTTAATACACAATGATTCTAGGTATAGGATGCAACGTGCTACATTGTTTACTGGTGCTTGGAAGAATTCAGTAGCTGTACGTACCATGGATAATAATACTTGGAGTCCTTGGATACCTGTTAAGGATGCTTTTGGATCCCGAGGCAACACTGTAGGTAATAGGGTTACTTTTCTATCCGTAGGAACGGTATCCGGAGATGCTGCTGGTTTAATAGGGGGGTTCAGTAGAATCAGGTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
58a6104365057d8fa2d0cf1b2fab173b9860c34352e35bd65707ee5d701aef74
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7909
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50