Genbank accession
YP_004327377.1 [GenBank]
Protein name
structural protein with Ig domain
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,82
Protein sequence
MPTITVLVAPEVVRNKPETERNHVVTGVAKGWQKTSLNQDPDEILTECKGLDALLTKSNLQADGVTKVDPTKPVGFQVSYEIHDPTAVLTTGLAITPATASGEIGQIVELLATVAPANATYQGVNWYSGDLTKAIHIGGGKFKLLQSGSVTVYGVTVEGNHTDSTVITVAGLLSLTTDLAASQDVADGADATFTIVAAGGTAPYSYVWYYSDTPGGEGVVIDAGVNPTAATASLVNHAVTAASEGEYWCVVEDADGHSVTSARCELAVV
Physico‐chemical
properties
protein length:269 AA
molecular weight: 27817,73730 Da
isoelectric point:4,48064
aromaticity:0,05948
hydropathy:0,10186

Domains

Domains [InterPro]
DC_0078
STR
1–195
G3DSA:2.60.40.1080
STR
86–170
DC_2066
RBD
166–269
IPR036179
RBD
181–268
YP_004327377.1
1 269
Architecture
STR
STR 1-269
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage Vi01
[NCBI]
2991283 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella enterica subsp. enterica serovar Typhi
[NCBI]
90370 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_004327377.1 [NCBI]
Genbank nucleotide accession
NC_015296 [NCBI]
CDS location
range 5503 -> 6312
strand +
CDS
ATGCCAACAATCACAGTATTAGTCGCGCCGGAAGTTGTCCGCAACAAACCGGAAACCGAACGCAATCATGTCGTGACGGGTGTTGCAAAGGGTTGGCAAAAAACCAGCCTAAATCAAGATCCTGATGAAATCTTGACCGAATGTAAAGGTCTTGATGCATTGCTGACCAAAAGCAATCTGCAAGCGGACGGTGTCACCAAAGTTGATCCGACCAAGCCAGTAGGTTTCCAGGTTTCTTATGAAATCCATGATCCGACTGCTGTTCTGACCACGGGTCTGGCCATTACTCCGGCTACTGCCAGCGGGGAAATAGGCCAAATCGTTGAATTGCTGGCCACTGTTGCCCCTGCGAACGCAACATATCAAGGGGTTAACTGGTATTCTGGCGACCTGACGAAGGCGATTCATATCGGCGGTGGTAAATTCAAACTGCTTCAGTCTGGTTCTGTTACAGTTTATGGTGTTACTGTAGAAGGAAATCACACCGATTCCACCGTCATTACCGTCGCAGGGTTGTTGTCGTTGACAACCGATCTGGCTGCTTCTCAGGACGTGGCCGACGGCGCTGATGCGACATTCACTATCGTTGCTGCTGGCGGCACCGCGCCGTACTCTTATGTGTGGTATTACTCTGACACCCCTGGCGGCGAAGGTGTGGTGATTGATGCTGGTGTTAACCCAACAGCAGCGACTGCGTCTCTGGTCAACCACGCTGTCACCGCTGCTTCTGAGGGTGAGTACTGGTGTGTTGTAGAAGACGCGGACGGCCATTCTGTAACTTCCGCCCGTTGTGAACTGGCTGTGGTGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
3760fa67107f8ed08a46ef51e62cd7f1010ac4e31650f7c0b22323957d5f01e7
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7293
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50