Genbank accession
XIF73150.1 [GenBank]
Protein name
tail protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,80
Protein sequence
MPTITVLVAPEVVRNKPETERLHTVTGTARGWEKTSLNQDPDEILTECKGLDALLTKSNLQADGVTKVDPTKPVGFQLSYEIHDPSAVLTTGLTITPETANGEVGQVVELKAVVAPENATYQGVNWYSGDLTKAVHIGGGKFKLLAPGSVTVYGVTIEGDHTDSTVITVAGALSLSTDLAASQDVTTGESATFTIAATGGTTPYKHTWYFSDVPGGEGEVIDAGTNPTAATASLVITAVAAENEGEYWCVVEDADGHSVTSTRCEMAAV
Physico‐chemical
properties
protein length:269 AA
molecular weight: 28052,86480 Da
isoelectric point:4,46086
aromaticity:0,05576
hydropathy:-0,06691

Domains

Domains [InterPro]
DC_0078
STR
1–192
G3DSA:2.60.40.1080
STR
86–170
IPR008964
RBD
88–163
IPR003343
STR
91–158
IPR003599
STR
180–269
IPR036179
RBD
181–263
PF13927
STR
182–252
cd00096
STR
191–262
XIF73150.1
1 269
Architecture
STR
STR 1-269
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage vB_CECAV_050
[NCBI]
3373739 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella enterica subsp. enterica serovar Copenhagen
[NCBI]
486990 Pseudomonadota > Gammaproteobacteria > Enterobacterales > Enterobacteriaceae > Salmonella > Salmonella enterica

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XIF73150.1 [NCBI]
Genbank nucleotide accession
PQ306799 [NCBI]
CDS location
range 5406 -> 6215
strand +
CDS
ATGCCAACGATCACTGTATTAGTAGCACCGGAAGTTGTTCGCAACAAACCGGAAACTGAGCGTCTTCACACTGTAACAGGTACTGCTCGTGGTTGGGAAAAGACCAGTCTTAACCAAGACCCAGACGAGATCCTGACTGAGTGTAAAGGTCTGGATGCACTGCTGACCAAGAGCAATTTGCAGGCCGATGGCGTCACCAAAGTGGATCCAACCAAGCCCGTTGGCTTCCAGCTGTCTTATGAAATCCACGACCCAAGCGCTGTGCTGACCACTGGCCTGACCATCACCCCTGAAACTGCTAATGGTGAAGTCGGTCAGGTTGTTGAACTGAAGGCGGTCGTCGCTCCGGAGAACGCGACCTACCAGGGTGTCAACTGGTACTCTGGGGATCTGACCAAAGCTGTGCACATCGGTGGTGGTAAATTCAAGCTGCTGGCCCCTGGCTCGGTAACAGTCTATGGCGTGACCATCGAAGGTGACCACACCGACTCTACCGTCATTACAGTGGCAGGTGCTCTGTCTCTGTCTACCGATCTGGCTGCTTCTCAGGATGTAACAACAGGTGAATCCGCAACCTTCACCATCGCTGCTACAGGCGGCACTACTCCATACAAACATACTTGGTACTTCTCTGATGTCCCTGGTGGTGAAGGTGAGGTGATCGATGCTGGCACTAACCCCACTGCTGCAACTGCGAGCCTGGTTATCACTGCAGTTGCTGCCGAAAACGAGGGTGAGTACTGGTGTGTTGTTGAAGACGCTGACGGCCACTCTGTGACCTCTACTCGTTGCGAAATGGCTGCGGTCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
abadff59799dd4ec0d2be5654e749ba1a73fc8060c836718b13a3829123a0481
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7240
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50