UniProt accession
S5M6S2 [UniProt]
Protein name
Putative tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MSKSSYIGANVEVDSDTFTSFINKVNAVIADMSSTVVTAAPVAQANSTNGASTSGNAHVEGVLSAGVLVAPELRGGTMSVPATMVISTNTNPKANTSVSLGTDTARYTNVFMTNANTRNLFANTSQTNNLTVFTAANVNALDANTADITTLTSNSATITTIVNDTLTSNSATITTLSSNTFTSNSATIATADVGALTANTATVKTGTVTTLTSNSATITTLTSNTITVNTDLTVNRNLSIANNISANSLNVTKDVFIAGNLVVQGVTSLASDQALSVNTSIAEFLTVQQVALFNGNTTIGNAATDSLTITAKVASNIIPSGNTYSLGNTTNKFATAHITSMDSPTFTSVMNQTGAAPHYQLEETDTNTVGRVIVSGGQLYVQAGAANSGTTTSSGIIRFAGLNNADVSTLAVRSGGNWQSIYHQGNDGDGSGLDADLFEGQHGPYYLDAANFTGTLADARLPTSMAGKTFTSEVTATSIRLTSAGDITPTSTNHAFQIGLDSGVNVAINTNEIVCRNNGVFTSLMIPGGVTFDPAVTLSVDNGGTGANTAAGARTNLGLAAVAASGSATDLTTGTLNNARLPSSMTGKTFTSNTKISGMLTINHATSAELRLEMNGILSGRVYRDAGGGLVMRRYNSTTGAAEGYIQIVGNGADDLKYNGSTIWHDGNAALDIASVTGMSGAVMWFARNSAPAGFLKANGAAVSRTTYAALYAAIGTTFGAGDGSTTFNLPDLRGEFIRGWDDGRGIDVSRVFGSAQSANIQSHTHSIDPPSTTTTSDTHTHTWSGTTSSDAHTHTFSGTTNTTGAHTHKITTEDNTNLGSLVQSSAGGTDSTTNTGSAGDHSHTFSGTTSSESHTHTVSGTTSGDTHSHTVDIAAFTSGAAGSGTDTRPRNIALLACIKY
Physico‐chemical
properties
protein length:901 AA
molecular weight: 91994,52650 Da
isoelectric point:5,39012
aromaticity:0,05327
hydropathy:-0,09545

Domains

Domains [InterPro]
DC_1346
ATT
1–274
IPR037053
ATT
672–734
SSF88874
STR
680–901
IPR011083
ATT
681–738
S5M6S2
1 901
Architecture
ATT
STR
ATT
STR
ATT 1-274 | STR 439-671 | ATT 672-738 | STR 739-901
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
S5M6S2
1 901
Domain Start End Length (AA) Confidence
N-terminal 1 57 57 0,8930
Central domain 58 256 200 0,3753
C-terminal 257 901 644 0,9026
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-57
Central
58-256
C-terminal
257-901

Taxonomy

  Name Taxonomy ID Lineage
Phage Sinorhizobium phage phiM12
[NCBI]
1357423 Uroviricota > Caudoviricetes > Emdodecavirus >
Host Sinorhizobium meliloti
[NCBI]
382 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Alphaproteobacteria > Hyphomicrobiales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AGR47756.1 [NCBI]
Genbank nucleotide accession
KF381361 [NCBI]
CDS location
range 50274 -> 52979
strand -
CDS
ATGTCCAAATCTTCTTACATCGGCGCAAATGTTGAAGTAGATAGCGATACCTTTACCTCGTTTATTAACAAGGTGAACGCTGTCATTGCGGACATGAGTTCGACTGTTGTTACAGCGGCTCCGGTTGCGCAAGCCAATTCCACCAATGGGGCCTCTACATCCGGCAACGCACACGTCGAAGGTGTGTTGTCGGCTGGCGTGCTTGTTGCTCCGGAACTTCGCGGCGGCACCATGAGTGTTCCCGCGACAATGGTCATTTCGACCAACACCAATCCGAAGGCAAACACATCGGTAAGTTTGGGCACGGATACCGCACGATACACGAACGTCTTCATGACGAACGCCAACACGCGCAATCTGTTTGCTAACACTTCTCAAACAAATAACCTTACGGTGTTTACAGCCGCCAACGTCAACGCATTGGATGCGAACACGGCGGACATCACTACACTGACTTCGAACTCGGCAACGATCACAACGATTGTCAACGACACCCTGACTTCGAATTCGGCAACGATCACAACGCTGTCTTCCAACACGTTCACATCGAACTCGGCAACCATTGCGACCGCCGATGTTGGGGCCTTGACGGCTAACACTGCTACGGTAAAAACCGGCACAGTGACGACACTGACTTCGAATTCGGCGACGATCACGACGCTGACTTCTAACACGATCACGGTCAATACAGATTTGACGGTCAACCGCAATCTGTCTATTGCCAACAACATCAGCGCCAACTCTCTGAATGTCACTAAAGACGTCTTTATTGCTGGCAATCTGGTGGTTCAAGGTGTTACCTCGCTGGCATCCGATCAGGCACTTTCGGTCAACACGTCCATCGCCGAATTCCTCACTGTTCAGCAAGTCGCGCTCTTTAACGGCAACACGACCATCGGCAATGCGGCAACCGATTCCTTGACGATCACCGCTAAGGTGGCGAGTAACATCATTCCGTCCGGCAACACATATTCGCTCGGTAACACAACGAATAAGTTTGCGACGGCACACATCACGTCGATGGACTCGCCGACATTTACTTCGGTGATGAACCAAACCGGCGCGGCTCCCCATTATCAGTTGGAAGAAACCGACACCAATACTGTTGGGCGGGTGATCGTATCGGGCGGTCAACTGTACGTTCAGGCTGGTGCCGCCAACTCTGGCACGACCACGAGTTCCGGCATCATTCGTTTTGCTGGTCTTAACAACGCGGACGTTAGCACCCTTGCGGTCCGTTCTGGTGGCAATTGGCAATCGATCTATCACCAAGGCAATGACGGCGACGGCTCCGGTCTTGATGCCGATCTTTTCGAAGGTCAACACGGTCCTTACTATTTGGACGCGGCTAACTTCACCGGTACGCTTGCGGATGCTCGGCTGCCGACGTCGATGGCGGGTAAGACGTTCACGAGCGAAGTTACGGCGACTTCGATCCGATTGACATCGGCGGGCGACATCACCCCAACATCCACCAACCACGCATTTCAAATTGGATTGGATTCCGGTGTCAACGTTGCCATTAACACCAATGAAATTGTATGTCGCAACAATGGTGTCTTTACATCGCTGATGATCCCCGGTGGTGTCACTTTTGACCCCGCAGTCACGCTTTCGGTTGATAATGGCGGTACTGGCGCGAATACGGCGGCTGGCGCACGCACAAACCTTGGTCTTGCTGCGGTTGCGGCTTCCGGCTCGGCAACTGATCTTACCACGGGTACACTGAACAACGCACGTCTTCCGTCTTCCATGACTGGTAAGACGTTCACTTCGAATACCAAAATCAGCGGTATGCTCACAATAAACCATGCCACGTCGGCAGAACTTCGTTTGGAAATGAATGGCATTCTTTCCGGGCGGGTTTATCGTGATGCTGGTGGTGGTCTCGTTATGCGTCGGTATAACTCCACAACCGGTGCTGCCGAAGGTTATATTCAAATCGTTGGCAATGGTGCCGATGATCTGAAATACAATGGTTCTACAATTTGGCATGACGGCAATGCCGCTCTTGATATCGCCAGCGTCACCGGTATGTCTGGTGCGGTCATGTGGTTTGCTCGAAACAGCGCTCCTGCCGGATTTTTGAAGGCAAACGGTGCGGCGGTTTCGCGTACGACCTATGCGGCGCTGTATGCGGCAATCGGTACGACGTTTGGTGCTGGTGACGGCTCCACGACGTTTAACCTTCCCGATCTTCGCGGCGAATTCATCCGTGGTTGGGATGATGGTCGCGGCATTGACGTTAGTCGTGTGTTTGGTTCGGCTCAAAGTGCGAACATTCAGAGCCACACTCACTCTATCGATCCGCCATCAACGACAACTACTTCTGACACTCACACGCACACGTGGTCTGGTACGACGTCGTCTGATGCTCATACTCACACGTTCAGTGGCACGACCAACACCACTGGTGCTCACACTCACAAGATCACGACAGAAGATAACACCAACCTTGGTAGTTTGGTCCAATCGTCTGCCGGTGGCACCGACAGTACCACCAACACTGGGAGTGCTGGTGATCACAGTCACACGTTCTCTGGTACGACGTCGTCTGAATCGCATACCCACACGGTTTCGGGTACGACGTCGGGCGACACCCATAGCCACACAGTTGACATTGCGGCGTTTACGTCTGGTGCTGCCGGTTCGGGCACGGACACCCGTCCGCGAAACATCGCGCTTTTGGCATGTATTAAATATTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
cf7b7104dcec72131c4ffc6a8a1c8872425976e8f4a9df31957ab8bea55e6f48
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5630
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50