Genbank accession
YP_516200.1 [GenBank]
Protein name
tail spike protein with colonic acid degradation activity
RBP type
TSP
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,79
TF
Evidence Phold
Probability 1,00
Protein sequence
MADITPNIVVSMPSQIFTQRREFKSCSNGKIYIGKIDTDPTIAENQIPVYLENEDGKYTRVAQPLIIGTGGYPVYNGQIAKFVTEEGHAMAVYDGDGVQQFYFPNVLKYEPKKAIKELYDALDTKFDKTGGTVNGSLHVTQALHVGGNDSGLRANGDGNVAFYAKNAKVAAWNQDKLHWLNDVEIDGMLKAKAATFGGNVTVSWGGRTATFHENGDVAGPIWGGALSGYLHNTFVTNMRLTGIAWTGNIGNGEHLWHNGGVIVGIQSTANYATNLRVAVRYLQKWVGGTWVGIVSD
Physico‐chemical
properties
protein length:296 AA
molecular weight: 32131,73900 Da
isoelectric point:6,44500
aromaticity:0,10473
hydropathy:-0,23885

Domains

Domains [InterPro]
YP_516200.1
1 296
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Sodalis phage phiSG1
[NCBI]
373126 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Sodalis glossinidius str. 'morsitans'
[NCBI]
343509 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Sodalis

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_516200.1 [NCBI]
Genbank nucleotide accession
NC_007902.1 [NCBI]
CDS location
range 21687 -> 22577
strand +
CDS
ATGGCTGATATTACGCCAAACATTGTCGTATCCATGCCGTCACAAATTTTCACCCAGCGCCGGGAATTTAAGTCCTGTAGCAACGGCAAAATTTACATCGGCAAAATCGACACCGACCCCACAATAGCTGAAAATCAAATACCCGTTTATCTGGAAAATGAAGACGGAAAATATACCCGAGTCGCCCAGCCACTTATCATCGGCACCGGCGGCTACCCGGTCTACAACGGGCAGATTGCCAAATTCGTCACTGAAGAAGGCCACGCTATGGCGGTGTATGACGGCGATGGCGTACAGCAGTTCTATTTCCCCAACGTGCTGAAATACGAGCCGAAGAAAGCAATAAAAGAACTGTATGACGCGCTAGACACGAAATTCGATAAAACTGGCGGCACCGTTAATGGCAGCCTGCATGTTACCCAGGCATTGCATGTTGGGGGTAATGACTCGGGGTTGCGCGCCAATGGCGACGGCAACGTGGCGTTTTATGCTAAAAATGCCAAGGTGGCAGCGTGGAATCAAGACAAGCTGCACTGGTTAAACGATGTCGAAATAGATGGCATGCTGAAGGCAAAAGCAGCGACATTTGGCGGCAATGTCACCGTCTCATGGGGCGGCAGAACTGCAACCTTTCACGAAAACGGAGATGTGGCGGGTCCCATTTGGGGCGGTGCATTGAGTGGCTATCTCCATAACACCTTCGTCACCAACATGCGTCTGACCGGCATCGCCTGGACGGGGAATATTGGCAATGGTGAACACCTCTGGCACAACGGCGGTGTTATCGTCGGCATTCAGTCAACAGCGAATTACGCCACCAACCTTCGCGTCGCCGTTCGCTACCTTCAAAAATGGGTGGGTGGCACTTGGGTAGGGATTGTTAGTGATTAA

Tertiary structure

PDB ID
4d385cd2883a471f4f2bbb0407eaba910b2cb67520f48f2169d96d55247962c0
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8331
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50