Genbank accession
UJD18031.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,90
Protein sequence
MTETTPEVVPIRVRHKRMPASEWARSDFVLYDGELGVESDTGKVKVGNGSDYFSALQYLTGPKGDRGETGPVGPKGADGVMRFENLTSQQREGLRGDNGHSLNANVRIEGSYRNGATSQLNLIADVYYDGTRLTSGYTVDYYYRGFGNNNWQSLLNQTPDANGKFGQWNASQRSGGWLEVYIVVTHNGIKAAASTRLDNVSDGARGANGATGPAGPAGPAGARGADGAPGQNIINQNSGQPMKYWSGTRAQYDAIPNKDSNTIYDIYSSS
Physico‐chemical
properties
protein length:270 AA
molecular weight: 28973,27880 Da
isoelectric point:6,31523
aromaticity:0,09259
hydropathy:-0,70926

Domains

Domains [InterPro]
SSF69349
STR
11–133
IPR041352
ATT
13–51
IPR056923
RBD
242–267
UJD18031.1
1 270
Architecture
ATT
STR
STR
ATT 11-51 | STR 52-133 | STR 147-270
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage MissE1
[NCBI]
2911158 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Streptococcus mitis
[NCBI]
28037 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UJD18031.1 [NCBI]
Genbank nucleotide accession
OL774874.1 [NCBI]
CDS location
range 6231 -> 7043
strand -
CDS
ATGACGGAAACAACGCCTGAAGTAGTACCTATCAGGGTACGACATAAACGGATGCCAGCTAGTGAGTGGGCAAGAAGTGACTTTGTGTTGTATGACGGAGAGCTAGGGGTAGAAAGCGACACTGGTAAGGTCAAGGTCGGAAATGGTAGTGACTATTTCTCAGCTCTTCAATATCTGACTGGTCCTAAAGGTGACCGTGGAGAAACAGGACCAGTAGGGCCAAAAGGAGCGGACGGTGTTATGCGATTCGAGAACCTGACAAGTCAGCAGAGAGAGGGCTTGCGAGGAGATAACGGTCATAGCTTAAATGCCAATGTTCGTATTGAGGGAAGCTATCGAAACGGTGCGACTAGTCAGTTGAATTTGATCGCAGACGTCTACTATGACGGAACACGGTTAACTAGTGGCTATACTGTTGATTACTACTATCGAGGTTTTGGAAATAACAACTGGCAAAGCTTGCTAAACCAAACGCCTGATGCAAATGGAAAGTTTGGCCAGTGGAATGCTTCTCAGCGTTCAGGAGGCTGGCTTGAGGTCTACATCGTTGTAACGCACAACGGCATTAAAGCAGCTGCTAGCACACGACTTGATAATGTCAGCGACGGTGCAAGGGGGGCGAATGGTGCAACGGGTCCAGCAGGACCGGCAGGACCGGCAGGAGCTAGAGGAGCGGATGGAGCGCCAGGGCAAAACATCATCAATCAAAACAGTGGGCAACCGATGAAGTATTGGTCTGGTACAAGGGCTCAATATGACGCGATTCCTAACAAGGATAGCAATACTATCTATGACATCTATAGTAGTTCGTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
1e5a03740806095b6be0ea077814d033b0e487f10c9f8dea0e3f4630ca34158e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8316
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50