Genbank accession
WDS60755.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
Protein sequence
MITVLNANGQTVAHFVNNVSEGVPYFEPTLTENIETLVSTFSFSVPLDCDESQYLKGLNKVLVKDKDGDLRQFNIIHTEEVFQEVDSRILVECEDFSISEMNDTVIYPFNGHNLGDTLTKAVQGTGWGVEYAADTWQEGEEPFILAEYTNMREVFGNIQKTYDVDFKFTAERTAFNQTKRIVKVYKNRGLQTGRYFTYDRDVLGITRDVQYDTIKTAILPYYTGVDGKVWTLKGMVPVNPIEGITKDKESPLVVHNQAHADYDEPFFFKAMPFKASSTNPEQVYRQGVEELLKHIAPIYTYTVNVILLNRVQGWEGETLALGDTVWMKERVGSREIGLEARVIEYIYHEDDPSLDEVTFTNFREIDTYDTSDIAGIRDALNDLKDQVDSNTVIIETTREQISKLEEGQSGIIEDLGNKNSISIGDTPKPNPIDGDTWFSTRVNEAGQTIHEIKVWDGVEKVWKLSMDTSKAFEAEDTAKAAQKDAEESLDKANQAVADADTAKTAAQEALDRYNNLMISGRNLALNSQKITVPDTGAGTTARRKTIPLSIPSKLNTDYKLKFKYKLTEGTLPEGITVGIYNVSKLSWASNIVTIPTDGKDEGELLAELTTNATEGDALLIYQGIRSDVKNGDNFDFTEVYLVEGDKIGDWQPAPEDAISSITNINGEITSLVTKTDGLETSYSQISQTVDEIQLTVGDKADKSQITQLQDQINLRVEKDDVINQINVSNEGIIIDGAKVQITGKTYIEDAVITDAMISDLSATKLTAGVIDASKINVTNIDASQIKAGTIQGIDIIGSKITNPFEIGSEGYTLDGQTVMERAQVKIDYSVSETGQKGWSILHARGIQNQLLNQNGTINSFSSLASDGLSIQDSLGNSGYLSAELLMQFSNTGKKIYPGNSWVTNTDRIVPSLTMNKCAIGWLFLWQPYDTTGGKPYTWDYTYYLVPKAHANFNSGKGINMRLQGAGKGGGADDTVYKYVYVSNDSITGTANNGTGNGAKWVLTSVFSV
Physico‐chemical
properties
protein length:1008 AA
molecular weight: 111350,89510 Da
isoelectric point:4,61734
aromaticity:0,08532
hydropathy:-0,37302

Domains

Domains [InterPro]
DC_1956
STR
1–571
IPR007119
Unmapped
34–359
IPR010572
ENZ
118–364
WDS60755.1
1 1008
Architecture
STR
STR
STR 1-571 | STR 603-1008
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterococcus phage vB_EfKS5
[NCBI]
3025409 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Enterococcus faecalis
[NCBI]
1351 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WDS60755.1 [NCBI]
Genbank nucleotide accession
OQ297175.1 [NCBI]
CDS location
range 33114 -> 36140
strand -
CDS
ATGATTACAGTATTAAACGCAAACGGACAAACAGTAGCACACTTTGTCAACAACGTTAGCGAAGGAGTACCATACTTTGAGCCTACTTTGACGGAGAATATAGAGACGTTGGTATCAACGTTCTCTTTCTCTGTTCCTTTAGACTGTGACGAAAGCCAGTATCTAAAAGGCTTGAATAAAGTATTAGTAAAGGATAAGGACGGAGATTTAAGACAGTTTAATATCATTCATACGGAAGAGGTTTTCCAAGAAGTTGACTCACGTATTCTAGTGGAGTGTGAAGACTTCTCAATTAGCGAAATGAATGACACTGTAATCTATCCTTTCAATGGGCACAACTTAGGAGACACATTAACAAAAGCCGTTCAAGGAACTGGTTGGGGAGTTGAGTATGCCGCAGACACTTGGCAAGAAGGAGAAGAGCCTTTCATACTAGCTGAGTATACGAACATGCGAGAAGTGTTTGGAAACATTCAGAAGACGTATGACGTAGACTTTAAATTTACAGCTGAGAGAACAGCCTTCAATCAAACTAAACGGATTGTTAAAGTGTATAAAAACAGAGGTCTTCAAACAGGCCGTTACTTCACCTATGACAGAGACGTTCTAGGAATCACCCGTGACGTGCAGTATGATACAATCAAAACAGCCATCCTACCTTACTACACAGGTGTTGATGGAAAAGTATGGACGCTTAAAGGAATGGTTCCAGTTAACCCTATCGAAGGCATTACCAAAGATAAGGAAAGCCCTTTGGTAGTCCACAATCAAGCTCATGCAGACTATGATGAACCTTTCTTCTTCAAAGCAATGCCTTTCAAAGCTTCATCAACAAACCCTGAGCAAGTATATCGACAAGGTGTAGAAGAGCTTTTAAAACACATTGCTCCAATCTATACTTACACTGTTAATGTAATCCTCCTTAATCGTGTCCAAGGATGGGAAGGGGAAACCTTAGCACTTGGCGATACCGTATGGATGAAAGAGCGTGTAGGCTCTCGTGAAATTGGATTGGAAGCACGTGTTATTGAATACATTTATCACGAAGACGACCCAAGCCTTGATGAAGTAACCTTTACTAACTTCCGTGAAATCGACACTTACGACACCTCTGATATTGCAGGTATCAGAGACGCATTGAATGACCTTAAAGACCAAGTTGATTCTAACACAGTTATTATAGAAACTACTAGAGAGCAGATTAGTAAGCTTGAAGAAGGTCAGTCAGGTATCATTGAAGACTTAGGAAACAAGAACTCAATAAGCATTGGTGACACGCCTAAGCCAAATCCTATTGACGGTGACACATGGTTCTCTACACGTGTTAACGAAGCTGGCCAAACAATCCATGAGATTAAAGTTTGGGACGGCGTTGAGAAAGTATGGAAGCTTTCTATGGATACTTCTAAAGCCTTTGAAGCAGAGGACACCGCTAAGGCTGCTCAGAAAGACGCTGAGGAATCTCTCGACAAGGCTAACCAAGCTGTAGCTGACGCTGACACTGCTAAGACAGCTGCTCAAGAAGCTTTAGATAGATACAACAACCTTATGATAAGCGGTCGAAACCTTGCCCTTAACTCTCAAAAGATTACCGTGCCTGACACAGGAGCTGGTACAACAGCTCGTAGGAAAACTATTCCATTATCGATTCCATCCAAGTTAAACACAGACTATAAACTGAAGTTTAAGTATAAGCTTACTGAAGGTACTTTGCCAGAAGGTATCACAGTTGGTATCTATAACGTTTCTAAGCTATCGTGGGCATCTAATATAGTTACTATCCCTACTGATGGTAAAGATGAAGGAGAGCTGTTAGCAGAGCTTACTACAAACGCTACTGAAGGAGACGCCTTACTTATCTATCAAGGAATTAGAAGCGATGTTAAGAATGGTGATAACTTTGACTTTACAGAAGTATACCTTGTCGAAGGAGATAAGATTGGAGATTGGCAACCAGCTCCTGAAGACGCTATCTCAAGCATTACGAACATCAATGGTGAAATCACTTCCCTAGTAACTAAGACAGATGGATTGGAAACAAGCTATAGTCAAATCTCACAAACAGTTGATGAAATCCAATTGACAGTCGGGGACAAAGCTGATAAGAGTCAAATCACACAGCTTCAAGACCAGATTAACCTACGTGTCGAGAAAGATGATGTAATCAACCAGATTAATGTCTCTAATGAAGGTATTATCATCGATGGAGCAAAGGTTCAGATTACAGGTAAGACTTACATCGAAGACGCTGTTATCACAGACGCCATGATTAGTGACTTATCAGCTACTAAGTTAACAGCAGGTGTTATTGACGCTTCTAAAATCAATGTAACAAACATTGACGCAAGCCAAATCAAAGCTGGTACTATCCAAGGTATTGACATAATCGGTTCTAAGATTACTAACCCATTTGAAATAGGCTCAGAAGGATATACACTAGATGGTCAAACTGTTATGGAAAGAGCGCAGGTTAAAATTGACTACAGCGTTTCAGAAACTGGTCAAAAAGGTTGGTCAATACTTCATGCAAGAGGTATTCAAAATCAGCTGCTTAACCAAAACGGTACCATTAACTCGTTTTCATCTCTTGCTTCAGATGGATTATCTATTCAAGATTCTTTAGGAAATAGTGGTTACTTATCAGCCGAGTTGCTAATGCAGTTCTCCAACACTGGCAAGAAGATTTACCCCGGAAATTCATGGGTAACTAACACTGATAGAATTGTACCATCATTAACAATGAACAAATGCGCTATTGGATGGCTGTTCTTATGGCAACCTTACGATACTACTGGTGGTAAGCCTTATACATGGGACTATACTTACTATTTAGTTCCTAAAGCTCATGCTAACTTTAACAGTGGTAAAGGTATTAATATGCGACTACAAGGCGCAGGTAAAGGTGGAGGAGCAGATGATACTGTGTATAAGTATGTATACGTATCTAATGACTCTATAACAGGAACGGCCAATAATGGTACTGGGAATGGAGCTAAATGGGTTCTTACCAGTGTATTTTCAGTTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a77afdb8e30f4fb59bc9de6b07c04baee5c800bdca7a772142da2878975c8a6b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7105
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50