Genbank accession
YP_010673012.1 [GenBank]
Protein name
cell adhesion domain protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,89
TSP
Evidence RBPdetect2
Probability 0,83
Protein sequence
MALYPIKSLGAVGVIADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKELTPLSFISMPFDYYSAGNSFLVVGTDKKLYKLTDEGLTDISRKVATATKKATATLKIYPVVSSITPKESSVSMTFNKTKVLEVSVTPEDAQNTNLVWSVSNSAYGSIVVDPTDSKRATLTSKAVEGNLVVTVRTADESISTQIAVNIIDGDSGIFLSQDTLTVRKGGTSTLTAITGKPSVTWTSSNPSIVSVTPNSNSLTAVLRASGEGNVTITADNGTKTASCVVSAIPQIDSISLSQENVTMNRGTQYILTATVNPANAPNKAITWTSSNPNIATVSGSSTEATITGLVAGYTQITATTVEGKRTATCEVQVGLASRMARSLSYSITPEAPVEEPVIEKEDVVYFASENTGIDTTGMAEGNNFYDYSNVMDLEGFGRAALLANDPPLSGVTLDIIDASLDVGEEIVLTATASPTGNYSYKWTVDKSGYVSTTNTSSPTLKLTALRKGEVKVTCTVSQMVQKDYDAFEDYPWYHTIISNCAVATTHYETPQVKEFDSEYFVDLPGWGEQTVVDASGNPSVKKYNWKCERIRAFNNRLFALNMRESNASGVTTHYPLRLRWSNFAEENKAPELWDDYAYDRAVSSDLAANIVGQTEALENGYAGYIDLADSNGSLIEVLPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGILAPECVVEVEGGHFVVTQNDVILHNGASKKSIASNRVKNMLINEICLVNPIATKVHLHQDKKEVWILYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPHSYCIGLVDPPVLERGPIWADFQEVTWDDPSIDKLVWRKDATNFRQRVTIVGSFLRGFYQVDVGALDYFYDRSNDTVIERPLEMRLERTGIDFDNVTNEWNQKHINRFRPQVTGTGTYMFEAGGSQFSNEYGHNHSTKEFRVGVDRHVSVRLNHPYLFYNVIDNDVNSNASMNGLTIEFAVGGRR
Physico‐chemical
properties
protein length:999 AA
molecular weight: 109681,43100 Da
isoelectric point:4,98406
aromaticity:0,09409
hydropathy:-0,24334

Domains

Domains [InterPro]
DC_0191
STR
66–999
IPR008964
RBD
210–284
IPR003343
STR
214–284
IPR003343
STR
291–370
YP_010673012.1
1 999
Architecture
STR
STR 66-999
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_010673012.1
1 999
Domain Start End Length (AA) Confidence
N-terminal 1 544 544 0,8533
Central domain 545 760 217 0,2103
C-terminal 761 999 238 0,6147
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-544
Central
545-760
C-terminal
761-999

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage EK010
[NCBI]
2742112 Uroviricota > Caudoviricetes > Mktvariviridae > Suseptimavirus > Suseptimavirus EK010
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010673012.1 [NCBI]
Genbank nucleotide accession
NC_070981 [NCBI]
CDS location
range 57600 -> 60599
strand -
CDS
ATGGCTTTATATCCAATAAAATCATTAGGGGCAGTTGGGGTTATCGCTGACCAAGCACCTACTGACTTAGCACCAAATGCTTTCACTAACGCTATTAATGCTCGTTTTGTTGAGCAAAGGGTTTTTAAGACGGGGGGCAATGCCCCTCTTTCTTACGTTGATGAAGATAAGGAGCTTACCCCCCTCTCTTTCATCTCTATGCCTTTTGACTATTATAGCGCTGGCAACAGTTTCCTTGTAGTTGGTACAGATAAGAAGTTGTACAAATTAACTGATGAAGGGCTTACTGACATCAGTCGCAAGGTTGCTACAGCAACTAAGAAAGCAACAGCGACACTTAAGATTTATCCGGTAGTATCTTCTATTACCCCTAAAGAATCCTCTGTTTCTATGACTTTCAATAAAACAAAGGTTTTGGAAGTTAGTGTAACTCCAGAAGATGCACAAAATACTAACCTAGTTTGGTCAGTAAGTAACTCAGCCTATGGCAGTATTGTTGTTGATCCTACAGATTCTAAGAGAGCAACTCTTACTTCTAAAGCTGTTGAGGGTAATTTAGTAGTTACAGTTCGAACGGCAGATGAATCCATTTCTACTCAGATTGCAGTTAACATTATTGATGGAGATTCTGGAATCTTCTTAAGTCAGGATACATTGACAGTGAGAAAAGGAGGAACTTCAACACTTACTGCTATTACAGGAAAACCAAGCGTCACTTGGACAAGCTCTAATCCAAGTATTGTGTCAGTCACTCCTAACTCAAACTCTTTAACCGCAGTGTTGAGGGCTTCAGGAGAAGGTAACGTAACCATTACAGCTGATAATGGGACAAAAACTGCATCTTGTGTAGTTTCTGCAATTCCTCAGATTGATAGCATCTCTTTAAGTCAAGAGAATGTAACAATGAATAGGGGGACTCAGTACATACTAACAGCTACGGTTAATCCAGCCAATGCCCCTAACAAAGCAATCACTTGGACAAGTTCTAACCCCAACATTGCTACTGTTAGTGGTTCTAGTACAGAAGCAACCATTACTGGCCTAGTAGCGGGGTATACTCAAATCACGGCAACAACTGTAGAAGGTAAACGTACCGCTACTTGTGAAGTTCAAGTTGGTTTGGCTTCTAGAATGGCTCGTTCTTTGTCTTACTCTATTACTCCTGAAGCTCCGGTAGAAGAGCCTGTGATTGAGAAAGAAGATGTGGTATATTTTGCATCAGAGAACACTGGAATTGATACAACAGGTATGGCAGAAGGTAACAATTTCTATGACTACTCTAACGTAATGGACCTTGAAGGTTTTGGCAGAGCTGCTCTTCTAGCTAATGACCCTCCTTTAAGTGGCGTGACTTTGGATATTATAGATGCCTCTCTGGATGTTGGTGAAGAGATTGTATTGACAGCAACAGCTTCTCCGACTGGTAACTATTCCTATAAATGGACTGTAGATAAGAGTGGTTATGTGTCTACTACCAACACTAGTAGCCCTACTCTCAAGCTCACAGCTCTCCGTAAGGGTGAAGTGAAAGTTACCTGCACTGTAAGTCAGATGGTTCAAAAAGACTACGATGCTTTTGAAGATTACCCGTGGTATCATACTATCATCTCTAACTGTGCAGTTGCAACCACCCATTATGAAACTCCTCAAGTTAAAGAGTTTGATTCTGAATACTTTGTTGACTTGCCGGGATGGGGCGAACAGACAGTAGTAGATGCATCAGGGAACCCTTCTGTTAAGAAGTATAACTGGAAGTGTGAGCGCATTCGTGCATTTAACAACAGATTGTTTGCTCTGAATATGAGAGAATCTAATGCTTCTGGCGTTACAACACATTACCCGTTGCGTCTTCGTTGGTCTAATTTTGCAGAAGAGAACAAGGCTCCAGAGCTGTGGGATGATTATGCATATGATAGGGCTGTCAGCTCTGATCTTGCTGCTAATATTGTCGGACAGACTGAAGCTCTTGAGAATGGGTATGCAGGGTATATTGATCTTGCAGATTCGAACGGTAGTCTTATAGAAGTGTTACCTCTGAAGGACTATTTGTTTGTTTACACTGAATTTGAGACATACATTGGGTCACCCACTAATAACACATATCAACCTCTGATGTTTAAGAAGTTGTTTAACGATTCAGGAATTCTTGCTCCCGAATGTGTTGTGGAAGTAGAGGGTGGTCATTTTGTAGTTACTCAGAACGATGTTATTCTTCATAATGGGGCATCTAAGAAGTCAATTGCTTCCAATCGTGTTAAGAATATGCTAATCAATGAGATTTGTTTAGTTAATCCCATAGCTACTAAAGTTCACCTACACCAAGATAAGAAAGAAGTTTGGATTCTATACGTAGGACCGGGAGAGCCAAAAGAGAGTTTCGCTTGTACCAAAGCTGCTGTTTGGAACTACGAATTTGATACTTGGTCTTTCCGTACTATTCCACACTCTTATTGTATTGGTTTGGTTGATCCTCCTGTTCTTGAGCGTGGTCCTATTTGGGCAGATTTTCAAGAAGTCACTTGGGATGATCCATCTATTGATAAACTTGTCTGGAGAAAGGATGCAACAAACTTCCGTCAGAGGGTTACAATAGTAGGTTCTTTCTTGAGAGGGTTCTATCAAGTAGATGTTGGTGCTTTAGATTATTTTTATGACAGATCAAACGATACAGTCATAGAAAGGCCTTTAGAAATGAGGCTAGAGAGGACTGGTATTGACTTTGATAATGTCACTAATGAATGGAATCAAAAACACATCAACAGGTTTAGACCACAAGTGACAGGAACAGGCACTTATATGTTCGAAGCAGGTGGAAGTCAATTCTCTAATGAGTATGGACATAATCACTCAACTAAAGAGTTTAGGGTTGGGGTAGACCGTCATGTGTCAGTAAGATTGAACCATCCATACCTATTCTATAATGTTATAGATAATGATGTTAACAGTAATGCATCTATGAATGGACTCACTATAGAGTTTGCTGTTGGCGGTCGAAGGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
5fd407fec4974ecf5730e1e823996ef8ed416e1950ad2980874e0543e573349e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7705
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete Genome Sequence of the phage EK010 isolated from swine sewage Shahin,K., Bao,H., Soleimani-Delfan,A. and Wang,R. 2022-06-16 GenBank