Genbank accession
YP_007005418.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,94
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MAIYKTGQASVSADGVVTGYGTKWKDALSLIRKGCTIAFATSPTTFATISDIRTDTEMTVTDAPGVEIPRGDYVILLTTSITVDGLAQDVAETLRYYQGRETQYEQFVEFLENFDWEKFETVTQDVKANADAAQASADAAKTSETKAAASASAAKTSETNAANSAASIGNAERNAAASAAAAKTSETNAAASASAAAGSASAAKTSETNAKTSETNAASSATAANNSKTAAATSATNAAGSATSASNSASAAKTSETNAASSASAAKTSETNAAASASAAAGSATKAKNEADRAQGLADSLDTSKLMMKGNNLSDVASVPQARTNLGLGDSSNVTFNVVTTSGDINAIRNSDTPTRSPAITSRVVGSDGTVLVQAELWADTSSNSIALVNRNTTGPRFFTIKNDGSVNPSGRIISNYGADYAMNLAAPILDARKGYVNTIASSNSLSCYVCFKNADERNRGIIYCNDNQVINIRPDNVGTGAIGRTLSINGANGVCTAVQFSSTSDERAKFWIKPVTDALDKVCSLKGVTYSMHTTTQNTVRNAGLIAQDVQKVLPEAVHVGEVGKTLDKNCFEVEDPLSLDYNAMSALYVEAFKEVRSEMQAMRDEIQSLKAEIELLKNPQ
Physico‐chemical
properties
protein length:622 AA
molecular weight: 64532,34140 Da
isoelectric point:5,06154
aromaticity:0,05145
hydropathy:-0,26720

Domains

Domains [InterPro]
DC_0036
ATT
1–187
DC_0401
STR
179–245
IPR030392
CHP
505–560
YP_007005418.1
1 622
Architecture
ATT
STR
ATT 1-187 | STR 188-622
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_007005418.1
1 622
Domain Start End Length (AA) Confidence
N-terminal 1 140 140 0,9895
Central domain 141 339 200 0,7581
C-terminal 340 622 282 0,9274
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-140
Central
141-339
C-terminal
340-622

Taxonomy

  Name Taxonomy ID Lineage
Phage Cronobacter phage ESP2949-1
[NCBI]
2920894 Uroviricota > Caudoviricetes > Drexlerviridae > Kyungwonvirus Esp29491 >
Host Cronobacter sakazakii
[NCBI]
28141 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_007005418.1 [NCBI]
Genbank nucleotide accession
NC_019509.1 [NCBI]
CDS location
range 38767 -> 40635
strand +
CDS
ATGGCTATTTACAAAACCGGCCAGGCATCAGTAAGCGCCGATGGCGTTGTCACTGGTTACGGCACTAAATGGAAAGACGCGCTTTCCCTGATCCGCAAAGGCTGCACCATTGCTTTTGCCACCAGCCCGACCACTTTCGCAACTATCAGTGACATTCGAACCGATACTGAGATGACGGTAACGGATGCCCCCGGCGTGGAAATCCCGCGCGGCGACTACGTGATCCTGTTAACCACTTCAATAACCGTTGACGGCCTGGCGCAAGATGTGGCGGAAACCCTTCGCTACTATCAGGGCCGTGAGACCCAATACGAGCAATTCGTCGAGTTCCTTGAAAACTTCGATTGGGAAAAGTTTGAAACGGTCACGCAAGATGTGAAGGCTAACGCTGACGCCGCCCAGGCAAGCGCAGACGCTGCCAAGACCAGCGAAACCAAAGCTGCCGCCAGCGCATCGGCTGCAAAAACGAGTGAGACGAATGCCGCCAATAGTGCGGCAAGCATTGGCAATGCTGAAAGAAATGCTGCTGCCAGCGCTGCCGCTGCCAAGACCAGCGAAACGAACGCGGCGGCAAGCGCCTCTGCGGCGGCTGGTAGTGCTTCTGCGGCAAAGACCAGCGAGACAAATGCAAAGACCAGCGAAACCAACGCTGCGTCCAGCGCTACCGCCGCAAACAACAGCAAAACCGCCGCTGCGACTTCCGCAACCAATGCCGCAGGTAGTGCTACCTCTGCGTCTAACAGCGCATCAGCGGCCAAGACCAGCGAAACCAATGCTGCGTCCAGCGCTTCTGCGGCAAAGACCAGTGAAACCAATGCCGCCGCCAGCGCATCGGCTGCTGCCGGATCTGCAACGAAGGCGAAAAATGAAGCTGATCGAGCACAAGGTCTGGCAGATAGCCTGGATACATCCAAGCTGATGATGAAGGGAAACAACCTTTCAGATGTTGCCAGTGTTCCACAAGCAAGAACTAATCTTGGCCTGGGTGATTCCAGCAACGTCACCTTCAATGTCGTTACAACCTCTGGTGATATTAACGCAATAAGGAATAGTGACACGCCAACGAGAAGCCCAGCTATCACATCAAGGGTTGTCGGAAGTGATGGAACCGTTCTTGTTCAGGCTGAATTGTGGGCTGATACCAGTTCGAACTCAATTGCGCTGGTTAACAGAAACACAACCGGGCCACGATTTTTTACTATTAAAAACGATGGATCTGTTAACCCATCCGGCAGGATAATATCTAATTATGGCGCTGACTACGCTATGAATCTTGCCGCTCCTATACTGGATGCAAGAAAAGGATATGTAAACACGATCGCGTCGTCAAACAGCTTAAGCTGTTACGTTTGTTTTAAGAATGCTGACGAAAGAAATCGCGGAATCATATATTGCAACGATAACCAGGTTATCAACATTCGACCGGACAACGTTGGAACTGGCGCAATTGGCAGAACATTAAGCATCAATGGAGCAAATGGTGTTTGTACTGCCGTACAGTTTTCGTCAACTTCTGATGAGCGCGCAAAGTTCTGGATCAAGCCAGTAACAGACGCTCTAGATAAGGTTTGTTCGTTGAAAGGCGTTACATATTCAATGCACACAACGACGCAAAACACTGTCAGGAACGCCGGTCTTATTGCCCAGGATGTACAGAAAGTATTGCCGGAAGCTGTTCACGTTGGCGAGGTTGGCAAGACGCTTGATAAAAATTGCTTTGAGGTAGAAGATCCGTTAAGCCTTGACTACAACGCTATGTCAGCGCTTTATGTAGAAGCATTTAAGGAGGTCAGATCGGAAATGCAGGCGATGAGAGATGAAATTCAGTCTCTAAAAGCGGAAATTGAGTTGCTTAAGAATCCACAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
fe93540bbd154ad88b43a69ef8a5a4f0c8ebf2d1081c89a7c8259034bb018c2c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7091
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50