Genbank accession
QYU43879.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MSSGCGDVLSLADLQTAKKHQIFEAEVITGKSGGVATGADIDYATNQVTGQTQKTLPAVLRDAGFSPVSWDFSTGGTLTVNDRDKVVYDPVSETWYSYAGTLPVVVPASFNPVGNANWKPQTDPNLRNDLASSTAGLGASLVSFSNGNTVETLSDADGAKNIGSGERSLLARNNDIKHSGDFSTLQAAVDASLPKNDLLISPGEYTEKVTIGSAQLKGVGGAVVLKTPADFTNTVQVNLSTPHWQFRHSGGFAIDGSGTTGAVGISFDPSDQYSGRHNFSDLYIHNINKAIQKPSGNIGNTWRNIGVSTCDWGYYAISGSEMHCGADTLYNIHFDGISTYAVYLNGTADNGGIGGWWLKDSIIEASGGGGIYLKSKSGDCPTSPCGVSNVWTEAIATSSAVQVDGVAQKPRVLKLVDTAIFFAEYSYLNNIELSNSNLVTYGCRFDNADGNQDIVVDAQSTIVAHDVYLNGSSGKDVIVESVASQSATVATTNLSLRGNLTRGRVFNTPTGNKLKAITFDSGSHNFSGSGTVNGSTVSDGLHAATCTEFSFPGSGLYEMVASRTTLTSGRWYVWGVNSRLQSGTADVSITSGITMGSVYTKSGEWISTFGVGKASANGTVSLYVSTAGGSGAVIRFSDFFIAEFTTQAQALAFANSRMSLA
Physico‐chemical
properties
protein length:661 AA
molecular weight: 69245,64470 Da
isoelectric point:5,17510
aromaticity:0,08926
hydropathy:-0,12360

Domains

Domains [InterPro]
DC_0018
ATT
1–255
G3DSA:2.10.10.80
ATT
65–133
QYU43879.1
1 661
Architecture
ATT
STR
RBD
ATT 1-255 | STR 256-358 | RBD 359-661
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QYU43879.1
1 661
Domain Start End Length (AA) Confidence
N-terminal 1 186 186 0,9937
Central domain 187 505 320 0,9810
C-terminal 506 661 155 0,9950
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-186
Central
187-505
C-terminal
506-661

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoS-phiEc4
[NCBI]
2863833 No lineage information
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QYU43879.1 [NCBI]
Genbank nucleotide accession
MZ576218.1 [NCBI]
CDS location
range 22256 -> 24241
strand +
CDS
ATGTCAAGCGGATGCGGTGACGTTTTAAGCCTGGCGGATTTACAGACCGCCAAGAAACATCAGATTTTCGAGGCCGAGGTTATCACTGGCAAATCCGGCGGTGTAGCTACGGGTGCAGACATTGATTACGCAACAAATCAGGTTACCGGGCAGACGCAGAAGACGCTTCCTGCCGTGTTGCGCGACGCGGGTTTTTCTCCGGTGTCATGGGATTTCTCCACAGGCGGCACGTTAACCGTTAACGACCGGGACAAGGTGGTGTATGACCCTGTTAGCGAAACATGGTACTCGTACGCAGGTACGTTACCAGTTGTTGTCCCTGCATCGTTTAATCCAGTCGGTAACGCTAACTGGAAGCCGCAGACAGACCCAAATTTGCGTAATGATTTGGCGTCAAGTACCGCCGGTTTGGGGGCTTCACTGGTGTCTTTTTCCAACGGTAACACCGTTGAAACATTATCTGACGCGGACGGTGCCAAGAACATAGGCAGCGGCGAAAGGAGTTTGCTTGCACGTAACAATGATATTAAGCATTCAGGAGACTTCTCTACCCTGCAAGCTGCAGTAGATGCATCGTTACCGAAAAATGACTTACTTATTTCCCCTGGTGAATATACTGAAAAAGTTACTATTGGTAGCGCACAGTTAAAAGGCGTTGGTGGGGCTGTAGTACTAAAAACACCAGCAGACTTTACAAACACTGTTCAGGTTAATTTATCGACGCCTCATTGGCAATTCCGCCATAGTGGCGGTTTTGCTATAGATGGTTCTGGTACAACCGGGGCCGTAGGTATTAGTTTTGACCCGTCAGACCAATATTCCGGACGTCATAATTTTAGTGATTTATACATACACAACATCAACAAGGCAATTCAGAAGCCTTCCGGGAACATCGGTAATACATGGAGAAATATTGGGGTATCTACGTGTGATTGGGGGTATTACGCGATTAGTGGCTCAGAGATGCATTGCGGGGCCGATACCCTCTACAATATCCACTTCGACGGCATCTCCACCTATGCCGTCTACCTAAACGGCACTGCCGATAACGGCGGGATAGGCGGATGGTGGCTTAAAGACTCCATTATTGAGGCTTCCGGAGGTGGCGGGATATATTTAAAAAGCAAATCAGGTGACTGCCCTACATCTCCATGCGGGGTATCCAATGTATGGACGGAAGCGATTGCAACATCATCGGCTGTTCAGGTAGACGGCGTGGCGCAAAAACCGCGAGTTCTCAAGTTGGTAGACACTGCGATATTCTTTGCTGAGTATTCTTATCTCAACAACATCGAGCTATCCAACTCCAACTTAGTAACTTATGGTTGCCGTTTTGACAACGCTGATGGTAATCAGGATATTGTAGTGGATGCACAAAGCACCATTGTAGCCCACGATGTGTATTTAAACGGCAGTTCTGGAAAGGACGTTATTGTAGAGTCTGTCGCGTCGCAATCCGCAACGGTAGCCACTACAAACTTGTCTCTGCGAGGAAATTTAACAAGGGGACGAGTGTTTAATACGCCGACGGGGAATAAGTTAAAAGCTATCACATTTGACTCAGGTAGCCATAATTTTTCTGGTAGCGGTACCGTTAACGGTTCGACTGTATCTGACGGACTTCACGCCGCTACATGTACCGAATTTTCATTCCCTGGGTCCGGTTTATATGAAATGGTAGCGTCAAGAACGACACTTACATCCGGTAGATGGTACGTGTGGGGCGTCAACTCTCGCCTCCAATCAGGAACAGCAGATGTGAGTATAACGTCAGGTATTACTATGGGTAGCGTTTATACTAAGTCTGGTGAGTGGATTAGCACATTTGGGGTGGGTAAAGCTTCCGCTAACGGTACTGTAAGTCTGTACGTCTCTACTGCGGGAGGTTCAGGTGCTGTTATCAGATTTAGCGACTTCTTCATCGCTGAGTTTACGACGCAGGCTCAGGCTTTAGCTTTCGCTAACTCCCGAATGTCGTTGGCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
6579df7234d590f50bfaea599291d8ca7a51ee0f623b975900cf2ef6f83ba86f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7563
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Genomic characterization of two Escherichia phages belonging to the genus Kagunavirus Rodea Montealegre,G.E., Gonzalez-Villalobos,E., Balcazar,J.L. and Molina-Lopez,J. 2016 GenBank