Genbank accession
AXF38930.1 [GenBank]
Protein name
tail fibers protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MSTITQFPSGSTQYRIEFDYLARTFVVVTLVNSSNPTLNRVLEVGRDYRFLNPTMIEMLVDQSGFDIVRIHRQTGTDLVVDFRNGSVLTASDLTNSELQAIHIAEEGRDQTVDLAKEYADAAGSSAGNAKDSEDEARRIAESIRAAGLIGYITRRSFEKGYNVTTWSEVLLWEEDGDYYRWDGTLPKNVPAGSTPETSGGIGLGAWVSVGDAALRSQISNPEGAILYPELQMARWRDEGDVRGWGAKGDGSADDTEAFKAALATGKNLYIPVGIYIIKETLYFKNQVIKGGGISPSPSLGTILAISHNEAAFKYDVNSGYSMGGYLGGFFIDYGENKPDNYGGRKGIDIGDASQTAWPSQFIIENIIVRGAYFGIHDVTGAFQYTMRNVLAINCWEGFRKHIGTTVLMDTCYALNCYQAFNFANVYNMTMNNCAMDGCNDIQGLQAFDINNCKGMVINGMYSENCEIHHNGHASIYIHGDSTVTLNGYALHSHKVLASSGEAYFLRAHESSRVTVDGIFFGEDLTSTAVFMYPVLSSGDARVKLGTTRLKLWTGATGGASLAALGNSLIEYDNTVFVPTVTVGWCSNNGVVAKGTLAVNTTLEPGADLNAGNITLTGDYKPTKGDVLVYGATFDVKSCSIILKPIGENLCNVYIKNLSAGGSATLLGDLMVQAIRR
Physico‐chemical
properties
protein length:676 AA
molecular weight: 73218,38440 Da
isoelectric point:5,09166
aromaticity:0,09763
hydropathy:-0,11361

Domains

Domains [InterPro]
DC_0369
STR
1–670
IPR040775
RBD
156–211
IPR024535
ENZ
241–292
AXF38930.1
1 676
Architecture
STR
ATT
STR
ATT
STR
STR 1-10 | ATT 11-113 | STR 114-149 | ATT 150-220 | STR 221-670 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AXF38930.1
1 676
Domain Start End Length (AA) Confidence
N-terminal 1 253 253 0,9935
Central domain 254 576 324 0,9931
C-terminal 577 676 99 0,9755
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-253
Central
254-576
C-terminal
577-676

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage Vec13
[NCBI]
2783859 Uroviricota > Caudoviricetes > Autographivirales > Studiervirinae > Kayfunavirus
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AXF38930.1 [NCBI]
Genbank nucleotide accession
MH400309.1 [NCBI]
CDS location
range 33847 -> 35877
strand +
CDS
ATGTCCACGATTACACAATTCCCTTCAGGAAGCACTCAGTACAGGATTGAGTTCGACTACCTAGCCAGAACGTTTGTTGTTGTTACGCTGGTGAATAGCTCTAACCCTACCCTGAACCGTGTACTGGAAGTTGGTCGAGATTACCGATTCCTTAATCCAACGATGATTGAGATGTTGGTTGACCAATCAGGTTTCGACATCGTTCGCATTCACCGTCAGACTGGAACTGACTTAGTGGTAGACTTCAGGAATGGCTCAGTGTTGACAGCTAGTGACCTGACAAATTCAGAGCTTCAGGCTATCCATATTGCAGAAGAAGGTCGAGACCAAACGGTTGACTTAGCGAAGGAATATGCCGATGCTGCTGGTAGCTCTGCTGGCAACGCTAAGGATAGCGAGGACGAAGCACGCCGAATCGCTGAGAGTATCAGGGCGGCTGGTCTAATTGGTTATATTACCCGTCGCTCCTTCGAGAAAGGCTACAACGTTACAACATGGAGCGAGGTCCTGCTATGGGAAGAGGATGGTGATTATTACCGCTGGGATGGTACGCTTCCAAAGAACGTTCCTGCTGGTTCAACTCCTGAAACTTCCGGTGGGATTGGATTAGGTGCGTGGGTTAGTGTTGGTGATGCTGCTTTAAGAAGTCAGATTTCAAACCCGGAAGGGGCAATACTCTACCCAGAATTACAGATGGCGCGCTGGCGAGATGAGGGTGATGTTCGCGGATGGGGTGCTAAAGGTGATGGTTCTGCGGATGATACAGAGGCTTTCAAGGCAGCACTGGCAACAGGAAAGAATCTATATATCCCCGTTGGTATATACATCATTAAAGAGACTTTGTACTTTAAGAACCAAGTTATTAAAGGTGGGGGAATATCTCCATCACCAAGTCTTGGTACAATCCTAGCAATCTCTCACAATGAAGCAGCTTTTAAGTATGACGTTAACTCTGGATACTCCATGGGTGGCTACTTAGGCGGTTTCTTCATTGATTATGGTGAGAACAAGCCAGACAACTATGGTGGTCGCAAAGGTATTGATATTGGTGATGCATCGCAGACAGCATGGCCTTCACAATTTATCATTGAGAACATCATAGTACGTGGTGCGTACTTTGGTATTCATGACGTGACAGGGGCTTTTCAGTATACCATGCGTAACGTATTGGCGATTAATTGCTGGGAAGGTTTCCGTAAGCATATTGGTACTACAGTCCTAATGGATACATGCTACGCACTTAATTGTTATCAGGCGTTCAATTTTGCAAACGTGTACAACATGACTATGAACAATTGTGCAATGGACGGCTGTAATGATATACAAGGTCTACAGGCTTTTGATATTAACAACTGTAAAGGTATGGTAATAAATGGCATGTACAGCGAGAACTGTGAGATTCATCATAATGGACATGCCTCTATATATATTCATGGGGACTCAACAGTAACACTTAACGGATATGCACTACATTCCCACAAAGTTCTGGCTAGTTCTGGGGAGGCTTACTTCTTACGAGCACATGAAAGTTCTCGCGTTACTGTTGATGGTATCTTCTTCGGGGAAGACCTGACGTCTACTGCTGTATTCATGTACCCAGTCCTGTCTTCTGGGGATGCTAGAGTTAAGCTTGGAACTACTAGATTGAAGTTGTGGACTGGTGCTACGGGTGGGGCGTCACTGGCTGCACTAGGTAACAGTTTGATTGAGTATGACAACACAGTCTTTGTTCCAACAGTCACGGTCGGATGGTGTAGCAACAACGGAGTTGTAGCCAAAGGTACTCTTGCAGTTAACACTACGCTTGAACCAGGTGCTGACCTGAATGCCGGTAATATTACCCTCACTGGTGACTACAAGCCAACTAAGGGGGATGTGCTTGTGTATGGTGCTACCTTTGATGTTAAGTCATGTTCAATTATTCTCAAGCCAATAGGGGAGAATTTATGTAATGTGTACATTAAGAACTTAAGTGCAGGTGGTAGTGCTACTCTTCTTGGTGACTTGATGGTACAGGCCATCCGACGCTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
fad863207f9a5922aa71b42f6258efa644f058eb9e07f42d533529ef62bb83c3
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8295
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequence of Escherichia coli bacteriophage Vec13 Volozhantsev,N., Kislichkina,A., Denisenko,E., Verevkin,V., Myakinina,V. and Krasilnikova,V. 2017-09-14 GenBank