Genbank accession
YP_010068735.1 [GenBank]
Protein name
tail fiber protein proximal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,58
TF
Evidence RBPdetect2
Probability 0,79
Protein sequence
MAEIKRKFRAEDGLDAGGDKIINVALADRTVGTDGVNVDYLIQENTVQQYDPTRGYLKDFVIIYDNRFWAAINDIPKPAGAFNSVRWRALRTDANWTTVSSGPYQLKSGELISVNTAAGNDITFTLPSSPIDGDTIVLQDIGGKPGVNQVLITAPVQSIVNFRGQQVRSVLMTHPKSQIVLIFSNRLWQMYVADYSREAVVVTPASLYQAQSNDFIVRRFTSAAPINVKLPRFANHGDIINFVDLDKLNPLYHTIVTTYDETTSIQEVGTHSIEGRTTIDGFLMFDDNEKLWRLFDGDSKARLRIITTNSNIRPNEEVMVFGANNGTAQTIELQLPTDISVGDTVKISMNYMRKGQTVKIKAAGEDKIASSVQLLQFPKRSEYPPETKWVTVQELVFNGETNYVPVLELAYIEDSDGKYWVVQQNVPTVERVDSLNDSTRARLGVIALATQAQANADLENSPQKELAITPETLANRTATETRRGIARIATTAQVNQDTTFSFADDIIITPKKLNERTATETRRGVAEIATQQETNAGTDDTTIITPKKLQARQGSESLSGIVTFVSTAGATPASSRELNGTNVYNKNTDNLVVSPKALDQYKATPTQQGAVILAVESEVIAGQSQQGWANAVVTPETLHKKTSTDGRIGLIEIATQSEVNTGTDYTRAVTPKTLNDRRATESLSGIAEIATQVEFDAGVDDTRISTPLKIKTRFNSTDRTSVVALSGLVESGTLWDHYTLNILEANETQRGTLRVATQVEAAAGTLDNVLITPKKLLGTKSTEAQEGVIKVATQSETVTGTSTNTAVSPKNLKWIVQSEPTWAATTLIRGFVKTSSGSITFVGNDTVGSTQPLESYEKNSYAVSPYELNRVLANYLPLKAKAVDSNLLDGLDSSQFIRRDIAQTVNGSLTLTQQTNLSAPLVSSSTATFGGSVTANSVLTISNTGTETHLIFEKGPQIGTNQAQTVTIRVWGNQFSGESDTTRSTVFEVGDETSNHFYSQRNKAGNITFNINGTVTPINVNASGTLNANGVATFGNSVTATGEIISRSANAFRAINGDYGFFIRNDASNTYFMLTASGDQTGGFNGLRPLAINNASGQVTIGESLIIAKGATINSGGLTVNSRIRSQGTKTSDLYTRAPTSETVGFWSIDINDSATYNQFPGYFKMVEKTNEVTGLPYLERGEEVKSPGTLTQFGNTLDSLYQDWITYPTTPEARTTRWTRTWQKTKNSWSSFVQVFDGGNPPQPSDIGALPSDNAIMGNLTIRDFLRIGNVRIIPDPVNKTVKFEWVE
Physico‐chemical
properties
protein length:1289 AA
molecular weight: 140380,97830 Da
isoelectric point:5,36056
aromaticity:0,07370
hydropathy:-0,31877

Domains

View on InterPro
YP_010068735.1
1 1289 aa
ATT 979–1092 · ATT 1139–1237 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

YP_010068735.1
1 1289 aa
Domain Start End Length (AA) Confidence
N-terminal 1 1080 1080 0,8218
Central domain 1081 1278 199 0,1403
C-terminal 1279 1289 10 0,8767
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Escherichia phage vB_EcoM_F1 [NCBI] · taxon 2750846
Host No host information

Coding sequence (CDS)

Genbank protein accession
YP_010068735.1 [NCBI]
Genbank nucleotide accession
NC_054912 [NCBI]
CDS location
range 149182 -> 153051
strand +
CDS
ATGGCCGAGATTAAAAGAAAGTTCAGAGCAGAAGATGGTCTGGACGCAGGTGGTGATAAAATAATCAACGTAGCTTTAGCTGATCGTACCGTAGGAACTGACGGTGTTAACGTTGATTACTTAATTCAAGAAAACACAGTTCAACAATATGATCCAACTCGTGGATATTTAAAAGATTTTGTAATCATTTATGATAACCGCTTTTGGGCTGCTATAAATGATATTCCAAAACCAGCAGGAGCTTTTAATAGCGTACGATGGAGAGCGTTACGTACCGATGCAAACTGGACAACTGTTTCATCAGGACCTTATCAATTAAAGTCAGGTGAATTAATTTCAGTTAATACTGCAGCAGGAAATGACATTACATTTACTTTACCATCTTCTCCAATTGACGGTGATACTATCGTTCTCCAAGACATTGGAGGAAAACCTGGAGTTAACCAAGTTTTAATTACAGCTCCGGTACAAAGCATTGTAAATTTTAGAGGTCAGCAAGTACGTTCAGTACTAATGACTCATCCAAAATCACAGATAGTTTTAATTTTTAGTAATCGTCTGTGGCAAATGTATGTCGCTGATTATAGTAGAGAAGCTGTAGTTGTAACGCCTGCGAGTCTCTATCAGGCGCAATCAAACGATTTTATCGTACGTAGATTTACTTCTGCTGCACCAATTAATGTTAAACTTCCAAGATTTGCTAATCATGGAGATATTATTAATTTCGTCGATTTAGATAAACTAAATCCGCTTTATCATACAATTGTTACTACATACGATGAAACGACTTCAATACAAGAAGTTGGAACTCATTCCATTGAAGGCCGTACAACAATTGACGGTTTCTTGATGTTTGATGATAATGAGAAATTGTGGAGATTGTTTGACGGGGATAGTAAAGCACGTTTACGTATTATAACGACTAATTCTAATATTCGTCCAAATGAAGAAGTTATGGTATTTGGTGCGAATAATGGAACAGCTCAGACAATTGAACTTCAGCTTCCGACTGATATTTCTGTTGGTGATACTGTTAAAATTTCCATGAATTACATGAGAAAAGGACAAACGGTTAAAATCAAAGCTGCTGGTGAAGATAAAATTGCTTCTTCAGTTCAATTGCTGCAATTCCCAAAACGTTCAGAGTATCCGCCTGAAACTAAATGGGTAACAGTTCAAGAATTAGTTTTTAACGGTGAAACTAATTATGTTCCAGTTTTAGAGCTTGCTTATATAGAAGATTCTGATGGAAAATATTGGGTTGTACAGCAAAACGTTCCAACTGTAGAAAGAGTAGATTCTTTAAATGATTCTACTAGAGCAAGATTAGGCGTAATTGCTTTAGCTACACAAGCTCAAGCTAATGCTGATTTAGAAAATTCTCCACAAAAAGAATTGGCAATTACTCCAGAAACGTTAGCTAATCGTACTGCTACTGAAACTCGCAGAGGTATCGCAAGAATAGCAACAACTGCTCAAGTTAACCAGGATACTACATTCTCTTTTGCTGATGATATTATCATCACTCCTAAAAAGCTGAATGAAAGAACAGCTACTGAAACTCGCAGAGGTGTTGCTGAAATTGCTACGCAGCAAGAAACTAATGCAGGAACCGATGATACTACAATCATCACTCCTAAAAAGCTTCAAGCTCGTCAAGGTTCTGAATCATTATCTGGTATTGTAACCTTTGTATCTACTGCAGGTGCTACTCCAGCTTCTAGCCGTGAATTAAATGGTACGAATGTTTATAATAAAAACACTGATAATTTAGTTGTTTCACCTAAAGCTTTGGATCAGTATAAAGCTACTCCAACACAACAAGGTGCGGTAATTTTAGCAGTTGAAAGTGAAGTAATTGCTGGACAAAGTCAGCAAGGATGGGCAAATGCTGTTGTAACGCCAGAAACGTTACATAAAAAGACATCAACTGATGGAAGAATTGGTTTAATTGAAATTGCTACGCAAAGCGAAGTTAATACAGGAACTGATTATACTCGTGCAGTCACTCCTAAAACTTTAAATGACCGTAGAGCAACTGAAAGTTTAAGTGGTATAGCTGAAATTGCGACACAAGTTGAATTCGACGCAGGCGTCGACGATACTCGTATCTCTACACCATTAAAAATTAAAACCAGATTTAATAGTACTGATCGTACTTCTGTTGTTGCTCTATCTGGATTAGTTGAATCAGGAACTCTCTGGGACCATTATACCCTTAATATTCTTGAAGCAAATGAGACACAGCGTGGTACACTTCGTGTAGCTACACAGGTCGAAGCTGCTGCGGGAACATTGGATAATGTTCTAATAACTCCTAAAAAGCTTTTAGGTACTAAATCTACTGAAGCGCAAGAAGGTGTTATTAAAGTTGCAACTCAGTCTGAAACTGTGACTGGAACGTCAACAAATACTGCTGTATCTCCAAAAAATTTAAAATGGATTGTGCAGAGTGAACCTACTTGGGCAGCAACTACTCTGATAAGAGGTTTTGTTAAAACTTCATCTGGTTCAATTACATTCGTTGGTAATGATACGGTAGGTTCAACACAGCCATTAGAATCATATGAGAAAAATAGCTATGCAGTATCACCATATGAATTAAACCGTGTATTAGCAAATTATCTGCCGTTAAAAGCAAAAGCTGTAGATAGTAATTTATTGGATGGTCTAGATTCATCTCAGTTCATTCGTAGGGATATTGCGCAGACGGTTAATGGTTCACTAACCTTAACCCAACAAACGAATCTGAGTGCCCCTCTTGTATCATCTAGTACTGCTACGTTTGGTGGATCAGTTACAGCAAATAGTGTACTAACTATTTCTAATACTGGAACAGAAACTCATCTGATTTTTGAGAAAGGACCTCAAATTGGAACTAACCAAGCGCAGACGGTAACTATTAGAGTGTGGGGTAATCAATTTAGTGGGGAATCAGATACAACACGTTCTACTGTATTTGAAGTTGGTGATGAAACGTCTAATCACTTTTATTCTCAGCGCAATAAAGCTGGAAATATAACATTTAATATCAATGGTACAGTAACACCTATAAATGTTAATGCTTCAGGAACATTGAATGCAAATGGTGTAGCAACATTTGGTAATTCAGTCACTGCAACTGGTGAAATTATTTCTCGAAGCGCAAATGCTTTCCGTGCTATTAACGGTGATTACGGATTCTTTATTCGCAATGATGCCTCTAATACCTATTTTATGCTTACTGCATCGGGTGATCAGACTGGTGGATTTAATGGATTGCGTCCTTTAGCTATCAATAATGCATCTGGCCAAGTAACGATTGGTGAAAGCTTAATCATTGCCAAAGGTGCTACTATAAATTCAGGCGGTTTGACTGTTAACTCGAGAATTCGTTCTCAGGGTACTAAAACATCTGATTTATACACCCGTGCACCAACATCTGAAACTGTAGGATTCTGGTCAATTGATATTAACGATTCAGCCACTTATAACCAGTTCCCGGGTTATTTTAAAATGGTTGAAAAAACTAATGAAGTGACTGGACTTCCATACTTAGAACGTGGTGAAGAGGTTAAATCTCCTGGTACATTGACTCAGTTTGGTAATACACTTGATTCGCTTTACCAAGATTGGATTACTTATCCAACGACCCCAGAAGCGCGCACCACTCGCTGGACACGTACATGGCAGAAAACCAAAAATTCTTGGTCAAGTTTTGTTCAGGTATTTGATGGGGGTAACCCTCCTCAACCATCTGATATCGGTGCTTTACCATCTGATAATGCTATAATGGGTAATCTTACTATTCGTGATTTCTTACGAATTGGTAATGTTCGCATTATTCCTGACCCAGTGAATAAAACGGTTAAATTTGAATGGGTTGAATAA

Genome Context

Tertiary structure

YP_010068735.1
ESMFold structure
Source ESMFold
pLDDT 56.7
Oligomeric state monomer

Literature

Title Authors Date PMID Source
Complete genome sequences of eight phages infecting swine Enterotoxigenic Escherichia coli Ferreira,A., Oliveira,H., Silva,D., Almeida,C., Burgan,J., Azered,J. and Oliveira,A. 2020-09-03 GenBank