Genbank accession
QGH77647.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,81
TF
Evidence RBPdetect2
Probability 0,97
Protein sequence
MAENMITGSKGGSSKPYVPKEMEDNLISINKIKILLAVSDGECDPDFTLRDLYLDNVPVIADDGTVNYKGVSAEFRPGTQTQDYIQGFTDTSSEVTLARDITTSNPYVISVTNKTLSAIRIKMLMPTGIKQEDNGDLVGVKVTYAVDMAVDGDSYKEVLRDTIEGKTRSGYDRSRRIDLPAFNDRVLLRVRRVTADSTSSRVTDLIKLQSYAEVIDAKFRYPLTGLVYVEFDSELFPNQIPNISIKKKWKLINVPSNYDPVMREYHGSWDGTFKKAWSNNPAWVLYDIITNQRYGLDQRELGVQVDKWSLYEAAQYCDQKVPDGKGGTEPRYLCDVVIQSQIEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRDPSYIFTNDNVVDGDFQYTTASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWILKTNLRSTTVNFATGLEGMIPSIGDVIAIADNFHSSNLKLNLSGRVMEVSGLQVFVPFKIDARPGDFIIINKPDGKPVKRTISKVSVDGKTIELNIGFGFEVKPDTVFAIDRTDIALQQYVVTSIGKGDDDDEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDTMAAPENVQISSYSRIVQGASVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPQTANKEIEVEGIYSGNYQVRVRSVSASGSTSPWSRIVTASLTGKVGEPGAPVNLTASDNEVFGIRVKWGMPEGSGDTAYIELHQSPDGTAENSSLLTLIPYPQYEYWHGTLPAGHVVWYRIRSVDKIGNVSGWTDFVRGMASDDVDSVMGDLLDKIFDTEAGQDIKENAIDSANKIKDQAQSIIQNAIANDANLKWTRVQNGKRKAEYGHALELIANETEARVTQIEELKASIDDDIVSSIKTVQEAIANESETRATQVQQLDSKFTKEIDGVKRDTAASIGEVRQTIANESEARAQAVQQLDAKFTKEINDLDEVIKTEVEANISEVKQAIANETEARVQADQSLTAKFGDVESALAEKLDSWANVDSVGAKYAMKLGLTYKGQKYSAGMIMQLSQSSQGLISQILFDANRFAIMTSSTGGTFTLPFVVENNQVFINSLLVKNGSITNAMIGNVIQSNNFVQNQQGWRLDKNGNFENYGTRAGEGATKFTNEGLKVKDANGRLRVEIGRITGSW
Physico‐chemical
properties
protein length:1192 AA
molecular weight: 132323,59220 Da
isoelectric point:4,84504
aromaticity:0,08473
hydropathy:-0,39874

Domains

View on InterPro
QGH77647.1
1 1192 aa
ATT 92–217 · ATT 341–488 · STR 627–722 · STR 724–821 · RBD 1034–1163 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

QGH77647.1
1 1192 aa
Domain Start End Length (AA) Confidence
N-terminal 1 509 509 0,9276
Central domain 510 1066 558 0,0461
C-terminal 1067 1192 125 0,3789
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
QGH77647.1 [NCBI]
Genbank nucleotide accession
MN158217.1 [NCBI]
CDS location
range 46050 -> 49628
strand +
CDS
ATGGCTGAAAATATGATAACTGGCAGTAAGGGTGGATCATCAAAACCTTATGTGCCGAAAGAGATGGAAGATAACCTGATCTCAATCAACAAGATCAAAATCCTTCTTGCCGTTTCCGATGGTGAGTGCGATCCAGATTTTACACTTCGCGATCTGTATCTTGATAATGTTCCGGTAATTGCAGACGACGGAACTGTTAACTACAAGGGTGTGAGCGCAGAGTTTCGCCCAGGGACACAGACTCAAGATTACATCCAAGGATTTACTGACACATCAAGCGAAGTGACGCTGGCGCGTGATATTACTACGTCAAATCCTTATGTGATTTCCGTAACAAACAAAACATTATCTGCTATCAGAATCAAAATGCTAATGCCAACAGGCATTAAGCAAGAGGATAACGGCGATCTTGTCGGCGTTAAGGTTACTTATGCTGTTGATATGGCTGTTGACGGAGACTCTTACAAAGAAGTATTGCGAGACACTATCGAAGGTAAAACACGTTCAGGTTACGACAGAAGCCGAAGGATTGACCTTCCGGCATTTAATGATCGCGTATTGCTTAGGGTTAGAAGGGTTACGGCAGACAGCACATCTTCTCGTGTTACTGATTTGATTAAGCTACAAAGTTACGCTGAGGTTATTGATGCAAAATTCCGTTATCCTCTGACTGGTCTTGTATACGTTGAATTTGACAGCGAATTGTTCCCAAACCAGATCCCTAACATTTCAATCAAGAAGAAATGGAAGCTGATTAATGTTCCTAGCAATTACGATCCGGTAATGCGAGAGTATCACGGTTCATGGGATGGTACGTTCAAGAAAGCGTGGTCGAACAATCCGGCGTGGGTACTTTATGACATTATCACAAACCAGCGATACGGATTAGATCAGCGAGAGCTTGGTGTACAGGTTGATAAATGGAGTCTTTACGAAGCGGCGCAATACTGCGATCAGAAAGTTCCTGACGGGAAAGGCGGTACAGAGCCGCGTTATCTATGCGACGTTGTGATTCAAAGCCAGATTGAGGCTTATCAGCTTATTCGTGATATTTGCTCAATCTTCAGAGGCATGAGCTTTTGGAATGGAGAGAGCTTGTCAATCGTCATTGATAAGCCGCGCGATCCGTCGTACATCTTCACAAATGACAATGTTGTTGATGGTGATTTTCAGTACACAACAGCAAGTGAAAAGAGCATGTACACGCAGTGCAACGTGACGTTCGATGACGAACAAAACATGTATCAACAGGACGTAGAGGGTGTATTCGACACAGAGGCCGCATTGCGTTTTGGATACAATCCTACAAGCATAACCGCGATCGGATGTACACGCAGGAGCGAGGCTAATCGGCGCGGTAGATGGATACTAAAAACCAACTTGCGCAGCACTACGGTAAACTTTGCTACTGGACTGGAAGGCATGATCCCATCAATAGGTGATGTGATTGCTATTGCTGACAACTTTCACAGCAGCAACCTTAAGTTAAACCTATCAGGGCGCGTGATGGAAGTTTCCGGCTTGCAGGTGTTCGTTCCGTTTAAGATTGACGCGCGACCAGGTGATTTCATTATCATCAACAAGCCGGACGGAAAGCCAGTTAAGCGCACAATCTCAAAAGTAAGCGTCGACGGAAAAACCATTGAACTAAACATTGGGTTTGGGTTTGAAGTTAAACCGGATACAGTTTTTGCAATCGACCGCACTGATATTGCATTGCAGCAATACGTTGTAACGAGTATCGGAAAAGGTGATGATGATGATGAATTTACATACTCCATCACGGCTGTTGAATATGACCCGAACAAATACGACGAGATTGATTATGGAGTAAACATTGACGACAGGCCAACGTCAATTGTCCAGCCTGACACAATGGCAGCACCGGAAAACGTGCAAATATCCTCATACTCGCGAATTGTCCAGGGTGCAAGCGTTGAAACAATGGTTGTGTCGTGGGATAAAGTACCTTACGCATCGCTGTATGAAATGCAATGGCGAAAAGGTGATGGCAACTGGCTGAATACACCACAGACCGCAAACAAAGAAATTGAGGTTGAAGGTATTTATTCAGGAAACTACCAGGTAAGGGTTAGATCTGTTTCTGCTTCAGGTTCTACTTCTCCGTGGTCCAGAATTGTGACAGCATCACTGACTGGTAAGGTAGGAGAGCCAGGCGCGCCAGTTAACTTAACCGCATCCGACAATGAAGTGTTTGGCATTCGTGTTAAGTGGGGGATGCCAGAAGGGAGCGGAGACACGGCATACATTGAGCTTCATCAGTCGCCGGATGGAACGGCTGAAAACTCAAGCCTGCTAACGCTGATCCCGTATCCACAATATGAATACTGGCACGGTACGCTTCCGGCTGGTCATGTTGTATGGTATCGGATCAGGAGCGTAGACAAGATCGGCAACGTTTCCGGTTGGACTGATTTTGTTAGAGGTATGGCTTCGGATGATGTTGATTCTGTTATGGGTGATCTTCTTGATAAGATTTTTGATACTGAAGCAGGCCAGGATATTAAAGAGAACGCCATTGATAGTGCAAACAAGATAAAGGACCAGGCGCAATCAATAATCCAGAATGCTATTGCAAATGATGCAAATCTTAAGTGGACAAGGGTACAAAACGGAAAGCGCAAGGCTGAATATGGTCACGCTCTTGAGCTTATCGCCAACGAAACGGAGGCGCGAGTAACTCAAATCGAAGAGTTAAAGGCTTCAATTGATGACGATATAGTTTCAAGTATCAAAACCGTTCAGGAAGCGATTGCCAACGAATCAGAGACGCGAGCTACTCAAGTTCAGCAGCTTGATTCTAAATTTACGAAGGAAATAGACGGCGTAAAACGAGATACGGCTGCAAGTATTGGTGAGGTAAGGCAAACAATTGCCAATGAATCAGAAGCGCGCGCCCAGGCAGTTCAGCAGCTTGACGCTAAGTTTACGAAAGAGATAAACGACCTTGACGAAGTTATCAAGACCGAAGTCGAGGCTAACATCTCAGAAGTGAAACAGGCGATCGCCAATGAGACAGAGGCAAGGGTTCAGGCCGACCAGTCTTTAACAGCTAAATTCGGAGACGTTGAATCAGCACTAGCAGAAAAACTTGATTCGTGGGCTAACGTTGATTCTGTTGGCGCTAAGTACGCTATGAAATTGGGCCTTACTTACAAGGGGCAGAAGTACAGCGCAGGAATGATCATGCAGTTGTCGCAATCTTCTCAAGGCCTGATCTCGCAAATCCTGTTCGATGCTAACAGGTTCGCGATCATGACTAGCTCTACTGGCGGTACGTTTACATTGCCGTTTGTAGTAGAAAATAACCAGGTGTTTATCAACAGTTTGTTGGTGAAAAACGGGTCCATTACTAACGCAATGATTGGTAACGTGATTCAGTCAAATAACTTTGTTCAGAATCAGCAAGGATGGAGGCTTGATAAAAATGGTAACTTTGAGAACTACGGAACTAGAGCGGGAGAAGGAGCTACAAAATTCACTAACGAAGGCTTAAAGGTGAAAGATGCAAATGGACGATTGAGAGTTGAGATAGGGAGAATAACCGGAAGTTGGTAA

Genome Context

Tertiary structure

QGH77647.1
ESMFold structure
Source ESMFold
pLDDT 77.8
Oligomeric state monomer

Literature

Title Authors Date PMID Source
Isolation and Characterization of Lytic Phages From Belgium Mutuku,I.M., Jana,B., Kot,W., Moodley,A. and Butaye,P. 2023-09-20 GenBank