Genbank accession
QHR73407.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,87
Protein sequence
MRIQHNIDNVFQMIEGHRKCLDNNTGITNTPTQNKLVNPDGLIAAGWAGNKINNDWAMHESQAVLILGYIEAYRASKSDFYLDRAKDAWEAYNSHLLGAYAVESKMQRLSHYPLSTDGVPSHGGFKDVVVSFTNGRGKIPAGSPTWGEYLDKAFQAYNGVLGRNTVDSDVYGGTDANPDWTTSGLSWWVEWYIAWDGNRYWANGSIADSGHTSVEFGTIQLQDKGVTGSHKVTYCVRLPQEQGGSVIPAGSPVIVNPINVVADNQAPELDGEEMWCDACYQLYTLTGEEKYYTAFKQSYNKLLNFSDINAYDKLFRKTTLIKAPFTDGTCLVAGTVPVMSRDSDGYINFRMNARQPCEIIQKGTTYKIGRDAEIVMNFGGEGVLFQPHFILNDGNNNKKQYRIGVPFGGAEVTEIVAKVTDFVEVVGKDGNPYLLPKEENIAVSGGASVSMQYATDIYNHSDNFTRCNIARGKVTFNFGREISLNSVTYRSDDEMVRVKFLDKDRWLWYADLYATNGQWITETLTLGDFKLDPTQPHHTEEEIKPFFANPKGLTSVDFSLSDRQIGNGLFDLYCVNTLPQFYIADRDAYLIWFSVWMASKTESTAKVGDCYVRNYKQGAYRHTPGVFPAGKVIDKENYLLEEKPNWPYPGMQYPAVYCMGAENIDRHRLSNTIGFLYDAQVWYNNTFKTEGPVASRYVWQRGNEGLTGWWEMVDNSKLQSRSFVAACRTIYELKKHKEPVDERLFLFCQKWAWFLNRFQSSHSGNLPTDFNTSGGYSYQDRGDRMWVVGEWLAGCCWLGLCGYSETIPQIDIVVEACMKLLQKHHFINGDNVLNGCWAVSDPVGYHSGEILRGLGLYAQYRGLYL
Physico‐chemical
properties
protein length:865 AA
molecular weight: 97666,56900 Da
isoelectric point:5,63896
aromaticity:0,12948
hydropathy:-0,39919

Domains

View on InterPro
QHR73407.1
1 865 aa
ATT 34–97 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

QHR73407.1
1 865 aa
Domain Start End Length (AA) Confidence
N-terminal 1 10 10 0,1015
Central domain 11 265 256 0,9507
C-terminal 266 865 599 0,0826
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
QHR73407.1 [NCBI]
Genbank nucleotide accession
MN850627 [NCBI]
CDS location
range 98742 -> 101339
strand +
CDS
ATGCGTATTCAACACAACATTGATAACGTATTTCAGATGATTGAGGGGCACAGGAAGTGCCTCGATAACAACACAGGTATTACCAACACCCCTACACAAAATAAGCTAGTAAACCCTGATGGTCTTATAGCTGCAGGGTGGGCAGGTAATAAGATAAATAATGATTGGGCAATGCATGAGTCCCAAGCTGTGTTGATTCTGGGATACATTGAAGCATATAGGGCATCGAAAAGCGATTTCTATCTGGATAGGGCAAAAGATGCTTGGGAAGCTTACAATAGTCATCTTCTGGGTGCATATGCCGTAGAAAGTAAAATGCAAAGATTGTCCCACTACCCTCTATCCACAGATGGGGTTCCATCACACGGTGGGTTTAAAGATGTGGTGGTGTCTTTTACCAACGGTAGAGGAAAAATCCCAGCTGGATCTCCGACATGGGGGGAATACCTGGACAAGGCATTCCAAGCGTATAACGGTGTGTTAGGAAGAAACACTGTAGACTCTGATGTTTACGGTGGCACAGATGCTAACCCAGATTGGACAACATCCGGATTAAGTTGGTGGGTTGAGTGGTACATAGCATGGGATGGTAACCGCTATTGGGCTAACGGTAGCATAGCCGACAGTGGTCATACGTCTGTAGAGTTCGGCACAATCCAATTACAAGATAAGGGTGTTACAGGCAGCCACAAGGTGACCTATTGTGTAAGATTACCTCAAGAGCAGGGCGGATCGGTGATACCTGCTGGATCCCCTGTAATTGTAAATCCTATAAACGTGGTTGCAGATAATCAGGCCCCAGAGTTGGACGGGGAAGAAATGTGGTGTGACGCTTGTTATCAACTCTACACCTTGACAGGTGAGGAAAAGTATTACACCGCATTTAAACAGTCTTACAACAAGTTGCTTAATTTCAGTGATATAAACGCCTACGACAAGCTGTTTAGAAAGACCACGTTGATTAAGGCACCGTTTACAGACGGTACTTGTCTTGTTGCAGGAACAGTTCCTGTCATGAGCAGAGACAGTGACGGGTATATCAACTTTAGAATGAACGCACGCCAGCCTTGCGAAATAATTCAAAAAGGGACTACATACAAAATAGGCAGAGATGCTGAAATTGTTATGAATTTCGGAGGAGAAGGGGTTCTGTTCCAGCCACATTTCATCCTTAATGATGGCAACAATAACAAAAAACAGTACAGGATAGGGGTTCCTTTTGGAGGGGCAGAGGTAACAGAAATAGTTGCCAAGGTTACTGATTTCGTAGAGGTAGTTGGTAAAGACGGTAATCCGTACCTTTTACCTAAAGAGGAAAACATTGCCGTGAGCGGTGGTGCCAGTGTTAGTATGCAGTACGCTACTGACATTTACAACCACTCTGACAACTTCACCCGTTGTAATATAGCAAGAGGCAAAGTAACTTTCAATTTTGGAAGAGAGATTTCTTTAAACTCAGTAACTTATCGATCAGATGATGAGATGGTAAGGGTCAAATTTCTTGACAAAGACAGGTGGTTATGGTATGCTGATCTATATGCCACTAATGGACAATGGATAACAGAAACTCTTACTCTAGGAGATTTCAAACTAGATCCGACACAACCTCATCACACCGAAGAAGAGATTAAACCTTTCTTTGCTAATCCTAAGGGGCTCACCTCTGTAGATTTCAGCCTAAGCGACAGGCAGATAGGTAATGGCTTGTTTGATCTGTACTGCGTAAACACGCTGCCGCAGTTCTATATTGCAGATAGGGATGCTTATCTAATATGGTTTAGCGTATGGATGGCAAGCAAGACAGAATCAACCGCTAAGGTTGGTGATTGCTATGTCAGAAACTACAAGCAAGGTGCATACAGGCACACACCAGGTGTGTTCCCTGCCGGAAAGGTGATAGATAAGGAAAACTATCTCTTAGAGGAAAAGCCAAATTGGCCATACCCTGGCATGCAATACCCAGCAGTATACTGCATGGGTGCAGAAAACATAGATCGCCATAGATTGTCAAATACAATAGGATTTCTGTATGACGCTCAAGTCTGGTATAATAATACATTTAAGACAGAGGGTCCAGTTGCCTCGCGTTATGTATGGCAGAGAGGTAATGAAGGGCTGACTGGTTGGTGGGAGATGGTAGATAACTCTAAGCTCCAGTCAAGATCTTTTGTTGCTGCGTGCCGCACAATCTATGAGCTCAAGAAACATAAAGAACCTGTTGATGAAAGATTATTCCTGTTCTGCCAGAAATGGGCATGGTTCCTCAACAGATTCCAGTCATCTCATTCAGGAAATCTGCCTACAGACTTTAACACCTCTGGGGGATATTCTTATCAAGACAGAGGGGACAGGATGTGGGTGGTAGGTGAATGGCTTGCAGGTTGCTGCTGGCTAGGTCTGTGCGGGTATTCAGAGACAATACCTCAAATAGACATTGTTGTAGAAGCCTGTATGAAGCTACTGCAAAAGCACCACTTCATCAATGGGGATAATGTTTTAAACGGCTGCTGGGCAGTTTCTGATCCGGTGGGCTATCACTCTGGCGAAATCCTTCGCGGTCTGGGCCTTTATGCTCAATACCGTGGATTGTACCTTTAA

Genome Context

Tertiary structure

QHR73407.1
ESMFold structure
Source ESMFold
pLDDT 59.9
Oligomeric state monomer

Literature

Title Authors Date PMID Source
Exploring the Remarkable Diversity of Culturable Escherichia coli Phages in the Danish Wastewater Environment Olsen,N.S., Forero-Junco,L., Kot,W. and Hansen,L.H. 2020 GenBank