UniProt accession
A0A2Z5HRK3 [UniProt]
Protein name
L-shaped tail fiber
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,97
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MALKTKIIVQQIMNIDDTTTTASKYPKYTVVLGTSISSITASELTAAVEVSAASAAAAKDSEIAAKESEINAKDSENLSANYANSSEASATQSATSATEAKRQADLSKDSADASAISASQSAASATKAAESSAAAKTSETNALESSAAAKTSETNAKTSETNAKTSETNAAAYAAAAKTSETNAADSAASASDSKGFRDEAEAFAAQASTSALAAKNSETNTKTSEINSKASEDAAKLAQQGASDSANTATQAMTTIQGLKSDVEQLKADTQTIKEGAATEIRAAKTEAVGAIDTAKVNAIAEITPLKQAAEDAATLAGQKAVTATEQATAAAGSATTAGEQAVAASSSATRAETAANKAEQTLSISLLKDQNLADLSDKVQARINLSVDRLKQSADSSRIYDPTNRYNLVVMDTGSWGVYDDTSNVFKPLGVGQGGTGAWDAEGARNNIGALSKGGDTATAMIATKHSYPAGSTGQILGWSWRSIVEGYGIGTATADFYVNHTVGSVTYACIKPTRVTGESWEYLFSDFGEMSNIRSLIISRNTGAAGEASGNMELSYGAGQVTQYAARVDFYDGTARSVLDTYDATRLTTLLPSAGVICRRGIGGNYQANSYSFSWENPGVDVWIDSTRIGRVTLDPTSDIDYKEQVEPWDGKSALNNINQLELVTFIFKDDIKRRVRRGIIAQQAATIDPEYTHSSEDKEGNTILSLDTNVLLLDALAAIQVLSARVSKLESLLEDKPTTLPEDPAPNQDLP
Physico‐chemical
properties
protein length:755 AA
molecular weight: 78686,35350 Da
isoelectric point:4,71459
aromaticity:0,05033
hydropathy:-0,34570

Domains

View on InterPro
A0A2Z5HRK3
1 755 aa
CHP 641–737 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

A0A2Z5HRK3
1 755 aa
Domain Start End Length (AA) Confidence
N-terminal 1 140 140 0,9444
Central domain 141 396 257 0,1032
C-terminal 397 755 358 0,7976
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
AXC42470.1 [NCBI]
Genbank nucleotide accession
MH370386 [NCBI]
CDS location
range 20919 -> 23186
strand +
CDS
ATGGCACTTAAAACTAAAATTATTGTACAGCAGATTATGAACATAGATGACACTACAACTACTGCTAGTAAATATCCTAAGTACACAGTAGTTTTAGGTACTTCTATTAGTTCTATTACTGCTAGCGAACTAACAGCGGCTGTTGAGGTCTCTGCTGCTTCTGCTGCGGCAGCAAAAGATTCTGAAATTGCAGCAAAAGAATCTGAAATAAATGCTAAGGACTCTGAGAACCTATCTGCAAATTATGCTAACTCTTCAGAAGCTTCTGCAACTCAATCTGCTACTTCTGCTACTGAAGCGAAGAGACAAGCTGATTTATCTAAAGATAGTGCTGATGCCTCTGCTATTTCTGCTTCTCAATCTGCTGCGTCCGCTACTAAAGCTGCAGAATCATCAGCTGCAGCAAAAACTAGTGAAACTAACGCTCTAGAATCATCAGCTGCAGCAAAAACTAGTGAGACTAATGCAAAAACTAGTGAGACTAATGCAAAAACTAGTGAGACTAATGCAGCAGCATATGCAGCAGCAGCAAAAACTAGCGAGACTAATGCTGCTGATTCCGCTGCCTCTGCTTCTGACTCCAAAGGATTCAGGGATGAAGCAGAAGCATTCGCTGCACAAGCCTCCACATCAGCATTAGCAGCAAAAAACTCAGAAACTAATACAAAGACTAGCGAAATTAACTCAAAAGCTAGTGAAGACGCTGCTAAGCTAGCTCAGCAAGGTGCATCAGATAGCGCGAACACAGCTACGCAAGCGATGACCACAATACAGGGTCTTAAGTCCGATGTTGAACAGCTTAAAGCTGACACTCAGACCATTAAAGAAGGTGCGGCGACAGAGATTAGAGCAGCTAAGACGGAGGCAGTAGGAGCAATTGACACAGCTAAGGTAAACGCAATTGCTGAAATCACCCCATTAAAACAAGCTGCGGAAGACGCTGCCACCTTAGCTGGACAAAAGGCAGTAACGGCTACAGAACAGGCTACAGCCGCGGCTGGAAGCGCTACAACCGCAGGAGAACAAGCCGTAGCAGCATCCAGTTCCGCAACTCGAGCTGAGACCGCAGCAAACAAAGCTGAACAAACTTTGAGCATATCTTTGTTAAAGGATCAGAACCTTGCAGACTTAAGTGACAAGGTACAGGCTCGTATAAACCTAAGTGTGGATCGTCTTAAACAGAGCGCTGATAGTTCCCGCATTTATGACCCGACAAACCGCTACAACTTAGTTGTAATGGATACAGGGTCTTGGGGTGTCTATGACGATACTAGTAATGTATTTAAACCTTTAGGAGTAGGACAGGGTGGCACGGGAGCTTGGGACGCAGAGGGTGCTCGCAACAACATCGGGGCGCTGTCCAAAGGTGGTGACACTGCGACCGCCATGATTGCTACTAAACATTCTTATCCTGCTGGCTCAACAGGGCAAATTTTGGGTTGGTCGTGGCGATCAATTGTAGAAGGCTATGGTATCGGAACGGCGACTGCTGATTTTTACGTAAACCACACAGTAGGAAGCGTCACATATGCCTGCATTAAGCCTACTCGAGTGACCGGTGAAAGCTGGGAGTATCTTTTTAGCGACTTTGGCGAAATGTCTAATATTAGAAGCCTGATTATTAGTCGTAACACAGGTGCGGCAGGAGAGGCGTCCGGTAACATGGAGTTGAGTTATGGGGCGGGGCAGGTGACACAGTACGCCGCTCGTGTAGATTTTTACGATGGAACTGCACGATCTGTTTTAGATACCTACGATGCTACTCGCTTAACAACTCTCTTACCCTCGGCAGGTGTCATTTGCCGTAGAGGTATTGGTGGTAATTACCAAGCAAATAGCTATTCTTTCTCGTGGGAAAATCCGGGAGTAGATGTATGGATTGATAGTACTCGTATTGGGCGAGTAACGCTGGATCCTACAAGCGACATTGATTATAAAGAACAGGTTGAGCCATGGGACGGTAAGAGTGCGTTAAACAATATCAATCAGTTAGAGCTCGTGACGTTTATCTTCAAAGATGATATAAAGCGTCGGGTACGTCGTGGAATCATTGCACAGCAAGCGGCAACCATAGACCCTGAATACACACACTCAAGCGAGGATAAGGAGGGAAATACAATTCTTTCGCTTGATACTAACGTGCTGTTACTGGATGCGCTTGCTGCTATCCAGGTGCTAAGCGCCCGTGTCAGTAAATTAGAGTCTTTGTTAGAAGATAAACCAACCACTCTACCTGAAGACCCCGCTCCAAATCAAGACCTTCCTTAA

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail Cellular Component IEA:UniProtKB-KW (UniProt)

Tertiary structure

A0A2Z5HRK3
ESMFold structure
Source ESMFold
pLDDT 61.7
Oligomeric state monomer