Genbank accession
XRX03303.1 [GenBank]
Protein name
long tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,84
TF
Evidence RBPdetect2
Probability 0,89
Protein sequence
MAGMLINITDAGRAALVAGGNTGTAARRVVEIGLGAAPFAFDRGMQAMPNERKRVTTFGGENVAPDTVHVVIQDDTSDQYSLYAFGLYLDNGVLFAVYVQDTPILEKSPAAMMLLAGDVVFATIDAAKLEFGPATFLNPPATTERKGVIELATQAEVDAGEDDTRALTPKTAKRRYAALSGATFDGRVRVVADADERAAQLDVSPKTPGVGKTGKARLFGTFGDATLPDLSPRLVATLRAGFDAGAWGREYVDVCLNDGTNNDAASDAKQKRVARFTSGGRVLIGERVDDGKTALQVRGGVDASEGFTARAIDAGGAGGQFRAVCNGYGALIRNDGGCVYLLSTAKGAPDGTCNDYRPFSWSLTTGLVSVDGSGSGAVFGGAVNIARDLDVGRKANEAHIRLGPVDGYLYANPVSTGWWSPTGSSYQYIFADHTFRIDGRIVWHEGNLDPLDKGKGGTLAGDVSFAPGKRLVLAEGSPSAPSLAFANDGEPATGLYHAADGAFGVTCNGRAVARFSPLLAAFEQPVTVPTPPAADRSARAATTEWVRSVLSATTIGQIVFEPRTTVRPGFLKANGVLVNRADYPELWAYAQASGALVSDADWMKDRWGCFSTGDGATTFRLPELRGEFIRCWSDARGGVDAARQIGAFQGDQNHTHTHGAAASEAPDHIHTAWTDVQGWHGHHGWTNTVGDHQHISPWGENPQIYSPPWGTWGAANNRGAEGSDIDNVYGMTSPAGNHNHEFNTEGNGNHGHNVGIGGGGRHAHAITVQPDGGDESRPRNVALLALIRAY
Physico‐chemical
properties
protein length:790 AA
molecular weight: 83216,32680 Da
isoelectric point:5,67346
aromaticity:0,08354
hydropathy:-0,26291

Domains

Domains [InterPro]
DC_1182
STR
1–690
SSF88874
STR
545–789
IPR037053
ATT
552–631
XRX03303.1
1 790
Architecture
STR
ATT
STR
STR 1-551 | ATT 552-631 | STR 632-790
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
XRX03303.1
1 790
Domain Start End Length (AA) Confidence
N-terminal 1 178 178 0,9677
Central domain 179 377 200 0,2345
C-terminal 378 790 412 0,8609
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-178
Central
179-377
C-terminal
378-790

Taxonomy

  Name Taxonomy ID Lineage
Phage Burkholderia phage vB_CB1
[NCBI]
3416998 Viruses >
Host Burkholderia sp.
[NCBI]
36773 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Betaproteobacteria > Burkholderiales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XRX03303.1 [NCBI]
Genbank nucleotide accession
PQ789815.1 [NCBI]
CDS location
range 14566 -> 16938
strand +
CDS
ATGGCAGGAATGCTCATCAACATCACGGACGCCGGCCGCGCGGCACTCGTCGCCGGCGGCAACACAGGCACGGCCGCGCGCCGCGTCGTCGAAATCGGGCTCGGCGCCGCGCCGTTCGCGTTCGATCGCGGCATGCAGGCGATGCCGAACGAGCGCAAGCGCGTGACGACGTTCGGCGGCGAAAACGTCGCGCCGGATACGGTGCATGTCGTGATTCAGGACGACACGAGCGACCAGTATTCGCTGTACGCGTTCGGCCTGTACCTCGACAACGGCGTGCTGTTCGCCGTGTACGTGCAGGACACGCCGATTCTCGAAAAATCGCCTGCGGCGATGATGCTGCTCGCGGGCGACGTCGTGTTCGCGACGATCGACGCGGCGAAGCTCGAGTTCGGGCCGGCGACGTTCCTGAATCCGCCGGCGACGACCGAGCGCAAGGGCGTTATCGAGCTCGCCACGCAGGCCGAAGTCGATGCAGGCGAAGACGACACGCGCGCACTCACGCCGAAGACGGCGAAGCGGCGCTATGCGGCGCTGTCGGGCGCGACGTTCGACGGGCGCGTGCGTGTTGTCGCCGACGCTGACGAGCGCGCCGCGCAGCTCGACGTGTCGCCGAAGACGCCCGGCGTCGGCAAGACCGGCAAGGCGCGTCTGTTCGGCACGTTCGGCGACGCGACGCTGCCCGATCTGAGCCCGCGCCTGGTCGCGACGCTTCGCGCCGGATTCGATGCCGGCGCATGGGGCCGCGAGTACGTCGATGTCTGCCTGAACGACGGCACGAACAACGATGCGGCGAGCGATGCGAAGCAGAAGCGCGTCGCGCGCTTCACGTCGGGCGGCCGCGTGCTGATCGGCGAACGCGTGGACGATGGCAAGACCGCGCTGCAGGTGCGCGGCGGCGTCGACGCGTCGGAAGGCTTCACGGCGCGCGCGATCGACGCGGGCGGCGCCGGCGGCCAGTTCCGCGCGGTCTGCAACGGCTACGGCGCGCTCATCCGCAACGACGGCGGGTGCGTGTATCTGCTGTCGACGGCGAAGGGCGCACCGGACGGCACGTGCAACGACTATCGGCCGTTCTCGTGGTCGCTGACGACGGGGCTGGTGAGCGTCGACGGCAGCGGATCGGGCGCGGTCTTCGGCGGCGCCGTGAACATCGCCCGCGATCTCGACGTCGGCCGGAAGGCAAACGAAGCGCATATCAGGCTCGGTCCGGTCGACGGCTACCTCTACGCGAACCCGGTCAGCACCGGCTGGTGGTCGCCGACTGGTTCGTCCTATCAGTACATCTTCGCCGATCACACGTTCCGCATCGACGGGCGGATCGTATGGCACGAAGGCAACCTCGACCCGCTCGACAAGGGCAAGGGCGGCACGCTGGCCGGCGATGTGTCGTTCGCGCCAGGCAAGCGGCTCGTGCTCGCCGAAGGCAGTCCGTCCGCGCCGTCGCTCGCGTTCGCCAACGATGGCGAACCGGCTACCGGCCTCTATCACGCGGCGGACGGCGCGTTCGGTGTGACGTGCAACGGGCGCGCCGTCGCGCGCTTCTCGCCATTGCTCGCGGCCTTCGAGCAGCCCGTGACCGTGCCGACGCCGCCGGCGGCGGACCGGTCGGCGCGCGCCGCGACGACGGAATGGGTGCGCTCGGTCCTGTCGGCCACGACGATCGGCCAGATTGTCTTCGAGCCGAGAACGACCGTGCGGCCCGGCTTCCTGAAGGCGAACGGCGTGCTCGTGAACCGTGCCGACTATCCCGAGCTCTGGGCGTATGCGCAGGCGAGCGGCGCGCTTGTCTCCGATGCGGACTGGATGAAGGATCGGTGGGGCTGCTTCTCGACCGGCGACGGCGCGACGACGTTCCGCCTACCCGAGCTGCGCGGCGAGTTCATTCGATGCTGGTCCGATGCACGCGGCGGCGTCGACGCGGCGCGGCAAATCGGCGCATTCCAGGGCGACCAGAACCACACGCACACGCACGGCGCCGCAGCAAGCGAAGCGCCGGATCACATCCACACCGCGTGGACTGACGTGCAGGGCTGGCATGGCCACCACGGTTGGACGAACACTGTTGGCGATCACCAGCACATTTCGCCTTGGGGCGAAAACCCGCAGATATACAGCCCGCCTTGGGGCACGTGGGGCGCCGCCAACAACCGCGGCGCGGAGGGCAGCGACATCGACAACGTGTACGGGATGACGAGCCCGGCCGGCAACCACAACCACGAGTTCAACACCGAAGGAAACGGCAATCACGGGCACAACGTCGGTATCGGCGGCGGCGGCCGGCACGCGCACGCGATCACCGTTCAACCCGACGGCGGCGACGAATCCCGCCCGCGCAACGTCGCGCTGCTCGCGCTGATTCGCGCCTACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
58f0e40a2789ecda6444df7cf39f19fe44d097a748320beeca27829a643ce764
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7457
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50