Genbank accession
QAX96801.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,70
Protein sequence
MSSKLIFTMQYFRGEKKREYLADPETGLIPGTGETGGGLEEYDRAKAWDKDDVVIRDGAIFQANEDIPAGSSFIEGLTGPTWLKLGAEAVATLPDYAVGIRYLKDQAIARGRVIYRAKNDFTSAAWNPAQWETVFSEDAVRDKVADFSTAQSYVTHELVAYNGDIYAANEDMAAGAFDASKWQNLTTPNVTYVDDYAPGTVYTKDAVVMRNGQLFRATQQTATAWDEGDWQAVSSVSTVRGYWDLNATYKANEIVIANDRFYRANGDIGSGTTFTVGTGTEEWTEVSGPVTPPITADDFKASTLYAKNQVIVVNGILYRAKAEFTSLSAFSIADWDPVSYDPETAVDAWQANTDYKAGQLATLVVNGRNTLYRARADITGGASFVAAEWDDLTQTSTWRAEWDISNIYAKDDVVWRAGHLYIANDAVPINTAWVEGITGATWARLLAEPVSKISVWAATTSYQKDELLAYDGNIYRVEAPFVSGAAFTSGGLELLTAGKYTKIPAFRGSNAYVQDETIAEGGILYRAKADFTSATNFNEADWEVIADIATNILYDFSITEDYVENEIVFHLDSIWRAKGDLAPGAWDESDWEFVSEYSTFKGQHVQARDYKQDDVVIHNNRLYRANSNIAGGTAFLAGSGGNQWTEVSPPGNPSAPDWASSTAYVQHNLVIHGGVLYRCKTDHTSGSSFLASQWETLKAEAATLASTYQNGQEYVLDQLIEYNGILYRCIEAMLSSPVTFEPDKWQGVSVSYIRISAHSASAVYREGELIWHVNRLYRAKAGINPGRAFSLNDWAISEDTNYVSEYAENEDYRLGQLIRSGVDLYIANEAITGAPAVLDETKWNRVGYRPTVQGAHDDTLAYKQGDIVLKEGTWYEANADIAANVALSEGTTGATWKRVTDSMTIKAHAADAIFFPGQLCLHEERVYTAKEFLTVDSSNTFQSIKWEKLGSFVTLAEDYQDNIEYKKDQIIFHLSVLYRRIATGTDTTWTAANWQALSLTAAILNDFDNTKNYAANEVITRGGATYRVKTPGIKASFVESDWDRIDERPTFRGEWAQANAYKAADFVVRDSILYTANDDVPATTAFSVGTTGATWKRQSPENIAVEHVGAEITVTEGQPVLHRRVLYFATQTVTIPDAFDETGLKKIGRFGMTEQTFEADTWFPAGWLMRQDDKVYKAKTDFTTATAFDVADWDLVSHDYAFIVEFSTADTYYKDQIVINSKLLYRANTDVTVAGAWDSAQWDQLAPAEAARIETHDETNEYLLDDIVFKDGMLWRANSAIAANTPFVKGTSANEWRAAIELKIKAFAEGENILTGEYRLIEGHLAVAKEDIASTAAYATETEKWSILSHDISKLKMVYPVREYLPAETNINYDGSIDGSVSDTWVVTGSLPTTELSVLLWLGYEFHLGGLDVTDKVVQKGSSAPYGFGISGVWIESGDVLEVR
Physico‐chemical
properties
protein length:1444 AA
molecular weight: 159850,15480 Da
isoelectric point:4,63507
aromaticity:0,11911
hydropathy:-0,28767

Domains

Domains [InterPro]
IPR003610
CBM
193–233
QAX96801.1
1 1444
Architecture
ATT
STR
ATT
STR
STR
STR
ATT
STR
ATT
ATT
ATT 22-477 | STR 478-490 | ATT 499-602 | STR 603-698 | STR 705-795 | STR 804-843 | ATT 844-920 | STR 957-1044 | ATT 1045-1118 | ATT 1200-1362 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QAX96801.1
1 1444
Domain Start End Length (AA) Confidence
N-terminal 1 1322 1322 0,8778
Central domain 1323 1433 112 0,5402
C-terminal 1434 1444 10 0,0500
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-1322
Central
1323-1433
C-terminal
1434-1444

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage vB_VmeM-Yong MS31
[NCBI]
2500764 No lineage information
Host Vibrio mediterranei
[NCBI]
689 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QAX96801.1 [NCBI]
Genbank nucleotide accession
MK308676.1 [NCBI]
CDS location
range 179218 -> 183552
strand +
CDS
ATGAGTAGCAAATTAATCTTTACCATGCAATATTTCCGCGGGGAGAAAAAGCGCGAATATCTCGCTGACCCTGAAACCGGACTTATTCCTGGTACTGGTGAAACGGGCGGAGGTCTAGAAGAATACGACCGCGCCAAAGCATGGGATAAAGACGACGTCGTTATTCGCGACGGCGCTATCTTCCAAGCGAACGAAGACATTCCTGCGGGCTCTTCTTTCATTGAAGGCCTTACTGGCCCGACTTGGCTTAAGTTAGGCGCAGAAGCGGTAGCGACACTTCCCGACTACGCTGTAGGCATTCGTTATCTTAAAGACCAAGCGATTGCCCGTGGTCGAGTTATCTACCGTGCGAAGAATGACTTTACTTCTGCCGCGTGGAATCCAGCACAATGGGAAACGGTCTTCTCTGAAGACGCAGTTCGCGATAAGGTAGCGGACTTCAGTACTGCACAGTCTTACGTAACGCATGAATTAGTTGCCTACAACGGTGACATCTATGCGGCAAACGAAGATATGGCGGCAGGGGCCTTTGACGCATCTAAATGGCAGAATCTAACTACGCCTAACGTAACATACGTCGATGACTATGCCCCGGGCACAGTCTATACCAAAGACGCTGTAGTTATGCGCAATGGTCAATTGTTCCGTGCAACGCAACAAACTGCGACAGCATGGGATGAAGGCGATTGGCAAGCGGTTTCTTCAGTTTCTACTGTTCGTGGTTACTGGGACCTTAATGCAACCTATAAAGCAAACGAAATAGTTATCGCTAATGACCGCTTCTATCGTGCTAACGGCGATATCGGTTCAGGCACTACGTTCACTGTAGGTACTGGTACTGAAGAATGGACTGAGGTGTCGGGCCCTGTTACTCCGCCTATCACGGCAGATGACTTTAAAGCTTCTACGCTTTATGCTAAGAATCAGGTTATTGTAGTTAACGGCATCCTTTATCGGGCTAAGGCAGAGTTCACTTCACTCTCTGCATTTAGCATCGCGGATTGGGACCCTGTCTCTTATGACCCAGAAACGGCCGTAGACGCATGGCAAGCTAATACTGACTATAAAGCAGGGCAGTTAGCCACGTTAGTTGTAAATGGTCGTAACACGCTTTACAGAGCGCGTGCGGACATCACAGGCGGAGCGAGTTTCGTAGCGGCCGAATGGGACGACCTAACTCAAACGTCGACCTGGCGTGCTGAGTGGGATATCTCGAACATCTATGCTAAGGACGACGTTGTGTGGCGTGCAGGGCATCTGTACATTGCCAACGATGCGGTACCTATCAACACTGCATGGGTAGAAGGCATCACCGGCGCTACGTGGGCTCGTTTGTTGGCCGAACCAGTATCTAAGATTTCTGTTTGGGCAGCAACAACTTCTTATCAGAAAGATGAACTTCTAGCTTATGATGGCAACATCTATCGCGTGGAAGCCCCTTTCGTTTCTGGCGCGGCATTTACAAGCGGTGGCCTAGAACTGCTCACTGCCGGTAAGTATACTAAGATTCCTGCATTCCGTGGCTCTAACGCTTACGTCCAAGACGAGACGATTGCCGAAGGCGGAATCCTTTACCGCGCTAAAGCGGACTTTACGTCAGCTACAAACTTCAATGAAGCAGACTGGGAAGTTATCGCGGACATCGCGACAAACATCCTGTACGACTTCAGTATCACAGAAGACTATGTAGAAAATGAAATCGTTTTCCACTTAGACTCCATCTGGCGTGCTAAAGGCGACCTTGCTCCTGGGGCTTGGGACGAGTCCGATTGGGAATTCGTTTCTGAATACTCAACCTTTAAAGGTCAGCACGTCCAGGCACGTGACTACAAGCAAGACGACGTGGTAATCCACAACAACCGATTATATCGAGCAAACTCGAATATTGCGGGCGGCACTGCTTTCTTGGCAGGTAGCGGAGGTAACCAATGGACTGAGGTTTCCCCTCCAGGAAATCCTTCTGCTCCGGACTGGGCAAGCTCAACGGCATATGTTCAGCACAACTTGGTCATTCACGGTGGCGTCCTTTACCGCTGTAAGACAGACCATACTTCAGGAAGTAGCTTCCTCGCTTCTCAGTGGGAGACATTGAAAGCAGAAGCGGCTACCTTGGCTTCGACTTACCAGAATGGCCAAGAATATGTCCTTGACCAACTGATTGAATACAACGGCATTCTTTACCGCTGTATCGAAGCGATGCTAAGTTCACCTGTAACGTTCGAGCCAGACAAATGGCAAGGCGTATCGGTTTCTTATATCCGCATCTCTGCGCATAGTGCTTCAGCCGTTTATCGTGAAGGCGAATTGATTTGGCACGTGAATCGCTTGTATCGTGCTAAGGCAGGCATTAACCCAGGTCGTGCTTTCTCGCTTAACGATTGGGCTATCTCTGAAGATACCAACTACGTGAGCGAATATGCGGAGAATGAAGACTACCGCCTAGGCCAACTTATCCGTTCAGGTGTTGACTTATATATCGCAAACGAAGCTATTACCGGTGCTCCTGCTGTACTTGACGAAACTAAGTGGAACCGCGTAGGTTACCGCCCAACCGTTCAAGGCGCGCATGACGATACTCTTGCGTATAAGCAAGGGGACATTGTTCTTAAGGAAGGTACTTGGTACGAAGCTAACGCGGACATCGCGGCTAACGTGGCCCTGTCTGAAGGTACTACTGGCGCCACATGGAAACGTGTAACCGACAGCATGACCATCAAGGCACATGCGGCAGACGCAATCTTCTTCCCTGGTCAGCTCTGTCTACATGAAGAGCGCGTTTACACAGCGAAGGAATTCCTAACGGTAGACAGCTCTAATACGTTCCAGTCTATCAAGTGGGAGAAGCTAGGTTCGTTTGTTACTCTAGCGGAAGACTACCAAGACAACATCGAATACAAGAAAGACCAAATCATCTTCCATCTGAGCGTTCTTTACCGCCGCATTGCGACGGGCACCGATACGACTTGGACGGCAGCAAATTGGCAAGCTCTTTCTTTAACCGCGGCGATTCTAAACGACTTCGACAACACGAAGAACTATGCTGCCAACGAGGTCATCACTCGCGGAGGTGCTACCTACCGTGTTAAGACTCCTGGCATTAAAGCTAGCTTCGTAGAAAGCGATTGGGACCGCATCGATGAGCGTCCTACGTTCCGTGGCGAATGGGCTCAGGCTAATGCTTATAAAGCAGCTGACTTCGTAGTTCGCGACAGTATCCTTTACACTGCCAACGATGATGTTCCGGCAACAACTGCATTCTCAGTCGGCACTACCGGCGCTACATGGAAACGTCAGTCGCCAGAGAACATTGCAGTTGAGCACGTAGGTGCTGAGATTACGGTAACTGAGGGACAACCTGTTCTTCATCGCCGTGTTCTTTACTTCGCAACACAGACGGTAACCATTCCTGATGCGTTCGATGAAACCGGACTTAAGAAGATTGGTCGCTTCGGCATGACCGAGCAAACCTTCGAAGCCGATACTTGGTTCCCTGCTGGTTGGTTAATGCGTCAAGACGATAAAGTCTACAAAGCGAAAACGGACTTTACCACTGCGACGGCCTTTGATGTTGCTGACTGGGATTTAGTTTCTCATGACTACGCGTTCATTGTAGAGTTCAGCACAGCTGACACTTACTACAAAGACCAAATCGTTATCAACTCTAAACTACTTTACCGTGCTAATACAGACGTTACTGTAGCGGGTGCTTGGGACTCTGCCCAATGGGACCAATTGGCTCCAGCTGAAGCAGCTCGTATCGAGACTCATGACGAGACTAACGAATACCTACTTGACGACATTGTCTTCAAAGACGGTATGTTGTGGCGTGCTAACTCTGCCATCGCAGCGAACACTCCGTTCGTTAAGGGCACGAGTGCCAACGAATGGCGTGCAGCCATTGAACTTAAGATTAAAGCGTTTGCTGAAGGTGAGAACATTCTTACCGGAGAGTACCGCTTAATCGAAGGTCACTTAGCCGTAGCAAAAGAAGACATTGCATCTACTGCTGCGTACGCTACGGAAACTGAGAAGTGGAGTATTCTGTCTCACGACATCTCCAAACTTAAGATGGTTTACCCAGTCCGCGAATACTTACCGGCCGAGACAAACATCAACTATGATGGTTCAATCGACGGTTCAGTATCCGACACTTGGGTAGTCACGGGTAGCTTACCGACTACAGAACTTTCAGTGTTACTATGGTTAGGTTACGAGTTCCACTTAGGTGGCCTAGACGTAACTGACAAAGTAGTACAGAAGGGTTCGTCAGCACCGTATGGTTTCGGCATTAGCGGTGTCTGGATTGAGTCAGGCGACGTCCTAGAGGTACGATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
6aebcd18c7059b4314febe4963d874eca7948bb16fc3adfa31566d58f8e8f171
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6411
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50