Genbank accession
QAX96801.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,80
Protein sequence
MSSKLIFTMQYFRGEKKREYLADPETGLIPGTGETGGGLEEYDRAKAWDKDDVVIRDGAIFQANEDIPAGSSFIEGLTGPTWLKLGAEAVATLPDYAVGIRYLKDQAIARGRVIYRAKNDFTSAAWNPAQWETVFSEDAVRDKVADFSTAQSYVTHELVAYNGDIYAANEDMAAGAFDASKWQNLTTPNVTYVDDYAPGTVYTKDAVVMRNGQLFRATQQTATAWDEGDWQAVSSVSTVRGYWDLNATYKANEIVIANDRFYRANGDIGSGTTFTVGTGTEEWTEVSGPVTPPITADDFKASTLYAKNQVIVVNGILYRAKAEFTSLSAFSIADWDPVSYDPETAVDAWQANTDYKAGQLATLVVNGRNTLYRARADITGGASFVAAEWDDLTQTSTWRAEWDISNIYAKDDVVWRAGHLYIANDAVPINTAWVEGITGATWARLLAEPVSKISVWAATTSYQKDELLAYDGNIYRVEAPFVSGAAFTSGGLELLTAGKYTKIPAFRGSNAYVQDETIAEGGILYRAKADFTSATNFNEADWEVIADIATNILYDFSITEDYVENEIVFHLDSIWRAKGDLAPGAWDESDWEFVSEYSTFKGQHVQARDYKQDDVVIHNNRLYRANSNIAGGTAFLAGSGGNQWTEVSPPGNPSAPDWASSTAYVQHNLVIHGGVLYRCKTDHTSGSSFLASQWETLKAEAATLASTYQNGQEYVLDQLIEYNGILYRCIEAMLSSPVTFEPDKWQGVSVSYIRISAHSASAVYREGELIWHVNRLYRAKAGINPGRAFSLNDWAISEDTNYVSEYAENEDYRLGQLIRSGVDLYIANEAITGAPAVLDETKWNRVGYRPTVQGAHDDTLAYKQGDIVLKEGTWYEANADIAANVALSEGTTGATWKRVTDSMTIKAHAADAIFFPGQLCLHEERVYTAKEFLTVDSSNTFQSIKWEKLGSFVTLAEDYQDNIEYKKDQIIFHLSVLYRRIATGTDTTWTAANWQALSLTAAILNDFDNTKNYAANEVITRGGATYRVKTPGIKASFVESDWDRIDERPTFRGEWAQANAYKAADFVVRDSILYTANDDVPATTAFSVGTTGATWKRQSPENIAVEHVGAEITVTEGQPVLHRRVLYFATQTVTIPDAFDETGLKKIGRFGMTEQTFEADTWFPAGWLMRQDDKVYKAKTDFTTATAFDVADWDLVSHDYAFIVEFSTADTYYKDQIVINSKLLYRANTDVTVAGAWDSAQWDQLAPAEAARIETHDETNEYLLDDIVFKDGMLWRANSAIAANTPFVKGTSANEWRAAIELKIKAFAEGENILTGEYRLIEGHLAVAKEDIASTAAYATETEKWSILSHDISKLKMVYPVREYLPAETNINYDGSIDGSVSDTWVVTGSLPTTELSVLLWLGYEFHLGGLDVTDKVVQKGSSAPYGFGISGVWIESGDVLEVR
Physico‐chemical
properties
protein length:1444 AA
molecular weight: 159850,15480 Da
isoelectric point:4,63507
aromaticity:0,11911
hydropathy:-0,28767

Domains

View on InterPro
QAX96801.1
1 1444 aa
STR 145–234 · STR 242–341 · CBM 346–392 · STR 399–490 · STR 603–698 · STR 705–795 · STR 804–899 · STR 957–1045 · CBM 1052–1098 · STR 1204–1299 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

QAX96801.1
1 1444 aa
Domain Start End Length (AA) Confidence
N-terminal 1 1322 1322 0,8778
Central domain 1323 1433 112 0,5402
C-terminal 1434 1444 10 0,0500
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Vibrio phage vB_VmeM-Yong MS31 [NCBI] · taxon 2500764
No lineage information

Coding sequence (CDS)

Genbank protein accession
QAX96801.1 [NCBI]
Genbank nucleotide accession
MK308676.1 [NCBI]
CDS location
range 179218 -> 183552
strand +
CDS
ATGAGTAGCAAATTAATCTTTACCATGCAATATTTCCGCGGGGAGAAAAAGCGCGAATATCTCGCTGACCCTGAAACCGGACTTATTCCTGGTACTGGTGAAACGGGCGGAGGTCTAGAAGAATACGACCGCGCCAAAGCATGGGATAAAGACGACGTCGTTATTCGCGACGGCGCTATCTTCCAAGCGAACGAAGACATTCCTGCGGGCTCTTCTTTCATTGAAGGCCTTACTGGCCCGACTTGGCTTAAGTTAGGCGCAGAAGCGGTAGCGACACTTCCCGACTACGCTGTAGGCATTCGTTATCTTAAAGACCAAGCGATTGCCCGTGGTCGAGTTATCTACCGTGCGAAGAATGACTTTACTTCTGCCGCGTGGAATCCAGCACAATGGGAAACGGTCTTCTCTGAAGACGCAGTTCGCGATAAGGTAGCGGACTTCAGTACTGCACAGTCTTACGTAACGCATGAATTAGTTGCCTACAACGGTGACATCTATGCGGCAAACGAAGATATGGCGGCAGGGGCCTTTGACGCATCTAAATGGCAGAATCTAACTACGCCTAACGTAACATACGTCGATGACTATGCCCCGGGCACAGTCTATACCAAAGACGCTGTAGTTATGCGCAATGGTCAATTGTTCCGTGCAACGCAACAAACTGCGACAGCATGGGATGAAGGCGATTGGCAAGCGGTTTCTTCAGTTTCTACTGTTCGTGGTTACTGGGACCTTAATGCAACCTATAAAGCAAACGAAATAGTTATCGCTAATGACCGCTTCTATCGTGCTAACGGCGATATCGGTTCAGGCACTACGTTCACTGTAGGTACTGGTACTGAAGAATGGACTGAGGTGTCGGGCCCTGTTACTCCGCCTATCACGGCAGATGACTTTAAAGCTTCTACGCTTTATGCTAAGAATCAGGTTATTGTAGTTAACGGCATCCTTTATCGGGCTAAGGCAGAGTTCACTTCACTCTCTGCATTTAGCATCGCGGATTGGGACCCTGTCTCTTATGACCCAGAAACGGCCGTAGACGCATGGCAAGCTAATACTGACTATAAAGCAGGGCAGTTAGCCACGTTAGTTGTAAATGGTCGTAACACGCTTTACAGAGCGCGTGCGGACATCACAGGCGGAGCGAGTTTCGTAGCGGCCGAATGGGACGACCTAACTCAAACGTCGACCTGGCGTGCTGAGTGGGATATCTCGAACATCTATGCTAAGGACGACGTTGTGTGGCGTGCAGGGCATCTGTACATTGCCAACGATGCGGTACCTATCAACACTGCATGGGTAGAAGGCATCACCGGCGCTACGTGGGCTCGTTTGTTGGCCGAACCAGTATCTAAGATTTCTGTTTGGGCAGCAACAACTTCTTATCAGAAAGATGAACTTCTAGCTTATGATGGCAACATCTATCGCGTGGAAGCCCCTTTCGTTTCTGGCGCGGCATTTACAAGCGGTGGCCTAGAACTGCTCACTGCCGGTAAGTATACTAAGATTCCTGCATTCCGTGGCTCTAACGCTTACGTCCAAGACGAGACGATTGCCGAAGGCGGAATCCTTTACCGCGCTAAAGCGGACTTTACGTCAGCTACAAACTTCAATGAAGCAGACTGGGAAGTTATCGCGGACATCGCGACAAACATCCTGTACGACTTCAGTATCACAGAAGACTATGTAGAAAATGAAATCGTTTTCCACTTAGACTCCATCTGGCGTGCTAAAGGCGACCTTGCTCCTGGGGCTTGGGACGAGTCCGATTGGGAATTCGTTTCTGAATACTCAACCTTTAAAGGTCAGCACGTCCAGGCACGTGACTACAAGCAAGACGACGTGGTAATCCACAACAACCGATTATATCGAGCAAACTCGAATATTGCGGGCGGCACTGCTTTCTTGGCAGGTAGCGGAGGTAACCAATGGACTGAGGTTTCCCCTCCAGGAAATCCTTCTGCTCCGGACTGGGCAAGCTCAACGGCATATGTTCAGCACAACTTGGTCATTCACGGTGGCGTCCTTTACCGCTGTAAGACAGACCATACTTCAGGAAGTAGCTTCCTCGCTTCTCAGTGGGAGACATTGAAAGCAGAAGCGGCTACCTTGGCTTCGACTTACCAGAATGGCCAAGAATATGTCCTTGACCAACTGATTGAATACAACGGCATTCTTTACCGCTGTATCGAAGCGATGCTAAGTTCACCTGTAACGTTCGAGCCAGACAAATGGCAAGGCGTATCGGTTTCTTATATCCGCATCTCTGCGCATAGTGCTTCAGCCGTTTATCGTGAAGGCGAATTGATTTGGCACGTGAATCGCTTGTATCGTGCTAAGGCAGGCATTAACCCAGGTCGTGCTTTCTCGCTTAACGATTGGGCTATCTCTGAAGATACCAACTACGTGAGCGAATATGCGGAGAATGAAGACTACCGCCTAGGCCAACTTATCCGTTCAGGTGTTGACTTATATATCGCAAACGAAGCTATTACCGGTGCTCCTGCTGTACTTGACGAAACTAAGTGGAACCGCGTAGGTTACCGCCCAACCGTTCAAGGCGCGCATGACGATACTCTTGCGTATAAGCAAGGGGACATTGTTCTTAAGGAAGGTACTTGGTACGAAGCTAACGCGGACATCGCGGCTAACGTGGCCCTGTCTGAAGGTACTACTGGCGCCACATGGAAACGTGTAACCGACAGCATGACCATCAAGGCACATGCGGCAGACGCAATCTTCTTCCCTGGTCAGCTCTGTCTACATGAAGAGCGCGTTTACACAGCGAAGGAATTCCTAACGGTAGACAGCTCTAATACGTTCCAGTCTATCAAGTGGGAGAAGCTAGGTTCGTTTGTTACTCTAGCGGAAGACTACCAAGACAACATCGAATACAAGAAAGACCAAATCATCTTCCATCTGAGCGTTCTTTACCGCCGCATTGCGACGGGCACCGATACGACTTGGACGGCAGCAAATTGGCAAGCTCTTTCTTTAACCGCGGCGATTCTAAACGACTTCGACAACACGAAGAACTATGCTGCCAACGAGGTCATCACTCGCGGAGGTGCTACCTACCGTGTTAAGACTCCTGGCATTAAAGCTAGCTTCGTAGAAAGCGATTGGGACCGCATCGATGAGCGTCCTACGTTCCGTGGCGAATGGGCTCAGGCTAATGCTTATAAAGCAGCTGACTTCGTAGTTCGCGACAGTATCCTTTACACTGCCAACGATGATGTTCCGGCAACAACTGCATTCTCAGTCGGCACTACCGGCGCTACATGGAAACGTCAGTCGCCAGAGAACATTGCAGTTGAGCACGTAGGTGCTGAGATTACGGTAACTGAGGGACAACCTGTTCTTCATCGCCGTGTTCTTTACTTCGCAACACAGACGGTAACCATTCCTGATGCGTTCGATGAAACCGGACTTAAGAAGATTGGTCGCTTCGGCATGACCGAGCAAACCTTCGAAGCCGATACTTGGTTCCCTGCTGGTTGGTTAATGCGTCAAGACGATAAAGTCTACAAAGCGAAAACGGACTTTACCACTGCGACGGCCTTTGATGTTGCTGACTGGGATTTAGTTTCTCATGACTACGCGTTCATTGTAGAGTTCAGCACAGCTGACACTTACTACAAAGACCAAATCGTTATCAACTCTAAACTACTTTACCGTGCTAATACAGACGTTACTGTAGCGGGTGCTTGGGACTCTGCCCAATGGGACCAATTGGCTCCAGCTGAAGCAGCTCGTATCGAGACTCATGACGAGACTAACGAATACCTACTTGACGACATTGTCTTCAAAGACGGTATGTTGTGGCGTGCTAACTCTGCCATCGCAGCGAACACTCCGTTCGTTAAGGGCACGAGTGCCAACGAATGGCGTGCAGCCATTGAACTTAAGATTAAAGCGTTTGCTGAAGGTGAGAACATTCTTACCGGAGAGTACCGCTTAATCGAAGGTCACTTAGCCGTAGCAAAAGAAGACATTGCATCTACTGCTGCGTACGCTACGGAAACTGAGAAGTGGAGTATTCTGTCTCACGACATCTCCAAACTTAAGATGGTTTACCCAGTCCGCGAATACTTACCGGCCGAGACAAACATCAACTATGATGGTTCAATCGACGGTTCAGTATCCGACACTTGGGTAGTCACGGGTAGCTTACCGACTACAGAACTTTCAGTGTTACTATGGTTAGGTTACGAGTTCCACTTAGGTGGCCTAGACGTAACTGACAAAGTAGTACAGAAGGGTTCGTCAGCACCGTATGGTTTCGGCATTAGCGGTGTCTGGATTGAGTCAGGCGACGTCCTAGAGGTACGATAA

Genome Context

Tertiary structure

QAX96801.1
ESMFold structure
Source ESMFold
pLDDT 64.1
Oligomeric state monomer