Genbank accession
QZA69124.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,59
Protein sequence
MSIATGQITLTDYNDAITLTGFITANSPRMQQYNPDNETFNPDWTKNPLILTASLYIMGDTNNIISSTDVQKVEWFDAANPGTPLTTGGPYSVNRETLTIKQNILSTKPAVDFICQVTYRDSTTMLDIITKMSISLSRVTNGSGITSASVWMPNGNIFKNNQIATLIAECDLWRGGAVDNTNVAYQWYNQDPSITTDQGGGLGWRKLTDSTNSGETGYTTRQLTVPASAVPSFEVYKCVVTDLDTASNTYNENFQGTAAFIDQSDPIQLSITSTGGDVFKNGVGSSDLTAHMFRAGDEIDEDGTEYTYKWYKYDKDGVLVTGFGGTNNYRTGKTITVNSDDVDTKATFVCETSAAVGQFTVYDITDVIVSNTEPANPVNGTIWLNTSGQPPYRFYMYKDGVWETTDYDSLESLDPEAYDKVRDAYNAITDLDMDNRLTRYERSVVRGELANIIGTYLNSTDAMPTIEEVDASGVGALYSIRQQAKDIGVSTSNTNYVRLGTAYTALRTYLSGLAIKPWDVDSDGTLDIVSADWDNAWKDYYLAYNFLGITVTNRQKEYSDLVGEGAVQDAIKAVSNSAQFKEAPIANPMNINAPIASLGLPSFQGRHVDVWSMSPDAQTNWALGGNRIKPITNPTFSSAGSVEIIGKFYGDGTNNDEFSWDKEGRPVKATRWVDVSLDNSFTWVFGVDGTGYKQVTVPAFSTIAPADMSVTAVKHDGSILTTIGDTLTAADQVMLKNADSILNITVADTDSGWGETYTPEPAEILAYFNGWKMCNGTFGQPYDGTGMKVWYPIGDPDLSRANNSNGTANPVPTDVSPTIKEQSINLYQIVYRNDTAVQEIIQFDGILSLLAGDNEVQITYPVDTPEITVGTIKYATNLATVTDNLKYWIPTLQQRVSKAEEVITDDAIINTVTQSTQYQLTLASKADTDALADYATNSDLDDLSGDIDGRINNAINAIDFSPYATKSELEQKATDITAKFQAAGGMNLIKNSIGFADLDFWGLFTAYPVETISTNELDILGFGSGFLFNPDGNNKGITQDINVIPGQPYTLSWYLNKRTSGPDSSYRFWIQIQENDVTTLQIADNSAKTTVGYESSYMTYTPQTDVIRVRFIGYANVDATLTGIMLTIGDVPLKWSLSTGEVYNTNIRMNINGIRVSQLDANRQEIGFTQITPEEFAGYYDTEGNGSFQKVFYLNGDETVTRKLRATDSIVMQDIQIINVDTADRKGWAYIPNIDTE
Physico‐chemical
properties
protein length:1237 AA
molecular weight: 135753,68470 Da
isoelectric point:4,29648
aromaticity:0,10105
hydropathy:-0,32264

Domains

Domains [InterPro]
DC_0835
STR
121–1224
G3DSA:2.60.120.260
STR
986–1150
QZA69124.1
1 1237
Architecture
STR
STR 121-1224 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QZA69124.1
1 1237
Domain Start End Length (AA) Confidence
N-terminal 1 643 643 0,8179
Central domain 644 988 346 0,0867
C-terminal 989 1237 248 0,0606
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-643
Central
644-988
C-terminal
989-1237

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage 010DV004
[NCBI]
2869562 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QZA69124.1 [NCBI]
Genbank nucleotide accession
MZ501260.1 [NCBI]
CDS location
range 99611 -> 103324
strand -
CDS
ATGTCTATTGCAACAGGACAGATTACATTAACGGATTACAATGACGCTATCACACTCACAGGATTCATCACAGCTAACAGCCCAAGAATGCAGCAGTACAACCCAGACAACGAAACGTTCAATCCAGACTGGACTAAGAATCCGTTAATTTTAACAGCATCTCTTTACATCATGGGAGATACAAATAACATCATCTCAAGCACAGATGTTCAAAAAGTTGAGTGGTTTGATGCGGCTAATCCGGGGACGCCCCTTACAACAGGAGGACCCTATAGTGTAAACCGTGAAACCCTCACTATTAAACAAAACATCCTTTCAACAAAGCCCGCTGTAGATTTCATATGTCAAGTTACATATCGAGACAGCACAACTATGCTGGATATCATCACAAAAATGAGCATCTCACTGAGCCGGGTAACTAACGGTAGTGGAATCACATCTGCTTCTGTGTGGATGCCCAATGGTAACATCTTCAAGAACAACCAGATCGCTACCCTTATTGCCGAATGTGATTTATGGAGAGGTGGGGCAGTAGACAATACAAACGTAGCTTATCAGTGGTACAACCAAGACCCGTCTATCACCACAGATCAGGGTGGCGGTCTAGGCTGGAGAAAGCTTACAGACAGTACCAACAGCGGGGAGACTGGATACACTACGAGACAGCTTACTGTCCCAGCAAGTGCCGTCCCTAGTTTTGAAGTATACAAGTGTGTTGTAACAGACCTAGACACTGCAAGTAATACATACAATGAGAACTTCCAAGGTACAGCTGCCTTCATTGACCAGTCCGACCCTATTCAGTTGTCTATTACATCCACAGGGGGAGATGTATTCAAAAACGGGGTAGGTTCCTCAGACCTGACGGCTCATATGTTCCGGGCAGGGGACGAGATAGACGAAGACGGTACAGAGTATACATACAAATGGTATAAGTACGATAAAGACGGGGTTCTTGTTACAGGGTTTGGCGGTACGAACAATTATAGGACAGGGAAAACGATTACAGTTAACAGCGATGACGTAGACACTAAGGCAACCTTCGTATGTGAAACATCCGCAGCGGTAGGTCAGTTCACTGTATATGACATCACAGATGTGATTGTCAGCAACACTGAGCCTGCTAATCCAGTTAACGGGACTATCTGGCTTAACACCTCCGGGCAGCCTCCGTACCGTTTCTATATGTACAAAGACGGTGTATGGGAAACGACAGATTATGACAGTCTAGAATCCCTAGACCCAGAGGCATATGACAAAGTACGAGATGCCTACAATGCAATCACCGATCTTGATATGGATAACCGCCTAACAAGATATGAGCGCAGCGTAGTCCGGGGAGAGCTCGCTAACATCATCGGCACGTACCTAAACAGTACAGACGCTATGCCTACTATTGAGGAGGTAGACGCAAGCGGTGTAGGTGCTCTGTACTCTATCAGGCAGCAGGCTAAAGATATCGGGGTTTCGACTTCAAACACTAACTATGTACGATTAGGAACAGCATACACAGCACTTCGCACATATCTATCAGGATTAGCCATAAAACCGTGGGATGTCGATTCAGACGGAACTTTAGACATTGTGAGTGCGGATTGGGATAACGCATGGAAAGACTACTATTTAGCGTATAACTTCCTCGGAATTACTGTTACAAACCGCCAAAAGGAGTATTCTGATCTTGTAGGGGAGGGTGCTGTGCAAGACGCAATTAAGGCAGTCAGTAACTCCGCTCAGTTCAAAGAAGCACCGATAGCCAACCCGATGAACATTAATGCACCCATCGCTAGTTTAGGTCTCCCTTCATTCCAAGGTCGTCATGTAGATGTATGGTCTATGTCTCCAGATGCACAGACTAACTGGGCATTGGGAGGTAATCGGATAAAGCCAATAACAAACCCTACATTTAGTTCCGCAGGTTCGGTAGAGATTATCGGTAAGTTCTACGGAGACGGTACTAACAACGATGAGTTCTCGTGGGATAAAGAGGGGCGCCCAGTAAAGGCAACTCGCTGGGTAGACGTATCTTTAGATAACTCCTTTACTTGGGTTTTTGGGGTAGACGGGACAGGATACAAACAGGTAACAGTACCTGCGTTCTCTACAATAGCTCCGGCAGACATGTCTGTAACAGCCGTCAAGCATGATGGAAGTATCTTAACCACTATCGGAGACACCCTTACAGCAGCAGATCAGGTTATGCTTAAGAACGCTGACAGTATCCTCAACATCACTGTAGCAGATACAGACAGCGGATGGGGAGAGACGTATACTCCAGAACCCGCAGAGATTCTCGCATATTTTAACGGCTGGAAGATGTGTAACGGCACTTTCGGGCAACCATACGATGGAACAGGTATGAAAGTATGGTACCCAATTGGAGACCCAGACCTCTCCCGTGCTAACAACTCTAACGGTACGGCTAACCCAGTACCTACAGATGTGTCACCTACTATAAAGGAACAGTCTATAAACTTGTATCAAATTGTGTACCGGAACGATACAGCTGTCCAAGAGATCATTCAGTTTGACGGTATTTTATCTCTGTTAGCAGGCGATAACGAAGTACAGATTACATACCCAGTAGACACTCCAGAGATTACAGTAGGTACCATTAAATATGCGACAAACCTAGCTACTGTGACAGACAATCTGAAATACTGGATTCCCACCCTTCAGCAGAGAGTATCCAAAGCCGAAGAGGTTATCACAGACGATGCTATCATCAACACAGTAACACAGTCTACACAATACCAGTTAACTTTAGCGAGTAAAGCCGACACAGATGCACTAGCGGACTACGCAACAAACTCCGACTTAGACGATTTGTCTGGGGACATTGATGGGAGAATTAACAATGCCATCAATGCTATCGACTTTAGTCCGTATGCAACCAAGTCTGAGCTGGAGCAGAAAGCTACAGACATCACAGCAAAGTTCCAAGCTGCGGGTGGTATGAACTTAATTAAAAACTCCATTGGCTTTGCTGACTTGGACTTCTGGGGCTTATTTACTGCGTACCCTGTAGAGACTATTAGCACCAACGAGTTAGATATACTTGGATTCGGGAGTGGGTTCCTGTTTAACCCGGATGGGAACAACAAAGGGATCACACAAGATATCAATGTTATTCCGGGGCAGCCATACACTCTATCTTGGTACCTAAATAAAAGAACAAGCGGACCAGATTCCTCATACAGGTTCTGGATTCAGATACAGGAGAACGATGTCACCACCCTCCAAATAGCGGATAACAGCGCTAAGACTACTGTAGGATATGAATCCAGTTATATGACCTATACGCCTCAAACGGATGTTATTCGTGTAAGGTTCATTGGATATGCGAATGTTGATGCAACCCTTACAGGTATTATGCTGACCATTGGAGATGTACCGTTAAAGTGGTCTCTCTCAACTGGAGAAGTGTACAACACGAATATCCGTATGAACATTAACGGTATCCGAGTATCTCAGCTAGACGCCAACCGTCAAGAAATTGGGTTCACTCAAATTACGCCGGAGGAGTTTGCAGGGTACTATGATACAGAAGGGAACGGCTCTTTCCAGAAGGTCTTCTACTTAAATGGGGATGAGACAGTTACTAGGAAGCTGCGGGCTACAGACTCCATTGTAATGCAGGATATCCAGATCATTAACGTAGACACAGCTGATCGGAAAGGGTGGGCGTACATTCCGAACATTGATACGGAGTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
a0345068e446eb6bbcb4fe4927b067ef0ce8434bb547832474b945c38c2e7e07
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6657
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Comparative Genomics of Bacillus subtilis Phages Related to phiNIT1 from Desert Soils of the Southwest United States Vill,A.C., Delesalle,V.A., Magness,L.H., Chaudhry,B.E., Lichty,K.B., Strine,M.S., Guffey,A.A., DeCurzio,J.M. and Krukonis,G.P. 2023 GenBank