Protein
View in Explore- Genbank accession
- QPX71656.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTSPTSPTSP
- Protein sequence
-
MSYQAEVLTEDNTIRDLTVRFNRLMLAVQDIDGDKVDTSIKDIEKLKNDVIYIKTNYIKQYPSLQEAKLSTDLEEGMVVQTMGYYSINDGGGSIYAIAKTSDNQIEDGGSIIKYNDTLHFHSLSTDPVNYRQFGAKGDGVTDDGLYIRRAHEYANSVGLPVHNPNGEFYFKSERYVPVRTNTNLGQTIIHVDESKTPATQGNIYVIMSQYGQSPLSSNELTAIKNSLKKGTRYVAELAKYAGSFVKVFDSNTKVGNRQSENPNSGWDMQDFFIIEEGGRIVGDITWDFSDVTSGLIRKLDKNYLTFEGGVFLLSGNLSSASTGDPTTGGIAIQRSRTIVKNQFVGLEDNASDKPTASDTGFYNLQSVYDVTLENIRVLPRKYVSVNGVALSTYGIGGSMALKCVFRNVTSEGVSSQWGVFGASLFKDVTIDNCVLNGLDIYFHAWEIKITNSKVGEKGISLTGGGKLTVENTTVYTSTLVNFRQDYGSTWEGDIRIKGCRLSVPSNAIASVLKFSPKSTDYGYKVYFGKTVAVEDVLLDYTGVTNSELAYLFNHNNQRSTGTSNVYMPATMTFKDIRVAGRSRGVRLLTTLTPLGFRSPSGNHSYGSSSESLVTNAYYKFEGIDTEDIVSEPQSVTDTHMYMPVSATALYGPEDLAPTIEIVNCRYLHLQPKCARVKMIIRDSEVRAIDAYDNGPSMGIYSLHDCDLAPNLSSGSTETYAYNMEGARVTLSGCHFYPVKYAGVENVSKYGLLYFFDITSNGLQLELKSNNSGNTLHKKIMDQITSVYSKNTLEMVMKGLDSNSMDNTKLMVRNIEKHTSAPAVGTWNVGDTIMNSTPGILTKRGWVCTTAGTPGTWQTLGEVEPEKPIYLSVIGPLSDTPLREGATETLVFNNIESDNWSMYNTSTGKITFPYEGQYLIDCGIRFSMSDQGTSDALVYQNFELNVYTGVVDEYSRAAFAQKRGFFRHTHNQSLNGSTILNFKKGDSIYIRMYVGGKMSFWKGYKELDVNAKYNYLHIRRIGPRIS
- Physico‐chemical
properties -
protein length: 1025 AA molecular weight: 113576,27920 Da isoelectric point: 5,94720 aromaticity: 0,10244 hydropathy: -0,33356
Domains
Domains [InterPro]
DC_0298
ATT
24–271
ATT
24–271
IPR012334
STR
113–225
STR
113–225
IPR011050
STR
128–537
STR
128–537
1
1025
Architecture
ATT 24-271 | STR 272-863 | RBD 870-1025
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1025
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 143 | 143 | 0,9901 |
| Central domain | 144 | 804 | 662 | 0,9788 |
| C-terminal | 805 | 1025 | 220 | 0,9637 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-143
1-143
Central
144-804
144-804
C-terminal
805-1025
805-1025
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage SP8 [NCBI] |
2770327 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Bacillus subtilis [NCBI] |
1423 | cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Bacillales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QPX71656.1
[NCBI]
Genbank nucleotide accession
MW001214.1
[NCBI]
CDS location
range 70708 -> 73785
strand +
strand +
CDS
ATGTCATATCAAGCAGAAGTCCTGACAGAGGATAATACTATACGTGACTTAACCGTGAGATTTAATAGGCTAATGTTGGCTGTTCAGGACATTGACGGCGACAAAGTTGATACCTCTATAAAAGATATAGAGAAACTAAAAAATGACGTTATTTATATAAAGACTAACTACATCAAACAGTACCCATCCTTACAGGAAGCTAAACTATCAACTGATCTGGAGGAAGGCATGGTCGTGCAGACTATGGGTTACTATTCTATCAATGACGGCGGTGGCTCTATATATGCAATCGCAAAAACGTCCGATAATCAGATTGAAGATGGAGGCTCTATCATCAAGTATAACGACACCCTTCATTTTCACTCCTTATCCACAGACCCTGTAAACTATAGACAGTTTGGTGCTAAGGGTGACGGTGTTACAGATGATGGTCTGTATATCAGGCGTGCCCATGAGTACGCAAACTCAGTGGGTCTTCCTGTTCACAACCCTAATGGGGAATTCTACTTTAAGTCAGAAAGATATGTACCTGTAAGGACAAACACTAACTTAGGGCAAACAATTATCCACGTAGACGAATCCAAGACACCTGCAACCCAAGGAAACATATATGTTATTATGTCACAGTATGGGCAAAGCCCTCTTTCTAGTAACGAGTTGACGGCGATCAAGAACAGTCTTAAAAAGGGTACTAGATATGTAGCAGAATTGGCAAAATATGCAGGGTCTTTTGTTAAAGTGTTTGATTCCAACACTAAGGTAGGTAACAGACAAAGTGAAAACCCTAACTCGGGATGGGACATGCAGGACTTCTTCATTATTGAAGAGGGAGGTAGGATTGTAGGAGATATCACATGGGACTTTAGTGATGTTACCTCAGGTCTTATAAGAAAGTTAGACAAGAACTACCTAACATTTGAAGGTGGGGTATTCCTACTCTCAGGAAACTTATCATCCGCTTCCACAGGTGATCCTACTACTGGAGGGATTGCCATTCAGAGGTCTAGGACTATAGTTAAGAATCAGTTTGTAGGTCTGGAGGACAACGCTTCAGACAAACCTACCGCATCTGATACTGGATTCTACAACTTGCAGTCTGTCTATGATGTGACACTAGAAAACATCAGGGTTTTGCCTAGGAAATATGTGTCTGTAAATGGCGTGGCTTTGTCAACCTACGGTATTGGAGGGTCCATGGCACTTAAATGCGTGTTCCGTAACGTAACTTCTGAAGGGGTCAGCTCTCAGTGGGGTGTATTTGGAGCAAGTTTATTTAAGGACGTGACCATTGACAACTGTGTACTAAATGGTCTTGACATATATTTCCACGCTTGGGAAATAAAGATTACTAACTCCAAGGTTGGAGAAAAGGGCATATCCTTAACAGGAGGGGGCAAGCTCACAGTTGAAAATACTACTGTGTACACTAGTACCCTTGTAAACTTCCGTCAGGACTACGGTTCTACTTGGGAAGGTGACATACGCATTAAAGGATGTAGGCTTTCAGTGCCATCCAATGCAATAGCATCAGTTCTTAAGTTCAGCCCTAAGTCTACAGACTATGGTTATAAGGTATATTTTGGTAAAACTGTAGCGGTGGAGGACGTATTACTGGACTACACTGGAGTAACAAATTCTGAGTTAGCCTATTTGTTCAATCACAATAACCAAAGGTCTACAGGTACAAGCAATGTATATATGCCAGCTACTATGACTTTCAAGGATATTAGGGTAGCGGGTAGGTCTAGAGGTGTACGGTTACTGACTACTCTGACCCCATTAGGGTTCAGGTCACCTTCCGGCAATCACTCTTACGGAAGCTCATCAGAAAGTCTTGTGACTAATGCGTATTACAAGTTTGAAGGGATAGATACAGAGGATATCGTATCTGAGCCACAATCAGTTACTGACACGCATATGTATATGCCTGTTAGTGCTACTGCACTTTATGGTCCAGAGGACCTTGCACCTACTATTGAAATAGTAAATTGCAGGTATCTACACCTCCAGCCAAAGTGTGCTAGGGTCAAGATGATCATAAGAGATAGTGAGGTTAGGGCTATTGATGCATATGACAATGGTCCAAGCATGGGTATATACTCATTGCATGACTGCGATCTTGCTCCTAACCTGAGTTCTGGCTCTACAGAAACTTATGCGTACAACATGGAGGGGGCTAGGGTCACTCTGTCAGGTTGCCACTTCTATCCAGTGAAATACGCAGGAGTTGAAAATGTGTCAAAGTATGGATTGTTGTATTTCTTTGACATTACATCCAACGGACTACAGCTAGAACTGAAAAGTAATAACAGTGGCAACACACTTCACAAAAAGATCATGGACCAGATAACAAGTGTGTATAGCAAGAACACCTTGGAAATGGTTATGAAGGGTCTGGATTCAAACAGCATGGATAACACAAAATTGATGGTCAGGAATATAGAAAAACACACTAGTGCACCAGCAGTAGGAACTTGGAATGTTGGAGACACTATTATGAACTCTACTCCAGGTATTCTAACAAAACGAGGATGGGTATGCACTACTGCGGGTACTCCGGGCACTTGGCAAACTCTAGGTGAGGTGGAGCCAGAGAAACCTATCTACCTGAGTGTTATTGGACCACTAAGTGACACCCCACTACGGGAAGGTGCTACAGAAACGCTAGTATTTAACAACATAGAGTCAGATAACTGGAGTATGTACAACACATCTACGGGGAAAATAACATTTCCTTATGAAGGTCAGTATCTAATAGATTGTGGTATTAGATTTAGTATGAGTGACCAAGGCACGTCAGATGCTCTGGTATATCAAAATTTTGAGCTAAATGTCTATACTGGAGTTGTTGACGAATACAGCAGAGCGGCATTTGCTCAAAAAAGGGGGTTTTTTAGACACACCCACAACCAATCTTTAAACGGGTCCACAATACTCAACTTTAAAAAAGGTGACTCCATATATATACGAATGTATGTTGGGGGCAAGATGTCCTTTTGGAAAGGATATAAAGAACTTGATGTAAACGCAAAGTACAACTATCTACACATTAGAAGAATAGGACCTAGAATCTCCTAG
Genome Context
Genome Context
Tertiary structure
PDB ID
cc66a62ea0487dbf3fa65fae81945cf09ce648dcc04f8505e90a10d8c0a03b09
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50