Genbank accession
YP_004327138.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
Protein sequence
MSFESLKNQHGTRLIQAVAIYPNACKYSTDEALANGECVASADYSDSYTGNITVSGGDLTFFGASSNSYLRIGDEIAKCTVVNSTTVNITARAQLGTTAEAITTSDSLRVVHGGEADGSCRGYPKRPDGKGCSNDDSFDRDINREFLITDTQLVSGEIYYNGLNSISHNPVILKPAEAMAKNASVTVSITDNEDGDQYSVPYPSQRNSNSTYLRKLIARTGGYLRNRKMIVYSGFTEGSSFDPVNCISREYIIDNFNIGKGDRVTIIGKDPLMLAEETKAKTHDVSAGVLLADVTNASTQITLKNFAVGEYGNDTDTGTAIIDNELIDYTVNNSVLGILDINARAVAGSEQKDHKINASVQKCLVLTDFNPVQAIIDRLQARTSIETRFYDDYTDVIATIPSSTGTAYVTKPESLKDFNNTLIQSWAENNISMYFDELAKKIKIKAVGDFSQQPVTLTDDDIILDSIEIKNKYDDQITRASIGFAPFDASKKTNEENSSIIFQSINLDVELSGTLEPQEAKTFYSKFLTDSDNDVSIAVGGVSRIANVNKKPPQEYSFMLDYEKYGSVSGGVVEEGEIINVTTELSIDDDGQPLSQNLQILSIKDDMEEKKIKVTAVTYQDLINEDDFDFIIDESKEDYVLSNDFAPPAGDYTIFIASNVTIGSTSTANFALDTGAQASGVTFTIINRGQILAAGGDGGDGRPALAPDPDDFPSRFESVSDAGFDGGDALNITVPTVIDITQGVIYSGGGGVPSGISIADSTVSPVYVEGGNGGSGGQGYVGGNGGAAGVAEVENTATDTGINGADGSRGGAGSLGGLSGGSWGEAGQAGEGLAAGGEAGFAIKSNSNSVTLIGDNEATIRGKRDF
Physico‐chemical
properties
protein length:866 AA
molecular weight: 91829,92650 Da
isoelectric point:4,35042
aromaticity:0,07390
hydropathy:-0,27737

Domains

Domains [InterPro]
DC_2014
ATT
18–121
DC_1555
ATT
105–629
YP_004327138.1
1 866
Architecture
ATT
RBD
ATT 18-629 | RBD 649-863 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_004327138.1
1 866
Domain Start End Length (AA) Confidence
N-terminal 1 93 93 0,3891
Central domain 94 292 200 0,1284
C-terminal 293 866 573 0,2160
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-93
Central
94-292
C-terminal
293-866

Taxonomy

  Name Taxonomy ID Lineage
Phage Pseudoalteromonas phage H105/1
[NCBI]
877240 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Pseudoalteromonas sp. H105
[NCBI]
1348393 Bacteria > Proteobacteria > Gammaproteobacteria > Alteromonadales > Pseudoalteromonadaceae > Pseudoalteromonas

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_004327138.1 [NCBI]
Genbank nucleotide accession
NC_015293.1 [NCBI]
CDS location
range 24416 -> 27016
strand +
CDS
ATGTCATTTGAAAGCCTAAAAAATCAACATGGTACGCGATTAATTCAAGCCGTGGCGATATACCCAAACGCGTGCAAATACTCCACTGATGAGGCTTTGGCAAATGGTGAGTGCGTTGCTAGCGCTGATTACAGTGATTCATACACTGGCAATATAACCGTTAGCGGTGGAGATTTAACATTCTTCGGCGCTTCATCTAACTCTTATTTACGAATAGGTGATGAGATAGCTAAATGCACAGTGGTTAATTCAACAACTGTTAATATAACGGCTAGAGCGCAATTAGGCACTACTGCCGAGGCTATAACAACTAGTGATAGTTTAAGGGTTGTTCATGGCGGCGAAGCTGATGGCTCTTGTCGAGGATACCCAAAAAGGCCAGATGGTAAAGGCTGCTCAAACGATGATAGTTTTGACCGTGACATTAATCGTGAGTTTTTAATAACTGACACTCAACTTGTATCTGGTGAAATTTATTATAACGGGCTTAACAGTATTAGCCACAATCCCGTCATTCTAAAGCCAGCCGAGGCAATGGCTAAAAACGCAAGTGTTACTGTCAGCATTACAGATAATGAAGATGGTGATCAATATTCTGTACCTTACCCATCTCAGCGCAATAGCAACTCAACATACCTTAGAAAGTTAATAGCTCGCACTGGCGGGTATTTGAGGAATAGAAAGATGATTGTTTATTCTGGATTCACAGAGGGCAGCTCTTTCGACCCAGTAAATTGCATATCGCGTGAATACATTATTGATAATTTCAATATAGGAAAAGGCGATCGGGTAACTATAATTGGCAAAGACCCACTAATGCTGGCAGAAGAAACCAAAGCAAAAACTCACGACGTAAGCGCAGGGGTTTTGCTTGCTGATGTGACAAATGCGTCAACACAGATAACACTAAAAAACTTTGCTGTAGGCGAGTACGGTAATGATACTGATACTGGCACTGCAATAATTGATAATGAGCTGATTGATTACACTGTAAACAACTCAGTGCTTGGCATTCTTGATATTAATGCACGCGCGGTAGCTGGTAGTGAGCAAAAAGACCACAAAATTAACGCATCTGTTCAAAAGTGCCTAGTGCTAACTGACTTCAACCCAGTTCAAGCTATCATTGACAGGCTGCAAGCTAGAACATCAATTGAGACTCGATTTTATGATGACTACACTGATGTTATAGCAACAATACCAAGCAGTACGGGTACAGCTTATGTAACTAAGCCTGAGAGCCTAAAGGATTTTAATAACACTCTGATACAATCTTGGGCTGAAAACAATATAAGTATGTACTTCGATGAGTTAGCAAAGAAAATAAAAATCAAAGCGGTAGGAGACTTTTCACAGCAACCAGTGACGTTAACTGATGACGATATAATCTTAGATAGCATTGAGATAAAGAATAAATATGACGATCAAATAACTCGCGCATCAATAGGCTTTGCTCCTTTTGACGCAAGCAAGAAAACCAATGAGGAAAACAGCTCGATTATATTTCAATCTATAAATCTTGATGTTGAATTGAGTGGCACGCTTGAGCCTCAAGAAGCAAAAACATTCTACTCCAAGTTTTTAACTGATAGCGATAATGATGTAAGTATTGCAGTAGGCGGTGTATCGCGCATTGCAAATGTAAATAAAAAACCACCTCAAGAATATAGCTTTATGCTTGACTATGAGAAGTACGGCTCAGTTAGCGGTGGGGTTGTTGAAGAAGGTGAAATAATAAACGTTACAACGGAGCTTTCAATTGATGATGATGGCCAGCCTCTATCGCAAAACCTGCAAATACTCAGCATTAAAGACGACATGGAAGAGAAGAAAATAAAGGTCACTGCGGTCACTTATCAAGACTTGATAAACGAAGATGACTTTGACTTCATTATCGATGAAAGCAAAGAAGATTACGTGTTAAGTAATGACTTCGCGCCGCCTGCTGGTGACTACACTATATTCATAGCGTCAAACGTGACAATTGGGTCAACATCAACAGCAAACTTCGCTCTTGACACTGGCGCTCAGGCAAGCGGTGTTACTTTTACTATAATAAATAGAGGTCAGATACTTGCGGCTGGCGGTGATGGCGGTGATGGCCGCCCTGCTTTAGCTCCAGACCCTGATGATTTCCCGTCTCGCTTTGAGTCAGTAAGTGATGCTGGCTTTGACGGCGGTGACGCTTTGAATATAACAGTACCGACAGTTATAGACATTACACAGGGCGTTATATACTCAGGTGGTGGCGGTGTACCAAGCGGCATTAGCATTGCTGACAGCACAGTATCACCCGTTTATGTTGAAGGCGGCAATGGTGGGTCAGGCGGTCAGGGTTATGTGGGTGGTAACGGTGGAGCTGCTGGAGTTGCGGAGGTTGAAAATACAGCAACGGACACGGGTATTAATGGCGCTGACGGGTCAAGGGGTGGCGCTGGTTCGCTTGGTGGGTTAAGTGGTGGCTCTTGGGGAGAGGCAGGCCAAGCAGGCGAGGGTTTAGCGGCTGGCGGTGAAGCAGGCTTTGCTATAAAATCAAATAGTAACAGCGTTACGCTTATCGGCGATAACGAAGCAACAATACGCGGAAAGAGAGATTTTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
004de16528c55c7f2eabf616cbdf475971f20287ae3f0531367f378e507c6e91
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7773
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50