Genbank accession
SBV38305.1 [GenBank]
Protein name
putative phage tail protein, contains intein domain
RBP type
TSP
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,55
Protein sequence
MAQEANINSVVPKDGSDGTTLKVNTTSGLIEYSSEEPTSELLISQVGFNKKTGELNFVRTNGELLTVTGFLTVAQLGEGPRGTKGVAGKNGKPGRLGRIGKEGKTGCEGPQGVKGIVGEAGLDGDDGELGITGLYGPVGSPGPRGAEGEPGIIGFQGPIGPTGPSCLIGPKGETGPTASGDAYYGSDIPPDNYFIWAVPYEDGVVVEAPVIEIADMKGSVSDQSANLSLVTTDSYGGSVTLVLSNFSGGVGPFKYNWQQVDGSLNSSDVSIDGSVTNKSLKITCYVVIDPGESFSYSGDVKLTITDTGNSNKKHVIDGITFRFTGSNSRTTDEEEEEGAPIVIGGGGCVHEDTPILLWSGEKVKAKHIKVGDTLQAFTCDSMLDESEPGWKDWKATQLSDVKTVPTIARVAMHTTYDKYWVINDDVKITEQHPLLIKSGDQWGWHGAESIKVGDTLFGKESEVQVESIKQINEPINVVIIDAEPYDNYFGGVSPVCIHNAVMVAKK
Physico‐chemical
properties
protein length:506 AA
molecular weight: 53386,93460 Da
isoelectric point:4,64758
aromaticity:0,06917
hydropathy:-0,28340

Domains

Domains [InterPro]
DC_0258
STR
3–451
PTHR24637
Unmapped
77–179
IPR008160
STR
79–133
IPR036844
STR
348–499
SBV38305.1
1 506
Architecture
STR
STR 3-500 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
SBV38305.1
1 506
Domain Start End Length (AA) Confidence
N-terminal 1 356 356 0,8861
Central domain 357 495 140 0,3355
C-terminal 496 506 10 0,9744
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-356
Central
357-495
C-terminal
496-506

Taxonomy

  Name Taxonomy ID Lineage
Phage Phage NCTB
[NCBI]
1857647 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
SBV38305.1 [NCBI]
Genbank nucleotide accession
LT598654 [NCBI]
CDS location
range 102158 -> 103678
strand -
CDS
ATGGCACAAGAAGCTAACATTAACAGTGTAGTACCTAAGGACGGAAGTGACGGTACTACACTGAAAGTAAATACAACGTCAGGTCTTATAGAATATTCATCGGAAGAACCTACAAGTGAGTTACTTATATCGCAAGTAGGTTTTAACAAAAAGACAGGCGAATTAAACTTTGTTCGAACTAACGGTGAACTATTAACCGTCACAGGGTTTTTAACTGTAGCTCAGCTAGGTGAAGGTCCTCGAGGTACTAAGGGAGTTGCAGGTAAAAACGGAAAGCCTGGAAGACTAGGTAGAATAGGAAAAGAAGGTAAAACAGGATGTGAAGGCCCACAAGGTGTCAAAGGAATTGTAGGTGAGGCAGGCTTAGACGGTGACGATGGTGAATTAGGCATCACGGGTCTATACGGTCCTGTCGGAAGCCCGGGACCGAGAGGTGCTGAAGGTGAACCCGGTATTATAGGATTCCAAGGACCTATAGGACCGACAGGACCAAGTTGTTTGATAGGTCCTAAAGGAGAAACAGGACCTACAGCAAGCGGTGACGCATACTACGGTTCTGATATTCCACCTGACAACTACTTTATCTGGGCTGTACCTTATGAAGACGGTGTTGTTGTAGAAGCACCTGTTATAGAAATAGCAGATATGAAAGGAAGTGTCAGCGACCAGAGTGCTAATCTGAGTTTGGTTACTACTGACAGCTATGGAGGTTCGGTAACTCTTGTACTGAGTAATTTCTCAGGGGGTGTCGGTCCTTTCAAGTATAATTGGCAACAGGTAGATGGTAGCCTCAACTCTAGTGACGTTAGCATAGACGGTTCTGTTACAAACAAAAGCCTCAAGATAACCTGCTATGTTGTTATTGATCCAGGTGAAAGCTTCTCGTACTCAGGTGATGTTAAACTCACTATAACTGATACAGGTAACTCCAATAAGAAACATGTTATCGACGGAATAACCTTTCGATTCACAGGTTCTAATTCTAGGACTACAGATGAAGAAGAGGAAGAGGGAGCACCTATCGTAATAGGAGGCGGAGGTTGTGTTCACGAAGATACACCAATCTTGCTCTGGTCTGGAGAGAAAGTCAAAGCCAAGCATATTAAAGTAGGTGATACGTTACAGGCATTTACCTGTGATTCAATGCTTGACGAATCTGAACCCGGTTGGAAGGACTGGAAAGCAACACAACTCAGTGATGTTAAGACAGTACCTACTATAGCCAGGGTTGCTATGCACACTACGTATGACAAGTATTGGGTTATCAACGATGATGTTAAAATTACCGAGCAGCACCCCTTGCTAATAAAGAGTGGTGATCAGTGGGGATGGCATGGTGCTGAATCCATTAAGGTAGGCGATACATTGTTCGGAAAAGAATCCGAAGTCCAGGTAGAGTCAATCAAACAAATAAACGAACCTATCAACGTTGTTATAATAGACGCAGAACCTTACGATAACTATTTCGGTGGTGTAAGTCCTGTGTGTATTCATAACGCAGTAATGGTGGCGAAAAAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
b7bc41287f4e5a976e098886a905e22d03855e198f03631a3699fe7638645f82
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6584
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50