Genbank accession
YP_009809299.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MADPISFAVGTVATMGVSYLFPSEGPRIKDMKISASTYGAQIPNVFGVTRVPGNMIWSNKIRETKRKKMVGKGGFYNQYKYYCTFAMGFCYGPITMLRKLWADGKLIYDASGQSELETKHNYKLNVYTGSEEQYPDPIMEAIVGEGNTPAYRGLCYVVFDDMPLDDFGNRIPQLTAEVFNGEGVIVADGPDVVYQNPEPEENTGDNRPPVLMGYQINQIAVDSFGGYFYVLDKSSVYPVIRQVRISDGQECGRFSFMTPRTDSCYEFSYDSQTIDQIVGVAPGGELLCSISLNNNVAYAMIDLITGNLAVHGTTDPFGGRNCGFGTGTPPGNGSNYHAVSVNSAGQAQLGISGLFGDVSIYDCTSLLAFEGNPQNTPTGHSGFGSFPIVGAIGSTCHFYLPSVASENSNFITINRATRVADQVGFWSAGYIMPYQPQDPDYGYGHIIVRNAVYDATDGGVLVMFTVRKANGLEAYMFGKWSGVTYQQLWVKEVPIEAPGVYPRVGVAHQGQYAFVQPLNSHVPYPTVWIIDTMTGEWKAMSVSEQPSFVTYTDETESEVIVDQSAYTGYPIKDTQFIGYDMQYYDPIRQAVITLGPDGVDKIVKIGPEIGSTTLGAIVRSVLKRGGLSEQHMDMSRLESTHVWGYGWAAATDIKSILEELERVYLFDIVESEGKLVGVMRSAGSDEDFPGSTVVTIKQGALGSTSPEVADFWQETRIQEADLPAKINFTYMNLDADYQPATAYSKRISDPAPTMFSAQQVAIEINIVMKPSEAKTQANRALYAQWAERTMHKTILPWAYLRLDPADIISVEMDDGRSYRERLHHTEIGADFSIQSETYGQDSGAYDIVREGDGGGVPSQPIKAPGTITGFIINTPLLRDQDDTGGSSSRYYSALGNASGDGWRGGELWRAESLPNFDQIDTPINEAEWGYVSGTLPPPRHGHFALDWENKITIWPGVKWFELDSITDDELWAGGNAALVGDEVIQFRDVRENDDGSWTVWNLLRGRRGTEYATHTHKQSEKFLYLLNENSIAPEGEMIDTRGQKRYYKAVAYGRTIAETPLITVDYEPRDLMPYAPKDIRREFSSDGRIEVSWARRTRMGGNMQDYVGEVPVNEAAEKYEVYFYKTPFIGDLSRGGQVQPDYFHSAVVTEPHYSFLPNPAQFASNLDTLTVVVYQISATVGRGFPGTRDIEPWQDF
Physico‐chemical
properties
protein length:1196 AA
molecular weight: 132450,62430 Da
isoelectric point:4,77143
aromaticity:0,11538
hydropathy:-0,32876

Domains

Domains [InterPro]
DC_2254
STR
686–926
YP_009809299.1
1 1196
Architecture
ATT
ATT
STR
RBD
RBD
ATT 1-250 | ATT 651-822 | STR 823-926 | RBD 927-1022 | RBD 1025-1193 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009809299.1
1 1196
Domain Start End Length (AA) Confidence
N-terminal 1 247 247 0,7647
Central domain 248 446 200 0,7019
C-terminal 447 1196 749 0,1331
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-247
Central
248-446
C-terminal
447-1196

Taxonomy

  Name Taxonomy ID Lineage
Phage Caulobacter phage CcrPW
[NCBI]
2283271 Uroviricota > Caudoviricetes > Jeanschmidtviridae > Colossusvirus PW >
Host Caulobacter crescentus CB15
[NCBI]
190650 Bacteria > Proteobacteria > Alphaproteobacteria > Caulobacterales > Caulobacteraceae > Caulobacter

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009809299.1 [NCBI]
Genbank nucleotide accession
NC_048046 [NCBI]
CDS location
range 96970 -> 100560
strand +
CDS
ATGGCTGATCCTATTTCGTTTGCCGTCGGGACGGTCGCCACCATGGGGGTGAGCTACCTGTTTCCGTCTGAAGGCCCGCGCATCAAGGACATGAAGATCAGTGCGTCGACGTACGGCGCACAGATTCCGAATGTCTTTGGAGTCACGCGCGTGCCCGGCAACATGATCTGGTCGAACAAGATTCGTGAGACCAAGCGCAAGAAGATGGTTGGGAAGGGCGGTTTCTACAACCAATATAAGTATTACTGCACGTTCGCGATGGGTTTCTGCTACGGCCCGATCACGATGCTTCGTAAGCTCTGGGCGGACGGTAAGCTCATCTATGATGCTAGCGGACAGTCTGAGCTTGAGACCAAGCATAACTATAAGCTGAACGTTTATACGGGTAGCGAGGAGCAATACCCGGACCCGATCATGGAAGCGATCGTCGGAGAAGGGAACACGCCAGCATATCGCGGCCTGTGCTATGTCGTGTTCGACGACATGCCTCTCGACGACTTCGGCAATCGCATTCCGCAGTTGACTGCTGAAGTCTTCAATGGCGAAGGCGTCATTGTCGCTGACGGCCCGGATGTTGTTTATCAGAATCCAGAGCCTGAAGAGAATACGGGAGATAATCGTCCTCCCGTGCTGATGGGCTATCAAATTAATCAAATCGCTGTAGATAGTTTTGGGGGCTATTTTTACGTTCTAGATAAGTCATCTGTCTATCCGGTAATTCGTCAAGTCCGTATTTCGGATGGACAGGAGTGCGGGCGTTTCAGCTTCATGACGCCTCGGACCGATTCCTGCTATGAGTTCAGCTACGACTCTCAGACTATTGATCAGATCGTAGGGGTAGCTCCGGGCGGCGAACTACTGTGCTCGATCAGTCTCAATAATAATGTCGCCTATGCCATGATCGATTTGATCACCGGTAATCTGGCAGTGCATGGTACGACCGATCCTTTTGGAGGACGCAATTGCGGCTTCGGAACGGGGACGCCGCCTGGGAACGGTTCGAACTATCACGCAGTATCCGTGAACAGTGCTGGCCAAGCTCAGTTGGGCATTAGCGGGCTGTTCGGCGACGTTTCTATTTACGACTGTACTTCACTTCTCGCGTTCGAAGGTAACCCGCAGAATACTCCTACGGGTCATTCGGGCTTTGGATCGTTTCCTATTGTCGGCGCAATCGGGAGCACGTGCCATTTTTATTTGCCGTCTGTAGCGTCCGAGAACAGTAATTTCATTACGATCAATCGAGCGACGCGCGTCGCCGATCAAGTGGGCTTCTGGTCTGCGGGCTATATTATGCCTTATCAGCCTCAAGACCCTGATTACGGTTACGGTCATATTATTGTGCGCAACGCCGTCTACGATGCCACGGATGGGGGAGTGCTCGTAATGTTTACCGTTCGCAAGGCGAACGGTCTCGAAGCCTATATGTTTGGGAAGTGGTCAGGGGTCACGTATCAACAGCTTTGGGTCAAGGAAGTCCCGATCGAGGCGCCGGGGGTGTATCCGCGCGTGGGCGTCGCACACCAGGGTCAATATGCTTTCGTGCAGCCCCTCAACTCCCATGTACCGTATCCTACGGTCTGGATCATCGACACGATGACGGGCGAATGGAAGGCGATGTCTGTCTCGGAGCAGCCGTCTTTCGTGACCTATACCGATGAAACGGAAAGCGAAGTGATCGTTGATCAGTCGGCCTACACGGGCTACCCGATCAAGGATACGCAGTTCATCGGCTACGATATGCAGTATTACGATCCGATCCGACAAGCGGTGATCACGCTTGGGCCGGACGGCGTTGATAAGATCGTCAAGATCGGTCCCGAGATCGGATCGACGACCCTGGGCGCGATTGTCCGCAGCGTGCTCAAGCGGGGAGGCCTGTCCGAGCAACACATGGACATGTCCCGCCTGGAGTCGACTCATGTCTGGGGCTATGGCTGGGCCGCTGCGACCGACATCAAGTCGATCCTGGAAGAGCTTGAACGCGTCTACCTCTTTGATATCGTCGAGAGCGAAGGCAAGCTGGTCGGCGTCATGCGTTCGGCAGGGTCCGACGAAGACTTCCCCGGCTCGACGGTCGTGACGATCAAGCAGGGCGCCCTGGGGTCCACGAGTCCCGAAGTCGCTGACTTCTGGCAAGAGACTCGCATCCAGGAAGCGGACCTGCCGGCGAAGATCAACTTCACCTACATGAACCTGGACGCGGACTATCAGCCCGCCACCGCTTACTCCAAGCGGATCAGTGATCCGGCGCCGACTATGTTTTCGGCGCAGCAGGTAGCAATCGAGATCAACATCGTCATGAAGCCTTCCGAGGCTAAGACCCAGGCTAATCGCGCGCTCTACGCTCAGTGGGCTGAACGTACGATGCACAAGACGATTCTGCCCTGGGCCTATTTGCGTCTGGACCCGGCCGACATCATTTCCGTCGAGATGGATGACGGACGTTCTTATCGCGAGCGCCTGCATCACACCGAAATCGGCGCGGACTTCTCAATTCAATCCGAGACGTACGGCCAGGACAGCGGTGCCTACGACATTGTCCGCGAGGGTGATGGTGGCGGTGTGCCGTCTCAGCCTATCAAGGCGCCCGGCACGATCACCGGCTTCATCATCAACACGCCGCTACTGCGGGACCAGGATGATACGGGTGGTTCGTCCAGCCGCTATTACTCGGCGCTCGGCAACGCGAGTGGCGATGGTTGGCGCGGCGGAGAACTGTGGCGTGCGGAATCGCTTCCCAACTTCGATCAGATTGACACGCCGATCAATGAGGCTGAGTGGGGATATGTTTCGGGTACGCTGCCCCCGCCGCGACACGGCCATTTTGCCCTGGATTGGGAAAACAAGATCACGATCTGGCCCGGCGTGAAGTGGTTCGAACTCGATTCGATCACGGATGACGAACTGTGGGCTGGCGGCAACGCGGCTCTCGTGGGCGACGAAGTGATCCAGTTCCGCGACGTTCGCGAGAACGATGATGGGTCCTGGACCGTCTGGAACTTGCTGCGCGGCCGGCGCGGGACGGAGTATGCGACCCACACGCATAAGCAAAGTGAAAAGTTCCTGTACCTGTTGAACGAGAATTCGATCGCTCCCGAGGGGGAAATGATCGACACGCGCGGACAGAAGCGCTACTATAAGGCCGTCGCTTATGGTCGGACGATCGCGGAGACGCCGCTGATCACCGTCGACTACGAGCCGCGCGATCTGATGCCGTATGCCCCTAAGGACATTCGGCGCGAATTCTCCTCCGACGGCCGCATCGAAGTTTCCTGGGCGCGCCGCACGCGCATGGGCGGCAACATGCAGGATTACGTGGGCGAGGTCCCTGTTAACGAGGCTGCCGAGAAGTACGAGGTCTATTTCTACAAGACCCCCTTCATCGGCGATCTGTCTCGCGGCGGCCAAGTTCAGCCGGACTATTTCCACTCGGCGGTAGTCACGGAGCCCCACTACAGCTTCCTGCCTAATCCGGCCCAATTTGCCAGTAATCTTGACACTTTGACGGTCGTGGTTTACCAAATTTCTGCGACGGTCGGTCGTGGATTCCCAGGCACTCGCGATATCGAACCCTGGCAGGACTTCTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
499b8f36c422c1a0c1336975411598acd9bf7110cb8d2dfe078bf679da622fa5
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6762
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Giant CbK-like Caulobacter bacteriophages have genetically divergent genomes Wilson,K. and Ely,B. 2015-10-13 GenBank