Genbank accession
YP_002875655.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHLVINTNTGGWWLLDREAKNIVSEGNLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVASLHFARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYIIRYNQRRRRV
Physico‐chemical
properties
protein length:780 AA
molecular weight: 85607,94480 Da
isoelectric point:4,90222
aromaticity:0,08590
hydropathy:-0,21000

Domains

Domains [InterPro]
DC_0058
STR
1–780
IPR058003
TTP
9–780
YP_002875655.1
1 780
Architecture
STR
STR 1-780
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_002875655.1
1 780
Domain Start End Length (AA) Confidence
N-terminal 1 142 142 0,9910
Central domain 143 341 200 0,0592
C-terminal 342 780 438 0,2151
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-142
Central
143-341
C-terminal
342-780

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage VP93
[NCBI]
641832 Uroviricota > Caudoviricetes > Autographivirales > Maculvirus > Maculvirus MGD1
Host Vibrio parahaemolyticus
[NCBI]
670 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_002875655.1 [NCBI]
Genbank nucleotide accession
NC_012662 [NCBI]
CDS location
range 24741 -> 27083
strand +
CDS
ATGGCTCGACCATTTGAGGGTGCATTGAATGACCTGCTGCAAGGTGTGTCCCAACAAGTTCCACGTGAGCGTGTGGCTGGACAATGCTCTGCGCAAGTTAACATGCTGTCCGACCCAGTAACAGGTATCCGTCGTCGTCCCGGTAGTCTGTTCGTGAGTGTGCACGATTTCGGCCCGATTGGTGAGGGTGATGCACTGTACACGCAGTATCTCGAACGAGGTGCTGATGGACGACACTTAGTAATCAACACCAACACAGGCGGCTGGTGGCTCTTAGACCGTGAGGCTAAGAACATCGTGAGTGAGGGCAACTTGTCTTACCTCCTAGCGGCTGACCGTCGCAGTATCCAGACTACCAGTATGGGCGGTGTTACGTACATTCTTAACACCGAGAAGCGTCCGTCTGCAACGACTGACAACTCTGACAAGAAAGACCCGAAGACAACAGGCTTCTACTTCGTCAAGAGTGGTGCGTTCAGTAAAGAGTACGATATTTCCGTAGTGTGGTCTGAGGGTAGCCAAACTGTGACATACACCACGCCTGATGGTACAACCGCAGGTGACGCAGACCAATCTGTACCGGAAGCAATCGCACGTAAACTCGTGGAAGCTCTGATTGCAGTTGGTGTGGACTTCGCTGTGCGTGTTGGCCCGTACATTTACTTTGAACTAATCACAGGTACTGACCTTAAAATCACCAGTACGTCAGGTTCGCCTTACATTGGCTACTCAAACCAATCACAGGTAAACCTAGAGACTGACCTCCCTGCGCGTCTGCATCCGTCTGCTGATGGTGCGTTGTGTGCTGTAGGTCAATCAGAGCGTGCGCTTGTGTGGTATCGTTACAGTTCCGAAAAGGGTGTTTGGTTGGAATCTGGTGACTACAACTCTGTGACCGCTATTAGCGTGGATGTGCCCTATAAGATTGTCGATGACAACGTGGAGCAACATATCATGGAGGGGCGTCTCGCAGGTGATGACTTAACTAACCCTGCACCGACATTCCTTGAAGAACGCCGCATCACTGGTATTGGTACGTTCCAAGGTCGCTTAGTGCTTCTGTCTGGTGCGTACGTCTGTATGAGTGCCACTGGCGAACCAGACCGTTTCTTCCGCTCTACCGTGAGTTCCCTTGACCCAACAGACCGTATTGACATTGCATCCGGTTCGGCTCAGAACTCAGTGTTCCGCCAAGCGTTGCAGTTCAACAAGGACTTGATTCTACTTGGTGACAGCACACAGGCGGTAGTACCGTCCCTGCAACAGTTACTTGCACCTGATAACGCAAGTGTGGTGTTGACCTCAGATTTGGCCTGTAATGCGTTTGTAGCACCTGTTACAACCTCACAGACCCTGATGTACCCTGCACCTCGAAGCGAAGCATTCAGCGCAGTTCTGGAGCTTGTACCGTCACAGTACACCTCGTCTCAGTACGTATCTCAAGACGTTACGACTCACATCCCTCGTTACATTGAGGGTGAGGCTCGTTTCATGCAGAGTGCGAGTGCTGCGAACATCGTACTAATGGCAACTACTGGCGACAACCGTCAGGTGATTGCTCACGAGTACCACTTCACAAGTCAAGGTAAAGTGCACCAAGCATGGCACAAATGGGTGTTCCCGTACCGTGTCGCTAGTCTACACTTTGCGCGTGACCGTGTTGTACTGTTTGCCGCAGACGATGCTGGTAGCACTGACAAAATCACCATCTCGACCATCGACCCTAAGCAGGGTGGTGTGACGTTTGATGTTGACCGCTTACCGCACCTAGACTCGATGAGCATTGTGCCCGTCAATGATGGTAAGGGTATCGTGCCAATCTACATGCGTCCGTGGGTATCTGAGGGTAAGTTGACTGGTTCTGTTGCTACAGGTGCGTTAGCGTCTGAGGAAGTGGCTATTGATGTGGACGAGGTTTCATGGGAATTCACTGTAGAGCCGGGTTTCAAAGACTCGCAAATCTACTTAGGTTTCCGCTACGAATCGTTGTTTGCGCCAACGCCACCTATGCTGAAAGACCAGAACGATACTCTAATCAGTACTGCTCCGGTTCGACTGTTGCGTTATGAGTTAACAACTCGAAATACAGGTGAATTCGATGTACGCATCGTTGACCCTACTATCGGGCTAGACTACTCAAACAGCAAAACCAGCCTAGTGTTCGGCACTGACGACGTTCAGTTGAACCAAGCTCTAGTGTCTGACTTGTCACGTGTCCCTGTACCGTGTCGAAGCAATGCACAGTCCACCGAAATGTACTTGAGTACTGACGGTACACAGGATATGAACATTCTGGAAATTGAATACATCATTCGTTACAACCAACGCCGACGTCGCGTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
1ff79a3ec3642ca77a123d9d32a6ddb09079c926da205210eb8a8fd7e32f5cb8
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8933
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50