UniProt accession
A0A6B7SIZ1 [UniProt]
Protein name
Tail tubular protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MADCSKIPTLELVEDSKKGMSDIVSFANSSDNTYQSEFDNSTRTTIKGALNKLGGNNKGNYLDDPLLQDDVDYAYFSATQTKYKIAKGVLAPYQVNSTTYPDPETDPNLTVFDYATNSKLEGEITTAQNDIVGGKLFKGSNGDTVEVGDTVLSGTTHLRVLVGGEPAIVAMSPISVGLVSSISETGAVIGTESVSFFSPIDKNYKSVEAVISGVNGYPLIGDTITTSSRNGIKVTSRWEIRNSGDYIDGEEFESLNNGNIMVLVSEYTAKAFGCGGGVDDTDNLQRLFDSTRRGVAEIDIDCPVVLDTTKNTVQNTDPRPPLDDGETSSAAIVINHPVVCKSDFEISAKVGPTPVTTYHYLFSTNTDDGIHLELNLNGKRDEGAPSSSLGVQVNNDNVTTKNKGGNFGSSPLVLNGNARNDPLLNQKHDNHAYSNVGNSIFCRYVKNATAKNWTVRDVSEGFDLDKTCDGVNLDDWTVFGTRGDGADAAIEMNGAKNCTARNLFSVGFKTGAILNAKERYDEPGVYDRCENCSIDGGITISPTDGGVIVGNVTGDFTDTIDCSADDFQVFNATTGRPSYSVSGTNFSMKNAKSFGSEREGALIRGGTVNADGFESYESDRAGVDVLDGAKASLRNAHIENPNKSLGGHNGINAGNNTDVNVVGATVKGAAEYSIRKTGTGEFIFGNNDLSGALQDGLRYSSSANIVQMGDGKGSVQGGVQLGHGRRVFYASKSYTATSSLLMKEGDVIIHNDAFDAGDYYGRICYQVVGSIPQFRNFGKIE
Physico‐chemical
properties
protein length:781 AA
molecular weight: 83185,51610 Da
isoelectric point:4,71630
aromaticity:0,07682
hydropathy:-0,38886

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A6B7SIZ1
1 781
Domain Start End Length (AA) Confidence
N-terminal 1 281 281 0,9939
Central domain 282 715 435 0,9818
C-terminal 716 781 65 0,9465
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-281
Central
282-715
C-terminal
716-781

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage Seahorse
[NCBI]
2662136 Uroviricota > Caudoviricetes > Seahorsevirus >
Host Vibrio parahaemolyticus
[NCBI]
670 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QGF21011.1 [NCBI]
Genbank nucleotide accession
MN512538 [NCBI]
CDS location
range 25039 -> 27384
strand +
CDS
ATGGCAGATTGCTCAAAAATACCAACTTTAGAATTGGTTGAGGACTCAAAAAAAGGCATGAGTGACATTGTGTCTTTCGCAAACAGCAGCGATAACACATATCAATCAGAATTCGACAACTCAACCAGAACAACAATAAAAGGAGCTCTAAATAAACTTGGGGGAAATAACAAAGGCAATTACCTTGACGACCCATTACTTCAAGATGACGTTGATTATGCTTACTTTTCTGCTACTCAAACAAAATACAAAATTGCTAAGGGAGTTTTGGCTCCTTACCAAGTTAACTCAACAACTTATCCAGATCCAGAAACGGACCCAAACCTAACTGTATTTGACTATGCAACCAACTCAAAGTTAGAAGGTGAAATAACCACAGCCCAAAACGACATCGTAGGCGGCAAGCTATTTAAGGGTTCGAATGGTGATACTGTTGAGGTTGGTGACACAGTCCTATCGGGAACTACACACCTTCGAGTTTTGGTTGGTGGAGAGCCTGCTATTGTTGCAATGTCACCAATATCTGTCGGATTAGTTAGTTCAATAAGCGAAACCGGAGCAGTAATAGGCACTGAAAGTGTTAGCTTTTTCAGCCCCATAGATAAGAACTATAAAAGTGTTGAGGCGGTTATTTCTGGCGTTAACGGGTATCCTTTAATTGGCGATACTATAACCACAAGCTCCAGAAACGGAATTAAAGTTACCTCTAGATGGGAAATAAGAAATTCCGGTGATTACATTGATGGTGAAGAGTTTGAGTCTCTTAACAATGGTAATATCATGGTTCTTGTTTCAGAGTACACGGCAAAAGCATTTGGTTGTGGCGGTGGGGTTGATGACACTGATAATTTACAAAGACTTTTCGATTCAACCAGAAGAGGGGTAGCTGAAATTGATATTGATTGCCCTGTGGTCTTAGATACAACAAAAAACACCGTGCAAAACACGGACCCTAGGCCACCTTTAGATGACGGTGAAACTTCATCTGCTGCAATTGTCATTAACCACCCCGTTGTGTGTAAATCTGATTTTGAGATTAGCGCAAAAGTTGGCCCAACACCCGTGACAACTTATCACTACTTATTTAGCACTAATACAGATGATGGTATACACCTTGAGTTAAATCTTAATGGTAAGCGAGACGAAGGGGCTCCTTCATCATCTCTTGGGGTGCAGGTCAACAATGATAATGTAACAACTAAAAATAAAGGCGGAAACTTTGGTAGCTCACCACTCGTCCTCAACGGGAACGCAAGGAATGACCCGCTACTAAACCAAAAGCACGATAATCACGCCTATTCAAATGTCGGAAACTCTATTTTTTGTCGGTATGTGAAAAACGCCACAGCAAAAAATTGGACTGTTCGTGACGTTAGTGAGGGCTTTGATCTAGACAAAACTTGTGACGGCGTAAACCTAGATGATTGGACTGTATTTGGCACTAGGGGTGATGGAGCAGATGCCGCCATTGAAATGAATGGCGCCAAGAACTGTACGGCTAGAAATCTATTCAGTGTTGGCTTTAAAACAGGAGCGATACTTAACGCCAAAGAAAGATATGATGAGCCAGGTGTTTATGATAGGTGTGAAAATTGCTCAATTGATGGCGGGATAACCATATCACCTACTGATGGTGGGGTCATCGTCGGTAATGTTACAGGCGATTTTACTGACACAATTGACTGTTCTGCTGATGATTTTCAGGTGTTCAACGCTACTACAGGTCGCCCTTCTTATTCTGTATCTGGAACTAATTTTTCAATGAAGAATGCTAAATCATTCGGATCTGAAAGAGAGGGTGCGTTAATTCGGGGTGGCACAGTTAATGCGGATGGATTCGAATCTTACGAATCTGATAGAGCGGGGGTTGACGTCCTAGACGGGGCGAAAGCGTCACTCAGGAATGCACACATTGAGAACCCAAACAAATCCCTTGGTGGGCATAATGGAATAAATGCTGGAAACAACACGGATGTAAACGTTGTTGGCGCCACAGTTAAAGGAGCCGCTGAGTACAGCATCAGAAAAACCGGTACTGGTGAGTTTATATTTGGCAATAACGACCTTTCAGGAGCATTACAAGATGGACTTAGATACTCCTCTTCCGCCAATATTGTGCAGATGGGAGATGGAAAGGGTAGTGTGCAAGGTGGTGTGCAACTTGGTCATGGTAGAAGAGTGTTTTATGCCTCAAAAAGCTACACAGCAACAAGCTCATTACTAATGAAAGAGGGTGACGTTATAATCCATAACGACGCTTTTGATGCTGGAGATTACTATGGGCGAATTTGTTACCAAGTAGTAGGTAGTATTCCTCAGTTCAGAAACTTTGGTAAGATAGAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
edcfefad82984bc709effccbd358d8a904c550c62a77852b5a014ce8f4774cea
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7685
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
32047244 PubMed