Genbank accession
UOX39546.1 [GenBank]
Protein name
tail spike protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MGSNNPIGSTAMTDLFFNAGNLDKALNSSSDMWRDRFGIERVTYSYMERNVTELINNISGPTGATKVGYKNPAPGGITKNIAQRLMAFLSTTDFDTPQNTILAALSSSAAVLDLEESFTIEVGPAVQFKTITAAIHQLVKMRPLFKNGQAKCKIKLLSGFVMAEQILVINGSDLNWIIIESDVPVRIDHTKITTKLSDEDDLIPIFGASNQSVLPSIKALFYYDSNTTSKDGVAVIEAAGVYLYPGSGVQKSRNGFKALYGGWGFCYMRGLVLSGGGGGAGNTTGVDFSNARNRGLHVAHGSVVGFPRSNFANSLGDYGVYCIWNSMADLYQSDASGAAGTAFCSRDGSIVNCRESCAARSKRGYHALHGGRINARSKQTPEAAMWVKDSARGCSEYGIIASGCSQIEGAEVDVSGCTGQAGVIASDSSSVSFQLGIANGCADKGIFASGAAHIQADGADASTNPIGMYAIGVATIAAQSASKQAKINDCGFGALVAGGGRIDATGIQALRCDRAVEARDTGSIDARGANLSSSKQRGVSCIDGGTVNVIDGIITDSLDRAITCRDGGRVSARGANCDRAGYLSVEVKAGGIVSFHSGIGDKLNVAINTVTTDGIIFK
Physico‐chemical
properties
protein length:618 AA
molecular weight: 64461,79590 Da
isoelectric point:7,03720
aromaticity:0,06958
hydropathy:-0,03058

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UOX39546.1
1 618
Domain Start End Length (AA) Confidence
N-terminal 1 164 164 0,8247
Central domain 165 607 444 0,9755
C-terminal 608 618 10 0,4753
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-164
Central
165-607
C-terminal
608-618

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage ZPAH34
[NCBI]
2924888 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Aeromonas hydrophila
[NCBI]
644 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UOX39546.1 [NCBI]
Genbank nucleotide accession
OM810292.1 [NCBI]
CDS location
range 98095 -> 99951
strand -
CDS
ATGGGAAGCAATAATCCTATTGGCTCCACAGCCATGACAGATTTATTTTTTAATGCTGGTAATTTAGACAAAGCACTAAACTCGTCATCTGATATGTGGCGAGATCGGTTTGGAATAGAGCGCGTTACTTATTCCTATATGGAAAGGAACGTTACAGAATTAATTAATAATATTTCTGGCCCCACTGGAGCAACAAAAGTTGGGTATAAGAATCCAGCGCCTGGTGGAATTACAAAAAACATTGCTCAAAGATTAATGGCATTTTTATCCACTACTGATTTTGACACTCCTCAAAATACAATATTGGCTGCACTAAGTTCTTCGGCAGCCGTATTAGATTTAGAAGAAAGTTTTACTATTGAAGTTGGTCCAGCTGTTCAATTTAAAACAATAACAGCCGCCATCCATCAATTAGTAAAAATGAGACCGTTATTTAAAAATGGCCAAGCCAAATGTAAAATAAAACTTTTATCTGGTTTTGTCATGGCGGAACAAATCTTGGTTATCAATGGTTCCGATTTAAACTGGATCATTATAGAAAGTGATGTTCCCGTCAGGATTGATCACACAAAAATAACCACTAAATTATCCGATGAAGATGATTTAATCCCTATTTTTGGGGCAAGTAATCAGTCCGTTTTACCAAGCATAAAAGCCTTATTTTATTATGATAGCAATACTACCTCTAAAGACGGTGTCGCTGTTATTGAAGCCGCAGGTGTTTATCTTTATCCAGGTTCTGGAGTTCAAAAATCCAGGAACGGATTTAAAGCTCTCTATGGTGGTTGGGGTTTTTGCTATATGCGGGGGCTTGTTTTAAGCGGCGGTGGAGGTGGGGCAGGAAACACAACTGGGGTTGATTTCTCTAATGCCAGAAATAGAGGGCTTCATGTAGCCCATGGATCTGTCGTTGGATTTCCAAGAAGTAATTTTGCAAATTCATTAGGCGATTATGGTGTTTATTGTATTTGGAATTCAATGGCTGATCTTTATCAGTCTGATGCTTCAGGGGCCGCCGGTACGGCATTTTGTTCTAGGGACGGGTCCATAGTAAACTGTCGAGAGTCTTGCGCCGCCAGAAGTAAGCGAGGTTATCACGCCCTCCATGGTGGGAGAATTAATGCCCGTAGTAAACAAACACCTGAAGCAGCAATGTGGGTTAAAGATAGTGCAAGGGGGTGTTCCGAATACGGTATTATAGCTTCTGGTTGTTCCCAAATAGAAGGTGCAGAAGTTGACGTATCTGGGTGCACTGGTCAAGCTGGTGTTATCGCGAGTGATAGCTCTAGTGTTTCTTTCCAACTTGGAATAGCTAACGGTTGCGCCGATAAAGGAATATTTGCATCGGGTGCGGCGCATATTCAGGCAGATGGCGCTGACGCCTCGACTAACCCGATAGGTATGTACGCAATTGGTGTGGCTACGATAGCCGCCCAGAGCGCAAGCAAACAAGCTAAGATCAATGATTGCGGGTTTGGTGCACTTGTCGCTGGTGGCGGAAGAATAGACGCTACCGGGATTCAGGCCCTGCGATGTGACAGAGCTGTTGAAGCTAGAGATACCGGTTCTATAGACGCCAGAGGTGCGAATCTTTCAAGTTCTAAACAACGTGGAGTTTCTTGTATTGATGGCGGAACTGTTAATGTTATCGACGGAATTATAACAGACTCCTTAGATCGAGCAATAACATGTCGGGATGGCGGGCGAGTATCTGCTCGTGGCGCTAACTGTGATCGTGCAGGGTACCTTTCTGTAGAAGTAAAGGCAGGTGGAATTGTATCATTCCATTCTGGGATCGGCGACAAGTTAAACGTCGCGATAAACACAGTAACAACAGATGGAATTATATTTAAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
ffcc171c904d3fffdb9679c03fd316208d89eb090d91229a067e15f468879dc4
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7733
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50