UniProt accession
A0A172EK53 [UniProt]
Protein name
Tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,81
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MADYSQLPIENIWSTGGDMVAPTPAQQQGGWGIQSVPRQWWNWKWNLHDTNLAYLLQKGIPEWTSTQEYIANKSFCTRGGFVYKAVRTHTGSDPATVSANWVRAFADYTTASSALGGLTPREGGIPFFISATGASVFDSTAYGRGMLNVANAAGARNYISAQESSVVLSNLAAVTRAANAVPYFNTDTSMSTFNITAFGRGLVNAANAENARGFLGLANSAIITADPANRAHTIVYRDAAGNFNAGVITATLSGNATTANKLRTPVTINGVAFDGSSNIVLPGLDTSYAGTVARLHVNGANMTGTDKTTQIALRNKSDNDWISLAVVDDNILQFAFRSATNPIVQIGGEVILHTGNQFSLGPTLSDARTRLGLDRVSQGSSDTVVFASTLNNGPYLTVQPTSVGGFDGGSNNWMFRFDTNGNLTHGTVPVARITGLANSATIPAVTTAQANSIVQRDSTGSFSAIGINAYGPIIGYGSNIYSRASASGGNAHIGFQRADGSELGLIWGVPSSGAMSFRTSGGTTGMTLTGQDLLVAGRVNATTLNASGNVNATGNVNAGSATLNTAGNITGAAYGAYGSLTNWVDSVYAKKGEIPNDIARAGAAWDAVGQYILAGDLSGGSGGPGTTRAGSQLKPYSTISYTAGALPAAGTYRCMGAFAGGGNQITLWQRIS
Physico‐chemical
properties
protein length:672 AA
molecular weight: 69562,30050 Da
isoelectric point:8,66998
aromaticity:0,08631
hydropathy:-0,06786

Domains

Domains [InterPro]

No domain annotations available.

Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
A0A172EK53
1 672
Domain Start End Length (AA) Confidence
N-terminal 1 311 311 0,6024
Central domain 312 510 200 0,2523
C-terminal 511 672 161 0,8080
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-311
Central
312-510
C-terminal
511-672

Taxonomy

  Name Taxonomy ID Lineage
Phage Pseudomonas phage vB_PaeM_MAG1
[NCBI]
1639815 Uroviricota > Caudoviricetes > Vandenendeviridae > Pakpunavirus > Pakpunavirus MAG1
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ALA12079.1 [NCBI]
Genbank nucleotide accession
KR052143 [NCBI]
CDS location
range 47056 -> 49074
strand -
CDS
ATGGCTGATTACAGTCAACTACCTATTGAAAATATTTGGTCCACAGGCGGGGATATGGTAGCCCCGACCCCTGCGCAGCAGCAAGGCGGATGGGGTATTCAGTCGGTTCCTCGCCAATGGTGGAACTGGAAATGGAACCTTCACGATACCAACCTAGCGTATCTGCTACAGAAGGGCATTCCAGAGTGGACTAGTACACAAGAGTACATTGCTAACAAATCGTTCTGCACTAGAGGCGGATTTGTCTACAAGGCTGTCCGTACCCATACAGGTAGCGACCCGGCCACCGTTAGTGCCAACTGGGTTCGAGCTTTTGCAGATTACACCACAGCTAGTTCAGCACTAGGGGGCCTGACTCCAAGGGAAGGAGGTATTCCTTTCTTCATTAGTGCTACAGGAGCTAGCGTGTTTGACTCTACGGCTTATGGCCGTGGCATGCTTAACGTGGCTAACGCAGCCGGAGCAAGAAACTACATCTCGGCTCAAGAGAGTTCCGTTGTTCTGTCTAACCTAGCTGCTGTAACTCGGGCTGCTAACGCGGTTCCATATTTCAACACCGACACTTCTATGTCGACGTTTAATATTACAGCGTTCGGAAGAGGACTTGTCAACGCGGCTAATGCTGAAAACGCAAGAGGTTTCCTAGGCCTAGCTAACTCTGCTATCATCACAGCCGATCCAGCTAACAGAGCCCACACTATCGTTTATCGAGACGCAGCAGGTAACTTCAACGCCGGAGTTATTACAGCCACTCTTTCTGGTAACGCTACTACAGCTAACAAGCTTAGAACTCCTGTTACAATTAACGGTGTGGCGTTTGACGGAAGTTCGAACATTGTCCTCCCCGGTCTCGACACAAGCTACGCTGGAACAGTAGCGAGGCTCCACGTTAATGGTGCCAACATGACTGGAACGGACAAAACGACTCAAATTGCTCTGAGAAACAAATCGGACAACGATTGGATCAGTCTTGCAGTAGTCGATGACAACATCCTTCAGTTTGCTTTCAGAAGTGCCACAAACCCAATCGTACAGATCGGTGGAGAGGTTATCCTCCATACAGGTAACCAATTCTCCCTCGGACCTACCCTGAGTGATGCTCGCACTCGTCTAGGACTTGACAGGGTTTCTCAAGGGTCTTCTGACACCGTTGTATTTGCCAGTACCCTTAACAATGGACCGTACTTGACTGTCCAGCCTACCAGTGTAGGCGGTTTTGATGGCGGAAGCAACAATTGGATGTTCCGTTTCGATACGAACGGTAACCTGACTCATGGAACTGTTCCGGTTGCAAGGATTACAGGGCTAGCCAACTCTGCTACCATTCCCGCCGTAACTACTGCTCAGGCTAACTCTATTGTCCAAAGAGACTCCACTGGAAGCTTTAGTGCAATCGGTATTAACGCCTACGGGCCTATTATCGGATACGGCTCCAATATCTACAGCCGAGCTTCTGCATCTGGAGGTAATGCCCATATTGGATTCCAAAGAGCAGACGGCTCCGAGCTTGGGCTAATCTGGGGTGTTCCTTCTAGCGGGGCTATGAGTTTCCGTACATCGGGTGGGACCACGGGTATGACCCTGACAGGTCAAGATTTGCTTGTCGCAGGGCGCGTTAACGCTACCACTTTGAACGCCTCTGGAAACGTGAATGCGACAGGTAACGTCAACGCCGGGAGTGCTACACTCAATACAGCAGGGAACATCACAGGGGCCGCCTACGGAGCGTATGGTTCTCTTACCAACTGGGTAGATTCTGTATATGCGAAGAAGGGAGAGATTCCTAACGATATCGCTAGAGCAGGCGCAGCTTGGGACGCAGTTGGACAGTACATCCTAGCTGGCGATCTATCTGGAGGGTCTGGTGGACCGGGGACAACTAGAGCTGGAAGCCAGTTGAAGCCTTATTCGACTATTAGCTACACAGCAGGGGCACTACCAGCAGCAGGCACTTACAGATGTATGGGAGCGTTTGCAGGTGGTGGTAACCAGATTACCCTGTGGCAGAGAATTTCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
619f4967dc13c4dbd97f5e1617aba72761c1c8d7e04ac42c2ca71847664f34e7
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6499
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50