Genbank accession
YP_654148.1 [GenBank]
Protein name
tail spike protein
RBP type
TSP
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,94
Protein sequence
MIQRLGSSLVKFKSKIAGAIWRNLDDKLTEVVSLKDFGAKGDGKTNDQDAVNAAMASGKRIDGAGATYKVSSLPDMERFYNTRFVWERLAGQPLYYVSKGFINGELYKITDNPYYNAWPQDKAFVYENVIYAPYMGSDRHGVSRLHVSWVKSGDDGQTWSTPEWLTDLHPDYPTVNYHCMSMGVCRNRLFAMIETRTLAKNKLTNCALWDRPMSRSLHLTGGITKAANQQYATIHVPDHGLFVGDFVNFSNSAVTGVSGDMTVATVIDKDNFTVLTPNQQTSDLNNAGKSWHMGTSFHKSPWRKTDLGLIPSVTEVHSFATIDNNGFVMGYHQGDVAPREVGLFYFPDAFNSPSNYVRRQIPSEYEPDASEPCIKYYDGVLYLITRGTLGDRLGSSLHRSRDIGQTWESLRFPHNVHHTTLPFAKVGDDLIMFGSERAENEWEAGAPDDRYKASYPRTFYARLNVNNWNADDIEWVNITDQIYQGDIVNSSVGVGSVVVKDSYIYYIFGGENHFNPMTYGDNKGKDPFKGHGHPTDIYCYKMQIANDNRVSRKFTYGATPGQAIPTFMGTDGIRNIPAPLYFSDNIVTEDTKVGHLTLKASTSSNIRSEVQMEGEYGFIGKSVPKDNPTGQRLIICGGEETSSSSGAQITLHGSNSSKANRITYNGNEHLFQGAPIMPAVDNQFAAGGPSNRFTTIYLGSDPVTTSDADHKYSISSINTKVLKAWSRVGFKQYGLNSEAERDLDSIHFGVLAQDIVAAFEAEGLDAIKYGIVSFEEGRYGVRYSEVLILEAAYTRYRLDKLEEMYATNKIS
Physico‐chemical
properties
protein length:811 AA
molecular weight: 90382,89800 Da
isoelectric point:6,04388
aromaticity:0,11344
hydropathy:-0,44957

Domains

Domains [InterPro]
IPR024427
ENZ
214–296
IPR001724
Unmapped
288–311
YP_654148.1
1 811
Architecture
ATT
STR
RBD
CHP
ATT 2-102 | STR 103-705 | RBD 706-777 | CHP 778-808 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_654148.1
1 811
Domain Start End Length (AA) Confidence
N-terminal 1 48 48 0,9553
Central domain 49 279 232 0,7767
C-terminal 280 811 531 0,5776
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-48
Central
49-279
C-terminal
280-811

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage K1-5
[NCBI]
2681604 Uroviricota > Caudoviricetes > Autographivirales > Molineuxvirinae > Vectrevirus
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_654148.1 [NCBI]
Genbank nucleotide accession
NC_008152.1 [NCBI]
CDS location
range 41351 -> 43786
strand +
CDS
ATGATTCAAAGACTAGGTTCTTCATTAGTTAAATTCAAGAGTAAAATAGCAGGTGCAATCTGGCGTAACTTGGATGACAAGCTCACCGAGGTTGTATCGCTTAAAGATTTTGGAGCCAAAGGTGATGGTAAGACAAACGACCAAGATGCAGTAAATGCAGCGATGGCTTCAGGTAAGAGAATTGACGGTGCTGGTGCTACTTACAAAGTATCATCTTTACCTGATATGGAGCGATTCTATAACACCCGCTTCGTATGGGAACGTTTAGCAGGTCAACCTCTTTACTATGTGAGTAAAGGTTTTATCAATGGTGAACTATATAAAATCACGGATAACCCTTATTACAATGCTTGGCCTCAAGACAAAGCGTTTGTATATGAGAACGTGATATATGCACCTTACATGGGTAGTGACCGTCATGGTGTTAGTCGTCTGCATGTATCATGGGTTAAGTCTGGTGACGATGGTCAAACATGGTCTACTCCAGAGTGGTTAACTGATCTGCATCCAGATTACCCTACAGTGAACTATCATTGTATGAGTATGGGTGTATGTCGCAACCGTCTGTTTGCCATGATTGAAACACGTACTTTAGCCAAGAACAAACTAACCAATTGTGCATTGTGGGATCGCCCTATGTCTCGTAGTCTGCATCTTACTGGTGGTATCACTAAGGCTGCAAATCAGCAATATGCAACAATACATGTACCAGATCACGGACTATTCGTGGGCGATTTTGTTAACTTCTCTAATTCTGCGGTAACAGGTGTATCAGGTGATATGACTGTTGCAACGGTAATAGATAAGGACAACTTCACGGTTCTTACACCTAACCAGCAGACTTCAGATTTGAATAACGCTGGAAAGAGTTGGCACATGGGTACTTCTTTCCATAAGTCTCCATGGCGTAAGACAGATCTTGGTCTAATCCCTAGTGTCACAGAGGTGCATAGCTTTGCTACTATTGATAACAATGGCTTTGTTATGGGCTATCATCAAGGTGATGTAGCTCCACGAGAAGTTGGTCTTTTCTACTTCCCTGATGCTTTCAATAGCCCATCTAATTATGTTCGTCGTCAGATACCATCTGAGTATGAACCAGATGCGTCAGAGCCATGCATCAAGTACTATGACGGTGTATTATACCTTATCACTCGTGGCACTCTTGGTGACAGACTTGGAAGCTCTTTGCATCGTAGTAGAGATATAGGTCAGACTTGGGAGTCACTGAGATTTCCACATAATGTTCATCATACTACCCTACCTTTTGCTAAAGTAGGAGATGACCTTATTATGTTTGGTTCAGAACGTGCAGAAAATGAATGGGAAGCAGGTGCACCAGATGATCGTTACAAGGCATCTTATCCTCGTACCTTCTATGCACGATTGAATGTAAACAATTGGAATGCAGATGATATTGAATGGGTTAACATCACAGACCAAATCTATCAAGGTGACATTGTGAACTCTAGTGTAGGTGTAGGTTCGGTAGTAGTTAAAGACAGCTACATTTACTATATCTTTGGTGGCGAAAACCATTTCAACCCAATGACTTATGGTGACAACAAAGGTAAAGACCCATTTAAAGGTCATGGACACCCTACTGATATATACTGCTATAAGATGCAGATTGCAAATGACAATCGTGTATCTCGTAAGTTTACATATGGTGCAACTCCGGGTCAAGCTATACCTACTTTCATGGGTACTGATGGAATACGAAATATCCCTGCACCTTTGTATTTCTCAGATAACATTGTTACAGAGGATACTAAAGTTGGACACTTAACACTTAAAGCAAGCACAAGTTCCAATATACGATCTGAAGTGCAGATGGAAGGTGAATATGGCTTTATTGGCAAGTCTGTTCCAAAGGACAACCCAACTGGTCAACGTTTGATTATTTGTGGTGGAGAAGAGACTTCGTCCTCTTCAGGTGCACAGATAACTTTGCACGGCTCTAATTCAAGTAAGGCTAATCGTATCACTTATAACGGAAATGAGCACCTATTCCAAGGTGCACCAATCATGCCTGCTGTAGATAACCAGTTTGCTGCTGGTGGACCTAGTAACCGATTCACTACCATCTACCTAGGTAGTGACCCTGTTACAACTTCAGATGCTGACCACAAGTACAGTATCTCTAGTATTAATACCAAGGTGTTAAAGGCTTGGAGCAGGGTTGGTTTTAAACAGTATGGTTTGAATAGTGAAGCAGAGAGGGACCTTGATAGCATACACTTCGGTGTCTTGGCTCAGGATATTGTAGCTGCTTTTGAAGCTGAAGGGTTGGATGCCATTAAGTATGGAATTGTGTCCTTCGAAGAAGGTAGGTACGGTGTGAGGTATAGTGAAGTTCTAATACTAGAGGCTGCTTATACTCGTTATCGTTTAGACAAGTTAGAGGAGATGTATGCCACTAATAAAATCAGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
22935e239f07111823aa887e6e0b5e91e6cf9310695e03813877c3e5081243a4
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6676
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50