Genbank accession
YP_009802475.1 [GenBank]
Protein name
tail protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect2
Probability 0,56
Protein sequence
MALISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSVEQYYVVFTGQGIRVFDLNGKEYAVKGDLSYVKVGNPRDDLRMVTVADYTFIVNRNMVVRADTAPLYDLKENGDCLINVRGGQYGRTLAFTINGVRIAYKIHNGVGDGAEQAVQETDAQWLVKKLAGLARAHGSFKDWKFNEGPGFIHVIAPGNSQINSLSTEDGYANQLMNAVMHTSQSFSKLPLEAPNGYTVKIVGDTSKTSDQFYVQYDNVKKVWKEVAGWGVQKGLNGGTMPHALVRQSDGSFQMQVLPWAQRSCGDMDTNPTPSIVDQSINDVFFFRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRASYTSLNRYYAVQDVSSVKSAEDMSAHVPSYIPNGVFSIRGSGTENFISVLSANAPSKIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSIGSTMYLVLRNQSHTWMCRAHFTKNSIDFHDEPYRLYIDNKIKYVIPKGAYNDDTYTTTIKPVDIYGMKYWTGKFYIVASDGLVSWFDPPRGGWPNGVPVLSMSGNREGETIYVGLAINFRYVFSKFLIKKTADDGSTATEDIGRLQLRRAWVNYEDSGAFVVEVENTSRLFSYDMAGARLGSNALRAGGLNVGTGQFRFPVAGNAQLNEVRIISDHTTPLNVIGCGWEGNYLRRSSGI
Physico‐chemical
properties
protein length:795 AA
molecular weight: 88218,43560 Da
isoelectric point:6,90846
aromaticity:0,11195
hydropathy:-0,24931

Domains

Domains [InterPro]
IPR058003
TTP
1–795
YP_009802475.1
1 795
Architecture
TTP
TTP 1-795
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_009802475.1
1 795
Domain Start End Length (AA) Confidence
N-terminal 1 141 141 0,9961
Central domain 142 347 207 0,1091
C-terminal 348 795 447 0,2189
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-141
Central
142-347
C-terminal
348-795

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoP_S523
[NCBI]
2233775 Uroviricota > Caudoviricetes > Autographivirales > Studiervirinae > Berlinvirus
Host Escherichia coli MC1061
[NCBI]
1211845 Pseudomonadota > Gammaproteobacteria > Enterobacterales > Enterobacteriaceae > Escherichia > Escherichia coli

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009802475.1 [NCBI]
Genbank nucleotide accession
NC_047984 [NCBI]
CDS location
range 20290 -> 22677
strand -
CDS
ATGGCTCTTATTTCACAATCCGTCAAGAACCTGAAGGGCGGTATCAGTCAACAGCCGAACATCTTAAGGTTCCCCGAACAGGGTTCCGAACAGATTAACGGTTGGTCTTCGGAGACTGAGGGTCTTCAGAAGCGTCCACCTTTTGTCTTCACTAAGACCATTGGAGACCAGAATGCCCTTGGTGCCAAACCTCTCGTTCACCTAATCAACCGCGATAGTGTCGAACAGTATTACGTAGTGTTTACCGGACAGGGTATTCGTGTGTTCGACCTCAATGGTAAAGAGTATGCTGTGAAGGGTGACTTGTCCTACGTGAAGGTAGGGAACCCACGAGATGACTTAAGGATGGTCACTGTGGCTGACTATACGTTTATCGTAAACCGTAACATGGTGGTACGCGCTGACACTGCCCCTCTGTATGACCTTAAGGAGAATGGGGACTGCTTGATTAACGTCCGTGGCGGTCAGTATGGTCGTACATTGGCATTCACTATCAACGGTGTACGTATCGCATACAAGATTCACAACGGTGTTGGTGATGGTGCTGAACAGGCTGTACAGGAGACAGACGCACAGTGGCTCGTTAAGAAACTGGCTGGCCTCGCTCGTGCTCACGGTTCCTTTAAGGACTGGAAGTTCAACGAAGGGCCGGGGTTCATCCATGTGATTGCTCCGGGTAACAGCCAGATTAACTCACTGTCCACTGAAGATGGCTACGCCAACCAGTTGATGAACGCAGTGATGCACACCAGCCAGTCATTCAGTAAGTTGCCTCTTGAGGCTCCTAATGGGTACACAGTGAAGATTGTAGGTGACACCTCTAAGACTTCCGACCAGTTCTACGTTCAGTACGACAACGTGAAGAAGGTATGGAAAGAGGTGGCTGGTTGGGGCGTACAGAAGGGACTCAATGGTGGCACGATGCCTCACGCTCTCGTCCGTCAGTCTGATGGTTCATTCCAGATGCAGGTTCTACCGTGGGCACAGCGCTCATGTGGGGACATGGACACTAACCCTACTCCGTCTATTGTTGACCAGTCGATTAACGATGTGTTCTTCTTCCGTAACCGCTTAGGGTTCCTCGCTGGTGAGAACATTGTGATGTCCCGTACCTCCAAGTATTTCTCACTGTTCCCTGCCTCCGTGGCTAACCTGTCTGATGATGACCCAATCGACGTTGCCGTGTCTCACAACAGAATCTCAATCCTGAAGTACGCTGTGCCATTCTCCGAAGAGTTGCTCCTATGGTCAGACCAAGCACAGTTCGTGTTGTCTGCTCAAGGTATACTCTCACCGAAGTCAGTAGAATTGAACCTCACGACCGAGTTCGATGTGTCAGACCGAGCGAGACCTTTTGGCGTTGGGCGTGGTGTGTACTTTGCGTCACCTCGTGCTTCCTATACGTCACTTAACCGTTACTATGCGGTACAGGATGTTAGCTCCGTGAAGTCTGCTGAGGATATGAGTGCTCACGTTCCAAGTTACATTCCGAACGGTGTGTTCTCCATTCGTGGCTCCGGTACTGAGAACTTTATCTCCGTGCTCTCTGCGAACGCTCCGAGTAAAATCTTCCTGTACAAATTCCTGTACCTCAACGAAGAGATTGCTCAACAGTCGTGGTCACATTGGGAACTTGGAAGTAACGTAACGGTTCTGGCTTGTGACTCTATCGGCTCAACGATGTACCTTGTGTTGCGCAACCAGTCCCACACTTGGATGTGCCGAGCACACTTTACGAAGAACTCCATTGACTTCCATGATGAACCATATCGGCTGTACATCGACAACAAGATAAAGTATGTGATTCCTAAAGGTGCCTACAATGATGATACCTACACGACCACTATCAAGCCTGTGGACATCTACGGGATGAAGTATTGGACGGGTAAGTTCTACATCGTGGCCTCTGATGGTTTAGTCTCGTGGTTCGACCCTCCGCGTGGTGGTTGGCCTAATGGTGTCCCTGTGCTGTCAATGAGTGGGAACCGTGAGGGTGAGACAATCTACGTTGGCTTGGCTATAAACTTCCGTTATGTGTTCTCTAAGTTCCTCATTAAGAAGACCGCTGACGACGGGTCTACGGCTACCGAGGACATTGGTCGATTGCAGCTTCGTCGAGCATGGGTGAACTATGAGGACTCTGGTGCATTCGTTGTGGAAGTGGAGAACACCTCACGTCTGTTCAGCTACGATATGGCAGGTGCCCGTTTGGGTTCCAATGCGTTACGTGCTGGTGGACTTAACGTTGGTACAGGTCAGTTCCGATTCCCGGTTGCTGGCAACGCACAGTTGAATGAGGTCCGCATTATCTCTGACCACACCACACCACTGAACGTTATCGGTTGTGGCTGGGAGGGTAACTACCTTCGTCGTTCTTCTGGTATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2b37894896a23a3010800cf94d65516a620dd2fdc9abd044173ccbc302628a24
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,8982
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50