Genbank accession
YP_009211060.1 [GenBank]
Protein name
tail protein
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 0,87
TF
Evidence RBPdetect
Probability 0,91
TF
Evidence RBPdetect2
Probability 1,00
Protein sequence
MAVGEIQISALPQASLPIELSDIFHLKQGIEDKRCTLEQLLDPHASLRNNPHGVTKVQVELGNVLNALQLVAANNLSDIVDVAEARANLQIMSSDEVNQLIQAHINDKNNPHNTTKSQVGLGNVQNWLASNAYNEDADKYATARAVNSLYRAVQASYPVGTIHLSMNAANPSTYLICGGTWELVSRGRALVGYDSASRPAGSTFGSQTAALTANNIPAHTHSVYVTGGGHTHGATVSINAFDYGTKGTTTFDYGTKTVSTFNYGNKTTSSAGFTQTTLINRGYDNGSLPGLSVVGADYDPRATLTTANNHTHSVAIGAHNHTVAIGAHSHSVAIGSHNHSATATVTGAGEHSHSGTTGSTGSGQAFNIEQPSFVLYVWQRTA
Physico‐chemical
properties
protein length:382 AA
molecular weight: 40212,76290 Da
isoelectric point:6,35013
aromaticity:0,06283
hydropathy:-0,25759

Domains

View on InterPro
YP_009211060.1
1 382 aa
ATT 154–262 · STR 263–284 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

YP_009211060.1
1 382 aa
Domain Start End Length (AA) Confidence
N-terminal 1 141 141 0,9909
Central domain 142 340 200 0,0282
C-terminal 341 382 41 0,9949
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Host
Escherichia coli str. K-12 substr. MG1655 [NCBI] · taxon 511145

Coding sequence (CDS)

Genbank protein accession
YP_009211060.1 [NCBI]
Genbank nucleotide accession
NC_028935 [NCBI]
CDS location
range 19345 -> 20493
strand +
CDS
ATGGCAGTAGGTGAAATTCAAATTAGTGCCTTGCCTCAAGCGTCGTTACCAATTGAGTTAAGTGATATCTTCCACCTTAAACAAGGTATTGAGGATAAGAGATGTACACTTGAGCAATTACTTGACCCCCACGCATCCCTGAGAAATAACCCTCACGGTGTCACAAAGGTTCAGGTTGAATTGGGTAACGTATTAAACGCTTTACAGCTTGTCGCTGCAAATAACTTATCAGATATTGTTGACGTAGCCGAAGCAAGAGCAAACTTGCAGATTATGTCTTCTGATGAAGTTAATCAACTTATTCAGGCTCACATCAATGATAAGAACAACCCTCACAACACAACTAAATCACAAGTTGGTTTGGGTAATGTTCAGAACTGGTTAGCCTCTAACGCATATAACGAAGATGCTGATAAGTATGCTACAGCAAGAGCTGTTAATAGTCTATACAGGGCTGTTCAGGCTTCATATCCAGTTGGTACTATCCACTTGTCTATGAATGCTGCTAACCCATCTACTTACCTGATTTGTGGAGGTACTTGGGAGCTAGTTTCAAGAGGTAGAGCACTTGTGGGTTATGATAGTGCTTCCAGACCAGCAGGTTCGACGTTTGGTTCACAGACTGCTGCACTGACTGCTAATAACATCCCTGCTCACACTCACTCAGTGTACGTTACAGGTGGTGGTCACACTCACGGTGCAACTGTCTCTATTAATGCTTTTGACTATGGTACGAAGGGAACTACAACGTTTGACTATGGAACCAAAACAGTAAGTACATTTAACTACGGCAACAAGACAACAAGTTCTGCTGGCTTCACACAGACCACACTTATCAATCGTGGGTATGATAATGGTAGCTTACCGGGACTATCAGTTGTAGGTGCAGATTATGACCCAAGAGCAACACTGACAACGGCTAATAACCATACTCACTCTGTAGCAATCGGTGCGCATAACCACACAGTTGCAATTGGTGCTCACAGCCATTCTGTAGCGATTGGTTCTCACAACCACTCAGCAACAGCAACAGTAACTGGAGCAGGTGAGCACTCCCACTCTGGTACAACTGGTTCAACTGGTTCTGGTCAAGCGTTCAATATTGAACAGCCATCATTCGTACTTTATGTATGGCAAAGAACTGCTTAA

Genome Context

Tertiary structure

YP_009211060.1
ESMFold structure
Source ESMFold
pLDDT 71.2
Oligomeric state monomer