Genbank accession
UIS66033.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MAVTTKIIVQQILNIDDTKATASKFPRYTVTLGNSISSITANELVSSIDAAAKSAAAAKDSEIAAKTSELNAKNSEQEAAISAEASEASATQSATSATQSATSANKSAESAAAAKTSETNSKTSEANAKTSETNAKTSETNANASAAVAKISETNAKTSETNAAQSAVAAKLSEGSAKVSETNAAQSAADSSGFRNEAEIFSGQAAASAADAKTSETNAKASETKAKASETNAAGSATSANQSVTTIQGLKSDVEQLKSDTQAIKNSAVTETTALKADVEQLKSDTQAIKNSAVTETTALKADVEQLKTDTQGIKDSAVSETTTLKDQAAASATQAGNSAVEAGQQASNAASSAQSASTYAGRAEVAAGKAEGIIGKSLLKENNLSDLFNVEISRKNIRVDRLEQYPNETVLFSPGRGKYLTISESSWGVYSTEEGNVGFIPLPISSGGTGAVNITAAKKNLEIASFKSADEQSLMYSPDTSKYAIFIKDNGDWGCLTVSDGERHPLAINAGGTGATTPDGVRHNLGLATNHIPVFLGVHLDGNNGENSGILYLRNKNAEGVQLSYSRIYNEILGGTAYATIQVTREGGDTNYYQFDESGNAINYNTIAIGRGIINSLGTNSLAIGDSRTGFKQGGNGVLQVFCNGTNVASFDHQNLHLNGLLNIWPIDNNANGIRVSGSRTGGGNALIGGQVSGGAFVDWRDRAAGLLVELPSDGAASNVFKAVRWGYDWVAGLDVVRFNTGACEARFNVRGAIYSFNEAGYASCVQWVSTSDIRLKANLKEIKSAREKVKSIKGYTYFKRSNLYEDEHSVYSEEAGVIAQDVQTVLPEAVYKVSDSEYLGVSYGGVTALLVNAFNEMSDQVDKQQEEIETLKSEIADLKAAVAALLNKPTTLES
Physico‐chemical
properties
protein length:896 AA
molecular weight: 93599,69440 Da
isoelectric point:5,06148
aromaticity:0,05692
hydropathy:-0,33895

Domains

Domains [InterPro]
PTHR43049
Unmapped
65–896
DC_1202
STR
240–896
Coil
Unmapped
247–267
UIS66033.1
1 896
Architecture
ATT
STR
ATT 2-203 | STR 240-896
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
UIS66033.1
1 896
Domain Start End Length (AA) Confidence
N-terminal 1 414 414 0,9804
Central domain 415 620 207 0,2769
C-terminal 621 896 275 0,7963
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-414
Central
415-620
C-terminal
621-896

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage PNJ1902
[NCBI]
2880888 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
UIS66033.1 [NCBI]
Genbank nucleotide accession
OK254197.1 [NCBI]
CDS location
range 21149 -> 23839
strand +
CDS
ATGGCAGTAACTACTAAAATTATTGTGCAACAAATACTAAATATAGATGATACGAAAGCTACTGCTAGTAAATTTCCTAGATACACAGTAACTCTTGGAAATTCTATTAGCTCTATTACTGCTAATGAGTTAGTATCCTCTATAGATGCCGCTGCTAAGTCTGCCGCGGCTGCAAAAGATTCTGAAATAGCTGCTAAGACTTCAGAGCTTAATGCTAAGAATTCTGAACAGGAAGCTGCGATTTCCGCGGAAGCTTCTGAAGCATCGGCTACCCAATCAGCTACCTCTGCTACTCAGTCGGCAACGTCAGCCAATAAGTCTGCAGAATCCGCTGCCGCAGCTAAAACATCCGAGACTAACTCGAAGACTAGTGAAGCTAATGCAAAGACTAGCGAAACTAATGCAAAGACAAGTGAAACTAACGCTAACGCATCAGCTGCTGTTGCAAAAATTAGTGAGACTAATGCAAAGACAAGTGAGACTAATGCAGCTCAATCAGCTGTTGCTGCAAAACTTAGCGAGGGTAGTGCAAAGGTCAGTGAGACTAATGCAGCTCAATCAGCTGCTGATTCTAGCGGTTTTAGGAATGAGGCGGAAATATTCTCTGGGCAAGCTGCTGCATCAGCAGCTGATGCAAAAACCTCTGAAACTAATGCAAAAGCCTCTGAGACAAAAGCTAAGGCTAGTGAAACTAATGCTGCAGGGTCTGCAACTTCCGCCAACCAATCTGTAACTACTATTCAAGGACTTAAATCAGATGTTGAACAGTTAAAATCTGATACCCAAGCCATTAAAAATAGTGCTGTAACAGAGACAACAGCTTTAAAAGCAGATGTTGAACAGTTAAAATCTGATACCCAAGCCATTAAAAATAGTGCTGTAACAGAGACAACAGCTTTAAAAGCAGATGTTGAGCAATTAAAAACAGATACACAAGGTATTAAGGATAGCGCGGTATCTGAGACAACAACTTTAAAAGACCAAGCTGCTGCTTCTGCTACACAAGCGGGTAATAGTGCTGTTGAGGCTGGGCAACAAGCTAGCAATGCTGCTAGTAGCGCACAAAGCGCATCTACCTACGCTGGACGTGCAGAAGTGGCTGCTGGAAAAGCTGAAGGTATTATTGGTAAATCATTACTAAAAGAAAATAATCTTTCGGATCTTTTTAACGTAGAAATCTCCAGAAAAAATATTCGTGTAGACAGGTTAGAGCAGTATCCTAATGAAACTGTGTTATTCTCTCCCGGAAGAGGAAAATACCTAACAATAAGTGAATCGTCATGGGGTGTTTATTCAACGGAAGAAGGTAATGTTGGATTTATTCCACTTCCTATATCTTCAGGTGGTACTGGCGCCGTTAATATAACTGCCGCTAAGAAAAATCTGGAGATTGCATCATTCAAAAGTGCAGACGAACAAAGTTTAATGTACTCCCCTGATACTTCCAAATATGCAATTTTTATAAAGGATAATGGCGATTGGGGTTGCCTTACAGTGTCTGACGGAGAAAGACATCCTTTAGCTATCAACGCTGGCGGTACAGGTGCAACAACGCCAGATGGAGTAAGGCACAATTTAGGGCTTGCGACAAATCATATTCCTGTGTTCTTGGGTGTACATCTTGATGGAAACAACGGTGAAAACTCCGGCATTCTTTACCTTAGAAACAAGAACGCAGAAGGTGTGCAACTTTCATACTCAAGGATCTACAATGAAATTCTAGGTGGTACTGCTTATGCAACAATACAGGTAACAAGAGAAGGTGGAGACACAAATTATTATCAATTTGACGAAAGCGGAAACGCCATAAATTACAATACAATAGCAATTGGTAGAGGCATTATAAATTCACTTGGGACTAATTCGTTAGCTATAGGTGATAGCCGCACTGGATTTAAGCAAGGTGGTAACGGAGTATTGCAGGTTTTTTGTAATGGTACAAATGTTGCATCATTTGATCATCAAAACCTTCATCTTAACGGTCTTTTAAACATATGGCCTATTGATAATAATGCAAATGGCATTAGAGTTTCAGGATCTAGAACTGGTGGCGGCAATGCTCTAATTGGCGGTCAAGTATCTGGAGGTGCTTTTGTTGATTGGAGGGACCGAGCGGCTGGTCTTCTTGTTGAGTTGCCAAGCGATGGTGCAGCATCAAACGTCTTTAAGGCCGTTCGGTGGGGTTATGATTGGGTTGCTGGTCTTGATGTTGTCAGATTTAACACTGGAGCTTGCGAGGCTCGTTTTAATGTAAGGGGTGCGATTTACTCATTTAATGAAGCTGGTTACGCCTCATGTGTTCAATGGGTCAGTACGTCAGATATTAGACTGAAAGCAAACTTGAAGGAGATTAAGAGCGCTAGAGAAAAAGTGAAGTCAATCAAGGGTTACACTTACTTTAAGCGCAGTAATCTTTATGAAGATGAACATTCTGTATATTCGGAGGAGGCTGGTGTAATCGCTCAAGATGTGCAAACAGTTCTGCCGGAAGCGGTTTACAAGGTTTCAGATTCTGAATATTTAGGTGTTAGCTATGGCGGGGTTACTGCTCTTTTGGTTAACGCATTCAACGAAATGAGCGATCAAGTTGACAAGCAACAAGAGGAGATCGAAACACTGAAATCTGAAATTGCAGATCTTAAGGCGGCGGTAGCGGCGTTACTCAACAAACCAACAACGCTGGAAAGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
86344cce4c4a9a0909b8769555fe107d051c1a0010e0dc133e018dd7ece7c123
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5916
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50