Genbank accession
YP_001595467.1 [GenBank]
Protein name
putative tail fiber protein
RBP type
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MAKGYSLDALVNEHYIKSRTGLGNISIDTITGMDSVGVYYQSRNDYATLANKYPPGANAGTLEVLPHNANVGRVMQRYTNFSNKRMWVRSQNGTVSDANFDEWTEFVNMNNIYNAIYPIGIVVKFDNATNPNNNFTGTVWEQIIDGRVARAATGPEAGTADGQIGSIAGSDTANIAVTNLPGHTHGMQNHTHGIASHSHTMAHTHTINHDHGAVTSSSSGAHTHSVSGTAASAGAHQHTEGSPFTGDVNFGTTTSTSKDNISDWLYSPSTRYPLTSSSGAHTHSVSGTAASAGAHTHSVDLPNFTGTSGGSSTGNTGGTALTTGGPSNNTTTSTGDGTAFDVRNASHYYAFWKRVA
Physico‐chemical
properties
protein length:356 AA
molecular weight: 37352,93110 Da
isoelectric point:6,37469
aromaticity:0,07865
hydropathy:-0,48343

Domains

Domains [InterPro]
DC_0905
STR
4–356
cd19958
STR
18–95
YP_001595467.1
1 356
Architecture
STR
ATT
STR
STR 4-112 | ATT 113-239 | STR 240-356
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage phiEcoM-GJ1
[NCBI]
451705 Uroviricota > Caudoviricetes > Chaseviridae > Carltongylesvirus > Carltongylesvirus GJ1
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_001595467.1 [NCBI]
Genbank nucleotide accession
NC_010106 [NCBI]
CDS location
range 49866 -> 50936
strand +
CDS
ATGGCTAAAGGTTATTCTTTAGATGCGTTGGTGAATGAGCACTACATAAAGTCTCGTACCGGCCTGGGTAATATAAGTATAGACACTATCACGGGAATGGATAGTGTTGGAGTATATTACCAATCACGCAACGATTATGCCACCCTTGCAAATAAATATCCTCCAGGTGCTAATGCAGGTACTCTTGAAGTGTTGCCCCATAATGCCAACGTTGGCAGGGTAATGCAACGTTATACAAACTTTAGTAATAAGCGTATGTGGGTGAGAAGTCAGAATGGCACGGTATCGGATGCTAACTTTGACGAGTGGACTGAGTTCGTTAACATGAACAACATCTACAACGCAATCTATCCGATTGGCATTGTCGTCAAGTTTGACAACGCCACCAACCCGAACAACAACTTCACCGGAACTGTGTGGGAACAAATCATTGATGGTCGTGTTGCCCGTGCTGCAACTGGTCCGGAAGCTGGGACGGCTGATGGTCAGATTGGTTCAATAGCTGGGTCCGATACAGCCAACATTGCTGTAACCAACCTACCAGGCCACACACACGGGATGCAGAACCACACCCACGGCATCGCATCCCACTCCCACACAATGGCCCACACCCACACCATCAACCACGACCACGGTGCGGTAACATCTAGCTCTTCTGGTGCTCACACGCACAGCGTTAGTGGCACTGCCGCCTCTGCTGGTGCTCACCAACATACAGAAGGCTCTCCATTTACAGGAGATGTCAACTTTGGCACTACCACAAGCACCAGCAAGGACAACATATCTGATTGGCTCTACAGCCCGTCTACCAGGTATCCGCTAACTTCGTCCTCTGGTGCACACACCCACTCTGTGAGTGGTACAGCCGCCTCTGCTGGTGCTCATACTCACTCTGTTGACTTACCTAACTTCACAGGTACAAGTGGAGGCTCCAGCACAGGTAACACAGGCGGTACGGCACTGACTACTGGTGGTCCTAGCAACAACACTACTACCTCAACTGGTGACGGCACAGCGTTCGATGTTCGCAACGCCTCACATTACTATGCCTTCTGGAAACGGGTGGCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
65a9cf262b1a02ef576698838d0c3d0e030ae5711dc25e7be7414e4be8b0c31b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7129
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50