Genbank accession
XHG84469.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,91
TF
Evidence RBPdetect2
Probability 0,86
Protein sequence
MSTSVSKPASVIKLGAVKALVEGPPGKPGTNGVGEPGPEGKSAFEIWLAQSGNSGKTLDDFFEAYRGNGLNNRGQFVMGQSYKMNDYVVAAGSSADSSIFFCKSVDSFTAESQPRSDSKNWTELSAPAGANGKPIELHRGTTHLQWRVVGDAEWIDLIALADLMVKGDKGDDGKSFTVDASGPTADRDQYDSEAEGFSFLDTTTGYLYILTGTAGVWSDPIPFKGDKGDNGANIELQKTATYIQWRPEGTTQWFNLVALADLQGANGKNIELQASSTAIQWRVVGSTAWTDLVLLSTLKGTAGADGATWLSGNTGPTNSAGKVGDFWLNTATGEISKKTGATAWTLQLTLPTGSSAGGSTWLLMTSDPTAAQGSSGQWALNSATGTIWNKGGDTWNKVMGIPAFATQEQAVAATDYTLAMSPARVREYMESFGLTAKFTTTSTNLNTSVRGEFFNYNSDTLNHPGTGGYGRGLTIPSGDGYATQLAIENDSKLMFVRYQTAGSWGTWASIGGGGGSTTFATNEQAVAGTDLTVAMSPGRTREYLESLGLGAKYTNVTANLNTAVKFQPWSWDNTSTNTPVASSYGRGFTLPSGDGYVTQIGIVNDTGKMYIRYQSGASTWSAWAAVGGSSSGGGGGTTVYASVQTKQQGINSGAFNALGMSFGSAGVPPGDSTTAVLYRVTGQLFTQGPTDAASVINMHFLDGTVGASMFRMSVRSADATGAIKQTTVTQASADMSITVFGDGIATFEGVMQLSYVNPTGIDLTGKLASGTWANILPGSHLTFTPLGKTISS
Physico‐chemical
properties
protein length:792 AA
molecular weight: 82723,63620 Da
isoelectric point:5,05216
aromaticity:0,09470
hydropathy:-0,22323

Domains

Domains [InterPro]
DC_1480
ATT
396–435
cd19958
STR
551–626
XHG84469.1
1 792
Architecture
ATT
STR
STR
ATT 396-435 | STR 461-510 | STR 516-792
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Erwinia phage Fifi011
[NCBI]
3349640 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XHG84469.1 [NCBI]
Genbank nucleotide accession
PQ051109.1 [NCBI]
CDS location
range 40171 -> 42549
strand -
CDS
ATGAGCACTTCCGTATCAAAGCCGGCCAGCGTCATTAAGCTGGGTGCTGTCAAAGCCCTGGTCGAAGGCCCACCGGGCAAGCCAGGCACTAACGGAGTAGGCGAACCAGGCCCAGAAGGTAAATCTGCCTTCGAGATTTGGTTGGCTCAATCCGGCAACTCTGGCAAAACCCTGGATGACTTCTTTGAAGCCTACCGCGGCAACGGTCTCAATAACCGTGGCCAGTTTGTCATGGGCCAAAGCTACAAGATGAACGACTACGTGGTTGCTGCTGGCAGCTCCGCAGACTCGTCTATCTTTTTCTGCAAATCAGTGGACTCGTTCACGGCTGAATCTCAGCCACGCAGTGATTCCAAAAACTGGACAGAGCTTTCGGCGCCAGCCGGAGCCAATGGCAAACCTATCGAGCTGCACCGTGGTACTACTCACCTGCAGTGGCGAGTGGTTGGTGACGCCGAGTGGATTGACCTGATTGCCCTGGCTGACCTGATGGTCAAAGGTGACAAGGGTGACGACGGTAAATCGTTCACTGTTGACGCCAGCGGCCCGACTGCAGACCGTGACCAGTATGATAGTGAAGCGGAAGGTTTTTCCTTCCTGGACACCACGACTGGTTACCTCTACATCCTGACCGGCACAGCCGGTGTGTGGTCTGACCCTATCCCATTCAAAGGGGATAAAGGTGACAACGGTGCCAACATCGAACTGCAGAAGACAGCGACGTATATCCAGTGGCGCCCTGAGGGAACAACTCAGTGGTTCAATCTGGTAGCGCTTGCTGACCTGCAGGGAGCCAATGGCAAGAACATTGAGCTGCAGGCTTCGAGCACTGCAATTCAGTGGCGAGTAGTGGGAAGCACAGCCTGGACTGACCTGGTTCTGCTGTCTACGCTGAAAGGCACCGCCGGTGCCGACGGAGCAACATGGTTGTCTGGTAACACCGGGCCAACCAACTCTGCCGGTAAAGTCGGTGACTTCTGGCTGAACACAGCAACAGGCGAAATCAGTAAGAAGACCGGAGCGACAGCCTGGACACTTCAACTGACATTGCCGACTGGTTCCTCTGCTGGTGGTTCCACCTGGCTGCTTATGACTTCTGACCCGACTGCCGCTCAAGGTAGCTCTGGCCAGTGGGCACTGAACAGTGCAACGGGAACCATCTGGAACAAGGGTGGCGACACCTGGAACAAAGTAATGGGCATCCCGGCGTTCGCTACTCAAGAGCAAGCCGTGGCCGCTACCGATTATACCCTGGCTATGTCACCGGCCCGAGTCCGTGAATACATGGAATCGTTTGGCCTGACTGCCAAGTTCACCACAACCAGCACCAACCTTAATACGTCGGTGCGAGGTGAGTTCTTCAACTACAACTCCGATACCCTGAATCATCCGGGCACCGGTGGTTATGGCCGAGGACTCACTATTCCATCGGGTGATGGATACGCAACGCAGTTGGCTATCGAAAACGACAGCAAGCTGATGTTCGTGCGTTACCAGACAGCAGGTTCCTGGGGAACATGGGCATCAATCGGTGGCGGCGGTGGCAGCACTACCTTCGCAACCAATGAGCAGGCTGTGGCCGGTACTGACCTGACGGTGGCTATGTCACCGGGACGCACCAGAGAATACCTGGAGAGCCTGGGTCTGGGAGCCAAGTACACCAACGTGACTGCAAACCTGAACACTGCTGTGAAGTTCCAGCCGTGGTCGTGGGATAACACTTCCACGAATACGCCGGTAGCTTCCAGTTATGGACGTGGTTTCACGCTTCCTTCTGGTGATGGGTATGTCACTCAAATCGGTATCGTCAACGACACCGGTAAGATGTATATCCGCTACCAGAGCGGGGCAAGCACCTGGTCTGCCTGGGCTGCAGTCGGAGGTTCATCTTCCGGCGGTGGCGGCGGGACTACCGTATACGCCAGCGTCCAGACCAAGCAACAGGGGATTAACTCAGGAGCATTCAATGCCCTGGGAATGTCCTTTGGTTCGGCCGGCGTACCGCCGGGGGATAGCACAACGGCTGTGCTGTATCGTGTAACAGGACAGTTGTTCACCCAGGGGCCAACTGATGCGGCCTCTGTGATTAACATGCACTTCCTGGATGGCACCGTGGGCGCCAGCATGTTCCGTATGTCTGTGCGGAGCGCTGACGCTACTGGAGCCATCAAACAGACCACGGTTACTCAAGCCTCAGCGGATATGTCTATCACGGTCTTTGGTGATGGCATTGCCACGTTTGAGGGCGTCATGCAGCTCAGTTATGTCAACCCAACCGGGATTGATTTAACCGGCAAGCTGGCGTCTGGCACTTGGGCCAACATTCTGCCAGGGTCTCACCTGACGTTTACCCCGCTGGGCAAAACGATTAGCAGTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
49569d6cf889106229654b950ddab7e83956d992c3f8f108b06918aaf614172d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7469
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50