Genbank accession
UZV40566.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,92
Protein sequence
MAEIKTGILLRRNLKKHFVNDAKPTQGEIVLATDTNEIGMLVNDEIQWTPIQGVVNTVAGKQGDVILNKKDVGLENVDNTADIDKPISNSTKLEFQRHYTAENPHNITKKTLGLENVDNTADIDKPVSNLTQIELNKKISWDDARKQAGGKDPVFTDTTYTIKDGELSEFNFNSYYKNFIDTFNTNSRVLPSTQALIANGRTITLRRADGSSESIETQDTLYDDSELRALIEQAKIDLHINIQDNLESDSTQDALSANQGKVLKGLIDEIKKVINITDDDFRNLQDIINYIEENREKFDDLTIANIKGLQAALDSKLNRDDSTYIAPNSALLESHPASDFVLNTNYNAKLIEIQDSLNSINSQIKLFETQAGVDSKINQAIRDLNFTEIIQSINEQITRLQGSLDDIDLDAITENLQKVQQDLTQRISQLETNTSKKLEEFEAIVNNFDMSEIQTSINNFKDQINQNIDSIQGVVDSVSESLSNIENNIQTSLENKVSKDELATEVQTINDNIADLSNIANEAKWQDNFYGKVDRRRQALWSIISISKDSFKGSYKKPNTYNYWEAKYKNLKYINDNFDSLETISGAPTYDVVTDQVIRLTFNDISQASFLIGSPKNKIQIAKIIAVNANTKKAVFIYGYPSFMTDDLSKIMVTIEKDSSFGNYLSRANKTDPETFQKIVTAQEFDLPDDTDDYSYFEASYEISGNTITLTIPENIFLEFYGNMTSVETTVTGASAAQVWSATLKSTDYEGKKLFSNGVLIGKDLTGAILTIYDDKSTYINDFDEARTNYNDFQIIREEYEDIMYTRDFLPGTIGPGEDDLEW
Physico‐chemical
properties
protein length:823 AA
molecular weight: 92764,71120 Da
isoelectric point:4,48615
aromaticity:0,08141
hydropathy:-0,53159

Domains

View on InterPro
UZV40566.1
1 823 aa
STR 242–269 · STR 349–522 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Taxonomy

Coding sequence (CDS)

Genbank protein accession
UZV40566.1 [NCBI]
Genbank nucleotide accession
OP654737.1 [NCBI]
CDS location
range 123052 -> 125523
strand +
CDS
ATGGCTGAAATAAAAACTGGTATTTTATTAAGACGTAATCTTAAAAAACATTTTGTAAATGATGCAAAACCAACACAAGGTGAAATCGTTCTTGCTACTGATACAAATGAAATTGGTATGCTTGTAAACGATGAAATACAATGGACTCCTATTCAAGGCGTCGTTAACACAGTTGCTGGTAAACAAGGTGACGTCATACTAAACAAAAAAGATGTAGGCCTTGAAAATGTTGATAACACTGCTGACATTGATAAGCCAATTTCTAACTCTACAAAATTAGAATTTCAAAGGCATTATACAGCCGAAAACCCACACAATATAACAAAGAAAACACTTGGTTTAGAAAACGTTGATAACACTGCTGATATAGATAAACCTGTATCTAACTTAACTCAAATTGAGTTAAATAAGAAAATTTCTTGGGATGATGCTAGAAAGCAAGCCGGTGGGAAAGACCCAGTTTTTACAGATACTACTTATACCATTAAAGATGGTGAACTTTCAGAATTTAACTTTAATTCGTATTATAAGAATTTCATAGATACTTTTAATACTAACTCTAGGGTCTTACCAAGTACACAAGCTCTTATAGCTAATGGTAGGACAATAACCTTAAGAAGAGCTGACGGCTCGAGTGAAAGTATAGAAACACAAGATACTTTATATGATGATTCTGAACTTAGAGCATTAATAGAACAAGCTAAAATAGATTTACATATAAACATACAAGATAATTTAGAATCAGATTCTACTCAAGATGCTTTGAGTGCTAATCAAGGCAAAGTTCTTAAAGGTTTAATAGACGAAATAAAGAAGGTAATTAATATCACAGATGATGATTTTAGAAATCTTCAAGATATTATTAATTATATAGAAGAAAACCGCGAAAAATTTGATGATTTAACAATTGCTAACATCAAAGGCTTACAAGCTGCTTTAGATTCTAAATTAAATAGAGATGATTCTACATATATTGCTCCAAATTCTGCATTATTAGAATCTCACCCAGCTAGTGATTTTGTATTAAATACAAATTATAATGCTAAGTTAATAGAAATACAAGATTCTTTAAATTCTATCAATTCACAAATTAAATTGTTTGAAACTCAAGCGGGTGTGGATTCTAAAATAAACCAAGCAATCAGAGATTTAAATTTCACTGAAATCATACAGAGTATCAATGAACAAATCACGAGACTACAAGGCTCTTTAGACGATATTGATTTGGATGCTATAACAGAAAATCTCCAAAAAGTTCAACAAGATCTAACTCAACGAATATCACAATTAGAAACTAACACATCTAAAAAATTAGAAGAATTTGAAGCTATTGTTAATAATTTTGATATGTCTGAAATACAAACCAGTATTAATAATTTTAAAGACCAAATCAATCAAAATATTGATAGTATACAAGGTGTTGTAGATTCTGTTTCTGAATCATTATCAAATATTGAAAATAATATCCAAACCAGTTTAGAAAATAAAGTTTCAAAAGATGAATTAGCTACTGAGGTTCAAACTATTAATGATAATATTGCTGATTTATCAAACATAGCAAATGAAGCAAAATGGCAAGATAATTTTTATGGTAAAGTTGATAGAAGACGCCAAGCTTTATGGTCAATTATATCAATTTCAAAAGACTCTTTCAAAGGTTCGTATAAAAAACCAAATACCTATAATTACTGGGAAGCAAAATATAAAAATCTTAAGTATATCAATGATAATTTTGATAGTTTAGAAACGATATCTGGTGCACCTACATATGACGTGGTTACTGATCAGGTTATTAGATTAACTTTTAATGATATTTCACAAGCATCATTTTTAATTGGAAGTCCTAAGAATAAAATTCAAATAGCTAAAATTATAGCAGTAAATGCTAATACTAAAAAAGCTGTGTTCATATACGGGTATCCTAGTTTTATGACAGACGATTTATCAAAAATTATGGTAACCATAGAAAAAGATTCAAGTTTTGGTAATTATCTATCAAGAGCTAATAAAACAGATCCAGAAACTTTTCAAAAAATTGTAACTGCACAAGAATTTGATTTGCCGGATGATACTGATGATTATTCATATTTTGAAGCTAGTTATGAAATATCTGGAAATACAATAACACTAACTATACCAGAAAACATATTTTTAGAATTCTATGGTAATATGACATCAGTAGAAACAACCGTTACGGGTGCTAGTGCTGCACAAGTTTGGTCAGCGACTTTAAAATCCACTGATTATGAAGGTAAGAAATTATTCTCAAACGGGGTGTTGATAGGAAAAGATTTGACTGGCGCAATCTTAACTATATATGATGATAAATCAACATACATAAATGACTTCGATGAAGCTAGAACTAATTATAATGATTTTCAAATAATACGGGAAGAATATGAGGATATTATGTATACAAGAGATTTTTTACCAGGAACTATAGGACCAGGCGAAGACGACCTAGAATGGTAA

Genome Context

Tertiary structure

UZV40566.1
ESMFold structure
Source ESMFold
pLDDT 47.5
Oligomeric state monomer