UniProt accession
Q76YB1 [UniProt]
Protein name
Gp35 hinge long tail fiber proximal connector
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MAELVIARVTEQGIQARDLSETFSGDNELIIKGQQGNAGGTCVATLNGVSLSPSQTKGLNCIHLTEDFKPTLMSFDFSVVTDTDRFTTWVNSLATGIILLMSHTENVTNDKLNTYFDQIGSVGWKYYWNPDKGTNRSSYVAIIDCPLKKIMTEQFMGHGKTTMQAQLCIVFDTFADIGVTGYGDMIAWEESEYFTKTPGYAVKELIKLSNLKDLNIHPGETIELTGELWASEQASVDKVKSSIFLQYYTDNAWISSHSINSSGIETWTKGTVTSVVPANCNRIFIRLYRYPNVPLSTAEIGIRDVMVKLRKPDVKKDRTNGTFGQWGFITANANAADSSANWVASSKGQDLITHSYEELDQEWLPQFNQDGWFDIPDWMPGSRYEISQTISYDYPLDTSGDGNAWMRSGMGGVTDGLEDRSIKVGIRTVKHARGEGHFMSYGGAYVDFDARIEYGRSYTYKIAVDGANVKFYLDGVEQTPTGTAVAFRQITHFSVGAEKRGNVHMHPIRGSVHELSMVDKTRTTGTNDRFYNFVRPGQLDDNRTVPGKSRYSGVEYFGDNLTPDNSGALFYCGAGSKLVPGGLSDESEAVYVTVLDAGGNELQSGVELNKRYLCDVVQLGTPRADWVSSPSQYHIRNPSKGSQWIVETEGKNVNTMKVKIECVTSPKINGTVTNVDWVDINKQIARFNGTNQCADIPTWKGPGEFQVKFNFERKTPLTSSYILEGRTTTDVDTKFGGMYINTTDQVIPYGGLVIASINGMPFVAGSKVIPGVDYIVCGYIKEGSQISRIGSRYNDVEYLQGYVWDLSFDGTGDDRYYKNYNVSRYDAFNNYIKDELGKTQSLYSKAYVRDFTLFSTAIKQNEKSYKFNAATTPSGASIAIDVPKGAMIRVDFDIDTETPSELRCSKHTSGSAPLIKSLPAGRNKGSVCYVVTDPDPGIYIRSIAPKLNELVKIHKLNVSRVYTDAIITNANGSSYKLEEREQPVMKTINRTRWIPDYRVGHMHIQNAWKPDPKDWAIRIKFKVGALKDVMNPLISGLKNDTSTINIDQYNSVESVRVFSYDAARKINTGVTLNAGFKVGDIADVYVNVLGNKVTISANGKSITGDWIPDGTESVSFIGAREGAGLRYNNDIYLVELIDSSTKLCNSRRYDLSYYSTVKPTHTVIENELGAIKYMRIGEIDATNTIYGWDGTKGEESQIGRMKDSWFIDGHQMDLFTTINTSGYSAYLHFVGNKRPFNSVDVELYQIDNNNNEQLFGVYNANQAINSEGYGIPENADVKTWFSDLNTTKKVIFRPKGVGALLAFVDAKSWVAI
Physico‐chemical
properties
protein length:1312 AA
molecular weight: 145860,00780 Da
isoelectric point:5,80749
aromaticity:0,10595
hydropathy:-0,37591

Domains

Domains [InterPro]
DC_0912
STR
1–1312
PS52031
LEC
27–186
Q76YB1
1 1312
Architecture
STR
STR 1-1312
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage Aeh1
[NCBI]
2880362 Uroviricota > Caudoviricetes > Pantevenvirales > Cinqassovirus > Cinqassovirus aeh1
Host Aeromonas hydrophila
[NCBI]
644 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AAQ17984.1 [NCBI]
Genbank nucleotide accession
AY266303 [NCBI]
CDS location
range 213031 -> 216969
strand +
CDS
ATGGCTGAATTAGTTATTGCCCGTGTTACTGAACAGGGTATACAAGCGCGTGATCTGTCAGAAACATTTTCTGGCGATAACGAGTTGATCATCAAGGGTCAACAAGGCAACGCAGGGGGAACTTGCGTTGCCACCCTAAATGGAGTCAGTCTTAGCCCTTCTCAAACGAAGGGCTTAAACTGCATTCATTTAACAGAAGATTTTAAGCCAACATTGATGTCGTTTGATTTCAGTGTTGTAACAGATACCGATAGATTCACAACTTGGGTAAACAGTTTGGCTACTGGTATTATTCTTTTAATGAGTCACACGGAAAACGTGACTAACGACAAGCTTAACACTTATTTCGATCAAATCGGAAGCGTTGGTTGGAAATATTACTGGAATCCAGATAAAGGAACTAATCGTAGTTCTTATGTCGCTATTATTGATTGTCCTCTGAAAAAGATCATGACTGAACAGTTTATGGGACATGGTAAAACTACAATGCAAGCCCAGTTGTGTATTGTGTTTGATACATTCGCTGACATTGGTGTTACTGGATATGGCGACATGATTGCTTGGGAAGAGAGTGAATATTTTACCAAAACGCCAGGATATGCGGTAAAAGAACTAATTAAACTGTCAAACTTGAAAGATCTTAATATACATCCAGGTGAAACAATCGAATTGACTGGAGAACTTTGGGCATCAGAGCAGGCATCCGTCGATAAGGTAAAATCATCGATATTCCTACAATATTATACCGACAATGCTTGGATTAGTAGTCATTCTATTAATTCAAGTGGAATCGAAACCTGGACAAAAGGAACTGTTACATCCGTTGTGCCTGCGAATTGCAATCGTATATTCATTAGATTATATCGATATCCAAATGTGCCATTGAGTACAGCAGAAATTGGTATTCGTGATGTTATGGTGAAATTGAGAAAGCCCGATGTTAAGAAAGATAGAACAAACGGAACATTCGGACAATGGGGTTTTATTACTGCCAATGCCAATGCCGCTGATTCTTCTGCTAACTGGGTAGCGAGTTCCAAAGGACAGGATTTGATTACTCATTCGTATGAGGAATTAGATCAAGAGTGGTTGCCTCAGTTTAACCAAGACGGTTGGTTTGATATTCCGGATTGGATGCCAGGATCTCGTTATGAAATTAGTCAAACAATTTCGTATGATTATCCACTCGATACGAGCGGAGATGGCAATGCGTGGATGCGTTCTGGTATGGGCGGTGTAACGGACGGATTAGAAGATCGCAGCATTAAAGTGGGTATTCGAACAGTCAAACATGCTAGAGGCGAAGGACATTTTATGTCTTATGGCGGCGCTTATGTTGATTTCGATGCAAGAATTGAATATGGCAGATCGTATACTTACAAAATTGCGGTAGACGGTGCTAATGTCAAATTCTATTTGGACGGAGTAGAACAGACTCCAACTGGAACAGCAGTCGCATTTAGACAAATAACTCATTTCTCGGTTGGTGCAGAGAAACGCGGAAACGTACATATGCATCCAATTCGTGGCTCAGTTCATGAATTAAGTATGGTCGACAAAACAAGAACAACCGGAACAAATGATCGTTTTTATAACTTTGTTCGTCCAGGACAATTAGACGACAACAGAACAGTACCCGGAAAATCTCGTTATTCTGGAGTCGAATACTTCGGTGATAATCTAACTCCAGATAATAGCGGTGCGTTGTTCTATTGTGGTGCTGGCTCTAAGCTTGTGCCAGGCGGTCTAAGCGACGAAAGTGAGGCTGTCTATGTCACTGTATTAGACGCGGGTGGAAATGAACTCCAGAGCGGTGTGGAGCTTAATAAGCGATATCTATGTGATGTCGTTCAACTCGGTACTCCTAGAGCTGATTGGGTATCTTCTCCGAGTCAATATCATATTCGAAATCCTAGCAAGGGGTCGCAGTGGATTGTGGAGACAGAAGGAAAGAACGTCAATACGATGAAAGTCAAAATCGAATGTGTTACTTCTCCTAAGATCAACGGCACTGTTACTAATGTTGATTGGGTGGATATCAATAAGCAGATTGCTAGATTCAACGGCACAAATCAATGCGCTGATATTCCTACATGGAAAGGTCCCGGTGAGTTTCAAGTCAAGTTCAATTTTGAAAGAAAAACTCCATTGACGTCCAGTTATATTCTAGAAGGAAGAACAACAACGGATGTTGATACCAAGTTTGGCGGTATGTATATCAACACAACAGATCAAGTTATTCCTTATGGTGGTTTAGTTATCGCTAGTATTAACGGAATGCCGTTTGTTGCTGGTAGCAAAGTTATTCCTGGTGTTGATTACATCGTGTGTGGATATATCAAAGAAGGCAGTCAGATTAGCAGAATCGGTAGTAGATATAACGATGTTGAGTATCTACAGGGATATGTGTGGGATCTATCATTCGACGGAACCGGCGATGATCGATATTACAAGAATTATAATGTGAGTCGATATGATGCTTTCAATAACTACATCAAAGACGAACTTGGAAAAACTCAATCGCTTTATTCCAAGGCATACGTTAGAGATTTCACCTTGTTTAGTACCGCGATCAAACAGAATGAGAAGTCATACAAATTCAACGCGGCAACAACACCAAGCGGTGCATCTATCGCAATAGATGTTCCTAAAGGTGCGATGATTCGAGTTGACTTTGATATTGATACCGAGACTCCCAGTGAACTAAGATGCTCAAAACACACAAGCGGTTCTGCTCCGTTGATTAAATCGTTGCCTGCTGGTCGCAATAAAGGATCAGTGTGTTACGTTGTGACTGACCCAGATCCGGGGATTTACATTAGATCTATCGCTCCTAAGTTGAATGAACTCGTTAAGATTCATAAATTGAACGTTAGTCGTGTTTATACCGACGCAATCATTACTAACGCCAACGGTTCTAGTTATAAGCTAGAAGAACGTGAACAGCCGGTAATGAAAACAATCAACCGAACTAGATGGATTCCTGATTATAGAGTTGGACACATGCATATTCAAAATGCGTGGAAACCAGATCCAAAAGATTGGGCGATTCGTATCAAGTTTAAAGTTGGTGCGTTGAAGGACGTTATGAATCCACTTATTAGTGGCTTGAAGAATGATACCAGCACAATCAATATAGATCAATACAACTCGGTAGAAAGTGTTCGTGTGTTTAGCTATGACGCAGCAAGAAAAATTAACACCGGCGTGACGTTGAATGCTGGATTCAAAGTCGGAGATATCGCTGATGTGTATGTTAATGTTCTTGGTAACAAAGTGACTATTTCGGCGAATGGAAAATCAATAACAGGAGATTGGATTCCAGACGGCACAGAAAGCGTTAGTTTCATCGGTGCAAGAGAAGGCGCTGGATTGAGATACAACAACGATATATATCTTGTTGAGTTGATTGACAGTAGCACTAAACTTTGCAATAGCAGACGTTACGATTTGTCATATTATTCAACAGTTAAACCAACTCATACCGTAATAGAAAATGAGCTTGGTGCTATTAAATATATGAGAATTGGAGAAATAGATGCGACTAATACAATATACGGTTGGGACGGAACTAAAGGAGAAGAGTCTCAAATTGGACGTATGAAAGACTCATGGTTTATCGACGGGCATCAAATGGATCTTTTTACTACTATAAATACAAGTGGGTATAGTGCTTATTTGCATTTTGTTGGCAATAAGAGACCATTTAATTCTGTTGATGTTGAGTTATATCAAATCGACAATAATAATAACGAGCAGTTGTTTGGAGTATACAACGCGAATCAGGCGATCAACTCGGAAGGATATGGAATTCCGGAAAATGCTGACGTCAAAACTTGGTTTAGTGATTTGAACACTACCAAGAAAGTTATTTTCAGACCCAAAGGCGTCGGTGCATTGCTTGCATTTGTCGATGCGAAATCTTGGGTCGCTATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
b621a7003f795bce22bf11e3988d8025533eb048a40d916f93704dbb5690de25
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,6584
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50