UniProt accession
Q38394 [UniProt]
Protein name
Long tail fiber protein Gp37
RBP type
TF
Evidence UniProt/Swiss
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,80
TSP
Evidence RBPdetect2
Probability 0,81
Protein sequence
MATLKQIQFKRSKIAGTRPAASVLAEGELAINLKDSTIFTKDDSGNIIDLSISAGGNISGNVTIDGTLRVNGPINNFGNFSTSGQITAGSSISAQVFRALQGSFYSRASTGETANAHLWFENADGTERGVIYARPQTTTDGEIRLRVRQGTGSTTNSEFYFRSINGGEFQANRILASDSLVTKRIAVDTVIHDAKAFGQYDSHSLVNYVYPGTGETNGVNYLRKVRAKAAGTMWHEICTAQTGQADEMSWWTGNTPQSKQYGIRNDGRIAGRNSLALGTFTTNFPSSDYGNVGVMGDKYLVLGDTVTGLSYKKTGVFDLVGGGYSVASITPDSFRSTRKGIFGRSEDQGATWIMPGTNAALLSVQTQADTNNAGDGQTHIGYNAGGKMNHYFRGTGQMNINTQQGMEINPGILKLVTGSNNVQFYADGNISSIQPVKLDNELFLNSSNNTAGLKFGAPSKVDGTRAIQWNGGTREGQNKNYVIIKAWGNSFNATGDRSRETVFQVSDSQGYYFYAHRKAPTGDETIGRIEAQFAGELNAKSINAVENFKVNGLSTLVGGVTMSNGLNLTGGSSITGQVKIGGTYDALRIWNSRYGAIFRRSETSLHIIPTNENEGENGAINNLRPFSIELGTGTVSMLHDVHLGNSGSSTGLLQVSNSLKTIKMICPVTINERNAALTLDSPSSSSANYLQGSKAGTKSWYVGLGGAGNDLSLYSQSYGHGLVISDNFVSISKPLKVGNAQLGTDGNITGGSGNFANLNTTLNRKVNSGFITYGATSGWYKFATVTMPQSTSTAFFKIVGGSGFNSGLFTQCNIAEIVLRTGNERPADLNAVLYTRTIGAAFKNIAVNNVSGDTYDIYVYAGTYCNQLACEWACTENATISVIGINSSTQSPVDDLPDTAVNGQVANVLNNLVDSGKGKRYEAESEIAINSQTGIRIRSNADKTGSVATMLRNDGGSFYILFTDKNDTDGAATVNGEWNSKRPFAINLTTGEVMMNNGIAVRSAALFYNSINVKDNGSINFDKSGANPRNMRIFHAGDASRGNRIEIADETNYIAYFEKAPGGANRFVVNNATVSGVNQMNSFGVNTSNALGGNSITFGDTDTGIKQNGDGLLDIYANNAQVFRFQNGDLYSYKNINAPNVYIRSDIRLKSNFKPIENALDKVEKLNGVIYDKAEYIGGEAIETEAGIVAQTLQDVLPEAVRETEDSKGNKILTVSSQAQIALLVEAVKTLSARVKELESKLM
Physico‐chemical
properties
protein length:1243 AA
molecular weight: 132989,13680 Da
isoelectric point:7,18288
aromaticity:0,08367
hydropathy:-0,31681

Domains

View on InterPro
Q38394
1 1243 aa
ATT 493–628 · ATT 674–750 · CHP 1145–1242 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

Q38394
1 1243 aa
Domain Start End Length (AA) Confidence
N-terminal 1 934 934 0,0261
Central domain 935 1133 200 0,3665
C-terminal 1134 1243 109 0,9953
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Host No host information

Coding sequence (CDS)

Genbank protein accession
CAA25250.1 [NCBI]
Genbank nucleotide accession
X00613 [NCBI]
CDS location
range 1 -> 162
strand +
CDS
GAATTCTATTTCCGTTCTATAAACGGCGGTGAATTTCAAGCCAATCGTATTTTAGCATCAGATTCGTTAGTAACTAAACGCATTGCGGTTGATACTGTTATTCACGATGCCAAAGCGTTTGGACAATATGATTCTCACTCTTTGGTTAACTATGTTTATCCT

Genbank protein accession
CAA28445.1 [NCBI]
Genbank nucleotide accession
X04747 [NCBI]
CDS location
range 1 -> 3732
strand +
CDS
ATGGCTACTTTAAAACAAATACAATTTAAAAGAAGCAAAATCGCAGGAACACGTCCTGCTGCTTCAGTATTAGCCGAAGGTGAATTGGCTATAAACTTAAAAGATAGCACAATTTTTACTAAAGATGATTCAGGAAATATCATTGATTTAAGCATTTCCGCTGGCGGTAATATCAGTGGAAATGTAACTATTGATGGAACTTTACGCGTCAATGGACCAATAAACAATTTTGGAAATTTTTCCACAAGTGGCCAGATTACCGCTGGTAGTAGTATTTCAGCTCAAGTTTTTAGAGCATTGCAAGGTTCGTTTTATTCAAGAGCTTCAACCGGTGAAACAGCAAATGCCCATTTATGGTTTGAAAATGCCGATGGCACTGAACGTGGCGTTATATATGCTCGTCCTCAAACTACAACTGACGGTGAAATACGCCTTAGGGTTAGACAAGGAACAGGAAGCACTACCAACAGTGAATTCTATTTCCGTTCTATAAACGGCGGTGAATTTCAAGCCAATCGTATTTTAGCATCAGATTCGTTAGTAACTAAACGCATTGCGGTTGATACTGTTATTCACGATGCCAAAGCGTTTGGACAATATGATTCTCACTCTTTGGTTAACTATGTTTATCCTGGAACCGGTGAAACAAATGGTGTAAACTATCTTCGTAAAGTTCGCGCTAAAGCTGCCGGAACCATGTGGCATGAAATCTGTACAGCTCAGACTGGACAAGCTGATGAAATGTCATGGTGGACTGGTAATACTCCTCAGTCTAAACAATATGGTATTCGTAATGACGGACGAATTGCTGGACGTAATAGCCTTGCATTAGGTACATTCACTACAAATTTCCCGTCTAGTGATTATGGTAATGTCGGTGTAATGGGCGATAAATATCTTGTTCTCGGTGACACTGTAACTGGCTTGTCATACAAAAAAACTGGTGTATTTGATCTAGTTGGCGGTGGATATTCTGTTGCATCTATTACTCCAGACAGTTTCCGTAGTACTCGTAAAGGTATATTTGGTCGTTCCGAGGACCAAGGCGCAACTTGGATAATGCCTGGTACAAATGCTGCTCTCTTGTCCGTTCAAACACAAGCTGATACTAACAATGCTGGAGACGGACAAACCCATATCGGGTACAATGCTGGCGGTAAAATGAACCACTATTTCCGTGGTACAGGACAGATGAATATCAATACCCAACAAGGTATGGAAATTAACCCGGGTATTTTGAAATTGGTAACTGGCTCTAATAATGTACAGTTTTATGCTGACGGCAATATTTCTTCTATCCAACCTGTTAAATTAGACAACGAATTATTTTTAAATAGTTCTAATAATACCGCAGGACTTAAATTTGGCGCCCCTAGCAAAGTTGATGGAACAAGAGCTATCCAATGGAACGGTGGTACTCGCGAAGGACAGAATAAAAACTATGTGATTATTAAAGCATGGGGTAACTCATTTAATGCCACTGGTGATAGATCTCGCGAAACGGTTTTCCAAGTATCAGATAGTCAAGGATATTATTTTTATGCTCATCGTAAAGCTCCAACCGGCGACGAAACTATTGGACGTATTGAAGCTCAGTTTGCTGGAGAACTTAATGCTAAAAGTATTAATGCCGTCGAAAATTTTAAAGTTAATGGATTAAGCACTTTAGTCGGCGGAGTTACAATGAGCAATGGGCTTAATTTAACCGGCGGTTCTTCTATTACTGGACAAGTTAAAATAGGCGGAACATATGATGCGTTAAGAATTTGGAACTCTCGTTATGGCGCCATTTTCCGTCGCTCAGAAACATCATTACATATTATCCCAACTAATGAAAATGAAGGGGAAAACGGTGCAATAAACAACCTTCGTCCGTTTAGTATTGAGTTAGGCACCGGCACAGTTTCTATGTTACATGATGTTCATTTAGGAAATTCCGGATCTTCTACAGGATTATTACAAGTAAGTAATAGTCTTAAAACTATTAAAATGATATGTCCAGTGACTATTAATGAACGCAATGCAGCGCTTACCCTGGATTCTCCTTCATCTTCTTCTGCTAATTATTTACAGGGTTCTAAAGCTGGAACTAAATCGTGGTATGTTGGTCTTGGTGGCGCTGGAAATGATTTATCGCTTTATAGCCAATCTTATGGACATGGTCTTGTTATAAGCGATAATTTCGTGTCAATTAGTAAACCTCTTAAAGTTGGAAATGCACAACTAGGAACTGACGGTAATATTACTGGTGGTTCTGGTAATTTTGCTAACTTGAATACCACGTTAAATCGTAAAGTTAATTCTGGATTTATTACTTATGGAGCAACCTCTGGATGGTATAAGTTTGCAACAGTAACAATGCCACAATCCACTTCGACAGCCTTCTTTAAAATAGTTGGAGGTTCTGGATTTAATAGTGGATTATTCACCCAATGTAATATTGCTGAAATTGTTTTACGTACTGGTAATGAAAGACCTGCTGACTTAAATGCTGTATTATACACAAGAACAATTGGAGCTGCATTTAAAAATATTGCAGTTAATAACGTCTCTGGAGATACATATGACATCTATGTTTATGCTGGAACATATTGTAATCAATTAGCTTGTGAATGGGCATGTACTGAAAACGCTACTATTAGTGTTATTGGTATTAACTCATCTACACAATCACCTGTAGATGATCTTCCAGATACAGCAGTTAACGGTCAAGTTGCTAATGTTCTTAATAACTTGGTTGATAGCGGTAAAGGTAAGCGTTATGAAGCCGAGTCTGAAATAGCTATTAATAGCCAAACCGGTATTCGTATCAGAAGCAATGCCGATAAAACTGGTTCTGTAGCTACAATGTTACGAAATGATGGCGGTAGTTTTTATATTCTGTTTACAGATAAAAATGACACCGATGGCGCAGCAACTGTTAATGGTGAGTGGAATAGTAAACGTCCTTTCGCAATTAACTTAACAACCGGCGAAGTGATGATGAATAATGGCATAGCTGTTCGCAGCGCTGCTTTATTCTATAATAGCATAAACGTCAAAGATAATGGTTCTATTAACTTTGATAAATCAGGTGCTAACCCGAGAAATATGAGAATATTTCATGCTGGCGATGCTTCTCGCGGTAACCGTATTGAAATTGCCGATGAAACTAATTATATTGCTTACTTTGAAAAAGCTCCAGGTGGAGCTAACCGCTTTGTAGTAAATAATGCTACTGTATCTGGTGTTAATCAAATGAATTCATTTGGTGTCAATACATCAAATGCTCTCGGTGGAAACAGTATAACATTTGGTGATACCGATACTGGTATTAAGCAAAATGGTGATGGATTATTAGACATATATGCGAACAACGCACAAGTATTCCGTTTCCAAAACGGCGATTTGTACTCATACAAAAATATAAATGCTCCAAACGTTTATATTCGTTCTGATATTCGTTTAAAATCCAACTTTAAGCCTATCGAAAATGCACTTGATAAAGTTGAAAAACTTAACGGTGTCATTTATGATAAAGCTGAATACATCGGTGGAGAGGCAATTGAAACTGAAGCGGGTATTGTGGCTCAAACGCTACAAGACGTTTTACCAGAAGCCGTCCGTGAAACAGAAGACAGCAAGGGTAATAAAATACTCACTGTTTCTTCTCAAGCCCAGATTGCTCTTCTGGTTGAAGCTGTGAAAACGCTTTCTGCTCGTGTAAAAGAACTTGAATCTAAACTTATGTAA

Genome Context

Gene Ontology

Description Category Evidence (source)
GO:0098024 virus tail, fiber Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0046718 symbiont entry into host cell Biological Process IEA:UniProtKB-KW (UniProt)
GO:0019062 virion attachment to host cell Biological Process IEA:UniProtKB-KW (UniProt)

Tertiary structure

Q38394
ESMFold structure
Source ESMFold
pLDDT 51.7
Oligomeric state monomer

Literature

Title Authors Date PMID Source
6092843 PubMed