Protein

UniProt accession
A0A220IHB5 [UniProt]
Protein name
Tail spike TSP1/Gp66 N-terminal domain-containing protein
RBP type
TSP

evidence: UniProt/TrEMBL

probability: 1,0000

Protein sequence
MSSGCGDVLSLADLQTAKKHQIFEAEVITGKSGGVATGADIDYATNAVTGQTQKTLPAVLRYAGFSPVSWDFSTGGTLTVNGRDKVVYDPVSKTWYSYAGALPVTVPAGFNPVGNADWKPQTDPDLRNELTSTSGASLVGGSVYVVDTFESAKLVENVNSKTLRTLGHHTEGKGPAEYVKTGTTGTPSSGTEALFYDATGHGWRLKTDSVDYYMFGAELDGATDDTLPVVAAHTYANANKIPIVQNGGNAFVTPNDSSRQIVFNTDAAFTGGFKFSSNSITGRGFVVRSVADEITLSQSDVVISEFVATRMKIPSLSGYANCYAAIQSTETDLNRKRDTGPALQLKRQPVVIGKDGTLDAPLWTTFSSISAIKIQSILLPTITIDGFRLRSEGALTNANPFGVQRNNVVIKNFYYEKGTTSATIPVQSLLSVEFCYNVTIEGISCDPLSVGIVDYNYVINTWTSSKISIRNMTSFDGWAQLDGNYCRNVSISDSIIDRVGSHFRCYDYTFKNLTARRGRCIAVSGGGLLHVENIKVYAESSTVSDEQYTVVALRGDYGAEWDGAVVVKNVIFDFSNFYSSSTGITASIVSAFIDSTVGAHDFMRTVVMPKSISIKDCTFIINTMPGSYLRALGVGCTSAIVSSVIYPSNIDIENITCDNSSGTNKFKVRGVEWNTLQRSEIRRHDVNIRVKGVVNIDPATYGETILDDANSTNTILTTGTNTRIYMTVRDCPWQSLRAEGNYIRTKVYGGSITYFTGTAGTSNRISFYGTEFNGTRFRGAIKVTIQDCIFNNYVDYTSASVPIGNGQSVDTNTTFIIGTATEFGATITGTVVTASTAKTGYSQSGYFQ
Physico‐chemical
properties
protein length:848 AA
molecular weight: 91426,13200 Da
isoelectric point:5,84012
aromaticity:0,09906
hydropathy:-0,11521

Domains

Domains [InterPro]
A0A220IHB5
1 848
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage ST20
[NCBI]
2569975 Uroviricota > Caudoviricetes > Guernseyvirinae >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ASH99372.1 [NCBI]
Genbank nucleotide accession
MF153391 [NCBI]
CDS location
range 19675 -> 22221
strand -
CDS
ATGTCAAGCGGATGCGGTGACGTTTTAAGCCTGGCGGATTTACAAACCGCCAAGAAGCACCAGATTTTCGAAGCCGAGGTTATCACCGGTAAATCCGGCGGGGTGGCTACTGGCGCTGATATTGATTACGCGACCAATGCTGTTACGGGTCAGACGCAGAAGACGCTCCCGGCGGTACTGCGCTATGCTGGGTTTTCTCCAGTATCGTGGGATTTCTCCACGGGCGGTACGTTGACAGTTAATGGCCGCGACAAAGTGGTGTATGACCCTGTAAGTAAAACGTGGTACTCGTATGCGGGTGCACTTCCTGTTACAGTTCCAGCCGGTTTTAACCCTGTGGGAAACGCTGACTGGAAACCGCAAACTGACCCCGATTTACGCAACGAGCTAACATCCACATCTGGTGCGTCGTTAGTAGGAGGGTCTGTATATGTCGTAGATACTTTTGAGTCGGCTAAACTTGTAGAAAACGTTAACTCTAAAACATTACGCACTTTAGGACACCACACAGAAGGTAAAGGCCCGGCGGAGTACGTTAAGACCGGAACTACAGGCACTCCGTCTAGCGGAACAGAAGCTTTATTTTACGATGCGACAGGTCACGGGTGGCGTCTAAAAACAGACAGTGTTGATTATTACATGTTTGGTGCGGAGCTTGACGGGGCAACAGACGATACATTACCGGTGGTAGCGGCGCACACCTACGCTAACGCTAATAAAATTCCTATTGTGCAAAATGGTGGTAATGCTTTCGTCACCCCAAACGACTCCTCCCGGCAGATTGTATTTAACACTGATGCCGCGTTTACCGGTGGGTTTAAGTTCAGTTCAAATTCAATCACTGGCCGTGGGTTTGTCGTACGCTCTGTTGCTGATGAGATAACCCTCTCACAGAGTGATGTTGTTATTTCTGAGTTCGTAGCAACACGGATGAAGATTCCAAGTCTGTCCGGTTACGCAAACTGTTACGCAGCTATTCAGTCTACGGAAACCGACCTAAATCGTAAACGTGATACTGGCCCTGCGTTGCAACTTAAACGTCAACCGGTAGTAATTGGCAAGGACGGAACTCTTGACGCGCCACTGTGGACGACTTTCAGCTCAATATCTGCGATTAAAATTCAAAGTATCCTTCTACCGACTATAACCATTGATGGTTTCCGGTTGCGGTCTGAGGGTGCGTTAACCAATGCCAACCCCTTCGGCGTTCAAAGGAACAACGTTGTCATAAAAAATTTCTACTATGAGAAAGGCACGACATCAGCAACAATACCGGTTCAATCCCTACTCAGCGTAGAATTTTGTTATAACGTAACTATAGAGGGTATCTCATGTGACCCGTTGAGTGTAGGTATCGTTGATTACAACTACGTTATAAACACATGGACAAGTTCTAAAATTTCCATCAGAAATATGACATCTTTTGATGGTTGGGCGCAGCTTGACGGCAACTATTGCCGAAATGTTTCTATTAGTGATTCCATAATCGACCGTGTAGGCTCTCACTTTAGGTGCTACGACTACACTTTCAAAAACCTCACGGCACGAAGGGGGCGCTGCATCGCTGTTTCCGGAGGAGGATTACTCCATGTGGAAAACATCAAGGTTTATGCGGAATCATCCACGGTATCCGATGAACAGTACACTGTCGTAGCCCTTCGAGGTGACTACGGTGCTGAGTGGGACGGAGCCGTCGTAGTTAAGAATGTTATTTTCGACTTCAGTAACTTCTACTCATCGTCTACTGGCATAACGGCATCTATAGTGTCGGCATTTATCGATTCCACCGTGGGTGCACACGATTTTATGCGTACAGTCGTTATGCCCAAGAGCATATCTATTAAAGATTGCACTTTCATCATAAACACCATGCCTGGAAGTTATCTGCGTGCATTAGGGGTTGGTTGTACCTCTGCAATAGTCTCTAGTGTTATTTACCCAAGTAATATAGATATAGAAAACATAACGTGTGATAACAGTTCTGGTACAAATAAATTTAAGGTCCGCGGGGTGGAATGGAACACTTTACAGCGTTCTGAGATACGGCGACATGACGTGAACATCCGCGTGAAAGGTGTTGTAAATATAGACCCGGCGACATACGGAGAGACCATACTTGATGACGCTAACAGCACAAATACCATCCTTACTACCGGAACAAACACCAGAATCTATATGACTGTTCGTGACTGCCCGTGGCAGAGCTTGCGTGCGGAAGGTAACTATATACGCACAAAAGTTTATGGAGGGAGCATTACTTACTTCACAGGCACGGCGGGTACATCAAACCGTATATCTTTTTATGGAACTGAGTTCAACGGAACAAGATTTCGCGGTGCGATAAAGGTTACTATCCAGGACTGTATTTTCAACAATTATGTTGATTACACATCCGCATCTGTTCCCATAGGGAACGGGCAGAGTGTTGATACCAACACGACTTTTATCATCGGAACTGCTACAGAGTTTGGAGCTACCATAACTGGCACTGTTGTTACGGCTTCAACAGCAAAAACTGGATACTCGCAATCTGGGTATTTCCAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0044423 virion component Cellular Component IEA:UniProtKB-KW (UniProt)
GO:0051701 biological process involved in interaction with host Biological Process IEA:UniProtKB-ARBA (UniProt)
GO:0019058 viral life cycle Biological Process IEA:UniProtKB-ARBA (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.

Literature

No literature entries available.