Genbank accession
CAB5194888.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence RBPdetect
Probability 0,51
TF
Evidence RBPdetect2
Probability 0,98
Protein sequence
MPNTDNEPQIPKNQSSITDSRTGLVSRDWYRFFLNLLNKANEGNNAGLGTVTSVSVSGGTTGLTTSGSPITTSGTITIAGTLDVDNGGTGATTSAAALSNLGAYPASNPNGYNSGTVTSVGATVPAFLSVSGSPVTTTGTLAITYSGTGLPVANGGTGATIASTARSNLSAAQSGANIDITSIALTTGTITTAPSGNDDIVNKFYADSLINGVNFHAACNYATTAALAANTYNNGSSGVGATLTAVAVGTLTVDGYTFVVGDVGKRILVKNEVTGANNGVYTLTQAGTALLPYILTRATDYDTSGTGSNEIDQGDLVLILAGTANANTSWVQQTALPITVGTTALVFVQFAAVQTYSAGTGLSLASNTFSIANTGTAGTYGSASVVPVFVTNAQGQVTSVTNTAIAIAGSAVTGNITGNAANVTGTVAVANGGTGLTTTPANGALDIGNGTGFTRTTLTPSTGITVTNASGSITIANSLPMTYPGAGIPNSTGTAWGTSYTTTGSGTVVALATSPSFTTPILGTPTSGNFSTGTFTWPTFNQNTTGTASNVTGTVAVANGGTGLTTAPTNGQIDIGSTGVGFVRTTLTAGSGISVTNAAGSVTITNTSPSSGGTVTSVTGTAPVSVATGTTTPVISLAASYGDTQNPYASKTANYVLAAPNGAAGVPTFRAIVAADIPTLNQNTTGSAATLTTGRTISISGDLTYTSPSFDGSANVTAAGTLATVNATVGSFTNASLTVNGKGLVTAVSSGTAPVTSVTGTAPVVSSGGATPAISLAASYGDTQNPYASKTAKFVLAAPNAAAGVPTFRAIVASDIPTLNQNTTGTASNVTGTVAIANGGTGQTTAVAAFDALSPATTKGDLIVSNGTDNVRQAVGTDTYVLIADSTQTSGVKWGPVSVVYFTSSESTAAPNATVPVDALTVANASANVDVVLAAKGTGATLAQIPDSLTSGGNKRGTYATDWQKSRSSASYVASGIYSTVSGGADNRTTGDYGVAVGGLANATTGNYSISGGRSCTASGTYSIAMGYSNSTPTAGSGNVALGYNTTCSGSYGLTAGYNADARVRYSVFAMSSAAIASTQPTQTTIQTVGKETTTATPATLTTLSSGTTYFYQNRMDINSAYAFKIMVVANVTGGGNTKAWELTGCIKRGASGAPTIVGAVTKTIIAADTGTSAWDVSVVIDTDAFSVQATGAAATTIRWAASIYCSEVQF
Physico‐chemical
properties
protein length:1211 AA
molecular weight: 119192,65310 Da
isoelectric point:4,90023
aromaticity:0,06193
hydropathy:0,14162

Domains

View on InterPro
CAB5194888.1
1 1211 aa
STR 963–1065 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

CAB5194888.1
1 1211 aa
Domain Start End Length (AA) Confidence
N-terminal 1 523 523 0,6763
Central domain 524 722 200 0,3680
C-terminal 723 1211 488 0,4529
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
uncultured Caudovirales phage [NCBI] · taxon 2100421
Host No host information

Coding sequence (CDS)

Genbank protein accession
CAB5194888.1 [NCBI]
Genbank nucleotide accession
LR798219 [NCBI]
CDS location
range 29445 -> 33080
strand +
CDS
ATGCCTAATACCGATAACGAACCGCAGATACCCAAAAACCAATCCTCGATTACTGATTCTAGGACAGGGTTGGTTTCGCGTGATTGGTATCGCTTTTTTTTAAATCTGCTTAATAAGGCCAACGAAGGCAACAATGCGGGGTTAGGCACCGTAACGTCTGTTAGCGTGTCAGGCGGAACGACAGGTTTAACAACATCAGGCAGCCCAATTACAACCTCGGGCACCATTACTATCGCGGGCACATTAGATGTAGACAACGGCGGTACGGGGGCTACTACATCGGCTGCCGCGTTAAGTAACCTTGGCGCGTATCCAGCGTCTAACCCTAATGGGTACAATTCAGGTACGGTTACCAGCGTAGGAGCTACGGTTCCCGCATTTCTGTCGGTGTCTGGTAGCCCTGTTACAACTACAGGCACGCTAGCCATTACGTATTCGGGCACTGGTTTGCCGGTAGCTAATGGCGGTACTGGCGCCACCATTGCATCAACGGCTAGGTCTAATTTAAGTGCAGCGCAAAGCGGGGCAAATATTGACATTACATCAATCGCGTTAACTACTGGCACCATTACTACGGCCCCAAGTGGAAACGACGACATAGTTAATAAGTTTTATGCTGATTCGTTAATTAACGGCGTCAATTTTCACGCTGCGTGTAATTACGCTACTACGGCAGCTTTAGCGGCTAATACTTACAACAATGGGTCAAGCGGAGTAGGCGCTACGTTAACAGCAGTAGCTGTAGGCACTTTAACTGTTGATGGGTATACTTTTGTCGTCGGCGACGTAGGCAAACGTATATTAGTCAAAAATGAGGTAACGGGTGCCAACAATGGCGTCTATACGCTAACTCAAGCGGGTACTGCATTACTGCCGTACATTCTTACCCGCGCAACAGATTACGATACTAGCGGAACGGGATCAAATGAAATTGATCAAGGTGATTTGGTACTTATTTTGGCGGGTACGGCTAATGCCAACACGTCATGGGTTCAACAAACCGCATTACCTATTACTGTTGGCACTACGGCGCTTGTTTTTGTGCAGTTTGCGGCAGTTCAAACGTATTCCGCCGGTACAGGGTTAAGCCTTGCCAGTAACACGTTTTCGATTGCCAACACCGGCACAGCAGGTACGTATGGTTCCGCATCGGTGGTGCCGGTGTTTGTTACCAACGCTCAAGGTCAAGTTACTAGCGTAACTAATACGGCGATAGCTATTGCGGGCAGCGCGGTAACGGGCAATATAACGGGTAATGCAGCCAATGTAACCGGCACAGTAGCCGTAGCCAACGGCGGTACAGGTCTTACCACCACTCCAGCCAACGGCGCGTTAGACATTGGTAACGGTACGGGGTTTACTCGTACAACGCTAACTCCAAGCACGGGCATAACAGTTACCAACGCATCAGGATCAATTACGATCGCCAATTCGTTGCCTATGACGTACCCTGGAGCGGGCATCCCTAACTCGACCGGCACGGCATGGGGCACGTCGTACACAACTACAGGCTCTGGTACTGTGGTGGCATTAGCCACATCCCCTAGCTTTACTACGCCAATATTAGGCACACCCACGTCGGGCAACTTTAGTACCGGCACGTTTACGTGGCCAACTTTTAACCAGAACACCACGGGTACAGCTAGTAACGTCACAGGTACGGTTGCGGTGGCCAACGGCGGTACGGGATTAACTACAGCGCCAACTAACGGTCAGATTGATATTGGCAGCACAGGTGTTGGATTTGTCCGAACAACATTAACGGCGGGTAGCGGCATTTCGGTAACCAATGCTGCAGGTTCGGTCACCATTACCAATACAAGCCCTTCTAGCGGCGGTACAGTTACGTCTGTAACCGGCACGGCGCCAGTGTCTGTAGCTACGGGTACGACTACCCCTGTCATTAGTTTGGCGGCTAGCTATGGCGACACTCAAAATCCATATGCCAGTAAAACGGCTAATTATGTGTTGGCGGCGCCTAATGGCGCGGCGGGCGTACCCACGTTTAGAGCAATAGTCGCAGCAGACATACCAACATTAAACCAAAACACAACTGGTTCTGCGGCCACGCTGACTACGGGCAGAACTATTTCTATCTCGGGCGATTTAACGTATACCAGCCCTAGTTTTGATGGTTCGGCCAACGTAACTGCTGCGGGTACATTGGCTACCGTTAACGCTACTGTAGGCAGTTTTACCAATGCTTCACTTACCGTTAACGGTAAAGGGTTAGTTACCGCAGTTTCTAGCGGCACCGCGCCGGTTACCTCGGTAACAGGCACGGCGCCAGTTGTGTCGTCAGGCGGCGCAACGCCTGCTATTAGTTTGGCGGCCAGTTACGGCGATACCCAAAACCCGTACGCTAGCAAAACAGCTAAATTTGTTTTAGCTGCTCCCAACGCGGCAGCGGGCGTGCCTACGTTTAGGGCAATTGTTGCTAGCGACATACCAACGCTAAACCAAAACACTACCGGAACTGCTAGCAATGTTACCGGCACGGTAGCTATTGCCAACGGCGGCACGGGGCAAACAACTGCAGTAGCCGCTTTTGATGCTTTATCGCCTGCAACTACTAAAGGCGATTTAATTGTTAGCAACGGCACGGACAATGTCCGGCAAGCCGTGGGCACAGACACTTACGTTTTAATTGCCGATTCAACACAAACATCAGGCGTTAAATGGGGGCCAGTTAGCGTTGTTTATTTTACGTCTTCTGAAAGCACTGCCGCACCTAATGCTACCGTTCCTGTTGACGCGTTAACGGTAGCTAACGCAAGCGCAAACGTAGACGTTGTGCTTGCTGCTAAAGGTACGGGCGCAACGCTTGCTCAAATACCCGATTCATTAACTTCGGGCGGTAACAAGCGGGGCACGTACGCTACAGATTGGCAAAAATCTAGGTCTTCCGCAAGTTATGTCGCTTCTGGCATATATTCCACCGTTTCAGGGGGAGCCGACAACCGAACAACGGGCGATTACGGCGTAGCTGTCGGAGGTTTGGCTAATGCAACAACAGGAAATTATTCTATTTCTGGAGGCCGCAGTTGTACCGCTAGCGGAACCTATTCAATAGCAATGGGGTATTCTAATTCCACACCAACGGCTGGTAGTGGAAATGTAGCTTTAGGATATAACACTACATGTAGTGGAAGTTATGGGTTAACCGCAGGGTACAATGCTGACGCTCGAGTCAGGTACAGCGTTTTTGCAATGAGCAGCGCGGCTATTGCGTCTACTCAACCAACACAAACTACAATTCAAACGGTAGGAAAAGAGACTACTACTGCAACGCCAGCAACCTTAACTACGCTTAGTTCAGGCACGACTTACTTTTATCAAAACCGCATGGACATTAATTCCGCCTATGCGTTTAAAATTATGGTGGTAGCTAACGTAACGGGCGGCGGGAATACTAAAGCGTGGGAGTTAACGGGTTGTATTAAACGCGGAGCATCTGGTGCCCCAACTATTGTGGGCGCGGTCACAAAGACTATTATTGCGGCGGACACCGGAACATCTGCTTGGGATGTCAGCGTAGTTATCGATACAGACGCTTTTTCAGTTCAAGCCACGGGCGCGGCGGCCACAACCATTCGTTGGGCGGCGAGCATATATTGCTCTGAAGTGCAATTCTAA

Genome Context

Tertiary structure

CAB5194888.1
ColabFold structure
Source ColabFold
pLDDT 27.5
Oligomeric state monomer