Genbank accession
CAH1073607.1 [GenBank]
Protein name
phage protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MRQWILKNNLSSGELSPLLWTRTDIQQYANGAKKLLNALPLVEGGAKKRPGTKFRSIFAGALRLIPFIANSENTYLLILGVSFLKVYNPRTYAVVYETVTPYNTAQKVREVQYAHTKYRMYFVQGDTPVQRLLCSADFTNWQFAAFTFGVNPNDELGSTPNVALTPSGTEVGKVISLTASSFPNWSNTETYLTGDRVIHTSKTWRATIDNKGIEPTATTSEWEEVTNEAANVFTPSNVGSIIEINGGQVKITQYVDPSRVNGEVLVKLTSAVQAIAKSWVLKSIAFSAEAGYPKAVCFFKQRLVFANTKTSPNQMWFSRVGDDGNFLETTQDADAFSIASSSAQSDNILHLSQRGGVVALTGGAEFLINSQGPLTPASAQIDEHTSYGVQANVKPCRVGNELLFVQRGGERLRAMSYRYEVDGLVSPELSQIAPHIPENHAGIKELTFQQTPNSIVWIVMGDGAVSSITLNRDQEMNAWSQHDFGGQVLSICALPTGLGEDQCFMLTIRNGSTVLEEFSESAQSDCEFDINVTNGVGSILNLDIQVLDNPLVNFNNADGYFYSTYTISGTNINLSNTDLTQTVHLGQPFKTEIDLLPPDFSQVPTTAMFHKIQVHEMAIFLNASVGGYINGQELSTKYYNQSAFVNLPYTGYVVDSFVGWQSLHELEVKITHDKPMPLHMQSISMLVSINEK
Physico‐chemical
properties
protein length:692 AA
molecular weight: 76233,01340 Da
isoelectric point:5,34977
aromaticity:0,09682
hydropathy:-0,16055

Domains

View on InterPro
CAH1073607.1
1 692 aa
ATT 166–228 · TTP 295–520 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

CAH1073607.1
1 692 aa
Domain Start End Length (AA) Confidence
N-terminal 1 166 166 0,9864
Central domain 167 365 200 0,0327
C-terminal 366 692 326 0,2396
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Acinetobacter phage MD-2021a [NCBI] · taxon 2899278
Host No host information

Coding sequence (CDS)

Genbank protein accession
CAH1073607.1 [NCBI]
Genbank nucleotide accession
CAKLQF020000002 [NCBI]
CDS location
range 247245 -> 249323
strand -
CDS
ATGAGACAGTGGATCCTAAAAAATAACCTGAGTTCTGGTGAATTAAGCCCGTTACTTTGGACGCGCACAGACATTCAGCAATACGCAAACGGTGCAAAAAAATTACTTAATGCATTGCCTTTGGTTGAAGGTGGAGCAAAGAAAAGACCAGGCACAAAGTTCCGTTCTATTTTTGCAGGTGCATTACGTTTAATTCCGTTTATTGCAAACTCAGAAAACACCTATTTGCTTATCCTTGGTGTGTCTTTCCTCAAGGTTTACAACCCAAGAACGTATGCAGTTGTTTATGAAACTGTGACACCTTACAACACGGCTCAAAAAGTACGTGAAGTACAGTACGCACACACTAAATACCGCATGTATTTCGTTCAAGGTGATACACCTGTACAGCGTTTGCTGTGTTCTGCCGACTTTACTAACTGGCAATTTGCGGCTTTTACCTTTGGTGTGAACCCTAATGATGAGTTAGGTAGTACTCCAAACGTAGCATTGACACCATCCGGTACAGAAGTTGGAAAAGTTATTTCCTTAACTGCTTCATCATTCCCAAACTGGTCAAATACTGAGACTTACTTAACAGGTGATCGTGTAATTCACACTAGTAAGACTTGGCGTGCAACGATTGACAATAAAGGGATTGAGCCTACTGCAACTACTTCGGAATGGGAAGAAGTGACAAATGAAGCAGCTAACGTTTTTACACCTTCAAATGTAGGCTCAATTATTGAAATTAATGGTGGTCAAGTAAAAATAACTCAATATGTAGACCCTTCTCGTGTAAATGGTGAAGTTTTAGTAAAACTAACTTCTGCTGTTCAAGCTATTGCAAAGTCTTGGGTTTTAAAAAGTATCGCATTTAGTGCTGAGGCAGGCTATCCAAAGGCAGTGTGCTTCTTTAAACAGCGCTTAGTATTTGCCAATACGAAAACAAGCCCTAATCAGATGTGGTTTAGTCGCGTTGGTGACGATGGTAATTTCTTAGAGACAACTCAAGATGCGGATGCTTTTAGTATTGCTTCAAGCTCAGCTCAATCTGACAATATTTTGCACCTGTCACAGCGTGGTGGTGTAGTAGCATTAACTGGTGGTGCTGAGTTCTTAATTAACTCTCAAGGCCCTTTAACACCAGCTTCAGCACAGATTGATGAGCACACTTCTTATGGTGTTCAGGCGAATGTTAAGCCTTGCCGTGTGGGCAATGAGCTTCTCTTTGTACAACGTGGTGGTGAGCGTTTACGTGCAATGTCATACCGTTATGAAGTTGATGGACTTGTCTCGCCTGAATTGTCACAAATTGCCCCACACATACCTGAAAACCATGCAGGAATAAAAGAGTTAACCTTCCAGCAAACACCAAACTCTATTGTATGGATTGTTATGGGTGATGGTGCAGTCTCAAGTATCACACTAAACCGTGATCAGGAAATGAATGCTTGGTCTCAGCATGATTTTGGTGGTCAGGTTTTATCTATCTGCGCCTTGCCAACAGGCTTAGGTGAAGACCAATGTTTCATGCTGACTATTCGTAATGGCTCTACAGTCTTGGAAGAGTTTAGCGAGTCTGCACAGAGTGATTGTGAATTTGATATCAACGTAACTAATGGTGTTGGTTCTATTTTAAATCTTGATATTCAGGTTTTAGATAATCCACTGGTTAATTTTAATAATGCGGATGGATATTTCTATTCAACTTATACGATTAGTGGCACCAACATTAATCTATCTAACACTGATCTAACCCAAACTGTACACCTTGGCCAACCGTTTAAAACTGAAATCGACCTATTGCCACCAGACTTTAGCCAAGTACCAACAACTGCTATGTTTCATAAGATTCAGGTTCATGAAATGGCTATATTTTTGAATGCGTCAGTTGGTGGATATATCAATGGTCAAGAGTTATCTACCAAGTATTACAACCAATCGGCGTTCGTAAATTTGCCTTACACTGGCTATGTGGTCGATTCATTTGTTGGTTGGCAATCATTACATGAACTTGAGGTCAAGATAACACACGACAAACCTATGCCTTTACACATGCAAAGTATCTCTATGTTGGTATCAATTAATGAGAAATGA

Genome Context

Tertiary structure

CAH1073607.1
ESMFold structure
Source ESMFold
pLDDT 87.6
Oligomeric state monomer