Genbank accession
ANS02388.1 [GenBank]
Protein name
tail fiber protein host specificity
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,53
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MGNYCSRCRNFMGGKYTLIINVLNKDLVPVTFIDNDIPGLPSYYKDTLIDYLSLGTASFEFTILKSKNNIIQDYSRFFNDETCFSFEKNGKQYAVFPAGSDGFYETDTEITYKCLSLDRELSLEYVDKFDNSSTHTLQWYIDYFELISNNQIEIGRNDVADYTRVIKYDSQDTKLNRLLSLINNFDAEFEFITKLTNNGAVDKIILNIVKKRDDSGKGGIGAIRDDVELVYGKNVKGIERTYNFEFFNASKVIGKDGTNWNSSEFSYINSDGVEEFYKRKNDDTAFAPLSAQKYPAHLRKDSSDIWLRKNFETEYTTPAQMWGYIVQQFKSYAYPQITYKIKTNSNLVSQALDGKLPIQIGDTVTIEDDNFSNEQGDFGLILRARATEIKSSDSNPETNEITFENFVELQNDLSDDLMTQVNQLVDAATPFRAELSTTNGTQFKNGTGSTTLSAHIFKGSATTETVADSYEWSKDGTVVANAQTITVDASGVVDKAVYSFKATVGGKVVASQSVTITNVNDGTNGRSVTNVSQKWRLTTTTATPTQAWSDAGWLTTQPTTTATNKYLWSITRTTFNLAPLTQDVIEQKAVYGDKGDKGDTGNDGIAGKDGVGIKTTVITYAISTSGTIAPNTGWTSSVPSLVKGQYLWTKTVWNYSDGTSESGYTVTYIAKDGNNGNDGIAGKDGVGLINTTLRYAKSKDGVNKPEGRIVASFTDKFIPARSIIDNLIMTGKQVHLEQGKTYILSAETNGIFTNVHNVEQSNNATIWIVNPSFSTWDIISDTNTAIGTKYTHNRPTGDYEIRINSYEIDNSIWVKNIVFEDGTWTPDIPVANPGEYLWTRTTWFYSDNTNETGFSVAKMGEQGPKGDPGKDGIAGKDGVGIKTTVITYAISTNETTAPATGWTSSVPSLVKGQYLWTKTVWTYTDNSSETGYSVTYISKDGNNGTNGIAGKDGVGIKTTTITYAGSTNGTTAPNTGWTSTVPTVAAGNYLWTKTVWAYTDNTSETGYSVAMMGVKGERGKQIFKSSYESVPHNNFHYWSDLSPAPSIDNPPKIGDTVITPSGNILQIDTVNVGGGGGGGTFGVGDVLGNIKGPSGSNGDPGKVVSDTEPTTRFKGLTWKYSGTTDLTASDGTVIKPNVEYYYNGTHWVINYFSVNNFAAESITSDKIDGKNLTITDGEFISKTTNGPVTTSTEIKDNHISISKTDGTVNTRNDIALDSEQGLAQKFTNINTGFYRTAGINYQGPFTSDSDGNFAQLTPQGTKLSTDVPWTKLSLMNNFNGNIEYAIINGTVYISASGVGVPAMTAGQWKQAAQLPTGSSAIPIRANRIAAGDSGDGLSWALLSNQAGGIFIRCSGNKSPTANLFNATLPYPIG
Physico‐chemical
properties
protein length:1373 AA
molecular weight: 149544,56520 Da
isoelectric point:5,02664
aromaticity:0,10488
hydropathy:-0,42702

Domains

Domains [InterPro]
DC_0690
STR
6–723
IPR010572
ENZ
228–407
ANS02388.1
1 1373
Architecture
STR
STR
STR 6-723 | STR 780-1349 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
ANS02388.1
1 1373
Domain Start End Length (AA) Confidence
N-terminal 1 702 702 0,9254
Central domain 703 1259 558 0,2576
C-terminal 1260 1373 113 0,9756
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-702
Central
703-1259
C-terminal
1260-1373

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactococcus phage 28201
[NCBI]
1871678 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Lactococcus lactis
[NCBI]
1358 cellular organisms > Bacteria > Bacillati > Bacillota > Bacilli > Lactobacillales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ANS02388.1 [NCBI]
Genbank nucleotide accession
KX456206.1 [NCBI]
CDS location
range 28372 -> 32493
strand +
CDS
TTGGGTAACTATTGCTCCAGATGTAGAAATTTCATGGGAGGAAAATATACTTTGATTATTAATGTTTTAAATAAAGACTTAGTTCCAGTAACTTTCATTGATAATGATATTCCCGGTTTACCAAGTTATTACAAAGATACTTTGATTGATTATCTAAGTCTAGGTACAGCATCATTTGAATTTACGATATTAAAATCCAAAAACAACATCATTCAAGACTACTCCAGATTTTTTAATGATGAGACATGCTTCTCTTTCGAGAAAAATGGTAAACAATATGCAGTTTTTCCTGCTGGTTCAGATGGGTTCTATGAAACAGACACAGAAATAACGTATAAATGTTTATCACTAGACCGTGAATTGTCTTTAGAATATGTTGATAAGTTTGATAATTCATCAACTCATACTCTGCAATGGTATATCGATTATTTTGAATTAATTTCAAACAATCAAATTGAAATTGGAAGGAATGATGTTGCTGATTATACAAGAGTCATAAAATATGATTCCCAAGATACAAAACTAAATCGCCTTTTATCTTTAATTAACAACTTTGATGCCGAGTTTGAGTTTATTACAAAACTAACCAATAATGGCGCGGTTGATAAAATAATTTTAAATATCGTTAAAAAACGCGATGATTCAGGCAAAGGTGGTATAGGGGCAATCAGGGATGATGTTGAGCTTGTATATGGAAAGAACGTTAAGGGGATTGAGAGAACTTATAATTTTGAGTTCTTTAATGCGTCTAAAGTTATAGGTAAAGATGGAACTAATTGGAATTCAAGTGAATTTTCCTATATTAATTCAGATGGAGTTGAGGAGTTTTATAAAAGAAAAAATGACGATACAGCATTTGCTCCACTTTCTGCTCAGAAATATCCAGCTCATCTTAGAAAAGACTCTTCAGATATATGGCTTAGGAAAAATTTTGAAACCGAGTATACTACTCCTGCTCAAATGTGGGGATATATTGTTCAACAATTTAAGTCATACGCTTATCCTCAGATTACTTATAAAATCAAAACAAATAGCAATTTAGTATCGCAAGCTCTCGATGGAAAACTTCCTATTCAGATTGGTGATACAGTAACCATTGAAGATGATAATTTTTCAAATGAGCAAGGTGATTTCGGATTAATCTTAAGAGCGAGAGCAACTGAAATTAAATCATCCGATAGTAATCCAGAAACAAACGAAATCACCTTTGAAAATTTCGTTGAATTGCAAAATGATTTATCAGATGACCTAATGACACAAGTTAATCAGTTGGTCGATGCAGCTACTCCGTTTCGAGCAGAACTTAGCACAACTAATGGCACACAGTTCAAAAATGGTACTGGCTCAACAACTTTATCAGCTCATATTTTCAAAGGCTCTGCAACAACTGAAACAGTCGCAGACAGCTACGAATGGTCGAAAGATGGAACGGTTGTTGCGAACGCTCAGACTATCACAGTTGATGCCAGCGGAGTTGTGGATAAAGCAGTTTATAGTTTTAAAGCAACGGTTGGCGGTAAAGTAGTCGCAAGTCAGTCGGTCACTATCACTAATGTGAATGATGGAACAAATGGACGTTCTGTTACAAACGTTTCTCAAAAGTGGCGTTTGACAACGACTACTGCAACACCAACGCAAGCTTGGTCAGACGCAGGTTGGCTCACTACTCAACCAACAACGACAGCTACTAATAAATATCTATGGTCTATCACTCGAACAACTTTCAATTTAGCACCTTTAACGCAAGATGTTATTGAACAAAAAGCAGTTTATGGTGATAAAGGCGATAAGGGAGATACTGGAAATGACGGAATAGCAGGTAAGGACGGTGTTGGAATAAAAACCACTGTTATCACTTACGCTATTTCAACAAGCGGAACGATAGCACCAAACACTGGCTGGACAAGTTCAGTTCCCAGTCTTGTAAAAGGGCAATATCTCTGGACGAAAACAGTTTGGAATTACTCAGACGGAACGAGTGAATCGGGATATACAGTAACTTATATTGCAAAAGACGGAAATAACGGTAATGACGGAATTGCTGGTAAAGATGGCGTTGGGCTTATTAATACCACGCTACGTTATGCAAAGTCAAAAGACGGTGTAAATAAACCTGAAGGTCGTATTGTAGCTTCTTTCACGGATAAGTTCATACCAGCTCGCTCAATCATCGATAACCTCATCATGACTGGTAAACAGGTTCACCTCGAACAAGGTAAGACTTATATCTTATCTGCTGAAACGAACGGCATATTTACAAATGTTCATAATGTGGAACAAAGTAACAACGCTACAATCTGGATTGTTAATCCAAGTTTTAGCACGTGGGATATTATTTCCGACACCAACACAGCTATCGGTACGAAATACACGCACAATCGTCCGACTGGTGATTACGAAATTCGTATTAACAGTTATGAAATAGATAATTCAATTTGGGTTAAAAACATTGTATTTGAAGACGGTACTTGGACTCCTGACATACCAGTGGCCAATCCCGGCGAATACCTCTGGACGAGAACGACATGGTTCTATTCAGATAACACAAATGAAACTGGTTTTTCCGTTGCGAAAATGGGAGAACAAGGACCTAAAGGAGACCCTGGTAAAGACGGAATAGCAGGTAAGGACGGTGTTGGTATTAAAACCACCGTTATTACTTACGCTATTTCAACAAATGAAACAACAGCACCGGCAACTGGCTGGACAAGTTCAGTTCCCAGTCTTGTAAAAGGGCAATATCTCTGGACGAAAACAGTTTGGACATACACGGATAACTCATCTGAAACAGGTTATTCAGTCACTTATATTTCTAAGGATGGTAACAACGGTACTAATGGAATTGCTGGCAAGGACGGCGTTGGTATTAAAACTACGACCATTACATACGCAGGTTCAACAAATGGAACGACAGCACCAAATACTGGTTGGACTTCCACAGTTCCAACAGTTGCAGCAGGTAACTACCTATGGACTAAGACTGTTTGGGCTTATACGGATAATACCAGCGAAACAGGATATTCCGTTGCAATGATGGGGGTTAAAGGTGAACGTGGAAAGCAGATTTTTAAAAGTAGTTACGAGTCTGTACCACATAATAATTTTCATTATTGGTCTGATTTAAGCCCAGCACCGTCCATTGATAATCCTCCAAAAATTGGTGATACCGTAATCACTCCATCTGGCAACATTTTACAAATTGATACTGTAAACGTTGGCGGCGGAGGCGGTGGTGGAACTTTTGGAGTTGGCGATGTACTTGGAAACATCAAAGGACCTTCTGGTAGTAACGGCGACCCAGGTAAAGTTGTTTCTGATACTGAGCCAACGACTCGATTTAAAGGCTTGACTTGGAAGTATTCAGGCACCACTGACCTTACAGCTAGTGATGGAACGGTCATTAAGCCAAATGTTGAGTATTACTATAATGGCACTCATTGGGTGATTAACTATTTTAGCGTCAATAACTTTGCGGCTGAATCGATAACATCAGATAAAATTGATGGTAAAAATTTAACAATTACTGATGGTGAGTTCATTAGCAAAACAACTAATGGTCCAGTTACAACCTCTACTGAAATTAAAGATAATCATATTTCGATTTCAAAGACAGACGGAACTGTTAATACCAGAAATGATATAGCGCTTGATTCTGAACAAGGACTAGCTCAGAAATTTACGAACATTAATACAGGATTCTACAGAACAGCTGGGATTAATTATCAAGGGCCATTCACAAGTGACTCAGATGGAAATTTTGCTCAACTCACACCTCAAGGCACAAAGTTATCTACTGATGTTCCTTGGACCAAGCTTAGTTTGATGAATAATTTTAATGGAAATATTGAGTATGCAATTATCAATGGGACTGTCTATATATCAGCGTCAGGAGTTGGCGTACCAGCAATGACTGCTGGTCAATGGAAGCAAGCGGCTCAATTGCCAACAGGAAGTTCAGCAATTCCAATTAGAGCAAATCGAATTGCAGCAGGAGATAGTGGAGATGGTCTAAGTTGGGCATTACTTTCTAATCAGGCTGGAGGAATATTCATTCGATGCAGTGGTAATAAATCGCCAACAGCTAACTTATTTAATGCCACATTACCATATCCTATCGGATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
ba574277ed2730b07506ab7ee51cb6639f4a1c4ec8e8edf27a85550f39ea1f5e
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7428
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Genome sequence of eight prophages isolated from Lactococcus lactis dairy strains Oliveira,J. and van Sinderen,D. 2016-12-29 GenBank