Genbank accession
YP_003858558.1 [GenBank]
Protein name
long tail fiber protein distal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,73
TSP
Evidence RBPdetect2
Probability 0,57
Protein sequence
MADLKLGTTLGGAGIWSASNLPLLPSGDRLTYKGWRVYTENDRPTAEDINALSTINGGTVAKNTTFNQSVTVGINLTANTLYSKSYVDITTGGAAPGILFKRSDVTGTPSTEQGIMQITGVNASGVALATLNINARADGGNRIYLNAYKNGTTNTSFVVDSANQQVSVELGSFRVAGSSTFAAVSATTLNVSGAITANTVTPTDWTNHDNRYITGIPRSMQGNSFGAAVVTEKNDVLSVGGNVIDGPYGNTTYSGQVSTYRRTLNSGVSLVQTYYDTNSSWIRSGSGSPGAWSWSAGDANGWRRIYDTATPPTPAEVGAVNKAGDTMSGVLTLAPISQPLKTQGGGVFANDGNVYINKGGFAGWIDDLFFKNTGGVVTGTINQDGLNSNVYCTSALSITATGSKNYLRRFRGGNGDQIWHETVQGGTYRLATGDTDSANLLTIESSGNMILNGSQGGSSTLFLDADSNSVVWYRMINGTEKAVTFASNDGVYHIRTNNVRSNTWDFRAGMILNLGICSGSDYALIRGTPDGGGLDQWRERSSGLQIDMPNGSTSAYNIWKATVWDATHVAAMDVHIPNNDGAQARVRLIHQSGAYYHFDGSGQFTASGNGNFNDVYIRSDERLKSNLSKIESALDKVDLLEGVIYDKADHVGGEPTSREAGLIAQQLREVLPEAVKTGEDTERNEILTVSPTAVIALLVNAIKELREEIRELKSR
Physico‐chemical
properties
protein length:715 AA
molecular weight: 76034,12500 Da
isoelectric point:5,44565
aromaticity:0,07832
hydropathy:-0,29273

Domains

View on InterPro
YP_003858558.1
1 715 aa
CHP 619–715

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

YP_003858558.1
1 715 aa
Domain Start End Length (AA) Confidence
N-terminal 1 323 323 0,1689
Central domain 324 522 200 0,2521
C-terminal 523 715 192 0,9253
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
YP_003858558.1 [NCBI]
Genbank nucleotide accession
NC_014467 [NCBI]
CDS location
range 167611 -> 169758
strand +
CDS
ATGGCTGATTTGAAATTAGGCACAACGCTGGGTGGTGCGGGTATCTGGAGTGCGTCAAACCTCCCTCTGTTGCCCTCTGGCGACAGATTGACGTATAAAGGGTGGAGAGTATACACCGAAAACGATCGTCCAACAGCGGAGGATATAAACGCCCTCTCGACGATCAACGGCGGTACTGTTGCGAAGAACACCACGTTCAACCAAAGCGTGACCGTAGGTATTAACTTAACAGCGAACACGCTGTATTCAAAAAGTTATGTCGATATCACCACGGGCGGGGCTGCTCCGGGTATATTGTTTAAACGTTCTGATGTGACGGGAACACCTTCAACCGAACAGGGCATTATGCAAATCACTGGGGTTAATGCTTCCGGCGTTGCGCTGGCAACTCTGAATATTAACGCCCGTGCTGACGGTGGTAACAGGATCTATCTGAACGCATACAAAAACGGAACAACAAATACTTCCTTTGTTGTTGATTCTGCGAATCAACAGGTATCCGTTGAACTAGGTTCTTTCCGTGTAGCCGGATCATCGACCTTCGCTGCGGTGAGCGCAACCACGTTAAACGTAAGCGGGGCAATTACTGCTAATACGGTAACTCCTACTGATTGGACAAACCACGATAACCGTTATATCACGGGTATTCCTCGCAGTATGCAGGGAAACAGTTTTGGTGCTGCTGTTGTAACCGAAAAGAACGATGTTCTGTCAGTTGGTGGTAACGTGATCGATGGGCCTTATGGTAACACTACATATTCTGGTCAGGTTTCGACATATCGACGCACGTTGAATAGTGGTGTATCACTGGTTCAAACTTACTATGACACAAATTCATCATGGATTCGAAGCGGTTCTGGTTCTCCTGGTGCTTGGTCTTGGTCTGCTGGCGACGCTAATGGTTGGCGACGGATTTATGATACTGCTACTCCGCCAACTCCGGCAGAAGTAGGGGCAGTAAACAAAGCTGGAGATACCATGAGTGGTGTTTTAACACTGGCACCTATTTCTCAACCGCTTAAAACGCAAGGCGGTGGTGTTTTTGCGAATGACGGAAACGTGTATATCAATAAAGGAGGATTTGCTGGATGGATTGATGATTTGTTCTTTAAAAATACAGGAGGTGTTGTAACAGGGACAATCAACCAGGACGGCCTTAATTCTAATGTTTATTGTACTTCTGCATTGAGCATAACCGCAACAGGTTCGAAGAATTATCTTCGTCGTTTCCGTGGTGGTAATGGCGACCAGATCTGGCATGAAACTGTTCAGGGCGGCACGTATCGTTTAGCAACAGGTGATACAGACTCTGCAAATTTACTTACTATTGAAAGTTCTGGCAACATGATTCTTAACGGTTCTCAAGGTGGTTCATCAACCCTATTCCTTGATGCTGACAGTAACAGTGTAGTTTGGTATCGTATGATTAATGGTACGGAAAAGGCAGTAACCTTTGCAAGTAATGACGGTGTATACCATATCAGAACAAATAACGTCAGATCAAATACTTGGGATTTTAGAGCGGGTATGATTCTGAACCTGGGCATTTGTTCTGGTTCAGATTATGCATTGATTCGTGGTACTCCTGATGGTGGCGGTTTGGATCAGTGGCGTGAGCGTTCCTCTGGTTTACAGATCGATATGCCTAACGGTTCTACCTCAGCATATAACATCTGGAAAGCTACCGTATGGGACGCAACTCATGTTGCTGCTATGGATGTGCATATTCCAAATAATGATGGCGCACAAGCTCGCGTCCGATTGATTCACCAAAGCGGTGCATATTATCACTTTGATGGTAGCGGTCAATTTACTGCATCTGGTAATGGCAACTTCAACGATGTTTATATTCGTTCTGATGAAAGACTGAAAAGTAATCTCAGTAAGATCGAATCTGCATTGGATAAAGTCGATTTGTTGGAAGGTGTGATCTATGATAAAGCGGATCATGTCGGCGGTGAACCTACGTCCCGCGAAGCTGGTTTGATTGCACAACAACTGCGCGAAGTATTACCGGAAGCGGTTAAAACTGGTGAAGACACCGAAAGGAATGAAATCTTAACGGTTTCTCCGACTGCGGTTATTGCCTTGCTTGTCAACGCAATTAAAGAACTCCGCGAGGAAATCAGGGAACTTAAATCCCGCTAA

Genome Context

Tertiary structure

YP_003858558.1
ESMFold structure
Source ESMFold
pLDDT 60.0
Oligomeric state monomer