Genbank accession
WDQ26480.1 [GenBank]
Protein name
short tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,99
Protein sequence
MSNNTINHVSDKSIYVTFDPTGTDWPDTITNVQDALEKIGSWARTDTGLPIATTSVRGIAQIATEADIDAGTDNTKIVTPKLLAYRMQNPKASQTVWGYTKYSTDAESTTVTNDASSITPRSLNYVFNNRKGTESVWGSSKIATTAQAVAGTDNTVTMTPLKVKQAIASLVPVQSSATESSQGLVQLATVAQVQAGTIREGYAISPYTFIRLTATESNLGVIRIASQAEANAGTDDTKAITAKKLINTRATGSQFGVVKLATTVGYVANTALSSNANVLPSDRSAVINGSLYENSAIHNNKYQTWTDLDWHFPVGAIVMTGFQTDHGSLYICDGRSLNKNNYPLLFERIGYTFGGGGDWFNIPDCRGVAVRGHDRGRGLNPNRGYGTYEGDMLGWHEHPLQLIYQNGGNIPKWQAVYELKSAEKNDQSARVFDSSITKATGVGGEETRMKNIALNYVIRVL
Physico‐chemical
properties
protein length:461 AA
molecular weight: 49830,98930 Da
isoelectric point:7,05965
aromaticity:0,08026
hydropathy:-0,32820

Domains

View on InterPro
WDQ26480.1
1 461 aa
STR 242–300 · ATT 301–368 · STR 369–460 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

WDQ26480.1
1 461 aa
Domain Start End Length (AA) Confidence
N-terminal 1 275 275 0,8761
Central domain 276 450 176 0,0259
C-terminal 451 461 10 0,9965
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
WDQ26480.1 [NCBI]
Genbank nucleotide accession
OQ267591.1 [NCBI]
CDS location
range 124323 -> 125708
strand +
CDS
ATGAGTAACAATACAATCAACCACGTAAGTGATAAATCCATTTACGTGACATTTGACCCAACAGGTACTGATTGGCCTGATACCATAACCAACGTACAAGATGCGTTGGAAAAAATAGGTAGTTGGGCGCGTACTGATACTGGGCTTCCTATCGCAACAACTTCTGTTCGTGGTATTGCTCAGATCGCAACCGAAGCTGATATTGACGCTGGCACGGATAACACTAAGATTGTTACTCCGAAACTGTTAGCATACCGTATGCAGAACCCTAAAGCATCGCAAACCGTATGGGGTTATACGAAGTATTCGACTGATGCGGAATCTACAACCGTAACTAACGATGCATCGTCTATTACTCCGCGATCGTTGAACTATGTGTTCAATAACCGCAAAGGTACAGAATCGGTTTGGGGTTCTTCTAAAATCGCTACCACTGCACAGGCGGTTGCTGGTACAGATAACACTGTAACTATGACTCCGCTTAAAGTCAAGCAAGCGATCGCGTCTCTGGTTCCGGTTCAGTCAAGTGCGACCGAAAGCTCGCAAGGTCTGGTACAACTGGCAACAGTTGCACAGGTTCAGGCTGGTACGATCCGTGAAGGGTATGCAATTTCACCTTATACGTTTATTCGTTTAACTGCAACTGAAAGCAACCTGGGCGTTATTCGCATCGCATCACAGGCAGAAGCAAACGCAGGTACTGATGACACCAAAGCGATTACTGCGAAAAAATTAATCAATACCCGTGCAACTGGATCCCAGTTCGGTGTTGTCAAATTAGCAACAACTGTTGGTTATGTGGCAAACACTGCACTTTCTTCTAATGCTAATGTATTGCCTAGCGATCGTAGTGCGGTAATTAATGGTTCTCTTTATGAGAATAGCGCAATACATAACAACAAATATCAGACGTGGACAGATCTTGATTGGCATTTCCCAGTAGGTGCTATTGTCATGACTGGTTTCCAGACTGACCACGGTAGTTTGTATATTTGTGATGGACGTTCACTGAATAAAAATAATTACCCGTTACTGTTTGAGCGTATAGGTTATACATTTGGTGGTGGCGGTGATTGGTTTAACATTCCAGACTGTCGAGGCGTTGCAGTACGTGGTCATGACCGTGGGCGTGGACTAAACCCTAATCGTGGGTATGGTACATATGAAGGCGATATGTTGGGCTGGCACGAACACCCATTACAGCTTATCTACCAGAACGGCGGTAATATTCCGAAATGGCAAGCAGTTTATGAACTTAAAAGCGCCGAGAAAAACGATCAGAGTGCTCGCGTGTTTGATTCTTCCATAACTAAAGCTACTGGTGTGGGCGGAGAAGAAACCCGCATGAAAAACATCGCATTAAACTACGTAATTCGCGTATTATAA

Genome Context

Tertiary structure

WDQ26480.1
ESMFold structure
Source ESMFold
pLDDT 72.5
Oligomeric state monomer