Genbank accession
QPX48018.1 [GenBank]
Protein name
tail fiber protein host specificity
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect2
Probability 0,63
Protein sequence
MSASVTKAGPYYSSGSISFSSLRTNFKESDSGSISASELRRNTTTTNTNPTVPDATENSSISTASNLAISQFRNSIKYYYITQTGTDVNFDIDAQSWNSNLNKNIRKWMYINGTCGSNSISSTAVDFNATAYNLTVDVSGGIYGAAGSGGTAATISGGSGGTALSVNSSGGNNIVVFVRDSANIYGGGGGGEKGATGATGSPGTCTDSYTASQCGSCPECPSPYVNGSCWGGGECGRRQVCNWWGNCWWEASQWTQYRTCTRTYGVSGGTGGAGGDGGTGRGYNNAGSLAGAAGAAGGSNNGCGSTNGATGETGGSGGDWGSSGVNTANSGNGGPSGRAISGSNYSVSGTINSSTIKGLYLP
Physico‐chemical
properties
protein length:362 AA
molecular weight: 36162,30100 Da
isoelectric point:6,27681
aromaticity:0,08564
hydropathy:-0,42099

Domains

Domains [InterPro]
DC_1762
ATT
56–212
IPR007932
RBD
83–293
QPX48018.1
1 362
Architecture
ATT
RBD
ATT 56-212 | RBD 213-361 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-SRM01
[NCBI]
2781608 Uroviricota > Caudoviricetes > Pantevenvirales > Serangoonvirus > Serangoonvirus essarone
Host Synechococcus sp.
[NCBI]
1131 cellular organisms > Bacteria > Bacillati > Cyanobacteriota/Melainabacteria group > Cyanobacteriota > Cyanophyceae

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QPX48018.1 [NCBI]
Genbank nucleotide accession
MW015081.1 [NCBI]
CDS location
range 43173 -> 44261
strand +
CDS
ATGTCAGCATCTGTAACAAAAGCAGGACCATATTACTCTTCTGGGTCTATTTCTTTTAGTTCCTTACGAACTAACTTTAAGGAATCTGATTCTGGGTCTATTAGTGCTTCGGAGTTGAGAAGAAATACAACAACTACAAATACAAATCCAACTGTTCCAGATGCAACAGAAAATAGCAGTATTTCTACTGCCTCTAATTTAGCAATATCTCAATTTAGAAATTCAATCAAGTATTATTATATCACACAAACTGGAACTGATGTTAATTTTGATATTGATGCTCAATCTTGGAACAGTAATTTAAACAAAAACATCCGAAAGTGGATGTATATAAATGGAACTTGTGGATCTAACTCAATATCTTCAACTGCCGTAGATTTTAACGCAACTGCATATAACTTAACTGTTGATGTTTCTGGTGGCATTTATGGTGCTGCTGGTTCTGGAGGAACTGCTGCCACAATTAGTGGTGGTAGTGGAGGAACTGCTCTCTCTGTAAATTCTTCTGGGGGAAATAATATTGTCGTATTTGTAAGAGACAGTGCAAATATCTATGGTGGTGGTGGAGGAGGAGAAAAAGGTGCTACAGGTGCTACTGGAAGTCCAGGAACTTGCACAGATTCTTATACTGCATCACAGTGTGGTAGTTGTCCGGAGTGTCCATCACCTTATGTAAATGGTTCTTGCTGGGGTGGTGGAGAATGTGGAAGAAGACAAGTTTGTAACTGGTGGGGAAACTGTTGGTGGGAAGCAAGTCAATGGACTCAATATAGAACTTGCACCAGAACTTATGGAGTTTCTGGAGGAACTGGTGGTGCTGGTGGTGATGGAGGCACTGGTAGAGGATACAATAATGCAGGGTCTCTAGCAGGTGCAGCAGGGGCAGCAGGAGGTTCTAACAATGGTTGTGGGTCTACGAATGGTGCTACTGGAGAGACTGGTGGAAGCGGTGGAGATTGGGGTTCTAGTGGTGTAAATACTGCAAACAGTGGAAATGGAGGCCCTTCTGGAAGAGCAATTTCAGGGTCTAACTATTCTGTTTCTGGAACAATTAACTCATCAACAATTAAGGGATTATACCTACCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
26cdf5fac4bcff52e63094d4f44b37f6c9b6f79161c33dff8021a80abe81d1bd
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7051
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50