UniProt accession
A0A2I6PF78 [UniProt]
Protein name
Long tail fiber proximal connector
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MVEFGQDYVGAQTFAENNALTYKLTIKASNKNTQGESCVLINDNQITTECDTGINVWVINNIGKLDNTMCFDVSTESGITAFITFLKEQTSGIICLASSNELDSTQVLADYMKSIGSASWNNFMISKIKTVSYAAVYRPELKSIVLESIQYSDGIKEDELLELETIFDSNNSIGITGFPGSIVYDYKEYTSIEQDYKKWPTNLLNNKLSDYGLKPGDWVSLSASIFGDKELKDDGGWTRIDCRWVFGNAWKQSFYLESTLKGNYPVQSMVNPDIWESKTVYSQIPEGVDGFVIIASRYNSDLGHSAVKNVAFGRAAEPIIEKSDRQIGINGIRNSFVKEEEQKVGSLLSLLNLKDKSDTVSSINFKEI
Physico‐chemical
properties
protein length:368 AA
molecular weight: 41033,58950 Da
isoelectric point:4,67276
aromaticity:0,10326
hydropathy:-0,27500

Domains

Domains [InterPro]
DC_0912
STR
1–368
PS52031
LEC
18–182
A0A2I6PF78
1 368
Architecture
STR
STR 1-368
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Proteus phage phiP4-3
[NCBI]
2065203 Uroviricota > Caudoviricetes > Pantevenvirales > Bragavirus > Bragavirus p43
Host Proteus penneri
[NCBI]
102862 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AUM58376.1 [NCBI]
Genbank nucleotide accession
MG696114 [NCBI]
CDS location
range 9985 -> 11091
strand -
CDS
ATGGTTGAATTCGGCCAAGATTATGTAGGTGCTCAAACGTTTGCAGAGAACAATGCACTTACATATAAGTTAACTATTAAGGCCAGTAATAAAAACACACAGGGTGAATCCTGTGTGTTAATTAACGATAATCAAATAACAACTGAATGCGATACTGGTATTAATGTTTGGGTCATTAATAATATAGGTAAACTTGATAATACTATGTGTTTTGATGTATCAACAGAATCTGGTATAACAGCATTTATAACATTTTTGAAAGAACAAACTTCTGGAATAATTTGTTTAGCTTCATCTAACGAATTAGATTCAACTCAAGTCTTAGCTGATTATATGAAATCTATAGGTTCTGCTTCATGGAATAATTTTATGATTTCTAAAATCAAAACAGTGTCTTATGCAGCAGTCTATCGCCCAGAATTAAAATCAATTGTTCTGGAATCTATACAATATTCTGACGGAATTAAAGAAGATGAATTGCTTGAACTAGAAACCATTTTTGATTCTAATAATTCTATAGGAATAACAGGATTTCCTGGTTCAATCGTATATGATTATAAAGAATACACGTCAATTGAACAAGATTATAAAAAATGGCCTACTAATCTTTTAAATAACAAGCTTTCTGATTATGGTCTTAAACCTGGCGATTGGGTTTCTTTGAGTGCTTCTATATTTGGAGATAAAGAATTAAAAGATGACGGTGGATGGACTAGAATAGATTGTAGATGGGTATTTGGTAATGCATGGAAACAATCATTTTATTTAGAAAGTACATTGAAAGGGAATTATCCTGTTCAGTCTATGGTAAATCCAGATATCTGGGAATCTAAAACAGTTTATTCTCAAATACCAGAAGGTGTAGATGGATTTGTTATAATAGCTTCAAGATATAATTCAGACTTAGGCCATTCTGCTGTAAAAAATGTTGCATTCGGAAGAGCTGCAGAACCTATAATAGAAAAATCTGATAGACAAATAGGTATAAATGGTATTAGAAATTCTTTTGTAAAAGAGGAAGAACAAAAAGTAGGATCGTTATTAAGTTTGTTAAATCTTAAAGACAAATCCGACACAGTTTCTTCTATAAACTTTAAAGAAATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
2f65304fc5833adce586c12b0ec9f6f2745db966199436f1762c2f6c018158f7
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6361
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50