Genbank accession
WNO23963.1 [GenBank]
Protein name
long-tail fiber proximal subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence RBPdetect
Probability 0,61
TF
Evidence RBPdetect2
Probability 0,94
Protein sequence
MADILKPAFRATSGLDAAGEKVINVAKADYNVLDDGVNVEFFIDENTIQAYDETRGYKKGFAVIHDQRIWVAQRDIAAPAGTFVPGYWTATRTDPKWITVASPTRQLASGEYIAVDSAASFTTFTLPPNPTDGDTIVIKDIGGRVGYNEIKLQSSSAPGGGNQKIVRFGNQFTETLITKPFSYNMIIFANRLWHFWEAANEERGIRVEPNTAQFQSQAGDNVLRRYTSGAVIKFTLPKYANQGDMIKTVDIDGLGSKFHLIVETFDASSSLGKLGQHSMEFRTSGDGFFVYNSTEKLWYVWDGDRQTRLRVIRDDVELLANESVIVFGPNNTTPQTINITLPTGVAQGDVVKIALNYLRKAQTVNIKAAVGDKIASSVQLLQFPKRSEYPPDTEWVLNDVLTFNGNLSYTPVIELSYIEDSTTGGKYWVVAQNVPTVERVDSKDDLTRARLGVIALASQTQANVDHENNPEKELAITPQTLANRVATESRRGIARIATTAQVNQNTTFAFQDDLIISPKKLNERTATETRRGVAEIATQVETDAGVDDTTIITPKKLQTRQGTESLSGIVKYVSTTGTTPADSRATVGTNVYNKNTTTLVISPKALDQYKADQNNQGAVYLATQAEVNAGATNPGFSNSVVTPETLGARRSTDTNHGLIEIATQAETDTGTDYTRAVTPKTLNDRNATQTLTGIARIGTQVEFDAGVLDNVISTPLKVKTRFNDTARTSVSAASGLIESGTLWNHYTLDIREASNTQRGTARLATQTEVNTGTDDKTIITPLKLQAKKATENAEGIIQLATQAEVIAGTISNKAFSPKHYKYVVQQEKSWEATSARRGYVKLTTGTATWEGDDTNGSVANLAKFEDDGFAISPLQMNTALTHYLPIKGKAFDSDKLDNLDSSQFVRRDIDQIVEGTLTLRKNIRVDGQLATGGTGEFGGSLAANSTFTLRNTGTATRIIFEKGPQTGTNPVQSMSIRVWGNQYGGGSDTSRSTVFEVGDETSNHFYSQRNKDGNIAFSINGTVMPINVNASGLMNVNGVATFGRSVTAQGEFITYSANAFRAISGQYGFFIRNDNSNIYFMLTNANDQTGGFNGLRPLAISNTSGQVTIGESLIIAKGATINLGGLTVNSRIRSQGTKPADLYSRKPNADNTGFWSVDVNDSATYNQFPGYFEMVEKVNEVTGLPYLERGAEIKSPGTLTQFGNTLNSLYQDWITYPANANARTTRWTRTWQQSKNAWSGFVQVFDGGNPPQPSDIGALPADNASMSNLTIRDWLRIGNVRIVPDPVTRSVKFEWIDTP
Physico‐chemical
properties
protein length:1299 AA
molecular weight: 141651,99420 Da
isoelectric point:5,78635
aromaticity:0,08391
hydropathy:-0,38722

Domains

Domains [InterPro]
DC_1986
ATT
11–126
IPR048390
ATT
987–1100
WNO23963.1
1 1299
Architecture
ATT
STR
ATT
STR
ATT
STR
ATT 11-126 | STR 349-986 | ATT 987-1100 | STR 1101-1146 | ATT 1147-1245 | STR 1246-1279 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage fDHCT2
[NCBI]
3075956 Uroviricota > Caudoviricetes >
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Genome Context

Genome Context

Tertiary structure

PDB ID
f4ff13c1eda152e8a0a1aeb0825155aafd3a026579fcafb5734e26b6d71dc5ba
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5657
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50