Genbank accession
URC10612.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,75
Protein sequence
MAENKIYAVLTDRGAQLEAAALASGVPVVLNKFVIGDANGNDDVTPDPARTALIHETYRGDIKSAENSGNQVIFTLYVPPDTGGYTIREVGILTDKGELYSVARSPDILKPTDSNGALISITYKYTLAVSSTSTVNVVVYDNYVTPEEADKKYLQISKNLSEIPANGETAQQAARKNIGIDGDVAYRDKGNAFTHPNTFKDDVIFEDVLSSKKQININKPGTDLTLFSFAGLSDVSGDYLDFSTRGENNQKVFYRLRGVHYDGLVLLCGGQTYKYWNEANLKPVKTINGESPDENGNIDISLSPDPSPEIGVVGAVGTYALVAINHAPQILPGSIWAGGDLYYASINNTGSYNTDIATKPFYDTPLIGSWRAMAYSPGGVGDIGTACFLAIRIA
Physico‐chemical
properties
protein length:394 AA
molecular weight: 42363,79310 Da
isoelectric point:4,75887
aromaticity:0,09137
hydropathy:-0,21091

Domains

Domains [InterPro]
DC_1371
STR
1–249
IPR051934
Unmapped
3–145
IPR022225
ATT
4–137
URC10612.1
1 394
Architecture
ATT
STR
RBD
ATT 1-137 | STR 138-249 | RBD 299-394
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoM-705R4
[NCBI]
2946092 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URC10612.1 [NCBI]
Genbank nucleotide accession
ON470624.1 [NCBI]
CDS location
range 8044 -> 9228
strand -
CDS
ATGGCTGAAAATAAAATTTACGCAGTACTGACAGACCGTGGCGCACAACTTGAAGCCGCTGCGCTGGCATCGGGTGTACCCGTGGTGCTGAATAAATTCGTTATTGGCGACGCAAACGGAAACGACGACGTAACACCGGACCCGGCACGAACTGCATTAATTCACGAGACGTATCGTGGGGATATTAAATCCGCAGAAAATAGCGGTAACCAGGTTATCTTTACACTTTACGTACCGCCAGACACTGGGGGCTATACTATCCGTGAGGTGGGGATATTAACAGATAAAGGCGAGCTGTACTCTGTGGCGCGCTCGCCGGACATTTTAAAACCTACGGACAGCAACGGTGCGCTGATTTCAATCACGTATAAATACACACTCGCGGTGTCCAGCACATCTACGGTTAATGTTGTAGTATATGACAACTACGTAACTCCCGAGGAAGCTGACAAAAAATATTTGCAGATAAGTAAAAACTTATCTGAAATTCCCGCAAATGGTGAAACTGCTCAGCAGGCTGCGCGGAAAAATATTGGTATAGATGGGGACGTTGCATATAGAGATAAGGGTAATGCATTTACTCATCCAAATACTTTTAAAGATGATGTGATTTTTGAAGACGTTTTATCCTCTAAAAAACAAATAAATATAAACAAACCGGGCACTGATTTGACGTTGTTTTCATTTGCTGGTTTATCAGATGTGTCGGGCGATTATTTAGATTTTTCGACCCGTGGAGAAAACAATCAGAAGGTTTTTTATAGATTACGCGGAGTTCACTATGATGGCCTGGTGTTACTGTGCGGCGGGCAGACGTATAAATACTGGAATGAGGCTAATCTTAAACCGGTTAAAACGATTAACGGGGAGTCGCCGGATGAAAATGGCAACATAGATATAAGTCTTTCACCGGACCCATCACCGGAAATAGGTGTTGTTGGCGCGGTTGGTACTTATGCGCTTGTCGCTATTAATCATGCACCACAAATACTCCCGGGCTCCATATGGGCTGGAGGGGATTTATATTACGCTAGTATAAATAATACCGGTTCATATAACACGGATATTGCCACAAAACCATTTTATGACACACCACTTATTGGCTCATGGCGAGCCATGGCGTATTCACCCGGTGGAGTCGGTGATATTGGGACTGCCTGTTTTTTAGCTATTAGGATTGCGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
5f8f0262834d1419ca12e33a6c0d84bb3b5bebc0d3bd7a7bed97985de18fe8b8
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7808
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50