UniProt accession
A0AAU7GWC0 [UniProt]
Protein name
Tail fiber assembly
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
Protein sequence
MARCDCAGLGVEGCMCALQAGDNVTVTGTGQALDPWVVSGIPPAPYTEGQAIDIVANAISVDVSSDAGNQLVLGGDGGLFVPAPAASGGALGMVVYLTVAGGTWDKATYPDANWLRVRVIGGGGGGAGATSAASQAIARGGGAGGNYGESWIDVSTLGASTTVTVGAGGSAGAAANGAGGTGGTSSFGTAVIALGGGGAPNTAVSAAAGVSPSGDPQGLGTHQVYTLGERGQRGIWHSATVGMAGGGGSGGCGFGTGGRGGISGGAANGQTGSGPGGGGGGGHSQNAAVAAGGAGGPGAVIVEMFT
Physico‐chemical
properties
protein length:306 AA
molecular weight: 28036,43440 Da
isoelectric point:4,52941
aromaticity:0,04575
hydropathy:0,21111

Domains

Domains [InterPro]
DC_1198
ATT
1–100
IPR049304
STR
102–303
A0AAU7GWC0
1 306
Architecture
ATT
STR
ATT 1-100 | STR 101-303 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage Geonosis
[NCBI]
3158856 Uroviricota > Caudoviricetes >
Host Streptomyces griseus
[NCBI]
1911 cellular organisms > Bacteria > Bacillati > Actinomycetota > Actinomycetes > Kitasatosporales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XBM94990.1 [NCBI]
Genbank nucleotide accession
PP750866 [NCBI]
CDS location
range 24234 -> 25154
strand +
CDS
ATGGCACGCTGCGACTGCGCAGGGCTCGGCGTCGAAGGGTGCATGTGCGCGCTTCAGGCGGGCGACAACGTCACGGTGACCGGGACCGGACAGGCACTGGACCCGTGGGTGGTGAGCGGCATCCCGCCCGCGCCGTACACCGAGGGCCAGGCCATCGACATCGTGGCCAACGCGATCAGCGTGGACGTGTCCTCGGACGCCGGCAATCAGCTGGTGCTGGGCGGTGACGGTGGGTTGTTCGTTCCGGCCCCTGCCGCGTCCGGCGGCGCGCTCGGCATGGTGGTGTACCTGACGGTTGCGGGCGGTACGTGGGACAAGGCGACGTACCCGGACGCCAACTGGTTGCGGGTGCGGGTCATCGGCGGTGGCGGCGGTGGCGCGGGTGCGACGTCCGCAGCATCGCAGGCCATCGCGCGCGGTGGTGGCGCGGGCGGGAACTACGGGGAATCCTGGATTGACGTGTCCACCCTCGGGGCGTCCACCACGGTCACAGTCGGCGCGGGCGGCAGCGCGGGTGCGGCGGCCAACGGCGCCGGTGGCACCGGGGGAACGTCCAGCTTCGGCACGGCGGTCATCGCGCTCGGTGGCGGCGGCGCCCCGAACACCGCTGTGTCCGCAGCGGCTGGCGTCAGCCCGAGCGGTGACCCGCAGGGCCTCGGCACCCACCAGGTGTACACCCTCGGGGAACGGGGACAGCGCGGTATCTGGCACTCGGCCACGGTCGGAATGGCTGGCGGTGGCGGGTCCGGTGGCTGCGGTTTCGGGACCGGTGGACGTGGCGGAATCTCTGGCGGTGCGGCCAACGGGCAGACCGGATCCGGTCCCGGCGGCGGTGGCGGTGGCGGCCACTCACAGAATGCGGCTGTTGCCGCAGGGGGTGCCGGTGGCCCCGGTGCGGTCATCGTGGAAATGTTCACCTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
95bbdc9e305c72795756c5d8d4939016059a789b5a8b4f1077ec079a405c9174
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7971
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50