Protein
View in Explore- UniProt accession
- A0A2U9DT70 [UniProt]
- Protein name
- Tail fiber
- RBP type
-
TFTFTF
- Protein sequence
-
MKTMANQKDLMKLSVNELIALGSQSGLTFHAGMKKSHMVQQLSASAASGWLDTNAELMGGSFEDDSLITESLGDSSMISDAAHIAQVLSSAGYTEAFHAAMNGPTHHVEAVHAYMERLGVNTDDVWMHMPKPNPNLPQGSFNMLNAYMRDTLQGHQDIMPEIPGHYAGDIMAEYSTNRGNIADSYNYMAHMYVDREQYSNHEQYARDISRVAKRLEQELPAAFKEVGYTSASNVGNKSAPHISYMDALPQIGSHSIVSGIAHPRAPLNASGLPLGSYGSGINPEYSLTASLTGGTGWSEPSKALYSSISEAVKSAAKVYSGSKGMMNTRHEVSDRDMIMDSATRYTDLEEARSGYENLRSELADEPAYSASSVRAILENTYDKMDAEPVRIPKMKGASQRIEKDSQPTSTEFTAQLGEATNWNSARMRREEIAIANMDSAGFRDVSDYGTGRSESPVRFHDLEQGSQEWLDFRKNYDITGSTVGTFLGNNPYTRPWAGMIDKIGLSRSKEHSDFTKKMFADGHRTEEEARARVGKEFGFDIKQTGAITNDNYPAFMYSPDGLIGDDAIWEHKNPQRAGKFADLLKGDHPDYMDQIQFGMHVSGRSRALFSQTINDETRSQWIEKDEGWYERNRNRLDSVLGRLDAGREFVRNNAGLDREELIAGARSAMTGDGIWKDIRQRSTRGYSAVAGTPEDPFIGSRSSYNPTASYSDYQPNFVMHEQNFPATTGNGDTGNDSMALSVKKGILAAQEENRQKGIGADADFNGKADSMGWDRERFDAANGGGYSGGGGGRGGNFTSGGNYYDDYGRMGGSLAAGIAGGSIGSATNGVMQALMATPAGRMAAVGIGAIQIGNEAAEYMNDFIGNSLDAGVINPNEYSSMSQGLEMLGLNSQQAARLNQTAHSAYNTMLNGDPSAAVNIVRGSRGLLTIGDIRATGGDPVALARIMQERGKERGWSQARIAGAAEMAGLNGMARAYDRTEYSHEQAGSVVEAGRNSDFAEGMAQSEMLQGSRARMLPSYNVPQSVLSNGASVFEAGSDALDSVRTGFNNTREFAGSVYDFIAQEESGGREYNKDGSRVTSRTGARGIMQILPSTARDPGFGIRPSDGTPEDDARVGREYYDAMYKRYGDHEKAMAAYTDGPGTVDKAVDKNGLDWLTAVPVQAQNRVKAFREWSKSSQSLEEGATGFTRNGMSYGQTQTVVNVKIDAKVNNQTASATVAIPGGQTVTQQMNMNNGAQQRR
- Physico‐chemical
properties -
protein length: 1241 AA molecular weight: 134569,99270 Da isoelectric point: 5,35562 aromaticity: 0,07816 hydropathy: -0,61821
Domains
Domains [InterPro]
DC_0169
ATT
4–423
ATT
4–423
Coil
Unmapped
345–365
Unmapped
345–365
IPR011335
Unmapped
458–637
Unmapped
458–637
IPR019080
STR
468–604
STR
468–604
1
1241
Architecture
ATT 4-423 | STR 450-1055 | STR 1057-1152 | RBD 1153-1241
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage ST32 [NCBI] |
2005048 | Uroviricota > Caudoviricetes > Chaseviridae > Carltongylesvirus > Carltongylesvirus ST32 |
| Host |
Escherichia coli H21 [NCBI] |
3060862 | Pseudomonadota > Gammaproteobacteria > Enterobacterales > Enterobacteriaceae > Escherichia > Escherichia coli |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AWP47871.1
[NCBI]
Genbank nucleotide accession
MF044458
[NCBI]
CDS location
range 38383 -> 42108
strand +
strand +
CDS
ATGAAGACAATGGCTAATCAAAAAGATTTAATGAAGTTGAGCGTTAATGAGTTAATTGCATTAGGCTCTCAAAGCGGTCTGACTTTCCATGCTGGTATGAAGAAGTCGCACATGGTTCAACAACTTAGTGCGAGCGCTGCTTCGGGATGGCTGGATACAAATGCAGAATTAATGGGTGGGTCGTTTGAAGATGACAGTTTAATTACTGAATCTTTGGGCGACTCTTCCATGATTTCAGACGCTGCCCACATTGCACAAGTGCTTTCAAGTGCTGGCTACACAGAAGCATTTCATGCAGCAATGAATGGCCCAACACACCATGTAGAAGCTGTTCACGCATACATGGAAAGGCTTGGCGTCAACACAGATGATGTGTGGATGCACATGCCAAAGCCAAATCCGAACTTACCGCAAGGTAGCTTCAACATGCTTAATGCCTATATGCGAGATACTTTGCAAGGGCATCAAGATATTATGCCAGAAATCCCAGGCCACTATGCTGGGGATATTATGGCTGAATACTCCACCAACCGTGGAAACATTGCCGACTCATACAACTACATGGCTCACATGTATGTTGACCGTGAGCAGTACAGCAATCATGAACAATACGCAAGAGACATTAGTCGAGTTGCCAAACGTCTGGAACAAGAGTTACCCGCAGCCTTTAAGGAAGTTGGGTATACCTCGGCCTCTAATGTAGGAAATAAATCGGCCCCACATATTTCTTACATGGATGCACTTCCTCAGATTGGCTCTCATTCAATTGTTTCTGGCATTGCACATCCTCGTGCGCCTCTTAACGCATCTGGACTTCCTCTTGGTTCATATGGTTCTGGCATAAATCCAGAATACTCCCTCACTGCATCATTAACTGGTGGCACAGGTTGGAGCGAACCAAGCAAGGCTTTATATTCCTCTATTAGTGAAGCTGTTAAATCTGCTGCCAAAGTTTATTCTGGCTCCAAAGGGATGATGAATACTCGTCACGAAGTATCTGACCGTGACATGATTATGGACAGCGCAACTCGCTATACAGATTTGGAAGAAGCCAGAAGCGGGTATGAAAACCTTCGTTCTGAACTTGCAGATGAACCAGCTTATTCTGCATCGAGTGTTCGAGCTATTCTCGAAAACACTTACGACAAAATGGATGCCGAACCTGTTCGCATTCCAAAGATGAAGGGAGCTTCACAGCGAATCGAAAAAGATTCGCAACCAACATCTACAGAATTTACAGCACAACTTGGAGAGGCCACCAATTGGAACTCTGCAAGAATGCGTCGTGAGGAAATTGCGATTGCAAATATGGATAGTGCTGGATTCCGTGATGTATCAGACTATGGCACAGGACGTTCCGAGTCTCCCGTCAGATTCCACGACTTAGAACAGGGTAGCCAAGAGTGGTTAGACTTCCGTAAAAATTATGACATAACTGGCTCAACTGTTGGAACTTTCTTAGGAAACAACCCGTACACCCGACCTTGGGCGGGAATGATTGACAAGATTGGTCTATCTCGTAGTAAAGAGCACAGCGATTTCACCAAGAAGATGTTCGCTGATGGTCACAGGACGGAAGAAGAAGCACGCGCTCGTGTTGGTAAGGAATTTGGATTCGATATCAAACAAACTGGCGCTATAACAAATGACAACTATCCCGCCTTCATGTATTCTCCTGATGGTCTTATTGGCGATGATGCCATTTGGGAGCATAAGAATCCGCAAAGGGCTGGTAAGTTTGCTGACCTTCTTAAAGGCGACCATCCAGATTACATGGACCAAATTCAATTTGGTATGCACGTTTCTGGCCGTTCTCGTGCTTTGTTCTCACAAACCATCAACGACGAAACACGTTCGCAATGGATTGAGAAGGACGAAGGATGGTACGAGAGAAACCGAAATCGCCTTGATTCTGTGCTTGGCCGCCTTGATGCTGGGCGAGAGTTTGTTAGAAACAATGCTGGACTAGACCGAGAAGAACTGATTGCTGGTGCAAGAAGTGCAATGACTGGTGACGGAATCTGGAAGGATATTCGTCAAAGGTCAACTCGTGGCTATTCTGCCGTTGCTGGAACTCCAGAAGACCCATTCATTGGTTCACGTTCTTCATACAATCCAACGGCATCTTACTCAGACTACCAGCCAAACTTTGTAATGCACGAGCAAAACTTTCCAGCAACCACAGGAAATGGGGATACTGGAAACGACTCGATGGCATTGTCTGTTAAGAAAGGTATCCTTGCTGCTCAGGAAGAAAACAGGCAGAAGGGTATCGGTGCAGATGCAGACTTTAACGGCAAAGCCGATTCAATGGGTTGGGACCGCGAAAGATTTGATGCTGCCAATGGCGGTGGATATTCTGGCGGTGGCGGTGGTCGTGGCGGTAACTTCACAAGTGGTGGCAACTACTATGATGACTACGGTCGCATGGGTGGTTCACTAGCTGCTGGTATTGCTGGTGGCAGTATTGGTTCAGCAACCAACGGAGTTATGCAAGCATTGATGGCAACTCCTGCCGGACGTATGGCTGCTGTGGGCATTGGTGCTATTCAGATTGGCAATGAAGCTGCTGAATACATGAATGACTTTATCGGCAATTCGCTTGATGCTGGTGTGATAAATCCTAATGAATACTCTTCTATGTCGCAAGGCTTGGAGATGTTAGGACTAAACTCCCAACAAGCGGCACGTCTGAATCAAACCGCACATAGTGCCTACAACACCATGCTTAACGGCGACCCCAGCGCCGCTGTGAACATCGTTCGCGGCAGTAGGGGATTGCTCACCATAGGTGATATTCGTGCGACAGGCGGCGACCCTGTGGCCCTGGCTCGCATCATGCAGGAAAGGGGCAAGGAGCGTGGCTGGAGTCAGGCCCGTATCGCAGGTGCTGCCGAAATGGCTGGCCTGAATGGGATGGCGCGAGCCTACGACCGCACTGAATACAGTCATGAGCAAGCTGGTTCGGTGGTAGAAGCGGGTAGAAACTCTGACTTTGCCGAAGGTATGGCCCAATCGGAAATGTTGCAGGGAAGCCGCGCACGGATGCTGCCGAGCTATAACGTGCCACAAAGTGTCTTATCTAACGGTGCTTCCGTATTTGAGGCTGGCAGTGATGCACTTGATAGTGTACGCACTGGATTCAACAATACGCGAGAATTTGCTGGAAGTGTTTACGACTTTATCGCCCAAGAAGAATCTGGCGGTCGTGAATATAACAAGGATGGTTCCCGTGTAACAAGTCGTACTGGTGCTCGTGGGATAATGCAGATTCTTCCTTCAACAGCCCGTGACCCAGGATTTGGAATCAGGCCATCTGATGGAACTCCGGAAGATGATGCGCGAGTTGGTCGTGAATACTATGACGCAATGTACAAGCGTTATGGCGACCACGAAAAGGCAATGGCTGCTTACACCGATGGTCCTGGCACTGTTGACAAAGCTGTTGATAAGAATGGGTTGGATTGGCTAACTGCTGTTCCTGTACAAGCACAGAATCGCGTCAAAGCATTCCGTGAATGGTCCAAGTCTTCTCAGTCTTTGGAAGAAGGTGCTACAGGGTTTACTCGTAATGGAATGTCATACGGTCAAACCCAAACTGTTGTTAATGTTAAGATTGATGCTAAGGTCAACAACCAGACTGCCTCTGCTACAGTGGCAATTCCTGGTGGCCAGACTGTAACTCAACAGATGAACATGAACAACGGTGCGCAACAAAGAAGATAA
Genome Context
Genome Context
Tertiary structure
PDB ID
6a700da945d3ed3d760e1f145d2462228c7c4073841a574f3630bbbf0d84691b
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50