Protein
View in Explore- Genbank accession
- QGF19902.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTSPTSPTSP
- Protein sequence
-
MLHAYAGYPVYNGQIAKFVTVQGHSMAVYDAYGAQQFYFPNVLKYDPDQLRQQLEDPDGANKYPKLQIARWRDSYDVRGWGAIGDGVHDDTSALSELLSVATGGEKIDGRGLTFKVSTLPDVSRFKNARFLFERIPGQPLFYASEDFIQGELFKITDTPWYNAWTQDKTFVYDNVIYAPFMAGDRHGVNNLHVAWVRSGDDGKTWTTPEWLTDLHENYPTVNYHCMSMGVVRNRLFAVIETRTVSGNKLQVAELWDRPMSRSLRVYGGITKAANQQVAYIRITDHGLFAGDFVNFSNSGVTGVTGNMTVTTVIDKNTFTVTTQNTQDVDQNNEGRYWSFGTSFHSSPWRKTSLGTIPSFVDGSTPVTEIHSFATISDNSFAVGYHNGDIGPRELGILYFSDAFGSPGSFVRRRIPAEYEANASEPCVKYYDGILYLTTRGTLSTQPGSSLHRSSDLGTSWNSLRFPNNVHHSNLPFAKVGDELIIFGSERAFGEWEGGEPDNRYAGNYPRTFMTRVNVNEWSLDNVEWVNVTDQIYQGGIVNSAVGVGSVCIKDNWLYYIFGGEDFLNPWSIGDNNRKYPYVHDGHPADLYCFRVKIKQEEFVSRDFAYGATPNRTLPTFMSTSGVRTVPVPVDFTDDVAVQSLTVHAGTSGQVRAEVKLEGNYAIIAKKVPSDDVTAQRLIVSGGETTSSADGAMITLHGSRSSTPRRAVYNALEHLFENGDVKPYLDNVNALGGPGNRFSTVYLGSNPVVTSDGTLKTEPVSPDEALLDAWGDVRYIAYKWLNAVAIKGEEGARIHHGVIAQQLRDVLISHGLMEEESTTCRYAFLCYDDYPAVYDDVITGQREMPLTDNDGSIIVDEDDNPVMVIEDIIERVEITPAGSRWGVRPDLLFYIEAAWQRREIERIKARLDLIEGKH
- Physico‐chemical
properties -
protein length: 917 AA molecular weight: 102354,81930 Da isoelectric point: 5,17550 aromaticity: 0,11450 hydropathy: -0,36161
Domains
Domains [InterPro]
IPR036730
ATT
2–45
ATT
2–45
IPR009093
ATT
5–49
ATT
5–49
IPR036730
ATT
5–45
ATT
5–45
IPR024429
ENZ
82–148
ENZ
82–148
IPR001724
Unmapped
150–174
Unmapped
150–174
IPR023366
STR
263–343
STR
263–343
IPR001724
Unmapped
334–357
Unmapped
334–357
1
917
Architecture
ATT 2-49 | STR 50-105 | ATT 106-146 | STR 147-753 | CHP 754-766 | RBD 767-819 | CHP 820-886 | RBD 887-916 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
917
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 104 | 104 | 0,8843 |
| Central domain | 105 | 420 | 317 | 0,9076 |
| C-terminal | 421 | 917 | 496 | 0,5330 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-104
1-104
Central
105-420
105-420
C-terminal
421-917
421-917
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage phiv205-1 [NCBI] |
2663260 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QGF19902.1
[NCBI]
Genbank nucleotide accession
MN340231.1
[NCBI]
CDS location
range 3572 -> 6325
strand -
strand -
CDS
TTGCTGCACGCTTACGCTGGATATCCGGTATATAACGGACAGATTGCCAAATTCGTTACCGTGCAAGGCCATTCTATGGCTGTTTATGATGCGTATGGTGCACAGCAGTTCTATTTTCCCAATGTGCTGAAGTATGACCCTGATCAACTACGGCAGCAATTAGAAGACCCAGATGGAGCGAATAAATACCCAAAACTTCAGATAGCAAGATGGAGAGACAGTTATGATGTAAGAGGTTGGGGGGCTATTGGTGATGGTGTTCATGATGATACATCAGCTCTATCAGAATTACTTTCTGTTGCAACAGGTGGTGAAAAGATAGATGGGCGAGGGCTTACTTTTAAAGTATCAACTCTTCCAGATGTCAGTCGATTTAAAAATGCTCGTTTTTTATTTGAGAGAATACCGGGTCAGCCTCTTTTTTATGCTTCTGAAGATTTTATCCAGGGAGAGTTATTTAAAATTACAGATACACCGTGGTACAACGCCTGGACGCAGGATAAAACGTTTGTATATGACAATGTCATCTATGCGCCTTTTATGGCTGGAGACCGCCATGGTGTAAATAACCTCCATGTTGCATGGGTTCGCTCAGGAGATGACGGGAAGACCTGGACAACGCCGGAATGGCTTACAGATTTACATGAAAACTATCCCACAGTTAACTATCACTGCATGAGTATGGGGGTTGTCAGAAATCGCCTTTTTGCTGTAATTGAGACGCGGACCGTGAGCGGAAATAAACTGCAGGTTGCAGAGTTGTGGGATCGCCCAATGAGTCGCAGCCTTCGCGTTTATGGTGGTATAACGAAAGCAGCAAATCAGCAAGTCGCTTATATTCGCATTACTGATCACGGATTATTTGCTGGTGATTTTGTCAACTTCTCAAACTCTGGTGTTACAGGTGTTACCGGGAATATGACGGTGACTACTGTTATTGATAAAAATACTTTTACAGTTACGACGCAAAATACCCAGGATGTGGATCAGAATAACGAGGGTAGATACTGGAGTTTTGGTACATCATTTCACTCGTCACCATGGAGAAAAACCAGTCTTGGAACTATTCCTTCTTTTGTTGACGGAAGCACTCCTGTTACTGAGATTCACAGTTTTGCGACGATTAGCGATAACAGTTTTGCTGTTGGCTACCATAATGGTGATATTGGTCCACGCGAGCTTGGGATACTCTATTTCTCTGATGCTTTCGGTTCTCCTGGTAGCTTTGTTCGCAGACGCATACCTGCAGAATATGAGGCGAATGCATCTGAGCCATGTGTAAAATATTATGATGGCATTCTGTATCTGACGACCAGGGGGACATTAAGTACTCAACCCGGTAGTTCATTGCACAGAAGCTCTGATTTAGGTACATCATGGAATTCTCTTCGCTTCCCAAATAATGTTCATCACTCAAACCTTCCTTTTGCCAAAGTTGGCGATGAGCTGATTATTTTTGGCAGTGAGCGCGCATTTGGTGAGTGGGAAGGAGGAGAACCTGATAACCGTTATGCAGGAAACTATCCAAGAACATTTATGACCAGAGTTAACGTCAATGAGTGGAGTCTGGATAATGTAGAGTGGGTTAATGTTACTGATCAGATTTATCAGGGCGGAATAGTTAACTCTGCGGTTGGTGTTGGTTCAGTTTGTATCAAAGACAACTGGCTGTACTACATTTTCGGTGGGGAAGACTTTCTAAACCCATGGAGCATAGGGGATAACAACAGAAAATATCCTTATGTTCACGATGGTCACCCGGCTGATTTGTATTGTTTCAGGGTGAAAATTAAACAGGAAGAATTTGTTTCAAGGGATTTTGCCTACGGAGCCACTCCTAACAGAACGCTTCCTACTTTTATGTCGACGTCAGGCGTGAGGACGGTTCCTGTACCTGTTGATTTCACAGATGATGTTGCCGTCCAGTCACTGACTGTCCATGCAGGTACATCAGGACAAGTTCGCGCGGAAGTCAAACTTGAGGGTAATTACGCCATTATTGCGAAGAAAGTACCGTCTGATGATGTTACCGCTCAGAGATTAATCGTTAGCGGCGGTGAAACAACGTCTTCAGCAGATGGTGCAATGATAACGTTGCATGGTTCCAGAAGCAGTACTCCACGTCGCGCGGTATATAACGCACTCGAACATCTTTTTGAGAACGGAGATGTTAAACCTTATCTTGATAATGTAAATGCTCTTGGTGGTCCGGGAAACAGGTTCTCGACAGTTTATCTTGGCTCCAATCCTGTGGTTACCAGTGACGGAACATTAAAGACAGAGCCGGTCTCTCCTGACGAAGCATTGCTGGATGCCTGGGGTGACGTCAGGTATATCGCTTATAAATGGCTGAACGCTGTCGCTATAAAGGGGGAAGAAGGGGCGAGGATACATCATGGTGTAATCGCGCAGCAACTTCGTGATGTTCTTATTTCTCACGGACTCATGGAAGAAGAAAGCACAACATGCCGCTATGCCTTTCTTTGCTATGACGATTATCCCGCAGTATATGATGACGTCATTACTGGCCAAAGGGAAATGCCGCTGACTGATAATGACGGGAGCATCATTGTTGATGAGGATGATAATCCAGTGATGGTAATAGAAGACATCATTGAGCGCGTTGAAATAACGCCAGCAGGATCTAGATGGGGGGTCAGACCTGATCTCTTATTCTATATCGAGGCGGCATGGCAGCGCAGAGAAATAGAAAGAATAAAAGCTAGGTTAGACTTAATAGAAGGGAAGCACTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
93d81d557c702b6b86a32554e2e28454b9b49e16f77acffabffd7d5bda666b49
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50