Genbank accession
QGF19902.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TSP
Evidence RBPdetect2
Probability 0,93
Protein sequence
MLHAYAGYPVYNGQIAKFVTVQGHSMAVYDAYGAQQFYFPNVLKYDPDQLRQQLEDPDGANKYPKLQIARWRDSYDVRGWGAIGDGVHDDTSALSELLSVATGGEKIDGRGLTFKVSTLPDVSRFKNARFLFERIPGQPLFYASEDFIQGELFKITDTPWYNAWTQDKTFVYDNVIYAPFMAGDRHGVNNLHVAWVRSGDDGKTWTTPEWLTDLHENYPTVNYHCMSMGVVRNRLFAVIETRTVSGNKLQVAELWDRPMSRSLRVYGGITKAANQQVAYIRITDHGLFAGDFVNFSNSGVTGVTGNMTVTTVIDKNTFTVTTQNTQDVDQNNEGRYWSFGTSFHSSPWRKTSLGTIPSFVDGSTPVTEIHSFATISDNSFAVGYHNGDIGPRELGILYFSDAFGSPGSFVRRRIPAEYEANASEPCVKYYDGILYLTTRGTLSTQPGSSLHRSSDLGTSWNSLRFPNNVHHSNLPFAKVGDELIIFGSERAFGEWEGGEPDNRYAGNYPRTFMTRVNVNEWSLDNVEWVNVTDQIYQGGIVNSAVGVGSVCIKDNWLYYIFGGEDFLNPWSIGDNNRKYPYVHDGHPADLYCFRVKIKQEEFVSRDFAYGATPNRTLPTFMSTSGVRTVPVPVDFTDDVAVQSLTVHAGTSGQVRAEVKLEGNYAIIAKKVPSDDVTAQRLIVSGGETTSSADGAMITLHGSRSSTPRRAVYNALEHLFENGDVKPYLDNVNALGGPGNRFSTVYLGSNPVVTSDGTLKTEPVSPDEALLDAWGDVRYIAYKWLNAVAIKGEEGARIHHGVIAQQLRDVLISHGLMEEESTTCRYAFLCYDDYPAVYDDVITGQREMPLTDNDGSIIVDEDDNPVMVIEDIIERVEITPAGSRWGVRPDLLFYIEAAWQRREIERIKARLDLIEGKH
Physico‐chemical
properties
protein length:917 AA
molecular weight: 102354,81930 Da
isoelectric point:5,17550
aromaticity:0,11450
hydropathy:-0,36161

Domains

Domains [InterPro]
IPR023366
STR
263–343
IPR001724
Unmapped
334–357
QGF19902.1
1 917
Architecture
ATT
STR
ATT
STR
CHP
RBD
CHP
RBD
ATT 2-49 | STR 50-105 | ATT 106-146 | STR 147-753 | CHP 754-766 | RBD 767-819 | CHP 820-886 | RBD 887-916 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
QGF19902.1
1 917
Domain Start End Length (AA) Confidence
N-terminal 1 104 104 0,8843
Central domain 105 420 317 0,9076
C-terminal 421 917 496 0,5330
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-104
Central
105-420
C-terminal
421-917

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage phiv205-1
[NCBI]
2663260 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Escherichia coli
[NCBI]
562 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QGF19902.1 [NCBI]
Genbank nucleotide accession
MN340231.1 [NCBI]
CDS location
range 3572 -> 6325
strand -
CDS
TTGCTGCACGCTTACGCTGGATATCCGGTATATAACGGACAGATTGCCAAATTCGTTACCGTGCAAGGCCATTCTATGGCTGTTTATGATGCGTATGGTGCACAGCAGTTCTATTTTCCCAATGTGCTGAAGTATGACCCTGATCAACTACGGCAGCAATTAGAAGACCCAGATGGAGCGAATAAATACCCAAAACTTCAGATAGCAAGATGGAGAGACAGTTATGATGTAAGAGGTTGGGGGGCTATTGGTGATGGTGTTCATGATGATACATCAGCTCTATCAGAATTACTTTCTGTTGCAACAGGTGGTGAAAAGATAGATGGGCGAGGGCTTACTTTTAAAGTATCAACTCTTCCAGATGTCAGTCGATTTAAAAATGCTCGTTTTTTATTTGAGAGAATACCGGGTCAGCCTCTTTTTTATGCTTCTGAAGATTTTATCCAGGGAGAGTTATTTAAAATTACAGATACACCGTGGTACAACGCCTGGACGCAGGATAAAACGTTTGTATATGACAATGTCATCTATGCGCCTTTTATGGCTGGAGACCGCCATGGTGTAAATAACCTCCATGTTGCATGGGTTCGCTCAGGAGATGACGGGAAGACCTGGACAACGCCGGAATGGCTTACAGATTTACATGAAAACTATCCCACAGTTAACTATCACTGCATGAGTATGGGGGTTGTCAGAAATCGCCTTTTTGCTGTAATTGAGACGCGGACCGTGAGCGGAAATAAACTGCAGGTTGCAGAGTTGTGGGATCGCCCAATGAGTCGCAGCCTTCGCGTTTATGGTGGTATAACGAAAGCAGCAAATCAGCAAGTCGCTTATATTCGCATTACTGATCACGGATTATTTGCTGGTGATTTTGTCAACTTCTCAAACTCTGGTGTTACAGGTGTTACCGGGAATATGACGGTGACTACTGTTATTGATAAAAATACTTTTACAGTTACGACGCAAAATACCCAGGATGTGGATCAGAATAACGAGGGTAGATACTGGAGTTTTGGTACATCATTTCACTCGTCACCATGGAGAAAAACCAGTCTTGGAACTATTCCTTCTTTTGTTGACGGAAGCACTCCTGTTACTGAGATTCACAGTTTTGCGACGATTAGCGATAACAGTTTTGCTGTTGGCTACCATAATGGTGATATTGGTCCACGCGAGCTTGGGATACTCTATTTCTCTGATGCTTTCGGTTCTCCTGGTAGCTTTGTTCGCAGACGCATACCTGCAGAATATGAGGCGAATGCATCTGAGCCATGTGTAAAATATTATGATGGCATTCTGTATCTGACGACCAGGGGGACATTAAGTACTCAACCCGGTAGTTCATTGCACAGAAGCTCTGATTTAGGTACATCATGGAATTCTCTTCGCTTCCCAAATAATGTTCATCACTCAAACCTTCCTTTTGCCAAAGTTGGCGATGAGCTGATTATTTTTGGCAGTGAGCGCGCATTTGGTGAGTGGGAAGGAGGAGAACCTGATAACCGTTATGCAGGAAACTATCCAAGAACATTTATGACCAGAGTTAACGTCAATGAGTGGAGTCTGGATAATGTAGAGTGGGTTAATGTTACTGATCAGATTTATCAGGGCGGAATAGTTAACTCTGCGGTTGGTGTTGGTTCAGTTTGTATCAAAGACAACTGGCTGTACTACATTTTCGGTGGGGAAGACTTTCTAAACCCATGGAGCATAGGGGATAACAACAGAAAATATCCTTATGTTCACGATGGTCACCCGGCTGATTTGTATTGTTTCAGGGTGAAAATTAAACAGGAAGAATTTGTTTCAAGGGATTTTGCCTACGGAGCCACTCCTAACAGAACGCTTCCTACTTTTATGTCGACGTCAGGCGTGAGGACGGTTCCTGTACCTGTTGATTTCACAGATGATGTTGCCGTCCAGTCACTGACTGTCCATGCAGGTACATCAGGACAAGTTCGCGCGGAAGTCAAACTTGAGGGTAATTACGCCATTATTGCGAAGAAAGTACCGTCTGATGATGTTACCGCTCAGAGATTAATCGTTAGCGGCGGTGAAACAACGTCTTCAGCAGATGGTGCAATGATAACGTTGCATGGTTCCAGAAGCAGTACTCCACGTCGCGCGGTATATAACGCACTCGAACATCTTTTTGAGAACGGAGATGTTAAACCTTATCTTGATAATGTAAATGCTCTTGGTGGTCCGGGAAACAGGTTCTCGACAGTTTATCTTGGCTCCAATCCTGTGGTTACCAGTGACGGAACATTAAAGACAGAGCCGGTCTCTCCTGACGAAGCATTGCTGGATGCCTGGGGTGACGTCAGGTATATCGCTTATAAATGGCTGAACGCTGTCGCTATAAAGGGGGAAGAAGGGGCGAGGATACATCATGGTGTAATCGCGCAGCAACTTCGTGATGTTCTTATTTCTCACGGACTCATGGAAGAAGAAAGCACAACATGCCGCTATGCCTTTCTTTGCTATGACGATTATCCCGCAGTATATGATGACGTCATTACTGGCCAAAGGGAAATGCCGCTGACTGATAATGACGGGAGCATCATTGTTGATGAGGATGATAATCCAGTGATGGTAATAGAAGACATCATTGAGCGCGTTGAAATAACGCCAGCAGGATCTAGATGGGGGGTCAGACCTGATCTCTTATTCTATATCGAGGCGGCATGGCAGCGCAGAGAAATAGAAAGAATAAAAGCTAGGTTAGACTTAATAGAAGGGAAGCACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
93d81d557c702b6b86a32554e2e28454b9b49e16f77acffabffd7d5bda666b49
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6529
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50