Genbank accession
WQA18413.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,82
Protein sequence
MTTKVIFTFHSPDGSPQANEKFTVRLTRPGMSDAEHCVVIPETYEMVTDAKGEFTMDLESSTSAYRVTAIGDEDEYEDDPCSQYTFTFYVPDSADPVYVQELILMPPPTNLPWDEEAMNKITQAVVDARNARDDAEESADRAEAQVGLAAAQVTLAKAEVTKATAQADRSKAEADRATTQATNAANSATAAANSATQANTQANRAKTEADRSKSEADRARDLADAMAEKVEGGSLPPLVGMNETFTYEGTDPYRWTISGPATAVSDGSVMRLTKTDGSGSRAFLRQAVTFPDSHWIVYMRVKTQTGTAAQNRSAQIRFIAADNKDCTVYFNVNVNGIVEPNTIHMQGTEGSSRNAATMFTGLNTESWMDLAVKYDAVNRHIELFRRMPNGTWQKGGGRLMVDAIKPAFIEVSSMPVAPLNWWLDLDFISVCKPNLICYGDSIAAGQNEYGVTRGSNSYNNNRNWAGTWFGKVPLYATNRNNLVLVQGVEGRRTWQYLSQLSEISNSGVKVVFIHASTNDVNDATMTMEKRTSDTQAIIDQLHAVGAQVVLFNSMQGTKAYNDASTTTVKLRDYTDQWWNTELPKVNGLAQTLDIARLIAKDGYMDPALGASDGLHLTDASAQKIADKLGQFFSNSSDTNGFASLDSPAFTGIPTVPTQTPFLPYGKQIANTEYVITFIQDWTSNYGYGDLSMRNRTGSQMEAGGVRSGYYYVPGDSSNPLPGNVYAFVHHMSYDTNKGWELWNHCYTDRVYMRYSNNAGVWQTPVEVVTEKWMERNSFMTPRVSAFNRLPVASSAGFEGVVPLQTNSSGAWRNVNAGVAGLGLLGATTAGNALNYIGGMPKAPTNDRANSNLNNLPDECGFYGLGPAPYSNIPPGVDAINPVGSTVYHQVYDANTATQIFIPRTSDICYFRRKAGGTWSPWVRYLTDLQLVGTTTDSSAGVPNGAIMQINGSAAVNVGVALRFADGTQIVRALLQLDYGAVDILQRQFTFPMAFVGKPVVTATLEQGTVADINNMPLQALGPVMVASIYAGNCNVRVMRSQGYTAGGFAAGSKMLCSVIAMGFWK
Physico‐chemical
properties
protein length:1065 AA
molecular weight: 115793,17280 Da
isoelectric point:5,19471
aromaticity:0,09296
hydropathy:-0,29671

Domains

Domains [InterPro]
DC_0123
ATT
1–581
IPR051532
Unmapped
209–632
SSF52266
STR
432–630
cd00229
ENZ
435–631
IPR013830
ENZ
437–620
WQA18413.1
1 1065
Architecture
ATT
STR
STR
ATT 1-581 | STR 582-637 | STR 714-1065
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WQA18413.1
1 1065
Domain Start End Length (AA) Confidence
N-terminal 1 234 234 0,9762
Central domain 235 481 248 0,5640
C-terminal 482 1065 583 0,7828
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-234
Central
235-481
C-terminal
482-1065

Taxonomy

  Name Taxonomy ID Lineage
Phage Pseudomonas phage vB_Pae_TUMS_P11
[NCBI]
3067296 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Pseudomonas aeruginosa
[NCBI]
287 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Pseudomonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WQA18413.1 [NCBI]
Genbank nucleotide accession
OR424355.1 [NCBI]
CDS location
range 39792 -> 42989
strand +
CDS
ATGACTACCAAGGTGATCTTCACCTTCCATAGCCCGGACGGTAGCCCACAGGCGAACGAGAAGTTCACCGTGCGACTTACCCGTCCTGGCATGAGCGATGCAGAGCACTGCGTCGTAATTCCCGAAACCTACGAGATGGTGACCGACGCCAAGGGCGAGTTCACCATGGACCTGGAATCGTCCACCTCTGCCTACCGCGTCACTGCTATAGGTGATGAAGACGAGTACGAAGACGATCCCTGCTCGCAGTACACCTTCACCTTCTACGTGCCGGACTCTGCTGATCCGGTCTACGTGCAGGAACTGATCCTGATGCCCCCGCCCACGAATCTTCCGTGGGACGAGGAAGCCATGAACAAGATCACCCAGGCGGTGGTCGATGCTCGCAACGCACGAGACGATGCCGAAGAGTCGGCTGACCGAGCAGAGGCTCAGGTTGGTTTGGCTGCTGCGCAAGTGACCCTTGCAAAGGCAGAGGTGACCAAGGCTACCGCTCAGGCTGACCGCTCCAAGGCGGAAGCTGACCGTGCCACTACCCAGGCCACGAACGCAGCCAACTCGGCTACTGCTGCGGCGAACTCTGCCACCCAGGCCAACACTCAAGCCAACCGTGCGAAGACGGAAGCCGACCGCTCCAAGAGTGAGGCTGACCGTGCACGGGATCTGGCTGATGCCATGGCTGAGAAGGTCGAGGGTGGATCGCTGCCTCCGCTGGTGGGCATGAACGAAACCTTCACCTACGAAGGTACCGACCCGTACCGCTGGACCATCTCCGGTCCTGCTACGGCTGTATCGGACGGTAGCGTCATGCGTCTCACCAAGACCGATGGCAGCGGCTCTCGTGCTTTCCTGCGGCAGGCAGTGACCTTCCCGGACAGTCACTGGATCGTCTACATGCGTGTGAAGACCCAGACCGGTACCGCTGCTCAGAATCGATCTGCACAGATCCGCTTCATCGCTGCTGACAACAAGGACTGCACGGTCTACTTCAACGTCAACGTGAACGGTATTGTCGAACCCAACACCATCCACATGCAGGGCACCGAGGGTAGTTCCCGTAACGCTGCCACCATGTTCACTGGTTTGAATACCGAGTCCTGGATGGATCTCGCTGTGAAGTACGATGCAGTCAACCGCCACATCGAACTTTTCCGCCGTATGCCCAATGGCACCTGGCAGAAGGGTGGCGGTCGTCTGATGGTTGATGCTATCAAGCCGGCCTTCATCGAAGTCTCGTCCATGCCGGTAGCACCTCTGAACTGGTGGCTGGATCTGGACTTCATCTCGGTGTGCAAGCCCAACCTCATCTGCTACGGCGACAGCATCGCCGCAGGCCAGAACGAGTACGGTGTGACTCGCGGGAGTAACTCCTACAACAACAATCGCAACTGGGCTGGTACTTGGTTTGGTAAGGTTCCGCTGTATGCCACGAATCGGAACAACCTCGTGCTTGTGCAGGGTGTGGAAGGTCGCCGTACTTGGCAGTACCTCAGCCAGCTGTCGGAGATCTCCAACTCTGGCGTCAAGGTAGTCTTCATTCATGCCAGCACGAATGACGTGAACGATGCCACCATGACCATGGAGAAGCGCACCTCTGATACCCAAGCCATCATCGACCAGCTTCATGCTGTCGGTGCCCAGGTGGTGCTGTTCAACTCCATGCAGGGCACGAAGGCGTACAACGATGCTTCGACCACTACCGTCAAGCTTCGGGACTACACTGATCAGTGGTGGAACACTGAACTACCCAAGGTCAATGGTTTGGCCCAGACGCTGGACATTGCTCGACTCATCGCCAAGGACGGGTACATGGACCCTGCCCTCGGTGCGAGTGATGGCCTGCACCTGACCGATGCCTCGGCACAGAAGATCGCCGACAAGCTTGGCCAGTTCTTCTCCAACTCCAGCGATACCAATGGCTTCGCCTCGTTGGATAGCCCAGCCTTCACTGGTATCCCGACTGTCCCCACGCAGACTCCTTTCCTGCCTTACGGCAAGCAGATCGCCAACACCGAGTACGTCATTACCTTCATCCAGGACTGGACCAGTAACTATGGTTACGGTGACCTGTCGATGCGCAATCGGACCGGTTCCCAAATGGAAGCCGGTGGGGTGCGTAGCGGTTATTACTATGTTCCTGGGGATTCTTCTAACCCCTTGCCGGGGAATGTTTATGCATTCGTGCACCACATGTCCTACGACACCAACAAGGGTTGGGAACTCTGGAACCACTGCTACACCGACCGTGTGTACATGCGTTATTCGAACAACGCAGGTGTATGGCAGACGCCGGTAGAGGTGGTTACTGAGAAGTGGATGGAGCGTAATAGCTTCATGACCCCACGGGTTTCTGCATTCAACCGTCTGCCCGTGGCTTCTTCTGCTGGTTTTGAAGGGGTTGTTCCCTTGCAGACCAACAGTAGTGGGGCTTGGCGTAACGTGAATGCTGGTGTAGCTGGGTTGGGCCTTCTCGGTGCAACTACGGCTGGTAATGCACTGAACTACATTGGTGGTATGCCCAAAGCCCCAACCAATGATCGGGCCAACAGTAACCTCAACAATTTGCCTGATGAGTGTGGTTTCTACGGTTTGGGACCAGCACCGTATTCCAATATCCCTCCGGGGGTTGATGCGATTAACCCAGTGGGTTCTACCGTCTACCATCAGGTTTACGACGCCAACACCGCTACGCAAATTTTCATTCCGCGTACTTCGGACATCTGCTATTTCCGCCGTAAGGCAGGGGGAACCTGGAGTCCATGGGTTCGTTACCTGACTGATTTGCAGCTGGTAGGCACCACTACTGACAGCAGTGCCGGTGTTCCGAATGGTGCCATCATGCAGATCAATGGGAGTGCTGCTGTAAACGTAGGTGTTGCCCTCCGCTTTGCAGACGGTACTCAAATTGTTCGGGCGTTACTCCAATTGGATTACGGTGCAGTAGATATCCTTCAGCGTCAGTTCACTTTCCCCATGGCTTTTGTGGGTAAGCCGGTGGTAACCGCCACCCTGGAACAAGGTACTGTTGCTGACATCAACAACATGCCTTTGCAGGCTCTTGGCCCTGTTATGGTTGCCAGTATCTATGCAGGCAACTGCAACGTGCGTGTCATGCGGTCTCAGGGGTATACTGCTGGTGGATTCGCAGCAGGCTCTAAAATGCTGTGCTCTGTCATCGCAATGGGGTTCTGGAAATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
a1df68a91ed230377550b2c21d2607b34606caeec2c769b3ebf8e9e70782988b
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6657
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50