Genbank accession
WBF78722.1 [GenBank]
Protein name
long tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TSP
Evidence DepoScope
Probability 0,61
TSP
Evidence RBPdetect
Probability 0,65
TF
Evidence RBPdetect2
Probability 0,95
Protein sequence
MALDPNINRIKFLRSSTAGAKPTTAAIQPGEIAINLADRTLYSTDGNAIIDIGFGLGGSVNGPINATGQISTNDFLYSKYGFSVNSETADGRGISLYGNGYTASNGLPSYGLAMAATSIYGTFGAVSGSHATYLTTNSGTNRGWIFNYNGTTNVASISGTGIATFARVDAPLNGNANTATKLQSARNINGVLFDGTSDINTPAITDVVSFDNRTVKPSDVRNKAMGVYFTSKAGLNGAADTNYGDFLSLSTYQDGTGGKVNGLYFNKLSREILHYQTDLNSNSWGTPKTIAYTDSSITGNAASATRLQTARTINGTAFDGTANINVNATYSEFIPDGADLNDYKTPGLYYCPTDAGAATQLNLPFSNAYSLFVERHAGIKQTITQYATNKTFIRKFYNGYWDNWRQLAFLDPSDQKFTENITLEKASSDVAINIKTTTDNTSELILSNRNKTASVSLVSDGTFLLWDSNRQTSFMSFTPDGVQSTLNSKLLINTPASLNVTGGETFAVTGGESSPIRFKINRAGAITTNVPETGAAIVHATSYNWYNTEWQIGNIRGGSTNSLGFGITKFNDTLVWRHDGNTMTNYGNISNTGSISTQGDISSNGNISNTGSISTQGNISSNGNLSTAGSISGDSLITRTGRIGAPTSTYHYLDIGRDGTDITTVGQYGGSFRVVDTAIGKNTFSVDPNTAIFAGRIITKPGTFYQNITDLGNATAAITVPDTVAPKDVTGYVPFIHGSVQTNGSGYRTNVSIGALRGSNTWSSSGAYIAIGGNDNYTTEDFRFISGGYIGTSGGTLNILGTLNAPTLTSTTGSFTTVNTTNINAQGSIVMSAAPGGSGRSGMYTGNGDGASFSTCNIDIGSHWGLGFKDNLGNRNIIFDTRAGNASFKGSIRIGASFADETQLLPTSNQLQILTSGGQARNISTGGVLASDSYADFNKVPTNGIYSKGDIKTAVWMYASTFTGPTGSGDGRFDGNANTATRLQTARTFQITGGITTNAVSFDGQQNVTLTASNVDGSKVTGVVPEAIKAQTLAVTPQKNAKLVASWKGTILQSMTPTLTIVDANTLRVRLADDNPSNRLAVLRFNVKIGTVYHLAFSDTMLPINGTVTFVQTGLTWVEVDLNSPNHGLSGSGNGVNVMAITYSAYGCYFEGTISQIIGTKGPQDSQWAYVLKLNSPTTDATYNLSGSSQDAIWVADKNIWYLNPAQPVITAGAMISPDRLNFFAADTDTATRMRSNMVTAQIWDIV
Physico‐chemical
properties
protein length:1247 AA
molecular weight: 131662,89060 Da
isoelectric point:6,25316
aromaticity:0,09142
hydropathy:-0,21291

Domains

Domains [InterPro]
DC_0113
STR
1–80
cd19958
STR
334–408
WBF78722.1
1 1247
Architecture
STR
STR
STR 1-80 | STR 83-1247
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WBF78722.1
1 1247
Domain Start End Length (AA) Confidence
N-terminal 1 40 40 0,2400
Central domain 41 239 200 0,4230
C-terminal 240 1247 1007 0,5523
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-40
Central
41-239
C-terminal
240-1247

Taxonomy

  Name Taxonomy ID Lineage
Phage Acinetobacter phage vB_AbaM_DLP2
[NCBI]
3003715 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Acinetobacter baumannii AB5075
[NCBI]
1116234 Pseudomonadota > Gammaproteobacteria > Moraxellales > Moraxellaceae > Acinetobacter > Acinetobacter baumannii

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WBF78722.1 [NCBI]
Genbank nucleotide accession
OP946502.1 [NCBI]
CDS location
range 155234 -> 158977
strand +
CDS
ATGGCATTAGATCCAAATATTAACAGAATTAAATTTTTACGATCTTCTACTGCTGGAGCTAAACCTACTACTGCAGCGATTCAACCAGGCGAAATTGCTATCAATTTGGCAGATAGAACTCTTTACTCAACCGATGGTAATGCTATTATTGATATCGGTTTTGGCTTGGGTGGAAGTGTCAATGGGCCAATAAATGCAACAGGGCAAATTAGTACCAATGACTTTTTATATTCAAAGTATGGATTTTCTGTAAATTCAGAAACTGCAGATGGACGCGGTATTTCATTATATGGTAATGGATATACCGCATCAAACGGTCTGCCGTCATACGGGCTTGCGATGGCTGCTACAAGTATATATGGTACATTTGGTGCTGTGTCAGGTTCGCACGCAACTTACCTTACTACAAACTCAGGTACTAACCGTGGTTGGATTTTTAACTATAACGGGACTACCAATGTAGCTTCAATTTCGGGAACCGGCATTGCGACATTTGCTCGTGTAGATGCTCCGTTAAACGGTAATGCTAATACAGCAACCAAACTTCAATCAGCTAGAAACATTAATGGCGTGTTGTTCGACGGCACATCAGATATTAATACACCAGCTATCACTGACGTTGTTTCATTTGATAATAGAACTGTTAAACCTTCTGATGTTAGAAATAAAGCCATGGGTGTTTATTTTACTTCAAAGGCAGGTTTAAATGGCGCGGCTGATACCAATTATGGTGATTTTTTATCTCTAAGCACTTATCAAGATGGCACTGGAGGAAAGGTAAACGGTTTATATTTCAATAAATTATCTCGAGAAATATTACATTACCAGACAGATTTAAATTCAAATTCGTGGGGAACTCCGAAAACGATTGCGTATACCGATAGTTCTATTACCGGCAATGCTGCATCAGCAACACGCTTACAGACAGCAAGAACTATCAATGGTACTGCATTTGATGGTACTGCGAATATCAATGTCAATGCTACATATTCGGAATTTATTCCTGATGGTGCGGATTTAAATGATTATAAAACTCCAGGGCTATATTATTGTCCTACTGATGCTGGTGCAGCGACTCAATTAAATTTACCGTTTAGTAATGCGTATTCGTTGTTTGTTGAGAGACATGCTGGAATTAAACAGACTATTACCCAATATGCGACAAATAAAACTTTTATTCGTAAATTTTATAATGGTTATTGGGATAATTGGCGTCAATTAGCTTTCTTAGACCCAAGTGATCAGAAATTTACAGAAAATATCACTTTGGAAAAAGCTTCTTCTGACGTAGCAATCAACATTAAAACTACTACTGATAATACTTCTGAACTTATTTTAAGCAATAGAAATAAAACAGCGTCTGTAAGTCTCGTATCAGACGGTACTTTTTTATTATGGGATTCAAATAGACAAACATCATTCATGTCATTTACACCAGATGGTGTACAATCGACATTAAATTCTAAGTTGTTGATTAATACTCCAGCATCGCTAAATGTTACAGGCGGAGAAACATTTGCTGTTACGGGTGGAGAAAGTTCCCCAATCAGATTTAAAATTAACCGCGCTGGAGCTATTACCACAAATGTTCCTGAGACTGGCGCCGCTATTGTCCATGCTACTTCATACAACTGGTATAACACTGAATGGCAAATTGGTAACATTCGTGGCGGTTCTACTAACTCTCTTGGTTTTGGTATTACTAAATTCAACGATACATTAGTATGGCGCCATGATGGTAATACTATGACCAACTACGGTAATATTAGTAATACGGGTTCGATCAGTACACAAGGAGATATTTCATCTAATGGTAACATTAGTAATACAGGTTCGATCAGTACGCAAGGAAATATTTCATCTAATGGTAACTTAAGTACTGCAGGTTCAATCAGCGGAGATAGTTTAATCACTAGAACTGGTCGTATCGGTGCGCCAACATCTACGTATCACTATCTCGATATCGGTAGAGACGGTACAGATATTACTACAGTCGGTCAATACGGGGGTTCATTTAGAGTTGTTGATACAGCTATCGGCAAAAATACATTCAGTGTTGACCCAAATACGGCTATTTTCGCTGGAAGAATTATCACGAAGCCTGGTACATTTTATCAAAATATAACAGACTTGGGTAATGCGACTGCAGCTATTACAGTTCCAGATACTGTTGCTCCAAAAGATGTCACTGGATATGTTCCGTTCATCCATGGATCTGTCCAAACGAATGGCTCTGGTTATAGAACTAACGTATCTATTGGTGCTTTGAGAGGTTCTAATACTTGGTCATCTTCGGGTGCTTATATCGCTATCGGTGGAAACGATAACTATACGACAGAAGATTTTAGATTCATTTCTGGTGGTTATATTGGTACAAGTGGCGGTACTTTAAATATCTTAGGTACTTTAAATGCACCTACTTTAACATCTACAACTGGATCTTTCACTACCGTTAATACAACAAATATTAACGCCCAAGGCAGTATTGTGATGAGTGCTGCGCCTGGTGGATCTGGGCGAAGCGGTATGTATACAGGAAATGGTGACGGTGCGTCCTTCTCTACCTGTAATATAGATATAGGTTCTCATTGGGGCTTAGGATTCAAAGATAATTTAGGAAACAGAAATATTATCTTTGATACCCGTGCAGGTAATGCGTCGTTTAAAGGATCTATTAGAATCGGTGCATCTTTTGCTGATGAGACTCAATTATTACCTACATCAAACCAACTTCAAATATTAACTTCAGGCGGTCAAGCTAGAAATATTTCGACTGGTGGTGTATTGGCTTCTGATTCGTACGCTGATTTTAATAAGGTTCCGACAAATGGCATCTACTCAAAAGGTGATATTAAAACCGCCGTATGGATGTATGCATCTACATTCACTGGTCCTACAGGATCAGGCGACGGTCGTTTTGATGGTAACGCAAACACAGCAACTCGTTTGCAAACTGCAAGAACTTTCCAAATTACTGGAGGCATCACAACAAATGCAGTATCATTTGACGGACAACAAAACGTTACATTGACTGCATCTAATGTTGATGGTTCAAAGGTTACAGGCGTGGTTCCTGAGGCGATTAAAGCGCAAACTCTTGCAGTGACACCGCAAAAAAATGCGAAACTCGTTGCATCATGGAAGGGTACCATATTGCAATCTATGACGCCTACACTAACTATTGTTGATGCAAATACACTTCGTGTTAGATTAGCAGACGATAATCCGAGTAACAGATTGGCAGTATTGAGATTTAATGTGAAAATCGGCACAGTGTATCATCTCGCATTTAGTGATACTATGTTGCCGATCAATGGCACAGTAACTTTCGTCCAAACTGGGTTAACATGGGTTGAAGTTGATCTTAATTCACCTAACCACGGGCTATCAGGTTCAGGCAATGGTGTAAATGTTATGGCGATAACATATTCTGCATATGGTTGTTATTTTGAGGGCACAATTAGCCAAATCATTGGCACAAAAGGGCCACAAGACAGTCAATGGGCATATGTATTGAAGCTCAATTCTCCGACAACTGATGCTACATATAATCTTAGCGGATCTTCGCAAGATGCAATATGGGTTGCAGATAAGAATATTTGGTATCTTAATCCTGCGCAGCCGGTGATAACTGCAGGCGCTATGATTTCTCCTGATAGATTGAATTTCTTCGCTGCTGATACAGATACTGCAACAAGAATGCGTTCTAATATGGTAACTGCACAAATCTGGGACATTGTATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
9f22e5f2972abf5f29d79389b37ee730c0c906fe415b10a668c0ac3ad5ee25d2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4615
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50