Genbank accession
WDQ26538.1 [GenBank]
Protein name
long-tail fiber proximal subunit
RBP type
TSP
Evidence RBPdetect
Probability 0,55
Protein sequence
MVAKSFRARSGLDAAGEKVINVGKADRNTLSDGVNVDFFNEFNGIQQYDPTRGYSQDMAIIYARRIWYAKQNIASPAGAFDESKWIATRNDPKWVYSNVTTPDGSIIESGSYIMADGRFTELLYLLPDNPTEGDVITFKDCGGLVGVNSILVKSNTRQIRLRTVQSAQYRLTHPYMIATFIYNGNVWRVAETLDNRDSEIVNATGTGSFQLQSGMTVFRNSATGKITLQLPKYANDGDVITTYDADKMNSINVAVLQIYPGSGHTITDGASTGVTSVSSQKSGFGMFIFDAQNSQWKVYDADNRVRLRRIYSDLNAVPNDYVFVTANPSGTIPNVTVTLPTDVADGDRVYVSLYMMGKNQNCTIKVKDGTTDKIRTNKNMMQFPQRKDYPPNDWFSVTSLAFNAGSDYLPYIEFSYLKATKEWVVANYRPIVERVDATNRSRTGVIALAAQAEVNKNLEDNPNDETAITPMTLANKTATETRRGIARLATTAEVNKLSTDTYLDDVIVTPKKLNERTATETRRGLAEIATQAETNGSTDDITIVTPKKLHNRIASPTLTGILALVATGGAPNTNTDRSQAGTGVYDHSDYQKAVTPKTLREYKATQLQSGVVWLASETEVINGTVASSNVPTVVTPEMLHKKTSTDGRIGLIEIATQTETNAGTDYTRAVTPKTLNDRAATETLTGIIAIATTAEVSAGTVTNKAIVPSKLKGYLDDTSHITVATADGLTQSGTIWTTVNIGIQSATETQRGTLRVATQSETNAGTLDTVFVTPKKLHAKKATESAEGIIQVATAAETTAGTVANKAVSPKNLKNTIQVDTSWQATDLVRGTVKLSKGLGTWSGNDVAGSTLPDDGYASVGVAVSPYELNLTLKHYLPIGAKAVDADKLDNLDSSQFIRRDVNQTVNGALTLTKATTVQADINSTADASFRVMNVSGDLNVGDGSSMGKLRLNGGSSNDWSIQASSASGRIAMISTGNTGTVHLSVYNDTRGVVANVKFQAPEIQAISKVTLGNDTVITAAGSVLSMGTKNKTTKILTSDAGNIVAEESANSYKVFTEKNAQTLLNPTYVRKAGDTMSGRLTVSNSSIIIAGQAAWSTLDAVTEASRGNWTAEITASAQYNLLPGYAVPVLEPDPINPEIMIVTRYTYVKAPGTLTQFGNGTAFTYQIWAPRPTSGTGVNALAQSFWIRQMNPITGKFDEWGRMYTSNNPPTAGEIGATSAVGTTVKNMTVTDWIKVGNVKIYPDPVTQTVKFEWVA
Physico‐chemical
properties
protein length:1257 AA
molecular weight: 135338,10950 Da
isoelectric point:6,81297
aromaticity:0,07080
hydropathy:-0,27709

Domains

Domains [InterPro]
WDQ26538.1
1 1257
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage phi_KPN_S3
[NCBI]
3028472 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WDQ26538.1 [NCBI]
Genbank nucleotide accession
OQ267591 [NCBI]
CDS location
range 166939 -> 170712
strand +
CDS
ATGGTAGCTAAATCATTCCGCGCACGAAGTGGCCTTGATGCTGCTGGTGAAAAAGTTATCAACGTTGGGAAGGCTGATCGTAATACGCTGAGTGACGGCGTTAACGTTGACTTTTTCAATGAATTTAACGGAATCCAGCAGTATGACCCGACTCGTGGTTATTCTCAAGACATGGCGATCATTTATGCGCGTCGTATCTGGTACGCAAAACAGAATATTGCTTCCCCTGCTGGGGCTTTCGACGAATCCAAATGGATCGCAACTCGTAATGACCCGAAATGGGTTTATAGTAACGTAACCACTCCAGACGGTAGCATTATTGAATCTGGTAGTTACATTATGGCTGATGGACGCTTTACTGAACTTCTGTACCTGTTGCCGGACAACCCGACAGAGGGCGACGTAATCACGTTTAAAGATTGCGGTGGTTTAGTAGGTGTGAACAGTATCCTTGTTAAGTCTAACACCCGTCAGATCCGTTTACGTACTGTACAATCAGCACAGTATAGACTGACTCACCCGTACATGATCGCAACGTTTATCTATAACGGTAACGTATGGCGTGTTGCTGAAACTCTGGATAACCGCGATTCTGAGATCGTGAACGCAACCGGAACAGGTTCATTCCAGTTACAATCTGGCATGACAGTTTTCCGTAACAGTGCTACAGGTAAAATCACTCTCCAGTTGCCGAAATATGCAAATGATGGTGATGTGATCACAACCTATGACGCTGATAAAATGAACTCTATCAACGTAGCCGTTTTGCAGATTTATCCAGGTAGTGGACACACTATTACTGATGGTGCAAGTACTGGTGTTACTTCGGTAAGTAGCCAGAAATCCGGCTTTGGTATGTTCATCTTTGATGCACAGAATAGCCAATGGAAAGTATATGATGCTGATAACCGTGTTCGTCTGCGTCGTATCTACAGTGATTTAAACGCGGTTCCTAATGATTACGTTTTTGTTACTGCAAACCCTTCTGGGACTATACCAAACGTTACTGTTACCCTTCCAACTGATGTTGCTGATGGTGATCGCGTTTACGTATCCCTTTATATGATGGGTAAAAACCAGAACTGTACAATCAAAGTTAAAGATGGTACAACGGACAAGATCCGCACCAACAAAAATATGATGCAGTTCCCGCAGCGCAAAGATTATCCGCCTAATGATTGGTTTAGTGTAACTTCATTAGCTTTTAATGCTGGCAGTGATTACCTCCCGTATATCGAGTTTTCTTATCTGAAAGCTACGAAAGAATGGGTTGTTGCTAATTATCGTCCAATCGTTGAACGTGTTGATGCAACTAACAGATCGCGTACTGGTGTTATTGCTCTGGCTGCACAGGCAGAAGTAAACAAAAACTTGGAAGATAACCCTAACGACGAAACCGCTATTACTCCGATGACGTTAGCAAATAAAACTGCTACTGAAACCCGTCGTGGTATTGCGCGTTTAGCGACAACTGCTGAGGTTAACAAACTTTCAACCGATACCTATCTGGATGATGTGATTGTTACTCCTAAGAAGCTGAACGAAAGAACAGCAACTGAAACCCGTCGTGGGCTGGCAGAAATCGCAACTCAGGCAGAAACCAACGGAAGCACCGATGATATTACTATTGTAACCCCTAAAAAGTTGCATAACCGTATTGCATCGCCGACCTTGACTGGTATCCTTGCCCTTGTTGCTACTGGTGGTGCTCCTAACACCAACACGGATCGTTCTCAGGCTGGTACTGGGGTTTATGACCATTCAGATTATCAGAAAGCGGTAACGCCTAAAACTCTTCGTGAGTATAAAGCGACTCAGTTACAATCTGGTGTTGTATGGCTGGCTTCTGAAACCGAAGTTATTAACGGTACTGTTGCAAGTTCAAACGTTCCGACCGTTGTTACTCCGGAAATGCTGCACAAGAAAACCTCTACTGATGGTCGTATTGGTTTGATCGAGATTGCAACTCAGACAGAAACAAACGCTGGTACTGATTACACGCGAGCGGTAACGCCTAAAACGCTTAATGATCGTGCTGCAACGGAAACGCTGACAGGTATCATTGCAATTGCAACCACTGCCGAAGTATCAGCAGGTACTGTAACGAATAAAGCGATCGTACCGTCTAAACTGAAAGGTTATCTGGACGATACAAGCCATATTACTGTTGCTACTGCTGATGGGTTAACTCAATCTGGTACTATCTGGACTACGGTTAACATCGGTATTCAATCAGCAACTGAAACTCAACGTGGTACTTTACGCGTTGCTACACAGTCTGAGACGAACGCAGGGACATTAGATACAGTATTTGTCACCCCTAAGAAGTTACACGCTAAGAAAGCGACTGAGAGCGCAGAAGGTATCATTCAGGTGGCTACGGCTGCTGAAACTACCGCTGGCACCGTTGCAAACAAGGCTGTTTCTCCGAAGAACTTGAAAAATACAATTCAGGTTGATACTTCATGGCAAGCTACCGATCTGGTACGTGGTACTGTGAAGCTGTCTAAGGGGCTTGGTACTTGGTCGGGTAATGATGTGGCTGGTTCTACTCTTCCGGATGATGGTTACGCCTCTGTAGGTGTTGCTGTTTCTCCTTATGAATTGAACCTGACGCTGAAACATTATCTGCCGATCGGTGCTAAAGCGGTTGATGCTGATAAACTGGATAACCTGGATTCTTCCCAGTTCATTCGTCGTGATGTTAATCAGACGGTTAACGGGGCATTGACTCTAACGAAAGCAACCACTGTACAGGCTGACATTAACTCAACAGCCGACGCAAGTTTCCGGGTAATGAATGTTAGTGGAGATCTGAACGTTGGTGATGGTTCATCGATGGGTAAACTTCGTTTGAATGGTGGTTCATCTAACGACTGGTCAATTCAAGCAAGTTCAGCCTCTGGGCGTATTGCAATGATCTCAACGGGTAATACTGGCACAGTTCATCTTTCTGTGTATAATGATACTCGCGGCGTTGTAGCTAACGTTAAATTCCAGGCTCCTGAAATTCAGGCGATTAGCAAAGTGACTCTGGGTAATGATACAGTGATCACTGCTGCTGGTTCTGTTCTCTCTATGGGTACTAAAAATAAGACAACTAAGATCCTGACTTCTGATGCTGGCAACATCGTAGCGGAAGAATCTGCGAACTCTTATAAAGTCTTTACTGAGAAGAACGCGCAAACCTTGCTGAATCCTACGTATGTACGTAAAGCAGGTGATACGATGTCTGGGCGCTTGACGGTAAGCAACAGTTCAATTATCATTGCAGGTCAAGCGGCTTGGTCAACGCTGGATGCAGTAACAGAAGCATCCAGGGGCAACTGGACGGCTGAGATCACAGCATCGGCACAATACAACTTGCTTCCTGGTTATGCGGTTCCGGTTCTTGAACCAGATCCGATTAATCCAGAAATCATGATTGTGACCCGTTATACCTATGTTAAAGCACCGGGTACTTTAACGCAGTTTGGTAACGGAACTGCATTCACTTATCAGATTTGGGCACCTCGTCCGACCTCTGGTACTGGTGTTAATGCGCTGGCGCAATCCTTCTGGATCCGTCAAATGAACCCGATCACGGGTAAATTTGATGAGTGGGGCCGCATGTATACCAGCAACAACCCGCCTACTGCTGGTGAGATTGGTGCAACGTCTGCTGTTGGTACTACGGTTAAAAACATGACTGTTACCGATTGGATCAAAGTTGGTAATGTTAAGATTTACCCAGATCCGGTTACTCAGACAGTTAAATTTGAGTGGGTGGCATAA

Tertiary structure

PDB ID
998561f8dcf7f8b7d6748d6525c73247e1e482e3720b3f7e2b207d1e6b72023c
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5522
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50