Genbank accession
XSD86093.1 [GenBank]
Protein name
central tail fiber J
RBP type
TF
Evidence Phold
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,84
TF
Evidence RBPdetect2
Probability 0,97
Protein sequence
MAKNMITGSKGGSSKPHTPKELEDNLISINKIKILLAVSDGECDPDFSLKNLYLDDVVVQNEDGSFNYENVKAEFRPGTQDQDYIQGFTDTASEVTVARDLTTKTPFSISVTNKNLSAIRIKILMPRGVTSEDDGDLVGVRVEYAVDMAVDGGSFNQVLSDVIDGKTTSGYDRSRRIDLPSFNQQVILRVRRVTPDSTTAKVTDLIRLQSYAEVVDAKFRYPLTGLVYVEFDSELFPNQIPRISTKKRWKIINVPSNYDPIDRTYSGSWNGTFKKAWSNNPAWVLYDIITSQRYGLDQRELGIPIDKWSLYDVGRYCDQMVPDGKGGQEPRYLCDVVIQSQVEAFQLVTDICSIFRGMTFWNGESLSIVVDKPREPSYIFTNDNVVNGEFSYTFASEKSMYTSCNVTFDDEQNMYQQDVEPVFDTEAALRFGHNPTSITAIGCTRRSEANRRGRWILKTNLRSTTVNFATGLEGMIPTIGDVVAISDNFWSSNLTLNLSGRVMEVSGLQVFVPFKVDARAGDFIMINKPDGSPVKRTISRVSGDGKTLELNVGFGFDVKPDTVFAIERTDIALQKYVVTEITKGDSDEEFTYRVTAVQYDPNKYDEIDYGVNIDDRPTSIVDPDRLQAPKNVRLSSYSRVIQGVSVETMHVSWDKVEYASMYEMQWRKGNGNWHNTPQTANKETEVEGIYAGNYSVRVRAVSAGGSASPWSAIVNATLTGKVGEPGAPINLTASNDEVFGIRIKWGMPEGTEDTAYIELQQSETGKAESATLLSLVPYPQREYWHSILPAGYTNYYRVRSVDKIGNVSAWTNFVKGQSSIDLDDIVGDILDDILKSEGLKDLIEGAIDQSEKIQDAVESAKDSANKIKNQAQAIIENALATDTNLRWTRVENGKRKAEIGESFEMIATETEARIEAINKLRTEFDAGISAEITKVTKLISDETQARATQVNELKTEFTTEIGKTNAKVQQAQDAVTTETEARAQAIQNLDAKLTKQINDAKVELNANISRVDQAITDEAGARAEAVESLKAQYKKDISEAIKGAETEFNASIDRVNQAIANEEEARAQAVTSLDAKLTKQIGDTKKELGASIDRVDRAVTTEQQARAEAISGLDAKMTKLVGDTKTEINANVSRVDQAVASEAEARANADSALSTRIGDTQAALTQKMDSWVNAEQAGVMYGVNLGLKYKGKEYKAGMNLMLVGEGDNAKSQFLFSADRFAIIPSLSRGDLKTLPFVVENDQVFMQSTLIKDGTITNAKIGNEIRSNNFVDGSQGWRVGKDGSSQFNNVIVRGAVYANDGYFNGTVYANRIEGDVMIAESDTIPLKSVERFGSGDYEIFRINGENFDRQIDTNLIVWCSCSQRNHFRLIVQTPGKGDVEYYYLDTGNEGGGRAFALRGFFIPAAGRGQQNRIIVRVQESRDSTIKTYTPWVEREMGQKYNDVPNSSINNRNFIREKTYIAAYRAGRRIIA
Physico‐chemical
properties
protein length:1470 AA
molecular weight: 163148,48650 Da
isoelectric point:5,02908
aromaticity:0,08231
hydropathy:-0,44415

Domains

View on InterPro
XSD86093.1
1 1470 aa
ATT 92–217 · ATT 341–495 · STR 627–722 · STR 724–816 · RBD 1163–1300 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

XSD86093.1
1 1470 aa
Domain Start End Length (AA) Confidence
N-terminal 1 1411 1411 0,8818
Central domain 1412 1459 49 0,5507
C-terminal 1460 1470 10 0,6606
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Klebsiella phage vB_Kpl_K50_evo13 [NCBI] · taxon 3412884

Coding sequence (CDS)

Genbank protein accession
XSD86093.1 [NCBI]
Genbank nucleotide accession
PQ569652.1 [NCBI]
CDS location
range 16651 -> 21063
strand +
CDS
ATGGCAAAAAATATGATAACCGGGAGTAAAGGCGGTTCCTCAAAGCCTCATACTCCAAAAGAGTTGGAGGATAACCTGATTTCAATTAACAAAATCAAGATCTTGCTCGCGGTTTCAGATGGCGAGTGCGACCCTGATTTTTCGTTAAAGAATCTATATCTTGACGACGTTGTTGTTCAGAACGAGGACGGCTCATTTAACTATGAGAATGTCAAGGCTGAATTTAGACCAGGCACGCAGGATCAGGACTACATCCAAGGCTTTACTGACACTGCAAGCGAAGTGACCGTTGCGCGAGACCTGACAACAAAGACGCCGTTTAGCATTTCAGTAACGAACAAAAACCTGTCCGCGATCCGCATCAAGATTCTGATGCCGCGAGGCGTTACCAGTGAAGATGATGGCGATCTGGTTGGTGTTCGCGTTGAGTATGCGGTTGATATGGCGGTTGATGGCGGATCGTTTAATCAAGTGTTAAGCGACGTTATCGACGGTAAGACGACAAGCGGTTATGACCGCAGCCGTCGAATCGACTTACCAAGCTTTAACCAGCAGGTAATTTTACGAGTTCGTCGTGTTACTCCTGACAGCACAACAGCGAAGGTTACGGATCTAATCCGATTGCAGAGCTACGCCGAAGTCGTCGATGCGAAGTTCCGTTATCCTCTGACTGGTCTTGTTTATGTTGAGTTCGACTCTGAATTGTTCCCAAACCAGATCCCTCGAATCTCAACAAAGAAACGGTGGAAGATAATCAACGTCCCAAGCAACTATGATCCTATTGACAGGACTTATAGCGGATCTTGGAACGGTACGTTTAAAAAGGCGTGGAGCAATAACCCTGCATGGGTTCTTTATGACATTATCACCAGTCAGCGCTATGGATTAGATCAGCGTGAGTTGGGGATCCCTATCGACAAATGGAGCCTTTACGATGTAGGGCGCTATTGCGATCAGATGGTTCCAGATGGAAAGGGCGGGCAGGAGCCGCGCTATCTGTGCGATGTTGTGATTCAAAGCCAGGTTGAGGCGTTCCAGCTTGTGACTGACATTTGCTCAATCTTCCGGGGAATGACTTTCTGGAATGGCGAAAGCCTTTCGATTGTGGTTGATAAGCCGCGCGAGCCGTCTTACATCTTCACAAACGACAACGTTGTTAACGGCGAGTTTTCTTACACGTTCGCCAGCGAAAAGAGCATGTACACATCATGTAACGTGACCTTTGACGATGAGCAGAACATGTATCAACAAGACGTTGAGCCAGTTTTCGACACGGAGGCCGCTTTACGGTTTGGACACAATCCAACAAGCATAACCGCTATCGGGTGCACACGACGCAGCGAGGCAAACCGCCGGGGCCGCTGGATCCTGAAAACCAACTTGCGCAGCACAACGGTTAACTTCGCAACCGGACTTGAAGGTATGATCCCGACAATCGGCGACGTTGTGGCAATCTCGGACAACTTCTGGAGCAGCAACCTAACTCTCAACCTATCCGGTCGAGTGATGGAGGTTAGCGGTCTGCAAGTTTTCGTGCCGTTTAAGGTTGACGCACGCGCTGGTGACTTCATCATGATCAACAAGCCTGACGGCTCGCCAGTTAAGCGCACCATATCGCGCGTAAGCGGCGACGGTAAGACTTTGGAGCTTAACGTAGGGTTTGGCTTTGACGTAAAACCGGATACCGTATTTGCGATTGAGCGCACTGATATAGCGCTTCAAAAATACGTTGTAACGGAGATAACGAAAGGCGATAGCGATGAGGAGTTTACCTATCGTGTAACGGCTGTTCAGTATGACCCGAACAAGTACGATGAGATCGACTATGGCGTAAACATTGATGATCGACCAACGTCGATTGTTGACCCTGACAGACTGCAAGCGCCTAAAAACGTTCGCTTGTCGTCTTACTCTCGCGTTATCCAGGGCGTAAGCGTGGAAACAATGCACGTTAGCTGGGATAAGGTCGAATATGCGTCAATGTACGAAATGCAGTGGCGAAAAGGCAATGGCAACTGGCACAACACGCCGCAGACGGCGAACAAAGAAACGGAAGTTGAAGGCATTTATGCAGGCAACTACTCAGTGCGAGTTCGCGCAGTATCCGCGGGCGGGAGTGCGTCTCCGTGGTCAGCGATTGTTAACGCTACTCTGACCGGGAAAGTTGGCGAGCCTGGCGCGCCTATTAACCTGACTGCATCTAATGATGAAGTTTTTGGCATTAGAATCAAATGGGGTATGCCGGAAGGCACGGAGGACACCGCATACATTGAGTTGCAGCAATCTGAAACCGGAAAGGCTGAATCAGCAACTTTGTTGAGTTTGGTTCCGTATCCGCAGCGCGAATACTGGCACAGCATTTTGCCTGCTGGTTATACCAACTACTATCGCGTAAGGTCGGTGGATAAGATCGGCAACGTGTCGGCCTGGACTAACTTTGTTAAAGGTCAATCCTCAATCGATCTTGACGATATCGTTGGCGACATTTTAGACGACATTCTTAAAAGCGAAGGCTTAAAAGATTTGATCGAGGGAGCCATTGATCAATCAGAAAAGATTCAAGATGCGGTTGAATCAGCCAAAGACTCGGCCAACAAGATTAAGAATCAGGCGCAAGCGATTATCGAAAACGCTCTTGCAACCGATACTAATCTGCGATGGACACGAGTAGAAAACGGCAAGCGAAAAGCCGAGATCGGGGAGTCATTCGAAATGATAGCCACCGAGACGGAAGCCAGGATTGAAGCCATAAACAAGCTTAGGACTGAATTCGACGCCGGAATCAGCGCGGAGATAACCAAAGTAACGAAGCTTATCTCTGATGAGACTCAGGCGCGAGCGACTCAGGTTAACGAGCTTAAAACAGAGTTCACGACCGAGATCGGAAAGACCAACGCCAAAGTGCAGCAGGCTCAGGATGCGGTAACAACCGAGACGGAAGCAAGGGCGCAAGCAATCCAGAATCTTGACGCGAAGCTAACCAAGCAGATTAATGATGCGAAAGTTGAGCTAAACGCCAATATCAGCAGGGTTGACCAGGCGATTACGGATGAAGCCGGGGCGCGAGCGGAGGCGGTAGAATCGCTTAAAGCTCAGTATAAAAAGGACATTAGCGAAGCCATAAAAGGCGCGGAGACTGAATTTAACGCGAGCATTGATAGAGTCAACCAAGCCATAGCCAACGAGGAAGAGGCGCGTGCTCAGGCCGTAACAAGCCTTGACGCGAAGCTAACGAAACAGATTGGCGACACGAAAAAAGAGCTTGGTGCATCAATCGACCGGGTAGATCGGGCAGTCACTACCGAGCAGCAAGCGCGAGCGGAAGCGATTTCTGGTCTTGACGCGAAAATGACAAAACTTGTCGGAGATACAAAGACTGAAATTAACGCTAATGTCAGTCGAGTTGATCAGGCGGTTGCAAGTGAGGCGGAGGCGAGAGCTAACGCTGACTCGGCGTTAAGCACAAGGATCGGAGATACTCAGGCTGCATTAACTCAGAAAATGGATTCATGGGTTAACGCGGAGCAAGCTGGCGTTATGTACGGCGTAAACCTTGGCCTCAAGTATAAGGGCAAGGAGTACAAGGCCGGGATGAACTTAATGCTTGTTGGTGAGGGTGACAACGCAAAATCTCAGTTCCTGTTTAGCGCTGATCGGTTTGCTATCATTCCATCGTTAAGCCGTGGCGATCTTAAAACGCTGCCGTTTGTTGTTGAAAATGATCAGGTGTTCATGCAATCGACGCTGATTAAAGACGGCACGATCACAAACGCGAAGATTGGAAATGAGATCCGCTCAAACAATTTTGTTGATGGTTCTCAAGGCTGGCGTGTTGGCAAGGATGGCAGTTCGCAGTTCAATAACGTTATCGTTCGCGGAGCGGTTTACGCTAATGATGGCTACTTTAACGGCACCGTTTACGCTAACCGCATCGAGGGCGACGTGATGATCGCAGAGTCTGACACGATACCGCTCAAGAGCGTTGAGCGCTTTGGAAGCGGCGACTATGAGATTTTCAGAATTAACGGGGAGAACTTTGATCGACAAATCGACACAAACCTTATCGTCTGGTGTTCTTGCTCTCAGCGAAACCACTTCCGCTTGATTGTGCAAACTCCAGGTAAGGGCGATGTTGAATATTATTATCTTGACACTGGTAACGAAGGTGGTGGTAGGGCGTTTGCTTTGCGCGGATTCTTCATTCCGGCAGCGGGCAGAGGGCAGCAAAACAGGATCATTGTTAGGGTTCAGGAAAGCCGAGACTCAACAATCAAGACTTACACACCTTGGGTTGAAAGGGAAATGGGCCAGAAGTACAACGATGTTCCTAACTCCTCCATTAACAACCGAAACTTCATTCGTGAAAAAACCTATATTGCAGCATACCGAGCAGGTCGCCGCATTATAGCGTAA

Genome Context

Tertiary structure

XSD86093.1
ESMFold structure
Source ESMFold
pLDDT 67.5
Oligomeric state monomer