Genbank accession
XPK41561.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,90
TF
Evidence Phold
Probability 1,00
Protein sequence
MALYPIKSLGAVGVIADQAPTDLAPNAFTNAINARFVEQRVFKTGGNAPLSYVDEDKDLTPLSFVSMPFDYYSAGNSFLVVGTNKKLYKLTDESLTDISRKVATVTKKASASIKIYPVVSQIVPKESTISMNFNQTKNLEVSLLPADANNTDLVWEVSNSTYGSIAVDPNDSKLATLTSFEREGNLVVTISTADGSVVAQIAVNIIDGDSGIFLSQDTVTIRKGGTTTLTAVTGKTPVTWSSSNASIVSVTPNANSLTAVITANGEGNVTITADNGTKTASCEVVSIPQIDSISLSQSDVIVSRGSQYILTATLSPADAPNQNITWTSSNPNIATVSGTSTQGTINALLAGFTEITATTEEGNRVAVCTVRVDLAGRAMRTSAMAFAAPASEPIESQEEEVVTSPESEETVYFADPVSGIDTSGMYEGNNFYDYSNVNDIEGFARASLLETPLSSVTLDVVSASLDVGEEIVITATASPEGEYSYQWSVDKTGYVSTTSVTGKSIKLVALRKGEINVTCTVSQMTQKDYDAFDDYPWYHAVISNCAVATTHYETPQVKEFESEYFVDLPGWGEQTVVDSDGNPSVKKFNWKCERVRSFNNRLFALNMREANASGVTTNYPLRLRWSNFANENKAPTLWDDFAYDRVVSSDLASNIVGQTQALENGYAGYIDLADSNGSLIDILPLKDYLFVYTEFETYIGSPTNNTYQPLMFKKLFNDSGILAPECVVEVEGSHFVVTQNDVILHNGATKKSIASNRVKNMLINEICLVNPLATRVHLHQDKKEVWILYVGPGEPKESFACTKAAVWNYEFDTWSFRTIPYAQCIGLVDPPVLERGPIWSDFQEITWDDQSIKELVWRKDATNFRQRVTIVGSFLRGFYQVDVGALDYFYDRLNDKIIEKPLEMRLERTGIDFDNITNEWNQKHINRFRPQTTGSGTYTFEAGGSQFSNEYGHPHTSKTYTIGVDRHVSVRLNHPYLFYNVIDNDVNSNAAINGLTIEFAVGGRR
Physico‐chemical
properties
protein length:1005 AA
molecular weight: 110265,71350 Da
isoelectric point:4,72948
aromaticity:0,09552
hydropathy:-0,20915

Domains

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage JacquesWildberger_Bas92
[NCBI]
3398418 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XPK41561.1 [NCBI]
Genbank nucleotide accession
PQ850610.1 [NCBI]
CDS location
range 14202 -> 17219
strand +
CDS
ATGGCGCTGTATCCAATAAAGTCACTAGGTGCTGTAGGCGTTATCGCTGATCAGGCTCCGACAGACTTAGCTCCTAATGCTTTCACTAATGCTATAAATGCTCGATTTGTTGAGCAGAGAGTGTTTAAGACGGGGGGCAATGCCCCTCTTTCTTATGTAGATGAAGATAAGGATTTAACCCCTCTGTCTTTCGTGTCTATGCCTTTCGATTATTATAGCGCAGGTAATAGCTTTCTTGTTGTAGGTACGAATAAGAAGTTATATAAACTGACAGATGAAAGCTTGACTGACATTAGCCGTAAGGTTGCTACAGTCACTAAGAAGGCTTCTGCCTCAATTAAGATCTATCCAGTTGTTTCTCAAATCGTTCCTAAAGAATCAACTATTTCAATGAACTTCAATCAGACTAAGAATCTAGAAGTTTCTCTTCTTCCTGCTGATGCTAACAATACCGACCTTGTTTGGGAGGTTAGTAATTCAACCTATGGAAGTATTGCAGTAGACCCTAATGATTCTAAACTTGCTACTCTAACATCTTTTGAGAGAGAGGGGAATCTTGTAGTAACCATCTCTACTGCTGATGGTTCTGTAGTGGCTCAGATTGCTGTGAACATTATAGATGGTGATTCGGGAATCTTCTTGAGTCAGGACACTGTTACTATCCGTAAAGGAGGGACTACCACTCTTACGGCTGTTACTGGTAAAACCCCTGTTACTTGGTCTAGCAGTAATGCTTCTATTGTATCCGTAACCCCTAACGCTAATTCTCTAACTGCTGTTATTACTGCTAATGGTGAGGGCAACGTAACAATCACTGCCGATAATGGAACGAAGACTGCTTCTTGTGAGGTTGTTTCTATACCTCAGATTGATAGCATCTCTTTAAGCCAGTCGGATGTTATTGTAAGCAGAGGTTCTCAGTACATCTTAACTGCTACCCTTTCTCCTGCTGATGCCCCTAATCAAAACATTACTTGGACCTCTTCTAATCCAAATATTGCAACAGTATCAGGGACTAGTACACAAGGGACGATCAATGCCTTACTTGCTGGATTTACTGAGATTACGGCTACCACTGAAGAAGGTAACAGAGTTGCTGTCTGTACTGTACGAGTAGATTTAGCTGGAAGGGCGATGAGAACAAGTGCTATGGCATTTGCTGCACCTGCATCAGAACCAATTGAATCACAAGAGGAAGAAGTAGTAACTTCTCCTGAAAGTGAAGAGACGGTTTATTTTGCTGATCCTGTGTCTGGTATCGATACGTCAGGGATGTATGAAGGTAATAACTTCTACGACTACTCTAACGTGAACGACATTGAAGGTTTTGCAAGAGCTTCTTTACTAGAAACCCCTTTGTCATCCGTAACCTTGGATGTTGTCAGTGCTTCTCTTGATGTTGGTGAGGAAATAGTTATTACAGCTACAGCTTCTCCAGAAGGTGAGTATTCCTATCAGTGGTCTGTCGATAAGACTGGTTATGTTTCTACAACTTCAGTTACTGGTAAATCTATCAAACTGGTTGCTCTTCGTAAAGGAGAGATTAACGTAACATGTACTGTCAGTCAGATGACTCAGAAAGATTACGATGCTTTTGATGACTACCCTTGGTATCACGCTGTTATCTCTAACTGTGCAGTTGCTACAACTCACTATGAAACTCCTCAGGTAAAAGAGTTCGAATCAGAATACTTTGTAGACCTTCCGGGGTGGGGTGAGCAAACAGTTGTTGATAGTGATGGAAACCCTTCAGTCAAAAAGTTTAACTGGAAATGTGAGAGAGTAAGATCTTTTAACAACCGTCTTTTTGCTCTGAATATGAGAGAGGCTAATGCTTCTGGTGTTACCACTAACTACCCTCTTCGTCTTCGTTGGTCTAATTTTGCCAACGAGAACAAAGCTCCTACTTTATGGGATGACTTTGCCTATGATCGAGTTGTGTCTTCGGACTTGGCTTCTAACATCGTAGGACAGACTCAGGCTTTAGAAAATGGATATGCTGGTTACATCGACTTAGCTGACTCTAACGGCAGCTTGATTGATATCCTACCTCTTAAAGATTACTTGTTCGTTTATACAGAGTTTGAAACGTACATTGGTTCTCCAACTAATAACACGTACCAACCTCTGATGTTCAAGAAACTCTTTAACGATTCCGGTATCCTTGCTCCTGAGTGTGTAGTAGAAGTTGAAGGTAGTCATTTCGTAGTTACTCAGAACGATGTAATCTTACACAATGGTGCAACTAAGAAGTCAATCGCTTCTAACCGTGTTAAGAATATGCTAATTAATGAGATATGTCTGGTTAACCCTTTAGCTACTCGAGTACATTTGCATCAGGATAAGAAAGAAGTTTGGATTCTGTACGTAGGTCCGGGAGAACCAAAAGAAAGCTTCGCTTGTACTAAAGCTGCTGTATGGAACTACGAGTTTGATACTTGGTCCTTCCGTACTATTCCTTATGCTCAGTGTATCGGTCTAGTTGACCCTCCTGTTTTAGAGAGAGGTCCAATCTGGTCTGACTTCCAAGAGATCACTTGGGATGACCAATCTATCAAAGAACTCGTTTGGAGAAAGGATGCAACAAACTTTAGACAGAGGGTTACGATAGTAGGTTCGTTCTTGAGGGGGTTCTATCAAGTGGATGTAGGTGCTCTTGATTATTTCTATGATAGGCTTAATGATAAGATTATAGAAAAACCACTGGAGATGAGGTTAGAGAGAACTGGCATTGATTTTGATAACATCACTAATGAGTGGAATCAAAAGCATATCAATAGGTTCAGACCTCAGACGACAGGTTCTGGTACGTATACTTTTGAGGCGGGTGGTAGTCAATTCTCTAACGAGTATGGTCATCCACATACATCCAAGACGTATACGATTGGTGTTGATAGGCATGTCTCGGTGAGACTGAACCATCCATACCTTTTCTATAATGTTATAGATAATGATGTTAACAGCAATGCTGCTATCAATGGATTAACGATAGAGTTTGCTGTTGGCGGGCGGAGATAA

Tertiary structure

PDB ID
7f22096f31e8f59f7cb1211645b67b751cac8ec8460bff30cc55fb064f0d31e6
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7768
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequences of Escherichia coli phages Huey, Dewey, and Louie Maffei,E., Willi,L. and Harms,A. GenBank