Genbank accession
WHL25945.1 [GenBank]
Protein name
tail fiber
RBP type
TSP
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,72
Protein sequence
MATLKQIQFKRSKTAGQRPAASVLAEGELAINLKDKTIFTKDDSGSVIELGLKYGGTINGSLEVTENITGTLIGNSSTATKLQTPRKINGISFDGSKDITLTPSDINVNSTTFIKNNGELPTDANLDTYGPIEEYLGVWSKSTSTNAQPANKFPEENAVGVLEVFVAGQFAGTQRYTVRSGNVYIRSLSAKWNGVDGPWGVWRNVQASTRPLSQTIDLDSLGELEHCGLWRNSSSAIASFDRHYPEEGSAAQGFLEIFEGGLYTRTQRYTTRMGMVYTRCLAAAWDASAPKWEEWKQVGHGTPATFYDGDLNDFKTPGLYNILGTDAVINCPTGEGLPTVIVGLLEVKQRASGGAIFQRFTTAGTGATTRDRIFERAYTGGAWGAWSEVYTSYSLPITLGMGGIKAQLAELDWQTFDFVPGSMFSVPLNKIKNMPANMDWGTINGNLVMFSVGPSEHTGTGRTVQVWRGTVSQANYRYFVVRIAGNPGSRTNTCCRVVLEDGSHTWTAQQNFRGLLNITAAVNLGANQKISLAPGAYIQAPASGSGSNTYANQNTTIAPLYQAIDDSNKNQFAPIVKQKNTVTNITMASGMDIASSEYRIVAQGDLSATGTTATELATWRFLPSGRFMSQSRVYAGAAFLNTDGNIAGSIWKKYNDATNLDAALNTRLGKGGDTMTGRLTINAPNDSIVLSTTASNSLHIRGDIDGTGNWYIGKGGADNSLAFYSYASQAAVHITNNGEIALNPQNTAMVNVNRDRVHINGSGWIARQPGDWGNQWRVEAPLFVDHGYVGQDSYYPILKARSVITNQGYSTAVDFGMRRIPSQWGQAIIRVGSTEASPDAGHPQAVFEFHHDGFFYTPGNGSFSDVYIRSDSRLKINKEELEYGAVEKVCRLKVYTYDKVKSIKDRSVIKREVGIIAQDLEKELPEAVSKVEVDGSDVLTISNSAVNALLIKAIQEMSEEIKELKTPLFTKIARKISKYFKF
Physico‐chemical
properties
protein length:982 AA
molecular weight: 106848,54660 Da
isoelectric point:7,58752
aromaticity:0,09369
hydropathy:-0,30652

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage S2_01
[NCBI]
2972506 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WHL25945.1 [NCBI]
Genbank nucleotide accession
OP137232.1 [NCBI]
CDS location
range 65993 -> 68941
strand -
CDS
ATGGCTACTTTAAAACAAATACAATTTAAAAGAAGCAAAACTGCAGGTCAACGTCCTGCTGCTTCAGTATTAGCCGAAGGTGAATTGGCTATAAACTTAAAAGATAAAACAATTTTCACAAAAGATGACTCAGGTAGTGTTATAGAATTAGGTTTAAAATATGGAGGAACTATAAATGGGTCTTTAGAGGTTACAGAAAATATAACTGGAACTTTAATTGGAAATTCTAGCACAGCTACTAAATTGCAAACACCTAGGAAAATTAATGGTATATCTTTTGATGGGTCAAAGGACATTACCTTAACTCCATCTGACATAAATGTCAATAGCACAACATTTATAAAAAATAATGGCGAATTACCTACTGATGCTAATTTAGATACGTATGGGCCCATTGAAGAATATCTTGGTGTTTGGTCGAAATCTACTTCAACAAATGCGCAACCAGCAAATAAATTCCCAGAAGAAAATGCAGTAGGTGTACTAGAAGTGTTTGTGGCCGGCCAATTTGCTGGCACTCAGCGTTATACTGTAAGATCTGGTAACGTCTATATTCGTTCCTTATCTGCTAAATGGAATGGCGTCGATGGTCCATGGGGTGTGTGGCGTAATGTTCAAGCGTCAACTCGTCCACTTTCACAAACGATTGACCTTGATAGCTTGGGAGAATTAGAACATTGTGGCTTATGGCGAAACAGTTCAAGCGCAATCGCATCATTTGATCGCCATTATCCAGAAGAAGGATCAGCCGCACAAGGATTTTTAGAAATATTTGAAGGTGGTTTATACACGAGAACGCAGCGATATACTACCCGCATGGGTATGGTTTATACTCGTTGTCTCGCTGCTGCATGGGATGCTAGTGCACCTAAGTGGGAGGAATGGAAGCAGGTTGGTCATGGCACACCAGCGACTTTCTATGATGGAGATCTGAATGATTTTAAAACTCCCGGGTTATATAATATTTTAGGCACTGATGCCGTTATTAACTGTCCTACAGGTGAAGGTTTGCCGACTGTTATTGTTGGTTTGCTGGAAGTTAAACAGCGTGCTTCTGGCGGTGCTATTTTCCAACGTTTCACTACCGCAGGAACGGGTGCAACTACTCGCGATCGTATTTTTGAGCGTGCATATACTGGTGGTGCGTGGGGTGCATGGAGCGAAGTATATACATCTTATTCTCTGCCAATTACTTTGGGTATGGGTGGTATTAAAGCTCAATTAGCGGAGTTAGATTGGCAAACCTTTGATTTTGTTCCTGGTAGTATGTTTAGCGTTCCTTTGAACAAAATAAAGAACATGCCAGCAAATATGGATTGGGGGACGATTAACGGAAACTTGGTTATGTTTTCCGTTGGTCCTAGCGAACACACTGGCACAGGGCGTACTGTTCAGGTTTGGCGCGGTACTGTATCTCAGGCGAACTATCGTTATTTCGTTGTTCGTATCGCTGGTAATCCAGGAAGTAGGACTAATACTTGTTGTCGTGTTGTTCTTGAAGACGGATCACATACTTGGACTGCTCAACAAAACTTTAGGGGATTGCTGAATATCACTGCTGCTGTTAATCTTGGTGCTAATCAGAAAATTTCACTTGCTCCAGGAGCATATATTCAAGCCCCTGCTAGCGGTTCTGGTTCTAATACTTACGCAAATCAGAATACTACCATTGCGCCATTATATCAGGCTATTGACGATTCAAATAAAAACCAGTTTGCGCCAATTGTTAAACAGAAAAACACTGTAACAAATATTACTATGGCTTCTGGTATGGATATTGCTAGTTCAGAATATCGTATCGTTGCTCAGGGTGATTTATCCGCTACTGGAACTACAGCCACTGAATTAGCTACATGGCGTTTCTTGCCGTCTGGCCGATTCATGTCACAAAGCCGAGTTTATGCTGGCGCAGCATTCTTGAACACTGATGGTAACATTGCTGGTTCAATCTGGAAGAAATACAACGATGCAACCAATTTAGATGCTGCCTTGAATACTCGCCTAGGTAAAGGCGGTGATACGATGACAGGTCGGTTAACAATCAATGCACCTAATGATTCTATTGTATTATCAACAACTGCTAGTAATTCTTTGCATATTCGCGGTGACATAGACGGGACTGGTAACTGGTATATTGGCAAGGGTGGTGCTGATAATTCGCTAGCATTTTATAGCTATGCTTCTCAGGCGGCAGTACATATCACAAACAATGGTGAGATTGCGTTAAACCCGCAAAATACCGCAATGGTTAACGTTAACCGTGACCGTGTACACATTAACGGTTCTGGATGGATTGCTAGACAACCGGGTGATTGGGGCAACCAATGGCGAGTAGAAGCTCCATTATTCGTTGATCATGGTTATGTTGGTCAAGATAGTTATTATCCTATTCTTAAAGCAAGAAGCGTTATAACCAATCAAGGATATAGCACCGCTGTTGACTTTGGTATGCGTCGTATTCCATCACAGTGGGGGCAAGCAATCATTCGTGTCGGATCCACGGAGGCTTCTCCTGATGCTGGACACCCACAAGCTGTGTTTGAATTCCATCATGATGGATTCTTTTATACACCAGGAAATGGTAGCTTTAGCGATGTGTATATTCGTTCTGACTCCCGTCTCAAGATTAATAAAGAAGAATTAGAATATGGAGCAGTCGAAAAAGTTTGCCGACTGAAAGTTTATACATACGATAAGGTTAAGTCTATTAAAGACCGTAGTGTTATTAAACGTGAAGTTGGTATTATTGCTCAGGACCTTGAAAAGGAATTACCGGAAGCTGTATCTAAAGTTGAAGTTGATGGATCTGATGTTCTGACAATTTCTAACTCTGCTGTGAATGCTCTTTTAATTAAGGCCATTCAGGAAATGAGTGAAGAAATTAAAGAATTGAAAACGCCTCTCTTTACTAAAATTGCTCGCAAAATTAGTAAATATTTTAAATTCTAA

Tertiary structure

PDB ID
eec46a8747572f88b8ab112cc7f6bcdcc370dce3f350a2772785b7e8fbde282d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6579
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50