Genbank accession
XFD07155.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
Protein sequence
MASGSFNTSTSNQYVQGTVTWGSTPNTGGNYSDVWVEWRFSRTNSGYETYGNGTFGIYVDGQQSVNTLRFSFTQNSRTLVVSGNFRVNHNSDGTKNLRIGVSGYTDVISINEGVVYVDLDRIPRASSISSNISWTAAIEGLPLSISRASGSFTHSLTLQIKNNVNNNWVSVASRYNIGDYTTIYFDKNEMTIIYREMAQWENAEVWIKLDTFNGGTYIGSSEKYGRVYGATPATPVVSDFNIGTKSVDVTLDYFYDTFNYTLEFTFGSFKKTFPNMVKFNKMEFTDAEVIQMYQQVPNQQSAQANVYASTKYNGIELNDNVPKDQNKKITLRVVNSEPQYDGGFTYLDSNSTTATLTGNNQYIIQSKSNLQVKLPVAKKAKPTNYSTITRYEVAVNGAVKSVNFSDTADLTVDFGTVNVSTNTSIVVSAVDSRGLKKSVSSVILVLPYSLPTYSFSADRVNNFETTTKLNVTGSASPLNVSNVNKNRIVSAKYKTKPVGGAYGSESDLPITGTFPAFISNNASVELNNTQAWEVSLTITDVLGSVTTVSTVAVGTPILFIDTKKKSIGVNKFPTGTKTFEVAGDWAIDGAITLQSNQWFSQGKYALHANNSDFMGVNTIYFSSPVNTQYQGLNFLRPGKTAGSMNINDYSTFGLLDYSMRMNNQPIFYQFAGTSNLRLAGELHSAYTNGSYMDVYGNIKGQTSAQGGETWGVVDSQNRTKFVVPVGKNGGNNSYKSYGGNHRFEKDDKFIVEFYSDGVNNCANFGGGIFKWQSNQNRFELRNCNDTGWGSIALDTLDATTVKTVNYVNTSSRELKTEIQPLEEDALQIILDSEVCSYMMKANPELGRRVGLIAEDSHELVQELGGKGVNGYTMNSLSWRAIQQLDAKINAILARQYKTL
Physico‐chemical
properties
protein length:899 AA
molecular weight: 99000,88280 Da
isoelectric point:6,65456
aromaticity:0,11235
hydropathy:-0,35929

Domains

Domains [InterPro]
XFD07155.1
1 899
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Atlee
[NCBI]
3289874 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XFD07155.1 [NCBI]
Genbank nucleotide accession
PQ217617.1 [NCBI]
CDS location
range 60704 -> 63403
strand +
CDS
ATGGCATCAGGTTCATTTAATACAAGCACAAGCAACCAGTATGTGCAGGGTACTGTAACTTGGGGCAGTACCCCCAACACTGGGGGTAACTATAGTGATGTATGGGTAGAGTGGCGATTCTCTCGTACAAACTCTGGCTATGAAACATATGGTAACGGTACATTCGGAATATATGTAGATGGTCAACAATCCGTAAATACACTTAGATTTAGCTTCACACAAAACTCTAGGACACTAGTAGTAAGTGGTAACTTCCGAGTAAACCATAACTCTGATGGAACAAAGAATTTACGTATAGGTGTATCTGGGTATACAGACGTAATATCAATTAACGAGGGCGTTGTGTACGTAGACTTAGACCGTATACCACGGGCAAGTAGCATATCATCTAATATAAGCTGGACAGCAGCTATAGAAGGTCTACCTTTATCTATTAGCCGGGCTTCTGGTTCCTTCACACACTCTCTTACTTTACAAATTAAGAACAACGTAAATAATAACTGGGTAAGTGTAGCTTCCAGATATAATATCGGAGATTACACTACGATATACTTCGACAAAAACGAAATGACAATCATATACAGAGAGATGGCTCAATGGGAAAATGCCGAAGTTTGGATAAAGCTGGATACGTTTAATGGTGGAACCTACATAGGCTCGTCAGAGAAATATGGTAGAGTTTATGGTGCTACTCCAGCTACTCCGGTTGTATCTGATTTCAATATCGGAACAAAATCAGTAGACGTGACTCTAGACTATTTCTATGACACATTTAACTATACACTAGAGTTCACTTTTGGGAGCTTTAAAAAAACATTCCCTAACATGGTTAAGTTTAATAAGATGGAGTTCACAGATGCAGAAGTAATTCAAATGTATCAGCAGGTACCTAACCAACAAAGCGCACAAGCCAATGTATACGCTAGTACAAAGTATAACGGTATAGAGCTAAATGATAACGTCCCTAAAGACCAAAACAAAAAAATCACATTACGAGTAGTTAATAGTGAACCGCAGTACGATGGAGGTTTCACGTACTTAGATTCCAATAGTACTACAGCAACACTAACAGGAAATAACCAGTACATAATCCAAAGTAAGTCAAATCTGCAAGTTAAACTACCTGTAGCCAAAAAAGCAAAACCGACCAATTACTCTACAATTACTCGTTATGAAGTAGCTGTCAATGGTGCGGTGAAATCGGTAAACTTTTCCGATACAGCAGACTTAACGGTAGATTTCGGTACTGTAAATGTAAGCACAAATACTAGTATTGTAGTGTCCGCAGTAGATAGCCGGGGACTTAAAAAATCTGTATCTTCTGTTATATTAGTATTGCCTTATTCGCTACCAACCTATTCATTTAGTGCAGACCGGGTAAATAACTTCGAAACGACAACTAAATTGAATGTAACAGGTTCCGCATCACCATTGAACGTTAGTAACGTCAACAAGAACAGAATTGTTTCCGCTAAATATAAAACGAAACCAGTGGGCGGAGCATACGGAAGTGAATCCGATTTACCGATAACTGGTACGTTCCCGGCTTTCATATCTAACAACGCTTCTGTAGAGTTAAATAACACACAAGCGTGGGAAGTATCTTTAACAATTACAGACGTATTAGGTTCAGTAACAACCGTCAGTACTGTAGCAGTAGGTACGCCTATTCTGTTCATTGACACAAAAAAGAAGTCGATAGGTGTGAACAAATTCCCTACTGGAACTAAAACATTCGAAGTAGCGGGTGACTGGGCGATAGATGGTGCGATAACTCTACAATCAAATCAGTGGTTTTCACAAGGTAAATACGCATTACACGCTAACAACTCTGATTTTATGGGTGTCAATACTATCTACTTTAGTAGCCCGGTAAATACGCAATATCAAGGGTTAAACTTCCTAAGACCCGGTAAGACAGCAGGTTCTATGAATATAAATGACTATAGCACTTTTGGTCTTCTAGACTATTCAATGAGAATGAATAACCAACCCATTTTTTACCAGTTTGCAGGAACATCTAACCTTCGATTAGCTGGAGAACTACACTCCGCATACACGAATGGTAGTTACATGGATGTATACGGAAATATCAAAGGGCAAACTTCCGCTCAAGGCGGAGAGACTTGGGGTGTCGTAGACTCACAGAACCGTACTAAATTTGTTGTACCTGTAGGTAAGAATGGTGGTAACAACAGTTACAAGTCTTACGGAGGTAACCACAGGTTCGAAAAAGACGATAAATTTATCGTTGAGTTCTATTCTGATGGTGTTAACAACTGTGCTAACTTCGGCGGAGGTATCTTTAAATGGCAATCCAACCAGAACCGTTTTGAACTACGTAACTGTAATGACACAGGTTGGGGAAGTATCGCATTAGACACGCTAGATGCGACTACAGTAAAAACAGTAAACTATGTAAATACTTCATCTAGGGAACTTAAAACAGAAATCCAACCTTTAGAAGAAGACGCACTACAGATTATTCTTGATTCAGAAGTCTGCTCGTATATGATGAAAGCTAACCCAGAATTAGGAAGGAGAGTAGGTTTAATAGCCGAGGATTCCCATGAACTGGTACAAGAGCTTGGAGGTAAAGGGGTTAATGGCTACACAATGAACTCATTATCTTGGAGAGCGATTCAGCAGCTAGACGCAAAAATAAACGCAATACTAGCAAGACAATATAAAACACTATAA

Tertiary structure

PDB ID
c004082df3da4d6c2040102ae4700754a36acb8c8710d0b22c956c1dc8d283fc
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7776
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50