Genbank accession
ANT42866.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
Protein sequence
MNYPNPPSPEITTKEKPAFKEITNELLKQLQNALNSNSLFTEQVELSFKGIVRILEVLLSLDFFKNANEIDSSLRNSIEWLTNAGESLKLKMKEYEGFFSEFNTSMKANEQEVSAILNANTENIKSEIKKLENQLIETATRLLTSYQIFLNNAKESANNEITTNRTQAITNINETKVSANNEINTNKTESLEAITQAKESATTQINTNKTESLEAITQAKESATTQINTNKTESLEAITQEKQQATSEINEAKKTAFNELLETLKPKFSGLFVGVYYIRNVIYIQGGWEQKVKDLSDYELVKSKKYEIEVFFQFTSAKIVEKDPFFGLKTNEGFLKESIQKITHKYVSQTYHTKLLVENVSGVLGLYHGYFYGNINYFNEAIITSIKELPNDTIISKVSNGSDFANATTITQNLNDQSGTTQTRS
Physico‐chemical
properties
protein length:425 AA
molecular weight: 48015,21730 Da
isoelectric point:5,27082
aromaticity:0,08706
hydropathy:-0,53059

Domains

Domains [InterPro]
DC_0415
STR
3–202
Coil
Unmapped
114–141
ANT42866.1
1 425
Architecture
STR
STR 3-424 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Helicobacter phage FrGC43A
[NCBI]
1852666 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Helicobacter pylori
[NCBI]
210 cellular organisms > Bacteria > Pseudomonadati > Campylobacterota > Epsilonproteobacteria > Campylobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ANT42866.1 [NCBI]
Genbank nucleotide accession
KX119195.1 [NCBI]
CDS location
range 14574 -> 15851
strand +
CDS
ATGAATTATCCTAATCCACCAAGCCCAGAAATCACCACTAAAGAAAAACCAGCGTTTAAAGAAATCACTAACGAGCTTTTAAAGCAATTACAAAACGCTTTAAATTCTAATAGCCTTTTCACCGAGCAAGTAGAATTAAGCTTTAAAGGGATTGTTAGGATTTTAGAGGTGCTTTTGAGTTTGGATTTTTTTAAGAATGCGAATGAAATTGATAGCAGTTTAAGAAACTCCATTGAATGGCTGACTAACGCCGGCGAAAGCTTAAAATTAAAAATGAAAGAATACGAGGGCTTTTTTAGCGAGTTTAATACGAGCATGAAAGCTAACGAGCAAGAAGTAAGCGCTATTTTAAACGCTAACACTGAAAACATCAAAAGCGAGATTAAAAAGCTAGAAAATCAATTGATAGAAACCGCTACCAGGCTTTTAACGAGCTATCAAATCTTTTTAAACAACGCTAAAGAGAGCGCTAATAATGAAATAACCACCAACAGAACGCAAGCGATAACTAACATTAACGAAACGAAAGTTAGCGCTAACAATGAAATCAACACGAATAAGACTGAAAGCCTTGAAGCGATCACGCAGGCTAAAGAAAGCGCTACAACGCAAATTAACACAAATAAGACTGAAAGCCTTGAAGCGATCACGCAGGCTAAAGAAAGCGCTACAACGCAAATTAACACAAATAAGACTGAAAGCCTTGAAGCGATCACGCAAGAAAAACAACAAGCCACAAGCGAGATTAACGAAGCGAAAAAAACCGCGTTTAACGAACTTTTAGAAACACTAAAGCCGAAGTTTAGCGGCTTGTTTGTGGGCGTTTATTATATTAGGAATGTGATTTACATTCAAGGCGGATGGGAGCAAAAAGTGAAAGACTTGAGCGATTATGAGCTAGTGAAAAGCAAAAAATACGAAATAGAAGTATTTTTCCAATTTACGAGCGCGAAAATTGTAGAAAAAGATCCGTTTTTTGGACTGAAAACGAACGAAGGCTTTTTAAAAGAGAGTATTCAAAAAATCACGCACAAGTATGTATCGCAAACTTATCATACAAAGCTTTTAGTGGAAAATGTTAGCGGTGTTTTAGGATTGTATCATGGCTATTTTTACGGAAATATCAATTATTTCAACGAAGCGATTATAACAAGTATCAAAGAATTACCTAACGATACGATCATAAGCAAGGTAAGTAATGGATCTGATTTTGCCAACGCCACAACGATCACGCAAAATTTGAACGACCAGAGCGGCACAACACAAACAAGGAGTTGA

Genome Context

Genome Context

Tertiary structure

PDB ID
56a1959c91765b89b1b72caf5952f5d0c821701873d553308327047786249b77
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7193
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50