Genbank accession
QZI93167.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
Protein sequence
MSDQNIALEGTPSDKVVGGDEENHAAGGDDLHYGSGIFLEETNPDDVQYGTRVDAPLVAGSTVGDDWPVAFANQIQGNWQTFKNAAQLNKWTKMYKSRLFNTVAFVKDGENHYPSLYSWSGTKKDGTDGNWVFIGYLGGIAITDLNGALLRIDSTIVLGEGLSLEAAGDQGNGVVIKLDQQPPSGDSGKTGGSPITISQMWGERGSVADVTKVNALYPLEINQGTSGPGNKSTGIAVMSIKPGVYEASHGKSCLLKSSGITLVRGQKPKAIYMPQEVVPTGGFFHLNPVAKGVDVQDDTGGDTALTGGQLTEVLASVAFYDTAPDDCNIKVWVEYKDPSNPIDSHILRDANGHPVVFEKQFDAGNEIGTIILSGAFYAKATQPLKVVVETDLLASQQLAVDNNKTMVCINQFGQGWATSVARIEFMRRAGVEITPVIQTFTPNMRSLAKEIRGLTVPAALVSSGEDGDTLNQFGLRAITDVETEIKDGAVTVRDHGAITDFYFDTITDNVETSMLRGQEVECTVTLENPDDSFELEVYGWTGTPDRVGYVYKTRTSRAIDVVKGWIPIANLTIAQDAGGIPATHSLTFTVPNEANNIVVIVRPSLDQQQNTMILSGFTFGTTKEVQVYTEVERNMPHEEHLRYSDSYSEYVLPVSGLAEIRYSLSNNVDGNPMPVGKLGKGKGPVVIDHTVNPVAGSGVPQFDGAMKFTADGEASIGKTYRAINDSKEDSTVTFWDLLFDVDGNASKIPESEKTFTIKANTPYPGKLCTIPAYAVDVEKGQRVGGRSKVAKDGGAYVATRTEKDNLVQTTVHFEKLVS
Physico‐chemical
properties
protein length:818 AA
molecular weight: 88030,43520 Da
isoelectric point:4,81809
aromaticity:0,07824
hydropathy:-0,26980

Domains

Domains [InterPro]
DC_0389
STR
7–818
QZI93167.1
1 818
Architecture
STR
STR 7-818
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage PS14A.1
[NCBI]
2859353 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Vibrio crassostreae
[NCBI]
246167 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
QZI93167.1 [NCBI]
Genbank nucleotide accession
MW824431.1 [NCBI]
CDS location
range 28172 -> 30628
strand +
CDS
ATGAGCGATCAAAATATAGCATTGGAAGGTACACCGTCAGACAAAGTGGTCGGTGGTGATGAAGAGAACCACGCGGCAGGCGGCGACGATCTGCACTATGGCTCAGGCATTTTCCTAGAAGAAACCAACCCAGATGATGTTCAGTACGGTACGAGAGTGGACGCGCCTTTAGTGGCGGGTTCTACAGTCGGTGACGACTGGCCTGTAGCCTTTGCCAACCAAATACAAGGTAACTGGCAGACGTTCAAGAACGCAGCGCAACTAAACAAGTGGACTAAGATGTACAAGAGCCGCTTGTTTAACACTGTGGCGTTTGTCAAAGACGGAGAGAACCACTACCCAAGTCTGTATTCTTGGTCTGGGACCAAAAAAGACGGCACTGATGGTAACTGGGTATTTATTGGTTATCTTGGCGGCATTGCCATTACCGATTTAAATGGCGCACTGCTTCGAATTGATTCCACTATTGTACTGGGTGAAGGCCTGTCACTTGAAGCGGCAGGCGATCAAGGTAATGGCGTAGTTATAAAGCTTGACCAACAACCTCCGTCTGGTGACAGCGGTAAGACTGGCGGTTCGCCAATCACAATTAGCCAGATGTGGGGCGAACGTGGCTCTGTGGCTGATGTCACTAAGGTCAATGCCCTTTACCCGTTAGAGATCAACCAAGGCACTTCTGGTCCGGGAAACAAATCGACTGGCATTGCGGTTATGTCAATAAAACCCGGAGTGTACGAAGCTTCGCACGGTAAAAGCTGCTTACTTAAATCAAGTGGCATCACACTGGTCAGAGGCCAAAAGCCTAAAGCTATCTACATGCCTCAAGAAGTTGTGCCAACTGGTGGGTTCTTCCACCTTAACCCTGTCGCCAAGGGTGTAGATGTTCAAGATGACACAGGCGGTGACACTGCTTTGACAGGCGGCCAGCTTACCGAAGTGTTAGCCAGTGTCGCGTTCTACGATACCGCACCTGACGACTGCAACATCAAAGTGTGGGTAGAGTACAAAGACCCGTCTAACCCAATTGACAGCCACATACTTAGAGACGCTAACGGCCATCCAGTCGTGTTCGAGAAGCAGTTTGATGCAGGTAATGAGATAGGTACGATTATCCTTTCTGGTGCCTTCTACGCCAAAGCTACTCAGCCTTTGAAAGTGGTGGTTGAAACTGACTTGTTAGCAAGCCAACAGTTGGCCGTTGACAACAATAAAACTATGGTCTGTATAAACCAGTTCGGACAAGGTTGGGCGACCAGCGTTGCAAGAATTGAGTTTATGAGACGAGCGGGTGTTGAAATAACGCCTGTCATCCAGACGTTTACGCCTAACATGCGAAGCCTGGCAAAAGAGATACGCGGTCTAACTGTACCCGCTGCTCTGGTGAGTTCTGGTGAAGACGGTGACACGCTTAACCAGTTTGGTCTAAGAGCCATCACTGACGTTGAGACTGAGATTAAAGACGGCGCTGTCACGGTTAGAGACCATGGTGCTATCACTGATTTCTACTTCGATACGATCACTGACAACGTTGAGACCTCAATGCTTCGTGGTCAAGAGGTTGAGTGTACTGTGACGTTAGAAAACCCTGACGACTCATTTGAATTGGAAGTATACGGCTGGACGGGTACGCCTGACCGAGTAGGTTACGTATACAAAACCAGAACGAGCCGAGCTATCGACGTAGTTAAGGGATGGATTCCAATTGCTAACCTGACGATTGCTCAGGACGCAGGTGGGATACCGGCTACTCACTCACTCACGTTCACCGTTCCTAACGAAGCGAACAATATCGTTGTCATAGTGAGACCTTCGTTAGACCAACAACAGAACACCATGATACTTAGCGGGTTCACTTTTGGTACGACTAAGGAAGTGCAGGTTTACACTGAAGTTGAACGCAACATGCCACACGAAGAACACCTTCGCTACAGTGACTCGTATTCTGAGTATGTGCTTCCAGTGTCCGGTCTTGCAGAGATACGCTACAGCCTAAGCAATAACGTTGACGGCAACCCTATGCCTGTAGGTAAGCTAGGTAAAGGTAAAGGGCCAGTCGTAATCGACCACACGGTTAACCCTGTTGCGGGTTCTGGCGTACCTCAGTTCGACGGAGCGATGAAGTTTACGGCTGACGGGGAAGCGTCAATCGGTAAAACGTATCGAGCGATTAACGACAGCAAAGAAGATTCTACGGTGACGTTCTGGGATCTACTCTTTGATGTTGACGGCAATGCTTCTAAGATTCCTGAGTCAGAGAAGACATTCACTATCAAAGCTAACACGCCATATCCCGGCAAACTTTGCACTATCCCTGCCTATGCAGTTGATGTAGAGAAAGGGCAACGAGTCGGTGGCCGATCTAAAGTCGCTAAAGATGGTGGCGCGTATGTTGCTACTCGTACAGAGAAGGACAACTTAGTGCAGACAACCGTACACTTTGAGAAGCTTGTCTCTTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
31d27a47cf4c0a3b48490986aaec28c452aae8d460d7c1b78fd235f209ce6307
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,4532
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50