Genbank accession
YP_009056237.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,88
TF
Evidence RBPdetect2
Probability 0,92
Protein sequence
MTIVLNEVRGGFNTSTINSNFEKIEEALNSNVLSVKDGSNQMEQDLDMNSNRLINLPAPINPTEPLRLEDLKTLEGDVSGIVPTVVRHTGDGVTIDFDSGTKVVAGPFSTKVYLNGIRLFPEVDYTTVGTKVRFINYVPTSTDKVDIYSYAPLEVVGPKGPAGPEGPQGPTGEQGPQGIQGQQGIQGPQGPQGTVGPTPIHQFGDGITQPTTSIRFANKIDPDTGAVIEWGPWVNIQGSQGVTGPVGPQGLSPEHEWDGPNLRFRNPDGTWGNYEDLTGPQGPVGATGPVGPKGDNGDSLFINAQGLYSGLSAYDNEVPPFIYFATDYSFDSAFSEQEFVGDGTTKSFPLSFTPNLKSNLYVWQGNSYQTLDTFEVVGNTLTFDTAPKSGVRITVRLQSPFVTKGAIFSKLSAASGDWSTAIPFGQGPQGPQGEQGPMGPTGPQGIRGPQGEKGVQGDQGPQGIQGPQGERGPQGEKGPQGDRGLVGPQGPQGPQGIIGPQGPVGPTGAQGPIGPQGPEGPRGPMGPQGPTGDKGPQGDQGQRGSREWYIETTGTSWSQALIDAINPNPVIYDVGIQYNKSQGYSESRVYRGSGVWEQVELVQGFTIKVMNGVSATKLNIGSATALSSTTFGGTDTPLNQARSFSVRLGVISTPFATSAMVSLEDLQVLLSSFSPSFQATVQMTWSIRVNGSVVYESSRRAYVTADGSTYYGFSGGTWAFTIPSPREGNTIEVFLNGYVTSSGTNFKLSRVTTTGSQFSVVSLKA
Physico‐chemical
properties
protein length:765 AA
molecular weight: 80695,29370 Da
isoelectric point:4,75205
aromaticity:0,08627
hydropathy:-0,42418

Domains

Domains [InterPro]
IPR008160
STR
157–197
YP_009056237.1
1 765
Architecture
STR
RBD
STR 8-503 | RBD 546-765
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage ICP2_2013_A_Haiti
[NCBI]
1529058 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Vibrio cholerae
[NCBI]
666 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009056237.1 [NCBI]
Genbank nucleotide accession
NC_024791.1 [NCBI]
CDS location
range 23847 -> 26144
strand -
CDS
ATGACTATTGTTCTTAATGAAGTAAGAGGAGGCTTTAACACCTCCACAATCAACTCCAACTTTGAGAAGATTGAAGAAGCTCTAAACTCCAATGTTCTCTCCGTTAAAGATGGTTCCAATCAAATGGAACAAGATTTGGATATGAACTCCAACCGTCTTATTAACCTACCTGCTCCAATTAACCCTACCGAACCTCTCCGACTGGAAGACTTAAAGACGCTAGAAGGTGACGTGTCTGGTATTGTTCCAACGGTGGTTCGACATACTGGTGATGGAGTGACCATTGACTTCGACAGTGGTACTAAGGTAGTTGCAGGTCCTTTCTCTACCAAAGTGTATCTGAACGGTATCCGTCTGTTTCCAGAAGTGGATTACACTACAGTGGGCACAAAGGTGCGATTCATTAACTACGTTCCAACATCTACAGACAAGGTGGACATTTATTCATATGCTCCTCTTGAAGTAGTTGGCCCGAAAGGACCGGCAGGTCCAGAGGGTCCGCAAGGTCCTACCGGAGAACAGGGTCCGCAAGGTATCCAAGGACAACAAGGTATCCAAGGTCCGCAGGGACCACAAGGTACAGTAGGTCCCACTCCAATCCATCAATTTGGTGACGGCATTACTCAACCTACCACATCCATTAGATTCGCCAACAAGATTGACCCAGATACTGGTGCAGTTATTGAGTGGGGTCCTTGGGTAAACATTCAAGGTTCACAGGGTGTAACAGGTCCCGTGGGTCCACAAGGATTGTCTCCAGAACACGAGTGGGATGGCCCAAACCTACGATTCCGCAATCCAGATGGTACGTGGGGTAACTATGAAGACCTGACTGGTCCACAAGGTCCAGTTGGTGCAACAGGGCCTGTTGGTCCTAAAGGTGACAATGGTGATTCACTGTTCATTAATGCCCAAGGGTTATATAGTGGTTTGAGTGCATATGATAATGAGGTTCCTCCTTTTATCTACTTTGCAACTGACTACTCATTTGACTCCGCATTCTCTGAGCAAGAATTCGTTGGAGACGGCACTACTAAGAGTTTCCCGCTATCCTTTACTCCAAACTTGAAAAGTAACCTGTACGTTTGGCAAGGAAACTCATACCAAACACTTGACACGTTTGAGGTAGTTGGCAACACACTGACCTTTGACACTGCACCTAAATCTGGAGTCCGCATCACAGTACGTCTCCAATCTCCCTTTGTTACCAAGGGTGCTATCTTTAGTAAGTTAAGTGCAGCCTCCGGTGATTGGTCAACAGCCATCCCGTTCGGCCAAGGTCCCCAAGGACCGCAGGGTGAGCAAGGCCCAATGGGGCCAACTGGTCCACAGGGTATTCGTGGTCCACAGGGTGAAAAGGGTGTACAGGGCGACCAAGGTCCTCAAGGTATTCAGGGTCCCCAAGGTGAACGTGGCCCACAGGGTGAGAAAGGACCTCAAGGGGATAGAGGGTTAGTCGGTCCACAAGGACCGCAGGGTCCTCAAGGGATTATTGGTCCACAAGGTCCAGTAGGACCAACAGGCGCTCAAGGCCCAATCGGTCCACAAGGACCGGAAGGCCCACGAGGTCCAATGGGACCACAGGGACCAACGGGAGACAAAGGTCCACAAGGGGACCAAGGCCAACGTGGTTCTCGTGAGTGGTACATTGAGACTACAGGTACAAGTTGGAGTCAGGCACTGATTGATGCCATCAACCCAAATCCGGTTATTTACGATGTGGGTATCCAGTACAACAAATCACAAGGATATTCTGAATCTAGAGTGTACCGTGGTTCTGGAGTGTGGGAACAGGTGGAGTTGGTTCAAGGTTTCACAATCAAAGTAATGAACGGTGTAAGTGCTACTAAGTTGAACATCGGGTCAGCAACTGCGCTCTCTAGTACGACATTTGGTGGTACAGATACCCCACTTAACCAAGCACGGTCATTCAGTGTTCGTCTCGGAGTGATTAGTACCCCATTTGCGACTTCTGCAATGGTGAGTTTGGAGGATTTACAAGTCTTGTTGTCCTCGTTTAGTCCATCATTTCAGGCTACCGTACAGATGACTTGGAGTATTCGAGTGAATGGTTCGGTAGTGTACGAGTCTTCTAGACGTGCCTATGTTACCGCAGACGGCTCAACTTACTACGGCTTTTCTGGTGGTACTTGGGCATTCACTATTCCTTCTCCCAGAGAAGGGAACACTATAGAGGTTTTCCTAAATGGATACGTAACCTCTTCTGGAACTAACTTCAAACTAAGCAGAGTAACGACAACAGGTTCTCAGTTTAGTGTAGTATCTTTGAAAGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
72b0d730127e77e0b4ba4edd5e48befa045d6ef07208afd078284026d7cff567
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6255
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50