Genbank accession
CAG9546467.1 [GenBank]
Protein name
tail fiber protein
RBP type
TSP
Evidence RBPdetect
Probability 0,87
TF
Evidence RBPdetect2
Probability 0,95
TF
Evidence Phold
Probability 1,00
Protein sequence
MQTKKQHKRLKVMAKYMISGSKGGGQKPYVPKEMEDNLISINKIKVLLAVSDGECDPDFTLRDLYLDDVPVIASDGTVNYEGVTAEYRPGTQTQDYIQGFTDTSSEVTVARDITGDNPYVISVTNKNLSAVRIKILMPVGIKTEDNGDLVGVRVEYAVDMAVDGGSYSEVMRDVIDGKTRSGYDRSRRIDLPKFDERVLIRVKRLTPDSTSSKVTDKIKLQSYAEVVDAKFRYPLTGLVFVEFDSELFPTQIPNISIKKKWKIINVPSNYDPISREYHGSWDGTFKKAWSNNPAWVLYDLVTNQRYGLDQRELGIQIDKWSLYEAGVYCDQKVPDGKGGTEPRYLCDVVIQNQVEAYQLIRDICSIFRGMSFWNGESLSIVIDKPRDPSYVFTNENVINGDFQYTTASEKSMYTQCNVTFDDEQNMYQQDVEGVFDTEAALRFGYNPTSITAIGCTRRSEANRRGRWVLKTNLRSTTVNFATGLEGMIPSIGDVIAIADNFQSSNLTLNLSGRVMEVSGLQVFVPFKVDARPGDFIIINKPDGKPVKRTISKVSADGKTIELNIGFGFDVKPDTVFAIDRTDLALQQYVVTTISKGDDENEFTYSITAVEYDPNKYDEIDYGVNIDDRPTSIVQPDVMAAPENVQISSYSRVVQGVSVETMVVSWDKVPYASLYEMQWRKGDGNWLNTPQTANKEIEVEGIYSGNYQVRVRSVSASGNTSPWSKIATATLTGKVGEPGAPINLTASDNEVFGIRVKWGMPEGSGDTAYIELHQSPDGTVENSSLLTLIPYPQYEYWHSTLPAGKVVWYRIRSVDRIGNVSGWTDFVRGMASDDVESVLGDILDKIFDTEAGQEIKENAIDSANKIKDQAQSIIQNALANDSDVRVMKKENGKRKAEFRQSIQMIADETEARVTALTQLKAEFDEEITSEVTRLDQAIATESETRATAIEELKSQIGDDIQGQLTRVEEAIASETEARVSADTALTAKFGDLESALTEKLDSWAGVDSVGAQYSMKLGLTYNGQQYSAGMVMQLTQSPQGLISQILFDANRFAIMTSSSGGVYTLPFVVENNQVFINSLLVKDGSITNAMIGNFIQSNNFVQNQQGWRLDKNGIFENYGSTPGEGATKFTNEGLKVKDANGVLRVEVGRITGSW
Physico‐chemical
properties
protein length:1153 AA
molecular weight: 128084,15970 Da
isoelectric point:4,81650
aromaticity:0,08760
hydropathy:-0,38153

Domains

Domains [InterPro]
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_Eco_Sip
[NCBI]
2831640 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAG9546467.1 [NCBI]
Genbank nucleotide accession
OU734268.1 [NCBI]
CDS location
range 8879 -> 12340
strand +
CDS
GTGCAAACCAAGAAACAGCACAAAAGGTTAAAAGTCATGGCTAAATATATGATAAGTGGCAGTAAAGGGGGCGGTCAAAAGCCCTACGTGCCAAAAGAGATGGAAGATAACCTGATCTCGATAAACAAGATTAAAGTTTTGCTGGCTGTATCTGATGGCGAGTGCGATCCAGATTTTACGTTGCGTGATCTTTATCTTGATGATGTTCCGGTTATTGCCAGCGATGGCACTGTTAACTACGAGGGAGTTACGGCTGAATATAGGCCAGGTACGCAGACGCAAGATTACATCCAGGGTTTTACTGACACATCAAGTGAGGTTACAGTTGCCAGAGATATTACCGGAGACAATCCTTATGTTATTTCTGTTACAAATAAAAACCTATCTGCGGTAAGAATAAAGATCCTGATGCCAGTAGGCATTAAAACAGAGGATAATGGCGATCTTGTTGGCGTAAGGGTTGAGTATGCCGTAGATATGGCTGTTGATGGCGGTTCTTATAGCGAGGTTATGAGAGATGTAATTGACGGCAAGACAAGATCAGGATACGACCGCAGCAGAAGGATTGATCTTCCTAAGTTTGATGAGCGCGTTTTAATCAGGGTTAAGCGACTTACCCCAGACAGCACATCTTCAAAGGTGACTGATAAAATCAAACTGCAAAGTTACGCTGAGGTTGTGGATGCAAAATTCCGTTATCCTCTGACTGGCCTTGTATTTGTAGAATTTGACAGCGAATTGTTTCCTACACAAATCCCTAACATTTCTATAAAAAAGAAATGGAAGATTATTAATGTGCCAAGCAACTATGATCCAATATCAAGAGAATATCACGGGTCATGGGATGGGACTTTTAAGAAAGCGTGGTCAAATAATCCTGCATGGGTTCTTTATGATCTGGTGACGAATCAGCGTTATGGGCTTGACCAGCGAGAGTTAGGGATACAGATCGACAAGTGGAGCTTATACGAGGCAGGCGTTTACTGTGATCAGAAAGTTCCAGACGGTAAGGGCGGTACAGAGCCTCGCTACCTATGCGATGTGGTGATTCAGAATCAAGTTGAGGCTTATCAGCTAATCCGTGACATTTGCTCAATCTTTCGCGGAATGAGCTTTTGGAATGGTGAGAGCTTATCAATCGTGATTGATAAGCCGCGCGATCCATCATACGTGTTTACTAATGAAAACGTCATCAACGGTGATTTTCAGTACACAACCGCAAGCGAAAAAAGCATGTACACGCAATGTAATGTGACGTTTGACGACGAACAAAACATGTATCAACAGGACGTCGAGGGGGTTTTTGATACTGAGGCGGCATTACGATTTGGATACAATCCAACAAGCATTACAGCGATCGGGTGTACACGCAGGAGCGAAGCGAATCGTCGAGGTCGGTGGGTTTTGAAAACAAACCTTAGAAGCACTACTGTAAACTTTGCTACTGGACTGGAGGGGATGATTCCATCCATAGGTGATGTGATTGCTATCGCTGATAATTTTCAGAGCAGCAATCTAACGTTAAATCTTTCGGGCCGAGTAATGGAAGTTTCAGGCTTGCAAGTTTTCGTTCCGTTTAAAGTTGATGCGCGTCCTGGTGATTTTATTATCATCAATAAGCCGGACGGCAAGCCAGTTAAGCGAACGATCTCAAAGGTGAGCGCGGACGGAAAAACCATTGAGTTAAATATTGGCTTTGGTTTTGATGTTAAGCCAGACACTGTTTTTGCGATTGACCGTACTGACCTTGCGTTGCAGCAATACGTTGTGACCACTATCAGCAAGGGAGATGACGAAAACGAGTTTACCTATTCAATCACGGCTGTGGAGTACGATCCGAACAAATACGACGAGATTGATTATGGCGTAAACATTGATGACAGGCCTACTTCAATTGTTCAGCCTGACGTGATGGCAGCGCCGGAGAATGTTCAGATCTCATCTTATTCTCGCGTCGTGCAGGGTGTTAGCGTTGAAACTATGGTTGTTTCATGGGATAAGGTTCCTTACGCATCGCTTTATGAAATGCAGTGGCGAAAAGGTGATGGTAACTGGCTGAATACGCCGCAGACCGCTAACAAAGAGATAGAGGTAGAAGGAATTTATTCAGGCAACTACCAAGTAAGGGTGAGATCCGTTTCTGCAAGCGGTAACACTTCCCCGTGGTCAAAGATTGCAACAGCTACCCTGACAGGTAAAGTTGGCGAGCCAGGAGCGCCGATTAATCTTACAGCTTCTGATAATGAAGTTTTTGGCATTCGTGTCAAATGGGGCATGCCGGAAGGATCTGGCGATACGGCTTACATTGAGCTTCACCAATCGCCAGATGGAACGGTGGAAAACTCAAGTTTGCTTACGCTGATTCCATATCCTCAATATGAGTATTGGCATAGCACGTTACCAGCGGGTAAAGTTGTATGGTATAGAATCCGTAGCGTTGACAGGATCGGCAACGTTTCAGGCTGGACTGACTTTGTTCGTGGCATGGCGTCAGATGATGTTGAATCTGTTTTAGGCGACATTCTTGACAAGATTTTTGATACCGAAGCTGGTCAAGAAATCAAAGAGAACGCCATAGACAGTGCCAACAAAATCAAAGACCAGGCGCAATCAATCATCCAGAACGCATTGGCAAATGATTCTGATGTTAGAGTTATGAAAAAGGAGAATGGGAAACGGAAAGCTGAATTTAGGCAATCTATACAGATGATTGCTGATGAAACAGAGGCAAGGGTAACAGCGTTAACGCAATTAAAAGCAGAATTTGATGAGGAGATAACTAGCGAAGTAACAAGACTTGATCAGGCGATCGCAACAGAATCAGAAACGCGAGCAACAGCTATAGAGGAATTAAAATCACAGATTGGTGATGATATTCAGGGGCAGTTAACGAGAGTTGAGGAAGCGATTGCAAGCGAAACAGAAGCGCGCGTTTCTGCCGATACTGCATTAACGGCAAAGTTTGGTGATCTTGAGTCAGCACTTACTGAAAAACTTGATTCATGGGCTGGCGTTGATTCCGTTGGTGCTCAATATTCAATGAAGCTGGGCCTAACCTATAACGGTCAGCAGTACAGCGCCGGAATGGTTATGCAGTTAACGCAATCACCGCAAGGATTAATATCACAGATTCTGTTTGACGCTAACAGATTCGCGATCATGACAAGTTCTAGCGGCGGGGTATATACGCTTCCTTTCGTGGTTGAGAATAACCAGGTTTTCATCAATAGCTTACTTGTGAAAGATGGTTCAATAACTAATGCTATGATTGGAAACTTTATTCAGTCAAACAATTTTGTTCAAAATCAGCAAGGTTGGAGGCTTGATAAAAACGGAATATTTGAAAACTACGGATCAACTCCAGGAGAGGGAGCTACTAAATTCACCAACGAAGGATTGAAGGTAAAAGATGCAAACGGAGTATTGAGGGTTGAAGTCGGAAGGATTACCGGAAGCTGGTAA

Tertiary structure

PDB ID
c6aaf254eb30711a77832d0c84700ee8f9dc6224c8daae6f4fef535b696724e2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7791
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50