Genbank accession
XGU08607.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,90
TF
Evidence RBPdetect2
Probability 0,95
Protein sequence
MADYNDKVVNAEPILPEGTGGGTFVDDGQFHTEDTTTVTLLGNGNSTTPLKAAVVVDPASGNALVAGPDGLSVGVSPLSDNLLVLAGGKLYVAPPQIVVPISTKTGNVLVEVTDPGEEGLFVPPGEKGDKGDKGDQGDPGVGIYIQGIIGDPSQLPPAATMQEGDTYVIGTHYWTVVKGTWADLGDFAGPQGQDGIGLVIRGAFSDTSLLPTQGNTEGDTYIIQDQMWVWTGDADGWQPVGQVGPAGPQGATGATGPQGPKGDKGDRGEKGDQGIQGIQGLTGPQGPKGDKGDKGNDAAIVKLKGTKATVDELPEFGNAVADAWVVQTDNHVWVWTSDDTWEDIGPVQGPKGDQGDVGPQGPQGQKGDQGPEGPAGAEGPAGAEGPQGIQGPIGPKGDKGDTGLTGPVGPQGPQGPKGDRGETGYGARVLGTKGAVSDLPATGTPGDAWIIVPNLYVWSQADAQWINVGPYVGPKGDKGDTGATGDVGPKGDTGADGPQGPKGDQGDEGPQGPAGATGPQGPMGVSLTARGTVANQAALPSGAATGDLYTTEDTGEAFAFDGTNWVNLGVMRGPQGDPGIQGDQGPMGASIVAKGVIATYEDLLDVQNPEQGWMYSVTSGGNKGYSYVYSGTAWEPMGDLTGPEGPQGPQGPEGQMGAGVEILGKLDNTTELPSTGELGQGYLIEGDFWGWTGTTYENLGPIQGPQGDVGPQGPQGPQGIQGIQGIKGDQGTLWLNFSRNPGPADGRIGDYFINKSTLEYFQKTSATAWASLGYMGGGNVYDTASTIPQARTNAGWVDVPVLEAPADTGYYVRVDNAWKKLDRYDLLVTSTTGAMDVGVSQVFKVDGTANKTMSFTNLPANRAMTIVIVFSGSGASLTWPGNLAWSNGTEVTLGTTRTVVTILWDGTNLTGTTSLTVD
Physico‐chemical
properties
protein length:918 AA
molecular weight: 93738,87650 Da
isoelectric point:4,16422
aromaticity:0,06863
hydropathy:-0,41416

Domains

Domains [InterPro]
IPR050149
Unmapped
115–731
DC_0385
STR
377–488
XGU08607.1
1 918
Architecture
ATT
STR
RBD
STR
RBD
ATT 1-242 | STR 243-586 | RBD 587-701 | STR 702-751 | RBD 752-917 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage vB_Sen_SG_WM_RUS_R7
[NCBI]
3234147 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Salmonella enteritidis
[NCBI]
149539 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XGU08607.1 [NCBI]
Genbank nucleotide accession
PQ002170.2 [NCBI]
CDS location
range 5412 -> 8168
strand +
CDS
ATGGCAGATTACAATGATAAGGTCGTCAACGCTGAACCTATCCTTCCGGAAGGTACTGGCGGTGGAACCTTTGTGGACGATGGACAATTCCACACAGAAGATACGACCACAGTAACACTTCTCGGTAATGGTAACTCCACTACTCCGCTGAAGGCCGCTGTTGTTGTAGATCCTGCTTCCGGTAACGCCCTTGTAGCTGGCCCTGACGGACTTTCTGTCGGGGTTAGCCCTCTCAGTGATAACCTGCTTGTATTGGCAGGTGGAAAGCTGTATGTTGCTCCTCCTCAGATCGTTGTGCCTATTTCAACCAAGACCGGAAACGTTCTTGTTGAAGTGACCGACCCTGGTGAAGAAGGTCTTTTTGTCCCTCCCGGAGAAAAAGGCGATAAGGGCGACAAGGGTGATCAAGGTGATCCGGGTGTTGGTATTTACATCCAAGGAATTATTGGAGACCCATCCCAACTTCCTCCTGCTGCAACCATGCAAGAGGGAGATACTTACGTAATTGGAACTCACTACTGGACAGTTGTGAAAGGCACTTGGGCTGATCTTGGCGATTTCGCTGGACCTCAAGGGCAAGATGGTATAGGATTGGTGATCAGGGGTGCATTTTCTGACACATCTCTCCTTCCGACTCAAGGAAATACGGAAGGCGATACCTACATCATCCAAGACCAGATGTGGGTTTGGACTGGCGATGCAGACGGATGGCAACCTGTAGGGCAAGTAGGGCCTGCTGGTCCACAAGGGGCTACTGGTGCCACAGGACCGCAGGGGCCTAAAGGCGATAAAGGTGACCGTGGAGAGAAAGGTGACCAAGGTATTCAAGGGATTCAGGGCTTAACAGGGCCTCAAGGACCTAAAGGTGACAAGGGCGATAAAGGTAACGATGCCGCAATTGTAAAACTGAAGGGAACAAAAGCAACAGTTGACGAATTGCCTGAATTTGGTAATGCTGTGGCTGATGCATGGGTTGTTCAAACGGACAATCACGTTTGGGTTTGGACATCTGACGACACTTGGGAAGATATTGGTCCAGTTCAAGGACCTAAAGGTGACCAAGGTGATGTCGGCCCACAAGGGCCTCAAGGACAAAAGGGCGACCAAGGCCCAGAAGGTCCTGCGGGGGCAGAGGGTCCTGCTGGAGCAGAAGGTCCACAGGGTATTCAAGGACCTATCGGTCCAAAGGGAGATAAGGGCGATACTGGTTTAACTGGCCCTGTCGGTCCACAAGGACCTCAAGGTCCGAAGGGTGATCGTGGTGAAACTGGTTATGGTGCTCGTGTTCTTGGGACAAAAGGCGCTGTTTCAGATCTTCCTGCAACAGGAACACCAGGCGATGCTTGGATTATAGTTCCAAACCTTTACGTCTGGAGTCAGGCTGATGCTCAATGGATAAACGTAGGTCCATACGTTGGTCCAAAAGGCGACAAGGGTGACACCGGGGCTACAGGGGATGTCGGTCCTAAAGGTGATACTGGGGCTGATGGCCCACAGGGGCCTAAAGGCGATCAGGGTGATGAAGGACCTCAAGGTCCCGCAGGTGCTACAGGACCACAGGGTCCAATGGGCGTTTCACTGACTGCCAGAGGTACAGTAGCCAATCAAGCAGCTTTGCCGTCCGGCGCAGCAACTGGAGATCTTTACACCACAGAGGACACTGGAGAAGCTTTTGCTTTCGATGGTACTAACTGGGTAAATCTTGGAGTCATGCGTGGCCCACAGGGTGATCCTGGTATCCAAGGTGACCAAGGACCGATGGGTGCAAGTATCGTTGCTAAAGGTGTTATTGCGACTTATGAGGACCTGCTTGATGTTCAAAACCCAGAACAAGGCTGGATGTATTCAGTAACTTCCGGTGGGAACAAAGGTTATTCTTATGTTTACAGCGGGACCGCATGGGAACCGATGGGTGACTTGACAGGTCCAGAAGGCCCTCAAGGACCACAAGGACCAGAGGGTCAGATGGGTGCTGGGGTAGAAATCCTTGGCAAACTCGACAATACCACTGAACTTCCTTCTACTGGAGAGCTTGGTCAGGGATACTTGATTGAGGGAGATTTCTGGGGATGGACTGGAACCACTTATGAGAACTTGGGACCTATCCAAGGGCCTCAAGGTGATGTTGGACCACAAGGTCCACAGGGTCCACAAGGTATTCAGGGTATCCAGGGTATTAAAGGAGACCAAGGAACACTTTGGTTAAACTTCTCCCGTAACCCAGGTCCTGCTGATGGGCGAATTGGTGATTACTTCATCAATAAATCCACATTGGAATACTTCCAGAAAACATCTGCTACCGCATGGGCTTCTCTGGGGTACATGGGTGGAGGAAACGTTTATGATACAGCCTCTACCATTCCGCAAGCTCGAACAAATGCTGGGTGGGTTGATGTTCCGGTACTTGAAGCACCAGCAGATACAGGCTATTATGTCCGTGTAGACAATGCGTGGAAGAAACTTGATCGTTACGATCTTCTGGTTACTTCTACAACTGGGGCTATGGATGTTGGCGTGTCTCAAGTCTTCAAGGTTGACGGTACAGCAAACAAAACCATGAGCTTCACTAACCTACCTGCAAACAGAGCAATGACCATCGTTATCGTGTTCTCTGGTTCCGGTGCGTCACTGACGTGGCCTGGCAACCTAGCATGGTCAAACGGCACAGAGGTTACTCTGGGAACTACCCGCACGGTTGTTACTATCCTCTGGGACGGTACAAACCTGACAGGTACAACCTCTTTGACTGTCGATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
19cc3a833f7f48234febd82f0667823ead2768a466f7678ab96395f0f911567f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7033
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50