UniProt accession
G5DEY2 [UniProt]
Protein name
Putative tail fibre protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
Protein sequence
MAKRGKDGFDDIDLDRLDDWDDFGEPPRPKDEKKRSPILSTLNVARKSALSTIWPEGKRDQVILKGMPKPAADAYEGYQSAAAAAKDIYAHTKGELEKTERTLKMQARQLGPTMKKYLPDAVTKRFDKWSKSDQLDYQNYDPNQALMDRELGDVFSGVPKDAEEQRQLQDSMVEDKLRDSIKEMRADAMHQTMIGMAKDINLLTGLSRGVFLNIERKKLELQYRTLFAIQDIVKMKQSEFDRNTPALEAIVKNTALPDYAKEDFSEVRWANVKRQAAEWMNPLRYADGFMDMIRENTKKKISGIFGEGRGLLESVLGMGVEDDFGMSDSSSLTAERRKTNARDKATAWGSGFLAKKLLGPQIEKLQKWTREEMEKNPEVMKRLQKGAFTFGNLSSISNSAIAGETQGPLADLFRVLNELGIVQPLNREKAFLDERNGETLSRSAKFDRKAYLSLVEVIPAWLAEINKSVRRGYGEHADLEYDITSRGFVDRKVVGNRVRKAVANDEQRLRLQNSINSTVDFVDRGKTLSQKDRQHLADYIESRASQGRAFDVEAILKDPMHLHRYMPGNSAERIKEALQGHSDSLTGGSNELSNELARKISTVQSSITQRQAVIDEAVNIYGERALRDAGIFNYDAKSDTFGVDKDLSDPYTLFNDLAMGKTRSGRALTRDQEIQRKLQNGSALGDYLRRMNQGANGGVDDTSLPPALRGGGKGRGMSPRQLAAVLYGETSTNFVELLSQRNRGEEAPRNNFDGIIEAIRGNNNSDTLQKILEHVRSMDEEGVLLASLAGGAGSGDEEMGPPRPGGSGGGKRRRIIIGEDGLIRRWGGVLFDTAAGIGGFAKRGVKGAWNKLNQFGGWARGKIAGMGGGEGPGFLTRMRGLISGSVRGGFEAVSSFGKGLLGIRDVYDDHGNVVLQGARLEAGEYYQVIDGKMVQLKTLDDIKLGSDIVDSAGNLVLAAADLAAAGKLRYYKGGKIQALTQGLASKIGLGFNKVAKLPKRFLDFLSPKAGSIVGKIKDWLNEIPEGEEKTRLQKAFNRVTSLPGKVLGFAKRGVDRLKDMITDNPLTRWWKGRKDGGGGGFSLFSSTGKKTNHILIRIYKLLNQRLPGEPEDESWTEEMEKGVGGGGSTIGRAVRGAYNRAKSSLSERFGGRWSRTKAFFGRGRDRLRGWFSGFRGRAGDMLDGYRGARHDIATRYEVERRLAGRDDDVAEFYRSHLNAKGGLSGRKVYGDAKEDLETARDAAGRVINKGKNAAKSAGARLFERLDRMIGLQEMSWFNTMRESVSRAGGDDGIIRTMFAKFGKRNKPPEGDEKRDYFNFFKRWREKRKEKKEKAQGSKGKSGGLWDMVKGLPIIGPIVSILGTVGNILGSITKWGVLKPVGLLGKAAWNVGKFAVTRLAAPAVSAVATAASAVVTAVGWPAILIGGAIAAAGYAAYKIATTTYTQYLDKMRLAQYGFRDYDKWSSDDGAKARYLEDALREYVSYAEDGQASLRGLSGKDVQKLAEGFGINVEEKGEMLAFQAFMLQRFIPIYLRWITALKSMPNSIQLADVGDAKKVSKEDMQTLFNKMKMTKDAKAFSSLTDPRKVNQGFFSKAWDVVTFTPKEFLSGEEVMEVQNEVERAIKFRMDDKKARKYGMAPAVEGIKSAGVDEAINKLGQLDNERNKNLAKVEGWEDGTEQVQIQVDWNAVLDQKDMNAMESVRWKTYGFTTIDNATRTLITVFEKNVIKDIDVKTASYKGDWKKAIASMVPDAIGTPKEDRLKRWFFDRFLPVFMTYLVGVKRYLPTADPLNLKLTGGYLYEISLMMSTAYSLKGGIRQSVWEVNINPLGGDANTNPSSIKAELETLKLLSKEADLAVRNMIKAIKNNGKRARWKDRNKNRSSLEVTGEDEEDSNISSGDSLSSDGARASGYIPSGTSGGVPGNLGQVVDAVGGVRNYAAMTTGSSSINLSDVKDGDYKSLAEKYPIEMLGRKGALNVPNIKALITDAANMMGVPPAVALAMAKAESGFNYTAKNPYASASGLFQFVDGTWNGMMKGYSRKFGIPRVNQMDPWANAILGVQFIRDNIQQAQRDLGGKAPPPAVAYLYHFLGAGGGKKFLEAWKRNPNMAASSAPGITSAILRGNANVFYSNGRIRSVDGVIQELNRRMGAISANEVAADPSKTKDMVAGLSPNSPTNPAAAMGAPAANDPSLSPADNLPADNANRRDDALTQKGAMAAQDAMATAAGNVGPAAPTPTTGGSGTSDASTTAETVASQAAAEGLSATDVAKVKAGAEAQVNAAARPVAAPTSDATASTPTLNGDPIDVQQLKVLIQSRDYLKEIRDILKSNPRAANDTRGGSIQQAANVPPPGSAARRQEITQPTPSLNVSRKAS
Physico‐chemical
properties
protein length:2372 AA
molecular weight: 259078,63190 Da
isoelectric point:9,47371
aromaticity:0,07336
hydropathy:-0,50388

Domains

Domains [InterPro]
DC_0124
STR
1–2078
G3DSA:1.10.530.10
RBD
1975–2149
IPR023346
STR
1981–2069
IPR008258
ENZ
1983–2071
cd00254
ENZ
1995–2073
G5DEY2
1 2372
Architecture
STR
RBD
RBD
STR 1-2078 | RBD 2079-2149 | RBD 2151-2372
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Salmonella phage SPN3US
[NCBI]
1090134 Uroviricota > Caudoviricetes > Chimalliviridae > Seoulvirus SPN3US >
Host Salmonella enterica subsp. enterica serovar Typhimurium str. LT2
[NCBI]
99287 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEP84072.1 [NCBI]
Genbank nucleotide accession
JN641803 [NCBI]
CDS location
range 205209 -> 212327
strand -
CDS
ATGGCAAAAAGAGGCAAAGACGGTTTCGATGATATCGATTTAGATAGACTCGACGACTGGGATGATTTCGGTGAACCACCCCGTCCCAAAGACGAGAAAAAACGAAGCCCGATACTCAGTACCCTGAATGTTGCCAGGAAGTCGGCGTTGTCAACTATCTGGCCAGAAGGGAAACGTGACCAAGTTATCCTCAAGGGGATGCCTAAGCCGGCAGCTGATGCATACGAGGGCTATCAGTCAGCGGCAGCTGCAGCAAAGGACATTTACGCCCATACGAAAGGGGAGCTTGAGAAGACCGAGCGCACGCTGAAGATGCAAGCACGCCAACTCGGTCCGACGATGAAGAAATACTTGCCGGATGCAGTTACTAAGCGCTTCGACAAGTGGTCAAAATCTGACCAGTTGGATTACCAAAATTACGATCCTAATCAGGCGTTGATGGATCGTGAACTGGGCGACGTTTTCTCAGGTGTGCCAAAGGACGCCGAAGAACAACGTCAGCTGCAGGACAGCATGGTAGAGGATAAGCTCCGCGACAGCATCAAAGAGATGCGTGCGGATGCAATGCACCAAACCATGATTGGGATGGCGAAAGACATTAACCTGCTAACCGGGTTAAGCCGCGGGGTATTCTTAAACATCGAGCGTAAGAAACTCGAACTGCAATACCGTACCCTGTTTGCCATCCAAGACATCGTGAAGATGAAGCAGTCGGAGTTCGATCGTAATACGCCTGCTTTAGAAGCCATCGTGAAGAACACGGCATTGCCGGATTACGCGAAGGAAGATTTCTCAGAAGTCCGTTGGGCGAACGTTAAACGCCAGGCTGCGGAATGGATGAACCCGTTGCGGTATGCTGACGGGTTTATGGACATGATTCGCGAAAATACCAAGAAGAAAATTTCTGGTATATTTGGCGAAGGTCGCGGTTTGTTAGAATCTGTTCTCGGCATGGGCGTTGAAGACGATTTCGGCATGAGCGACAGCTCTTCGTTGACTGCTGAACGTCGCAAAACAAATGCTCGCGATAAAGCAACTGCGTGGGGTAGTGGATTCCTTGCGAAGAAACTACTCGGTCCACAAATCGAGAAACTCCAGAAATGGACTCGCGAGGAGATGGAGAAAAATCCAGAGGTCATGAAACGCCTGCAGAAAGGCGCGTTCACCTTTGGTAATCTTTCCTCTATCTCTAACTCAGCCATCGCAGGCGAAACGCAAGGGCCGTTGGCGGATTTGTTCCGGGTATTGAATGAACTCGGTATTGTCCAACCGTTAAACCGTGAGAAAGCCTTCCTTGACGAACGTAATGGCGAAACGTTAAGCCGTTCGGCGAAATTTGACCGTAAAGCGTATCTTAGCCTGGTTGAAGTTATTCCTGCCTGGCTGGCAGAGATTAACAAATCCGTTCGTCGGGGTTATGGCGAACACGCCGATCTGGAATACGACATCACCAGTCGTGGATTCGTAGACCGCAAAGTGGTCGGTAACCGTGTACGCAAGGCGGTTGCAAACGATGAACAACGCCTGCGTCTCCAAAATTCCATTAACAGCACTGTGGATTTTGTTGACCGCGGCAAAACGCTCTCGCAGAAGGATCGACAGCATCTCGCCGACTATATCGAGTCGCGAGCCTCGCAGGGCCGCGCATTCGACGTAGAAGCGATTCTGAAAGACCCGATGCATCTTCATCGGTATATGCCAGGAAACTCCGCAGAACGCATTAAGGAAGCCTTACAGGGGCATTCTGATAGCCTAACCGGTGGTAGTAACGAATTAAGTAACGAACTGGCTCGTAAAATCTCCACAGTACAATCGTCTATCACCCAGCGTCAGGCAGTTATTGACGAAGCGGTGAACATTTACGGCGAACGTGCGCTGCGTGATGCAGGTATATTCAACTACGACGCAAAGAGTGACACCTTCGGCGTAGATAAAGACCTCTCCGATCCCTACACCTTGTTTAATGACTTGGCGATGGGTAAGACACGCAGTGGTCGTGCACTGACTCGTGACCAGGAGATTCAACGTAAACTGCAAAATGGTTCAGCGTTAGGCGACTACTTGCGCCGGATGAACCAAGGGGCAAATGGTGGTGTCGATGACACGTCGTTGCCGCCGGCTTTACGTGGCGGTGGTAAAGGCCGCGGAATGTCGCCTCGGCAGTTAGCCGCAGTTCTCTACGGTGAAACCTCGACTAACTTTGTTGAATTGTTGAGTCAACGTAACCGTGGCGAAGAAGCACCACGGAATAACTTTGACGGCATCATCGAAGCGATTCGGGGTAATAACAACAGTGACACCCTCCAGAAAATTCTGGAACACGTCAGAAGTATGGACGAAGAAGGGGTTCTCCTGGCTTCGTTAGCAGGGGGTGCTGGTTCTGGTGACGAAGAGATGGGACCACCTCGTCCTGGTGGTAGCGGTGGCGGTAAACGCCGCCGTATTATCATCGGTGAGGATGGGCTTATTCGTCGTTGGGGCGGTGTGTTGTTCGACACTGCAGCAGGAATCGGTGGCTTTGCAAAACGTGGTGTTAAAGGTGCCTGGAATAAACTGAATCAGTTCGGTGGCTGGGCGCGCGGTAAAATCGCGGGGATGGGCGGCGGAGAAGGTCCTGGTTTCTTAACCCGGATGCGTGGCCTTATCAGCGGTAGTGTCCGTGGGGGCTTTGAAGCGGTAAGCTCTTTCGGTAAAGGATTGCTGGGTATCCGCGACGTTTACGATGACCACGGTAATGTTGTTCTGCAAGGTGCACGTCTGGAAGCTGGAGAATACTATCAGGTCATTGATGGTAAAATGGTTCAGCTGAAAACGCTGGACGACATCAAACTGGGGAGTGATATTGTTGACTCCGCAGGTAATCTGGTATTAGCAGCCGCTGACTTAGCCGCTGCGGGTAAACTCCGCTACTATAAAGGCGGGAAAATCCAAGCACTGACCCAAGGTCTGGCCAGTAAGATTGGCTTAGGCTTTAATAAGGTGGCTAAGCTACCGAAACGGTTCCTGGATTTCCTGTCACCGAAAGCGGGCAGCATCGTTGGTAAGATTAAGGACTGGCTGAACGAGATTCCGGAAGGCGAAGAAAAGACACGTCTCCAGAAAGCGTTCAACCGCGTAACGAGCTTACCGGGTAAAGTGCTCGGTTTTGCGAAACGTGGTGTCGACCGTCTGAAAGACATGATTACCGACAACCCACTGACGCGTTGGTGGAAAGGCCGTAAAGATGGTGGGGGTGGTGGTTTCAGTCTCTTCTCATCAACCGGCAAGAAAACCAACCACATCCTTATCCGTATCTATAAACTGTTGAACCAACGTTTACCGGGCGAACCGGAAGACGAAAGTTGGACAGAAGAAATGGAGAAGGGTGTCGGTGGCGGTGGTAGTACGATCGGCCGTGCAGTTCGCGGGGCGTATAATCGTGCGAAATCGTCGTTGTCTGAACGTTTTGGTGGACGTTGGTCAAGAACAAAAGCGTTCTTTGGTCGTGGACGTGACCGCCTGCGTGGTTGGTTCAGTGGTTTCCGAGGCCGGGCTGGGGATATGCTCGATGGATATCGTGGAGCACGACACGACATTGCTACACGTTACGAAGTTGAACGTCGGTTGGCTGGACGTGATGATGATGTTGCTGAGTTTTATCGCAGTCATTTGAACGCGAAAGGTGGCCTTTCTGGCCGTAAAGTCTACGGTGATGCGAAGGAAGACCTTGAGACCGCCCGTGATGCAGCAGGGAGAGTTATTAACAAAGGGAAGAATGCTGCCAAGTCTGCAGGTGCAAGGTTATTCGAACGCTTGGACCGAATGATTGGCTTGCAAGAGATGTCGTGGTTTAACACCATGCGCGAATCAGTATCCCGTGCAGGTGGTGACGATGGCATTATTCGTACCATGTTTGCGAAGTTCGGTAAACGTAACAAGCCGCCTGAAGGTGATGAAAAACGCGACTACTTTAACTTCTTCAAACGTTGGCGTGAGAAACGTAAGGAGAAGAAAGAGAAAGCGCAGGGCTCCAAAGGGAAATCTGGCGGTCTGTGGGATATGGTGAAGGGCTTACCTATCATCGGTCCCATCGTCAGTATCTTGGGAACTGTGGGTAATATACTCGGTTCTATCACGAAATGGGGTGTGTTAAAACCCGTAGGCCTTTTAGGGAAGGCTGCGTGGAATGTTGGTAAGTTCGCAGTAACCCGCTTGGCGGCACCCGCGGTGTCCGCTGTGGCAACTGCGGCATCTGCCGTCGTGACCGCAGTGGGCTGGCCGGCAATTCTTATCGGTGGTGCGATCGCCGCGGCAGGTTACGCCGCGTACAAAATCGCAACGACTACCTATACCCAGTACCTGGATAAGATGCGTTTAGCTCAGTACGGTTTCCGTGATTACGATAAGTGGTCGTCGGACGACGGGGCGAAAGCGCGTTACTTAGAAGACGCATTGCGTGAGTATGTGTCTTACGCGGAAGATGGCCAAGCCAGTCTCCGTGGGCTAAGTGGAAAAGACGTTCAGAAGTTAGCTGAAGGGTTCGGTATCAACGTTGAGGAAAAAGGCGAGATGTTGGCCTTCCAAGCGTTTATGCTTCAGCGCTTCATTCCGATTTATCTCCGTTGGATTACGGCACTGAAATCAATGCCGAATAGTATCCAGTTGGCTGACGTCGGCGATGCGAAGAAAGTATCGAAAGAGGACATGCAGACACTCTTCAACAAAATGAAGATGACTAAGGATGCAAAAGCGTTCTCTTCGTTAACTGACCCACGCAAAGTCAACCAAGGTTTCTTCTCGAAGGCCTGGGACGTCGTAACCTTTACACCGAAGGAGTTTTTGAGCGGTGAAGAGGTTATGGAAGTGCAGAATGAAGTCGAACGTGCAATCAAGTTCAGAATGGACGATAAGAAAGCACGTAAGTACGGAATGGCACCGGCCGTGGAGGGGATTAAGTCCGCGGGCGTTGATGAAGCTATCAACAAACTTGGACAGTTGGATAACGAACGTAATAAAAATCTGGCGAAGGTGGAGGGTTGGGAAGACGGAACAGAGCAGGTTCAGATTCAGGTAGACTGGAATGCGGTGCTCGACCAGAAAGACATGAATGCGATGGAGTCGGTACGTTGGAAGACTTATGGTTTTACCACCATCGACAACGCCACCCGCACGTTGATTACGGTATTCGAGAAGAACGTCATCAAAGACATTGACGTGAAAACCGCAAGCTACAAAGGAGATTGGAAGAAAGCAATCGCCTCGATGGTTCCAGACGCTATCGGTACACCGAAAGAAGATCGTTTGAAACGGTGGTTCTTTGACCGTTTCTTACCGGTATTCATGACGTATTTGGTTGGGGTGAAGCGTTACCTGCCAACAGCCGATCCGCTGAACTTGAAACTGACCGGCGGCTACCTGTACGAAATCAGTCTGATGATGTCAACTGCCTACAGTTTGAAAGGCGGTATAAGACAGTCGGTGTGGGAAGTGAATATCAACCCATTAGGGGGCGATGCTAACACTAATCCGTCGTCTATCAAAGCCGAGCTGGAAACGCTGAAGCTGTTGTCAAAAGAAGCTGACCTTGCCGTGCGTAACATGATTAAAGCCATTAAAAATAATGGCAAGCGTGCACGCTGGAAGGATCGTAACAAGAACCGCAGTTCTCTGGAAGTTACCGGCGAAGATGAAGAAGACTCGAACATCAGCTCGGGAGATTCTTTGTCATCTGACGGTGCTCGCGCCTCAGGCTACATCCCGTCGGGTACGAGTGGTGGTGTGCCGGGTAACTTGGGTCAAGTCGTCGATGCGGTTGGCGGTGTGCGGAACTACGCCGCAATGACCACTGGTTCGTCTTCGATTAACCTGAGTGATGTGAAAGACGGTGATTATAAGTCACTGGCTGAAAAATACCCGATAGAAATGTTGGGTAGAAAGGGTGCGTTGAACGTTCCGAATATCAAAGCATTGATTACCGATGCGGCGAACATGATGGGCGTGCCACCTGCAGTGGCGTTAGCAATGGCTAAAGCGGAGTCCGGATTTAACTACACCGCTAAAAACCCGTATGCTTCGGCGTCTGGGTTGTTCCAGTTTGTTGACGGCACGTGGAACGGGATGATGAAGGGGTATTCGCGGAAGTTTGGTATTCCGCGCGTTAACCAAATGGACCCGTGGGCGAATGCTATATTGGGTGTACAGTTCATTCGTGACAACATCCAACAAGCACAGCGTGACCTGGGTGGTAAAGCACCACCTCCAGCCGTGGCTTATCTGTACCACTTCTTGGGTGCAGGCGGCGGTAAGAAGTTCTTGGAAGCATGGAAGCGTAATCCGAATATGGCGGCATCAAGTGCTCCTGGGATTACATCTGCAATACTGAGAGGGAATGCCAACGTCTTCTACAGCAACGGTCGTATACGTAGCGTAGACGGGGTTATTCAGGAACTGAACCGCCGCATGGGCGCAATTTCCGCCAACGAAGTCGCTGCCGATCCGAGTAAGACGAAGGATATGGTTGCAGGCTTGTCGCCTAATTCACCAACCAACCCGGCAGCAGCAATGGGTGCACCGGCCGCTAATGATCCGAGTCTGTCGCCAGCAGATAACCTGCCGGCAGATAATGCTAATCGTCGTGATGACGCATTGACGCAGAAAGGGGCCATGGCGGCACAAGATGCAATGGCTACCGCCGCAGGTAATGTAGGACCAGCAGCACCTACACCAACTACAGGTGGTTCCGGAACATCCGATGCCTCAACAACAGCCGAAACCGTAGCGTCGCAAGCTGCAGCAGAAGGATTGTCTGCAACGGATGTTGCTAAAGTGAAAGCAGGCGCGGAAGCGCAGGTTAACGCAGCAGCTCGTCCAGTTGCCGCTCCAACTTCTGATGCTACAGCATCGACGCCAACGCTGAATGGTGACCCGATAGATGTTCAGCAGCTCAAGGTACTGATTCAGTCTCGCGATTACTTGAAAGAGATTCGTGATATTTTGAAATCAAATCCGAGAGCAGCAAACGACACGCGTGGTGGTAGTATCCAGCAAGCGGCAAATGTGCCTCCTCCGGGCTCAGCTGCACGCAGGCAGGAAATAACCCAACCGACACCGTCGTTAAACGTAAGTCGGAAAGCAAGCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
7a7bbc249a81ee512d1c6453da1f4daeef182ce5234392b374ac93a78f8168c1
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4606
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50