Genbank accession
URC22127.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,89
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MLNEYEDKVVEVIVDVPKASAQSGGGDFNGIVVKDTATVQLVGSGTVDTPLMGSVRISKKAQNQVQSINSGPDAGIYVAPSEPKVSSRAGNTIVELVSSDPAVEGLYSPPLHDRATSPNKTIAIKSVSSDLSKSTIEVAVDGGDKNAIRVTSSGLAVNVSSVLDNTLELKSDGGLYVRPQETKISQDPLNKIETKPDGIFVENIKGDQGVKGDKGDKGDRGEQGLKGDKGDPGEKGDKGDMGLGIKVLDTLSSVSDLPPVADYNDGDTFVIEGHFWTKITRSGTPAWEDLGSFIGPDGRSAYEVAVQEGFQGTVDEWLVSIRGKDGIGLRILGSFDNVNQLPSTGNQSGDAYIVDEQMWVWDTQKWSPVGQVGPEGKSAYQVWIAAGNTGTVQDYLNSIKGEKGDTGPVGPKGETGSAGTNANVLNLKGSKASESDLPSTGNQIGDAWVVGTDVFGWTGTAWENYGPIRGPKGDTGATGQTGPQGARGPQGLQGVKGDTGPQGLQGPRGLNGEKGDKGDVGKGVGVKGTKDQPSDLPSTGNEDGDAYIVNGNLYVWSAGDWHNEGPIVGPQGPVGPQGAKGDKGDVGDRGAQGVKGETGPAGAAGAKGDKGDTGAALKPKGTKPSESDLPSTGNTEGDMWSVNGIGFVWNGTGWTNIGTIQGPKGDTGPAGQNGAKGDTGDRGEQGPKGDTGPEGPQGEKGEQGAGVKVLGKKDSEADLPSTGTLGEGYIIGQDFYVWTGSAYENVGPIQGPKGDQGLRGLTGAQGPVGDKGAKGDKGDQGNIWVVLPRDPQPTDGQQIGDIFMNKNTLEYWRKISATEWASQGHIGGGNVYDASADGRRKVRLDGAWVDETQAQRVAVINTSTPVIDLSRQTVAFTLDNSTNTAKTISFTNVPTNEYMLPLTIVIKGKAGALTWPANVKWSGGSAPTYADNKTVIVMLWDGTEFIGTLGPNY
Physico‐chemical
properties
protein length:953 AA
molecular weight: 98703,43720 Da
isoelectric point:4,72903
aromaticity:0,05981
hydropathy:-0,56170

Domains

Domains [InterPro]
IPR008160
STR
470–522
DC_1340
STR
490–599
URC22127.1
1 953
Architecture
ATT
ATT
STR
STR
RBD
ATT 1-95 | ATT 109-337 | STR 338-379 | STR 385-705 | RBD 706-953
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Serratia phage vB_SmaM-ChuuTotoro
[NCBI]
2943832 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host Serratia marcescens
[NCBI]
615 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
URC22127.1 [NCBI]
Genbank nucleotide accession
ON287369.1 [NCBI]
CDS location
range 19163 -> 22024
strand -
CDS
ATGCTTAACGAATACGAAGACAAGGTGGTTGAAGTCATCGTGGATGTACCTAAAGCTTCCGCTCAATCAGGTGGCGGGGACTTCAACGGGATTGTGGTAAAAGATACGGCCACTGTCCAGCTTGTTGGGTCAGGTACGGTAGACACACCACTCATGGGGTCTGTAAGGATCTCGAAAAAAGCTCAGAACCAAGTTCAGTCAATTAACTCTGGCCCTGATGCGGGGATCTATGTAGCCCCTTCAGAGCCGAAAGTATCAAGTCGTGCCGGTAACACCATCGTAGAGCTAGTCTCTTCTGACCCAGCTGTCGAAGGCTTGTACTCCCCTCCTCTGCACGACCGTGCTACCTCCCCAAATAAAACCATCGCCATCAAGAGTGTTTCTTCTGATCTCAGCAAGTCTACGATCGAAGTAGCAGTAGATGGGGGTGATAAGAACGCGATCAGGGTAACCTCATCAGGCCTCGCTGTCAACGTATCTTCTGTACTTGACAACACCCTCGAATTGAAATCAGACGGCGGATTGTATGTTCGCCCACAAGAAACTAAAATCTCTCAGGATCCTCTGAACAAAATCGAGACAAAACCAGATGGTATCTTCGTCGAGAACATCAAAGGTGATCAGGGTGTCAAGGGTGATAAGGGAGACAAAGGCGACCGTGGCGAGCAGGGACTGAAAGGGGACAAGGGAGACCCCGGCGAGAAGGGTGATAAAGGCGACATGGGTCTTGGGATCAAAGTCCTAGATACCCTGTCCAGCGTTTCCGACCTCCCTCCTGTTGCAGACTATAACGATGGTGACACTTTCGTTATCGAGGGTCACTTCTGGACCAAGATCACCAGATCTGGAACTCCAGCGTGGGAAGACCTAGGATCCTTTATTGGACCAGATGGAAGGTCTGCTTACGAGGTTGCAGTCCAAGAAGGCTTCCAAGGGACTGTAGATGAATGGCTGGTGTCCATTCGAGGTAAGGACGGTATTGGACTTAGGATCCTTGGATCATTCGATAACGTCAACCAATTGCCATCTACAGGTAACCAGTCTGGCGATGCGTACATCGTTGACGAGCAAATGTGGGTGTGGGACACTCAGAAATGGTCACCTGTCGGTCAAGTAGGTCCAGAGGGTAAGTCAGCCTACCAAGTGTGGATTGCTGCTGGAAACACCGGTACAGTACAGGACTATCTTAACTCAATCAAGGGTGAGAAAGGAGATACGGGTCCAGTCGGACCTAAAGGTGAGACGGGTTCCGCCGGTACAAACGCCAACGTTCTTAATCTGAAGGGTTCGAAAGCTTCAGAGAGTGACCTTCCTTCCACAGGGAACCAGATTGGCGATGCATGGGTTGTCGGGACAGACGTCTTCGGGTGGACAGGAACCGCATGGGAGAACTATGGCCCTATTCGCGGACCTAAAGGTGACACTGGGGCAACCGGCCAAACTGGACCTCAGGGTGCCAGAGGCCCTCAAGGTCTTCAAGGTGTTAAAGGCGACACCGGCCCTCAAGGTCTTCAAGGCCCTAGAGGCCTGAATGGTGAAAAGGGGGACAAGGGTGATGTTGGTAAGGGTGTCGGAGTAAAAGGCACTAAAGACCAACCTTCTGATCTCCCTTCTACGGGGAATGAAGATGGTGATGCCTACATCGTCAATGGTAACCTCTACGTCTGGTCCGCAGGGGATTGGCACAACGAAGGACCTATCGTCGGACCACAGGGTCCTGTAGGCCCACAGGGGGCCAAGGGCGACAAAGGTGACGTTGGTGACCGTGGTGCACAAGGCGTAAAAGGTGAGACAGGGCCAGCAGGCGCAGCCGGAGCCAAGGGCGACAAAGGTGATACCGGCGCAGCCCTGAAGCCTAAAGGGACTAAGCCTTCCGAATCGGATCTTCCATCGACAGGGAATACCGAAGGGGATATGTGGTCTGTTAACGGGATCGGTTTTGTATGGAACGGTACCGGATGGACAAATATCGGCACCATCCAAGGTCCTAAGGGTGACACTGGTCCAGCTGGACAAAATGGGGCGAAAGGCGACACCGGCGATCGCGGGGAGCAAGGACCTAAGGGTGATACCGGTCCAGAAGGTCCTCAAGGAGAGAAGGGAGAGCAAGGTGCCGGGGTTAAGGTTCTTGGCAAGAAAGATAGTGAAGCTGACCTACCTTCTACCGGTACGCTGGGAGAGGGTTACATTATCGGTCAGGACTTCTATGTCTGGACCGGGTCAGCCTATGAAAACGTTGGACCTATCCAAGGACCTAAGGGCGATCAGGGTCTCCGTGGCCTGACCGGCGCTCAAGGGCCTGTCGGGGATAAAGGGGCCAAGGGCGACAAAGGTGACCAAGGTAACATCTGGGTTGTCCTCCCTCGAGATCCTCAGCCTACAGACGGCCAGCAGATTGGCGACATCTTCATGAACAAGAATACGCTGGAGTATTGGCGTAAAATCTCCGCAACTGAATGGGCCTCTCAAGGTCATATTGGTGGCGGTAACGTCTACGATGCTTCGGCAGATGGAAGACGAAAAGTTCGTTTGGATGGAGCATGGGTAGATGAAACTCAAGCCCAGCGGGTCGCCGTAATCAACACCTCAACCCCGGTGATCGACCTGAGTCGTCAGACCGTGGCATTTACTCTTGACAACAGCACAAACACAGCGAAAACCATCTCGTTCACGAATGTGCCAACCAATGAGTACATGCTTCCACTAACCATCGTGATCAAGGGCAAAGCTGGCGCACTTACGTGGCCTGCAAACGTCAAATGGTCTGGTGGTTCCGCGCCAACATACGCAGACAATAAAACCGTCATCGTCATGTTGTGGGATGGCACCGAGTTCATCGGAACCCTTGGGCCGAACTACTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
535232e9fe5f8545b2051b7cd241b707640ec6f17d4f6848f2fdd752f83c50fe
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6471
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50