Genbank accession
XDJ14828.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,64
TF
Evidence RBPdetect2
Probability 0,96
Protein sequence
MAESNKKGMISGMDSLISISGEEFLEVIRLESDGSYKNYKLLVSKIRNNQGLSAYEIAVQNGFVGTVDEWLASLEGKSAYQIAVDAGFVGDEAAFIASLKGEKGEDGNEGKSIYDIALENGFIGTEADFLKTLVGKSAYQTALDNGFVGTEAEWLLSIKGDKGDDGDVGPIGPDGKSAFEVWQALPGNAGKTEDEFFEDQKGVTGDSAYEAAVAGGFVGTEAEWLKSLEGKSAYEIAKELDPTLTDESEFIKSLEGKTAFEVAKEAGFQGTEAEWLESLEGKSAYQIAKDGGFVGTESQWLASLEGSDGNDGKSAFDIYKELPGNENKTEAEFIESLKGEDGTNGTNGTNGEDGADGESAFDAYKKIPGNENKTEADFIASLKGEKGEDGTNGTDGKSAYEVAKDDGFVGTVEQWLESLVGEKGDIGQGVNVIATIEQDEYDQIVADGTSQPGDAYIVGVYLYIFNGVDWVKSNSIMGPAGQGLNYLGQWPDNVPLPLGPTYKAGDTYVWKNQGITSLYTLVITKDESGVETKREWVDIGVPGPQGASVYETWLKQPGNAGKPESEFLEAMKIKGDKGERGNDGTNGTNGVDGKSAYEVAVENGFVGTVEQWEASLVGPEGQKGDPAVAFEIKGRLTDESELPTPGVPSEAYYVGKNLFVWLADETKWENFGSLNGDSAYQTWLEQPGNAGKTEEEFLESLKGQSAYKLWADQPENAGKTEQEFLTSLKGKDGINGTNGIDGTDGKNLQVEGTKADLAEIQAIVDPQDQEAWVALDTGHLHIFAKGTWVDAGPFRGEDGKDGTNGTNGTDGKSAYETFKEIPGNEDKTEQEFIDSLKGKDGTNGTNGTDGEDGRNVQITGSVANEAALPTGAAQQDAYTVRDTGHLFMWIATTWVDLGQFKGDKGDTGDQGEIGLTGLGLNIVAEVETLADRPDPATLQSGDAVFVRENKSLYQLNAQGGWNVGINIEGPVGPDGPQGEKGDPGGGLKILGKYATLGELQAEHAQGTAGDAYLVGQNLYAWVTDAWTDLGPVVGPRGEQGPIGKTGLKGEKGVKGDRGALWLKLEDGVLVPTPVHGTPGDWAVNAQFDTYYKVDDTNWVHFGKLTAGDVWKPSELNIKMVWLNTSATTGGWVVLPVDEVSNPEEGVYYVRTRNATDPTKTEWTQLPHIADITAKSDTDQWVRVFKAAADAPEWAKLTVPDAGIPEAPTTAGKGYLRSGASGGSWIEGLTAPATAGKFLRTQTSWEQFNSYDLAYSQSAAATSAIAFDLSKQQMVEVDNSGNNAKTVTLSGIPGNGRCTTAVVVVKGNGGTVAFTAPNMLGGEVKWNSGTQPVYTAGYSVVTLLVYATSANTIVIGATGAQTLS
Physico‐chemical
properties
protein length:1363 AA
molecular weight: 144587,97880 Da
isoelectric point:4,35389
aromaticity:0,08804
hydropathy:-0,46310

Domains

Domains [InterPro]
DC_1725
STR
33–77
DC_1967
ATT
100–136
XDJ14828.1
1 1363
Architecture
STR
ATT
STR
STR
ATT
STR
STR
STR
STR 33-73 | ATT 74-178 | STR 197-231 | STR 254-374 | ATT 386-533 | STR 581-675 | STR 688-972 | STR 977-1290 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Pseudomonas phage RVTF4
[NCBI]
3236931 No lineage information
Host Pseudomonas sp.
[NCBI]
306 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Pseudomonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
XDJ14828.1 [NCBI]
Genbank nucleotide accession
PQ015378.1 [NCBI]
CDS location
range 216934 -> 221025
strand +
CDS
ATGGCAGAGTCGAATAAGAAAGGTATGATTTCGGGTATGGATTCGCTGATCTCGATCAGCGGTGAAGAATTCCTGGAAGTCATCCGCTTGGAGAGTGACGGTTCGTACAAGAACTACAAGTTGCTCGTATCCAAGATCCGCAACAACCAAGGTCTGTCTGCTTACGAGATCGCCGTCCAGAACGGCTTTGTTGGCACGGTAGACGAGTGGCTGGCTTCCCTCGAAGGTAAGTCGGCGTATCAGATCGCTGTTGACGCTGGCTTTGTTGGCGACGAAGCGGCCTTCATCGCTTCGCTGAAAGGTGAGAAGGGTGAAGACGGTAACGAAGGTAAGTCTATCTATGACATCGCTCTGGAGAACGGCTTCATCGGTACCGAAGCAGACTTCCTGAAAACCCTGGTCGGTAAGTCGGCATACCAAACTGCGCTGGACAACGGTTTCGTTGGTACTGAAGCGGAATGGCTGCTGTCGATCAAAGGTGACAAAGGGGATGACGGTGACGTAGGTCCAATCGGTCCTGACGGTAAATCCGCATTCGAAGTCTGGCAAGCTCTGCCAGGCAACGCTGGGAAAACCGAAGACGAGTTCTTCGAAGACCAGAAAGGTGTAACTGGTGATTCTGCCTATGAGGCAGCTGTTGCCGGTGGCTTCGTAGGAACCGAAGCCGAATGGCTGAAATCGTTGGAAGGGAAATCTGCCTACGAAATCGCCAAGGAACTCGATCCAACTCTGACCGATGAGTCCGAGTTCATCAAGTCCCTGGAAGGTAAGACTGCATTCGAAGTCGCTAAAGAAGCTGGCTTCCAGGGTACTGAAGCCGAATGGCTCGAATCCCTGGAGGGCAAGTCTGCCTACCAGATTGCCAAAGACGGTGGTTTTGTCGGCACTGAAAGTCAGTGGCTCGCTTCCCTGGAGGGTTCGGATGGTAACGACGGCAAGTCCGCGTTCGACATCTACAAAGAACTGCCGGGTAACGAGAACAAGACCGAGGCCGAATTCATCGAGTCCCTCAAGGGCGAAGATGGTACCAACGGCACGAACGGCACCAATGGTGAAGACGGTGCTGATGGCGAATCGGCTTTCGATGCCTACAAGAAAATCCCAGGCAACGAAAACAAGACCGAGGCTGACTTCATCGCTTCGCTGAAAGGTGAGAAGGGTGAAGACGGTACCAATGGTACCGACGGTAAGTCCGCTTATGAAGTTGCCAAGGATGATGGCTTCGTAGGTACTGTCGAGCAGTGGCTGGAAAGCCTGGTCGGTGAGAAGGGTGACATTGGCCAAGGCGTCAACGTGATCGCTACCATCGAACAAGATGAGTACGATCAAATCGTGGCCGATGGCACTTCCCAACCTGGTGACGCTTACATCGTCGGTGTGTACCTCTACATCTTCAACGGTGTGGATTGGGTTAAGTCGAACTCGATCATGGGCCCTGCTGGTCAAGGCCTGAACTACCTTGGCCAATGGCCGGACAACGTTCCTCTGCCTCTGGGTCCTACCTACAAGGCTGGCGACACCTACGTCTGGAAGAACCAGGGCATCACTTCGCTGTACACCCTGGTTATCACCAAAGACGAATCCGGTGTAGAAACCAAACGTGAGTGGGTAGACATCGGTGTTCCTGGTCCTCAAGGTGCTTCGGTTTACGAGACCTGGTTGAAACAACCTGGTAACGCAGGTAAGCCTGAATCCGAATTCCTCGAAGCCATGAAGATCAAAGGCGACAAGGGTGAGCGCGGTAACGATGGTACGAACGGCACCAATGGTGTTGATGGTAAGTCGGCTTACGAAGTGGCTGTTGAAAACGGCTTCGTGGGTACCGTTGAACAGTGGGAAGCATCCCTGGTTGGCCCAGAAGGTCAGAAAGGTGATCCCGCTGTTGCGTTCGAGATCAAAGGTCGTCTGACTGATGAATCCGAACTCCCAACTCCAGGTGTTCCATCGGAAGCTTACTACGTCGGTAAGAACCTGTTCGTCTGGCTGGCTGACGAAACCAAGTGGGAGAACTTCGGTTCCCTGAACGGTGACTCTGCTTACCAAACCTGGCTGGAACAACCGGGCAACGCCGGGAAGACTGAAGAAGAATTCCTGGAAAGCCTGAAAGGACAGTCGGCGTACAAACTGTGGGCTGATCAACCAGAGAACGCGGGTAAGACCGAGCAGGAATTCCTGACCTCGCTGAAAGGCAAGGACGGGATCAATGGTACCAACGGTATCGATGGTACTGACGGTAAGAACCTGCAGGTCGAAGGCACCAAAGCTGATCTCGCGGAGATCCAAGCTATCGTTGATCCACAAGATCAAGAGGCCTGGGTTGCCCTGGACACCGGTCACCTGCACATCTTTGCGAAGGGTACATGGGTAGATGCTGGTCCATTCCGTGGTGAAGACGGTAAGGATGGTACCAACGGCACCAATGGTACCGACGGTAAGTCTGCGTACGAAACCTTCAAAGAGATCCCAGGCAACGAAGATAAGACTGAACAGGAGTTCATCGATTCGTTGAAAGGTAAAGACGGTACCAACGGAACTAATGGTACTGACGGCGAAGATGGTCGTAACGTTCAGATCACTGGTTCTGTAGCTAACGAAGCTGCATTGCCAACTGGTGCTGCTCAGCAAGATGCCTACACCGTGCGTGACACAGGTCACCTGTTCATGTGGATCGCTACAACCTGGGTTGACCTTGGTCAGTTCAAAGGTGACAAGGGCGATACTGGTGACCAAGGTGAAATCGGTCTGACTGGTCTCGGTCTGAACATCGTTGCCGAAGTGGAAACACTGGCAGATCGTCCAGATCCTGCAACCCTGCAATCGGGTGATGCGGTATTCGTTCGTGAGAACAAGTCTCTGTATCAGCTGAACGCACAAGGTGGCTGGAACGTCGGTATCAACATCGAAGGTCCAGTAGGTCCAGACGGTCCTCAGGGTGAGAAAGGTGATCCTGGTGGTGGCCTGAAGATCCTCGGCAAATATGCCACTCTTGGAGAGTTGCAGGCAGAACATGCCCAAGGCACTGCCGGTGATGCCTACCTGGTTGGTCAGAACCTTTACGCTTGGGTAACCGATGCGTGGACTGACCTTGGCCCTGTTGTTGGCCCTCGTGGCGAACAAGGTCCAATCGGTAAGACCGGTCTGAAAGGTGAGAAGGGCGTCAAGGGTGACCGTGGTGCTCTGTGGCTGAAACTGGAAGACGGTGTTCTCGTTCCGACTCCAGTTCACGGTACCCCTGGTGACTGGGCCGTAAACGCTCAGTTCGATACCTACTACAAAGTGGATGACACCAACTGGGTCCACTTCGGTAAACTCACCGCTGGTGACGTATGGAAACCATCGGAACTCAACATCAAGATGGTCTGGCTCAATACCAGTGCCACTACTGGTGGTTGGGTTGTCCTGCCGGTCGACGAAGTCTCTAACCCTGAAGAGGGCGTGTACTACGTTCGTACTCGGAATGCCACCGATCCAACCAAGACCGAGTGGACTCAGTTGCCCCACATCGCAGACATCACTGCTAAGTCGGATACCGACCAGTGGGTACGTGTGTTCAAGGCTGCCGCTGACGCTCCGGAATGGGCGAAGCTGACTGTTCCAGATGCCGGTATCCCAGAAGCTCCGACTACTGCTGGTAAAGGGTACTTGCGTTCGGGTGCATCTGGCGGTAGCTGGATCGAAGGTCTGACTGCTCCAGCTACGGCTGGTAAGTTCCTTCGTACCCAGACCTCGTGGGAGCAGTTCAACTCCTACGACCTGGCATACAGCCAGTCTGCTGCCGCAACTTCGGCTATTGCATTCGATCTCTCGAAGCAGCAAATGGTGGAAGTTGACAACAGTGGTAACAACGCGAAGACAGTTACCCTGTCGGGCATTCCTGGAAATGGTCGTTGCACCACCGCTGTTGTCGTCGTCAAAGGCAATGGCGGTACCGTAGCCTTCACCGCACCAAACATGCTTGGTGGCGAGGTGAAGTGGAACAGTGGTACGCAACCGGTTTACACCGCAGGTTACAGTGTTGTTACGTTGCTTGTCTACGCGACTTCTGCGAACACCATTGTCATCGGTGCTACTGGTGCACAAACGTTGTCGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
d81f8210e78cb7583cc541cba134457f43ad77e9b41f3e04fb1937432f7ed361
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6473
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50