Protein
View in Explore- Genbank accession
- AHC30457.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTFTF
- Protein sequence
-
MIIRKVLKGAKGGGKGGSKDNSPNSLRSGAYIKLIDLLGEGEIGGLVDGSRSIFFDNTPLMSGGSYNFKGVSWTQRVGLPDQPYVPGFNSVENEEAVGVQVKHGLPITRNVEGEDLDAVRVTVEIPALARQSDGRFKAHSVDFALEVRYDGGPWTNPFGTLTISGKCTSPYDRNYLINLPRNPSGASSPWSIRMVRLSDDSEDATKEQNDTVFASMTKITYGKFTYPNSAYIAMRVEAEQFGQSVPTRSYEVYGRIVQVPTNYTTRLYNADGVITRNASYSGSWNGTFKWAWTDNPAWVLYDIMTNDRFGLGEDIDASQVDKWSLYEIAQYADQNVPDGRGGNKPRFTFNGAMYDQQEAFDAIQQIASVFRGMAYWSSGSVMATQDRPKDVSILAASANAVDGSFSYQGSALTARHTVAVVKFDNPELNYEQDFVVVEDRDGIELYGYNETEILATGCTDRAQAYRIGEWALFTELNETNVVSYKAGLDHAGLRPGDIIAVQDRSFAGPDNAGRLKAGSSASTLLLDHDVTLETGQSYSVSVVLPNGTVEERDIVVSSYDVPINILDVSAPFSVTPDEEALWVLASDTLVPSLWSCISVRESEPHIYEVVGAQYEPAKFEHIDNSARFDPLPTQNNPVVQPPINLLVQESIYTQDGVPRTALLVSWTSPGIEFAVVGYEVAYDGPDGYVQVGRVPSTSVQILDAPPGDYTFYISSVSLSQRVSLPATIDYATLGWQGTHNGTIANVRLKGKTPDITSFTGSSPEFVWDNVWPTGTIFNPDGSSPIFRNNLVSIYDEVTEDLLRQEYTRNSNYVYTLSKNRADNAVYNRGPSRAFRVEVQVCDVYGRQSDPGTISVSNPPPPAVAPVGIPGFGQFVVRWSTLSDPDLAGFLVWVSSAPGFNPLTTEPYFDGMVNAITYAASSGTTYYVRVGAYDTFGKTGLNISGEIECTAVGSDDLDPPDTPTGLTSTTSLISDVQSRVIYTWDANTEPDMMGYTLEIKEEGGDWVGFNTSTAIYPINVLPSTALDARVRAYDVNGNYSAFTAVYSTVAAGDLVPPAVPTNWTAKGGFGIVFLEGDPNSERDFRAFEVYASEATDAPDGATVATHTSSANQVFITDLDDNLTLNFWVRAVDTSGNKSDWTAMFSATTVSSNVLLTTEALEGIVDRTSFATSIEVPGIGNALPDIPFDSSTPKQFFLATEGRMYVQKVDGTGWVLTTSTTILDGQIITGQIAAGAIGTDQLAANAVTAKNLSIRDFTVLADNADMQLGPVKGWANSSRIFNDPATAYPGQTWVAKLIASVTVTVANELEVPCKEGEKFYLSASVKASGAAGSGRKGVRAQFLDSAGTLLTPGTADTTSIPADWVSVSGFATAPAGAVRVRLEMIAYNNEGGTVYIANPRMMRAAAVLIEPNGITSDKITTGEFITLSAQIKDAIITDAKVANLSAAKLTAGTALAGSITVSGTALSTMQSQANDPATRINAQSTQIDPGKILISGATTLASWRDGTDTTLIAGGKLSANSVTANKLTIGARGINLENIQFSYDKTAGTVSWTAGTLRYMSTNATTGVVEQRGLVVSVQALPPGQPGVSTSTGRSPIPTRRSPRLSRSPPRTSSPRRTLPIPSSSPPTTVERSSTPTTARRSSTAQRSRPAPS
- Physico‐chemical
properties -
protein length: 1651 AA molecular weight: 177134,04470 Da isoelectric point: 4,74375 aromaticity: 0,08904 hydropathy: -0,20478
Domains
Domains [InterPro]
IPR053171
Unmapped
4–756
Unmapped
4–756
DC_0129
STR
16–783
STR
16–783
IPR055385
ATT
91–221
ATT
91–221
IPR003961
STR
1055–1150
STR
1055–1150
DC_1605
STR
1145–1407
STR
1145–1407
1
1651
Architecture
STR 16-90 | ATT 91-221 | STR 222-355 | ATT 356-515 | STR 516-755 | ATT 756-953 | STR 957-1407 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Rhizobium phage vB_RleS_L338C [NCBI] |
1414737 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Rhizobium leguminosarum [NCBI] |
384 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Alphaproteobacteria > Hyphomicrobiales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AHC30457.1
[NCBI]
Genbank nucleotide accession
KF614509.1
[NCBI]
CDS location
range 30389 -> 35344
strand +
strand +
CDS
ATGATCATCCGTAAGGTTCTTAAAGGCGCGAAGGGCGGCGGCAAGGGCGGTTCGAAGGACAACTCCCCGAACTCCCTCCGCTCCGGCGCGTACATCAAGCTTATCGACTTGCTCGGAGAGGGCGAGATCGGCGGGCTTGTAGACGGTTCGCGCTCTATCTTCTTCGACAACACCCCGCTGATGTCGGGAGGCTCCTACAATTTCAAGGGCGTCTCCTGGACGCAGCGCGTGGGCTTGCCGGACCAGCCTTACGTCCCAGGCTTCAACTCTGTTGAGAACGAAGAGGCTGTCGGCGTCCAGGTCAAGCATGGCCTGCCGATCACCCGCAACGTCGAAGGCGAAGACCTTGATGCAGTCCGCGTGACCGTGGAAATCCCTGCCCTCGCAAGGCAGAGTGACGGGCGCTTCAAGGCGCACTCGGTCGATTTCGCGCTGGAAGTTAGGTACGACGGTGGGCCGTGGACCAACCCTTTCGGCACCTTGACCATCTCCGGCAAGTGCACGTCGCCGTATGACCGTAACTACCTGATCAACTTGCCGCGCAATCCCAGCGGCGCAAGCTCGCCATGGTCGATCCGCATGGTTCGCTTGAGCGATGACAGCGAAGACGCGACCAAAGAGCAGAACGATACCGTCTTCGCTTCGATGACGAAGATCACCTACGGTAAGTTCACCTATCCGAACTCCGCCTATATCGCCATGCGCGTCGAGGCAGAGCAGTTCGGCCAGTCGGTTCCGACCCGCTCATACGAAGTCTATGGACGTATCGTCCAGGTTCCGACGAACTACACGACACGCCTCTACAACGCCGATGGCGTCATCACCCGCAACGCCTCCTACTCGGGAAGCTGGAACGGCACGTTCAAGTGGGCCTGGACGGATAACCCGGCGTGGGTGCTCTACGACATCATGACCAATGATCGCTTCGGTCTTGGCGAGGATATCGACGCCTCGCAGGTGGACAAGTGGTCGCTCTACGAGATCGCGCAATACGCAGACCAGAACGTGCCCGATGGCCGTGGCGGCAACAAGCCGCGCTTCACGTTCAACGGCGCGATGTACGATCAGCAGGAGGCATTCGACGCCATCCAGCAAATCGCTTCCGTGTTCCGTGGCATGGCCTATTGGTCTTCCGGCTCGGTTATGGCGACACAGGATCGCCCGAAGGACGTGAGCATCCTTGCTGCATCTGCCAACGCGGTTGACGGCTCCTTTAGCTATCAGGGCTCGGCCCTCACTGCCCGGCATACGGTCGCCGTGGTCAAGTTCGATAACCCCGAACTGAACTATGAGCAGGATTTCGTAGTTGTCGAGGATCGCGACGGCATCGAACTCTACGGCTATAACGAGACCGAAATCCTTGCAACAGGTTGCACCGACCGGGCGCAGGCCTACCGCATCGGCGAATGGGCGCTGTTCACCGAACTGAATGAAACAAACGTCGTCAGCTACAAGGCTGGTCTCGATCATGCTGGCCTCCGCCCCGGCGACATCATCGCCGTCCAGGACAGGAGCTTTGCCGGTCCCGATAATGCCGGGCGTCTGAAGGCAGGCAGCAGCGCGTCCACCCTTCTGCTCGACCATGACGTGACCCTGGAGACCGGCCAGAGCTATTCAGTTTCGGTCGTGCTGCCGAACGGCACGGTCGAAGAGCGTGATATCGTCGTTTCTTCTTACGACGTTCCTATCAACATCCTTGATGTCTCAGCGCCCTTCAGCGTAACGCCTGACGAGGAAGCCCTTTGGGTTCTCGCGTCTGATACGCTCGTGCCCTCCTTGTGGAGTTGCATCTCTGTCCGTGAGAGCGAGCCGCATATCTATGAAGTGGTGGGAGCGCAGTACGAGCCCGCCAAGTTCGAACATATCGACAACTCCGCCCGCTTCGATCCTCTGCCGACGCAGAACAATCCTGTCGTGCAGCCGCCGATCAACCTGCTTGTGCAGGAAAGCATCTATACCCAGGACGGCGTTCCGCGCACGGCCTTGCTTGTATCGTGGACCTCGCCCGGCATCGAGTTTGCCGTGGTCGGTTACGAAGTCGCCTACGACGGCCCGGACGGCTATGTCCAGGTCGGCAGGGTTCCGTCCACGTCCGTGCAGATTTTGGATGCGCCGCCCGGCGATTACACCTTCTATATTTCGAGCGTGTCGCTTAGCCAGCGCGTCTCTCTGCCAGCCACCATCGACTACGCAACTCTCGGCTGGCAGGGTACGCACAATGGCACCATCGCCAACGTTCGCCTGAAGGGCAAGACGCCCGACATCACGTCGTTCACGGGCTCCTCCCCGGAGTTCGTCTGGGATAACGTGTGGCCCACGGGCACCATCTTCAATCCGGACGGCAGCAGCCCGATCTTCCGCAACAACCTCGTCAGCATCTATGACGAAGTCACCGAAGACCTTCTGCGCCAGGAGTACACCCGCAACTCGAACTACGTCTACACGCTGTCGAAGAACCGCGCCGACAACGCTGTCTACAATCGTGGTCCGAGCCGCGCGTTCCGCGTCGAAGTCCAGGTGTGCGACGTTTACGGACGCCAGTCCGATCCGGGCACCATCTCGGTAAGCAACCCGCCGCCTCCCGCCGTCGCCCCTGTAGGCATTCCGGGCTTTGGTCAGTTCGTCGTTCGCTGGTCTACTCTCAGCGATCCTGACCTTGCAGGCTTCTTGGTTTGGGTATCGAGCGCGCCGGGCTTCAACCCGCTGACCACGGAGCCCTACTTCGACGGCATGGTGAACGCCATCACCTATGCGGCGTCTTCCGGCACGACCTACTATGTCCGCGTTGGAGCCTACGACACCTTCGGCAAAACCGGCCTCAACATCTCTGGCGAGATCGAGTGCACGGCGGTCGGCTCCGATGACCTTGACCCGCCGGACACTCCGACCGGCCTGACCTCGACTACGTCGCTGATTTCGGACGTTCAATCCCGCGTCATCTATACCTGGGATGCGAACACCGAACCCGACATGATGGGCTACACGCTGGAGATCAAGGAAGAGGGCGGCGATTGGGTCGGCTTCAATACTTCAACGGCGATCTACCCGATCAACGTCCTGCCCTCCACCGCGCTCGATGCACGGGTAAGGGCCTACGACGTGAACGGCAACTACTCCGCCTTCACCGCTGTCTACTCGACGGTAGCGGCAGGCGACCTCGTACCTCCGGCGGTCCCGACCAACTGGACGGCCAAGGGTGGCTTCGGTATCGTGTTCCTGGAAGGTGACCCGAACAGCGAACGCGACTTCCGCGCCTTCGAAGTCTATGCGAGCGAAGCGACTGATGCGCCCGATGGTGCAACTGTTGCAACGCATACCTCCTCCGCCAATCAGGTGTTCATCACCGATCTTGACGATAACCTGACCCTGAATTTTTGGGTAAGGGCCGTGGACACGTCTGGCAACAAGTCGGACTGGACGGCGATGTTCTCCGCCACCACGGTCTCTTCCAACGTCCTCCTGACGACGGAAGCCCTGGAAGGCATCGTGGATCGCACCTCGTTCGCCACCTCCATCGAAGTGCCGGGCATCGGTAACGCGCTGCCTGACATCCCGTTCGATAGCTCGACGCCCAAGCAGTTCTTCCTTGCCACCGAAGGCAGGATGTACGTGCAGAAGGTGGACGGCACGGGATGGGTGCTGACCACTTCGACCACGATCCTTGACGGCCAGATCATCACAGGCCAGATCGCTGCCGGTGCCATCGGCACTGATCAGCTTGCCGCCAACGCGGTCACGGCCAAGAACCTGTCGATCCGAGACTTCACCGTCCTCGCCGATAACGCAGACATGCAGCTTGGACCGGTGAAGGGCTGGGCGAATTCGTCCCGCATCTTCAACGACCCGGCGACGGCCTACCCCGGTCAGACGTGGGTGGCTAAGCTCATCGCCTCGGTAACGGTCACGGTCGCCAACGAACTGGAAGTCCCTTGCAAGGAAGGCGAGAAGTTCTATCTCTCCGCCAGCGTCAAGGCTTCCGGCGCGGCTGGCAGTGGCCGCAAGGGTGTGCGCGCTCAGTTCCTGGATAGCGCTGGAACGCTGCTCACGCCCGGCACCGCCGATACCACCAGCATCCCCGCCGATTGGGTATCGGTCTCCGGCTTCGCCACGGCACCTGCTGGCGCTGTCCGCGTTCGCCTGGAAATGATCGCCTACAACAACGAAGGCGGCACGGTCTACATCGCCAATCCTCGGATGATGCGCGCCGCTGCCGTGCTTATCGAGCCGAACGGCATCACCTCCGACAAGATCACGACGGGCGAGTTCATCACGCTCTCCGCTCAGATCAAGGATGCGATCATCACCGACGCCAAGGTTGCCAACCTGTCGGCTGCAAAGTTGACGGCTGGCACCGCGCTTGCTGGAAGCATCACGGTTTCGGGCACGGCGCTTTCGACCATGCAGTCCCAGGCCAATGACCCGGCGACCCGCATCAATGCGCAGTCCACCCAGATCGACCCAGGCAAAATCCTGATCTCCGGCGCTACCACGCTGGCCTCTTGGCGCGACGGCACCGATACCACCTTGATCGCGGGCGGCAAGCTGTCGGCGAACAGCGTGACCGCCAACAAGCTCACTATCGGCGCTCGCGGCATCAACCTGGAGAACATCCAGTTCTCCTACGACAAGACGGCTGGCACTGTCTCGTGGACGGCGGGAACGCTGCGCTACATGAGCACGAATGCCACGACCGGCGTTGTCGAACAGCGTGGCCTCGTGGTCTCGGTGCAGGCACTGCCACCTGGACAACCGGGCGTCTCTACATCTACTGGCAGAAGCCCGATCCCGACCCGGCGCAGTCCACGGCTATCACGTTCTCCTCCACGAACGTCATCGCCACGGCGAACGCTTCCGATACCGTCGTCTTCGCCACCTACGACGGTGGAACGAAGTTCAACGCCAACTACGGCCAGACGATCATCGACGGCGCAACGATCCAGACCGGCACCATCGTAG
Genome Context
Genome Context
Tertiary structure
PDB ID
4f6d440109734f435b5908f99aba01dd4db6816166c47470bad1955f8679ab80
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Isolation and characterization of Rhizobium leguminosarum phages from western Canadian soils and complete genome sequence of rhizobiophage vB_RleS-L338C | Restrepo-Cordoba,M., Halmillawewa,A.P., Hynes,M.F. and Yost,C.K. | 2015-09-16 | — | GenBank |