Protein
View in Explore- Genbank accession
- CAB4138816.1 [GenBank]
- Protein name
- Collagen triple helix repeat
- RBP type
-
TFTF
- Protein sequence
-
MPTSPPDPYERQYDFSNHSILQPNTPQPGNKIDLELNEIRESLNATISRLGEIQRDDGKVRQTAVELTVGPQGLPGAKGDKGDKGDTGQAGAQGAQGVPGIQGQQGIQGQQGATGATGATGATGAAGQVGPQGAKGDKGDKGDTGAAGPAGPQGQTGPAGPTGPTGSQGPQGATGAAGATGPVGPVGPQGPQGDKYAGTSTTSLAVSNGTKTLTTQTGLAWTGQQDLTIVYDASNHMHAVVTTYNASTGVLVADVSNHTGSGTYASWTINLEGAIGAQGPVGPQGSTGATGATGATGPAGPAGAQGPQGATGPQGATGATGATGAQGPAGATGATGPAGATGATGATGAAGPAGPVGPEGPQGPQGIQGPEGQQGLQGPVGDPGPQGDAGAPGINGSMLFNWLGAYDNGVTYAANDGVQFNGSAYVMTATIGGAGYDPVGYPGNWSLVVSKGDEGDQGSTGPAGANGAPCVMRGTWADFTSYSAGDLVEYGGLIYSALNTHYAYGDPPPSANWVAVSITTIPGPQGEPGAPGANGADGAQGPQGPAGSPAKTVNDVSATVPYTLQLSDNNNIVFTTATGEYGPGIIVPNDTYITLEDMTVVPSPSNFPIGAVVTIVSSDTNNSISGDSAAPYINGSLSTQAVGQKVITLVKVNSSNWYFA
- Physico‐chemical
properties -
protein length: 660 AA molecular weight: 64099,72180 Da isoelectric point: 4,25141 aromaticity: 0,05303 hydropathy: -0,34924
Domains
Domains [InterPro]
DC_0862
STR
1–70
STR
1–70
IPR050149
Unmapped
69–549
Unmapped
69–549
DC_1266
STR
99–158
STR
99–158
IPR008160
STR
142–194
STR
142–194
IPR036573
STR
476–511
STR
476–511
1
660
Architecture
STR 1-473 | ATT 474-519 | STR 520-654 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
uncultured Caudovirales phage [NCBI] |
2100421 | Uroviricota > Caudoviricetes > Peduoviridae > Maltschvirus maltsch > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
CAB4138816.1
[NCBI]
Genbank nucleotide accession
LR796363
[NCBI]
CDS location
range 22062 -> 24044
strand +
strand +
CDS
ATGCCAACATCGCCGCCCGATCCCTACGAGCGTCAGTACGACTTCTCAAATCACTCGATTTTGCAGCCCAACACGCCGCAACCCGGCAACAAAATCGACCTGGAGCTCAACGAAATCCGCGAGTCCCTCAACGCGACCATCAGCCGGCTCGGCGAGATCCAACGCGATGACGGAAAAGTGCGTCAAACCGCGGTCGAATTGACTGTCGGGCCGCAAGGCTTGCCCGGCGCGAAGGGCGACAAGGGTGACAAAGGCGACACCGGCCAAGCCGGCGCGCAAGGCGCGCAGGGCGTGCCCGGAATTCAGGGCCAGCAAGGCATCCAAGGCCAGCAAGGGGCCACGGGAGCGACCGGTGCGACCGGTGCGACTGGCGCTGCGGGCCAAGTCGGTCCCCAGGGCGCAAAAGGCGACAAAGGCGACAAGGGAGACACCGGTGCAGCGGGCCCTGCGGGCCCTCAGGGCCAGACCGGCCCAGCGGGACCCACGGGACCGACCGGATCGCAAGGGCCCCAAGGGGCCACAGGGGCTGCCGGAGCGACCGGACCTGTCGGACCAGTCGGACCTCAAGGCCCCCAGGGCGACAAGTACGCCGGCACCTCGACAACGAGCCTGGCTGTGTCGAACGGCACCAAGACTTTGACGACTCAGACGGGCCTGGCGTGGACCGGCCAGCAGGATCTGACGATCGTCTACGATGCGAGCAACCACATGCACGCGGTTGTGACGACCTACAACGCCAGCACGGGCGTCTTGGTCGCGGATGTGAGCAACCACACCGGCAGCGGCACATACGCGAGCTGGACGATCAACCTGGAAGGCGCGATCGGCGCGCAAGGCCCGGTCGGGCCCCAGGGAAGCACGGGAGCTACGGGAGCCACGGGGGCTACGGGCCCGGCGGGGCCCGCGGGAGCGCAAGGACCCCAGGGGGCCACGGGGCCGCAAGGCGCAACCGGGGCTACGGGAGCTACGGGGGCCCAGGGACCCGCGGGGGCCACGGGTGCAACGGGGCCCGCAGGGGCCACAGGGGCCACAGGGGCCACGGGAGCTGCGGGCCCGGCGGGACCAGTCGGACCGGAAGGGCCGCAAGGGCCCCAAGGGATCCAAGGGCCCGAGGGCCAGCAGGGCCTTCAGGGCCCTGTGGGTGATCCGGGTCCCCAGGGTGACGCCGGTGCGCCTGGCATCAACGGATCGATGTTGTTTAACTGGCTTGGTGCGTACGACAACGGCGTGACCTACGCGGCGAACGATGGCGTGCAGTTCAACGGCAGCGCGTATGTAATGACGGCCACGATCGGCGGTGCCGGCTACGATCCGGTGGGTTATCCTGGCAACTGGTCGTTGGTCGTGAGCAAGGGTGATGAGGGTGATCAGGGATCTACCGGCCCTGCGGGCGCGAACGGCGCGCCGTGCGTGATGCGCGGCACCTGGGCTGATTTCACGAGCTACTCCGCCGGCGACTTGGTGGAATACGGCGGTCTGATCTACTCGGCGCTGAACACGCATTACGCTTACGGCGACCCGCCCCCCAGCGCGAATTGGGTGGCGGTGAGCATCACCACGATCCCTGGGCCGCAAGGTGAGCCCGGAGCTCCTGGCGCAAACGGCGCTGACGGCGCGCAAGGTCCGCAGGGTCCGGCGGGGTCGCCGGCAAAGACTGTCAACGATGTGTCGGCGACTGTGCCGTACACGCTGCAGCTTTCGGACAACAACAACATCGTCTTCACGACAGCGACCGGTGAATACGGCCCTGGCATCATTGTTCCAAATGACACTTACATCACATTGGAAGACATGACTGTCGTGCCTAGCCCGTCAAACTTCCCGATCGGAGCTGTCGTGACGATCGTTTCCAGCGACACAAACAACAGCATTTCAGGTGACTCCGCAGCTCCTTACATCAACGGAAGCCTTTCAACCCAGGCTGTTGGCCAGAAGGTGATCACTCTGGTGAAGGTCAATTCCAGTAATTGGTATTTCGCCTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
b30033d5870631b7ede0148589ec76decb65e4b325cdd871111d8b154bb4af49
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50