Genbank accession
WMC01262.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TF
Evidence RBPdetect
Probability 0,65
TF
Evidence RBPdetect2
Probability 0,74
Protein sequence
MAEIKDEYIQWPGEATFPGEDTTPAYDRYANGNTTVHSHKGWEWVESDNPFQKAAAALAQSTIEASIRRVRTTFGQVFYQKGNATDKPDFPGEAYGDTARIQDPSTLDIVAEWKWNGLDWERARVSGEQISNLDVGRLTAGSAAINDLAARRIAGDIGKFLQLTTDQLTVTGNASFVDLTAKHVWTRIINARQGEFETIKAGMLDANSVSASNIQGGAIDGQVITGATVQTDRSPTHGLKIDSTGIRAYTGRSSETSFEVNASNGRIKVLGEVGIQDTWSIAKFTDIVEVQSGNDVGQRGDRWGVGILMNGKTFPYKYPALITYKEDPTNAGGILYFQAPSSYDSSTPNMRLSTTGLSVYSGKTSAWSMNLSRSGFGAGAPGKGNIQVNDYMGSITVGGYDAHLYIQGDVFRLRSTQDRWKAVWCNGNAVVMGWDQTHQAIVDRDGFRAVGGKNFIMRVPGEWQKRHMMLQHASTESPHDGIEYWENVELNSEGRATWVLPDYIPKIASPTAPWIVLTSSSASAKLTQVGYGVDAAPWSVEVSGRPGETVAVLVKGARQIDEWDMATDRVALRDRSKESEWVLPPAASPDDEGLTDSAVAYNGHGGYGPSPTPHSPPPTEENRDSK
Physico‐chemical
properties
protein length:626 AA
molecular weight: 68102,62590 Da
isoelectric point:5,26167
aromaticity:0,08946
hydropathy:-0,47923

Domains

Domains [InterPro]
DC_1024
STR
24–518
WMC01262.1
1 626
Architecture
STR
STR 24-518 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
WMC01262.1
1 626
Domain Start End Length (AA) Confidence
N-terminal 1 227 227 0,9909
Central domain 228 426 200 0,2327
C-terminal 427 626 199 0,9545
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-227
Central
228-426
C-terminal
427-626

Taxonomy

  Name Taxonomy ID Lineage
Phage Caudovirus D_HF2_7
[NCBI]
3071194 Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
WMC01262.1 [NCBI]
Genbank nucleotide accession
OR148984 [NCBI]
CDS location
range 15522 -> 17402
strand +
CDS
GTGGCCGAAATCAAGGACGAGTACATTCAGTGGCCTGGTGAGGCCACATTCCCTGGCGAGGACACTACCCCGGCGTATGACCGGTACGCTAACGGCAACACGACTGTTCACTCCCATAAGGGGTGGGAGTGGGTTGAGTCCGATAACCCGTTCCAGAAGGCTGCCGCAGCTCTCGCCCAGTCCACTATCGAGGCGTCTATTCGACGCGTGCGCACCACGTTCGGCCAGGTGTTCTACCAGAAGGGGAACGCGACAGATAAGCCCGACTTCCCCGGCGAAGCATACGGGGATACCGCGCGCATTCAGGACCCGTCTACCCTGGATATTGTTGCCGAGTGGAAGTGGAATGGTCTCGACTGGGAGCGGGCCCGCGTCTCAGGGGAGCAGATAAGCAACCTGGATGTGGGGCGCCTTACCGCAGGCTCCGCAGCGATCAACGACCTTGCCGCACGCCGCATCGCTGGCGACATCGGCAAGTTCCTTCAGCTCACCACCGACCAGCTCACCGTGACCGGGAACGCGTCGTTCGTTGACCTGACCGCGAAGCACGTGTGGACGCGCATCATTAACGCACGCCAGGGTGAGTTCGAGACGATCAAGGCTGGCATGCTCGACGCCAACTCGGTGAGTGCGTCTAACATTCAGGGTGGCGCGATTGATGGTCAGGTGATTACTGGCGCCACCGTCCAGACCGACAGATCCCCCACCCATGGCCTAAAGATCGACTCCACGGGAATCCGCGCCTACACCGGCAGGTCGAGCGAAACATCCTTCGAGGTGAACGCCTCAAATGGGAGGATTAAGGTGCTCGGCGAGGTCGGCATCCAGGACACGTGGTCTATCGCCAAGTTCACGGACATTGTTGAAGTCCAGTCAGGAAATGACGTTGGGCAACGAGGAGACCGCTGGGGCGTGGGAATCCTCATGAACGGCAAGACATTCCCATACAAGTACCCCGCACTGATTACATACAAGGAAGACCCAACCAACGCCGGGGGAATCCTATACTTCCAGGCGCCATCCAGTTACGACAGTTCCACCCCAAACATGCGCCTATCGACCACGGGTCTGAGCGTGTACTCAGGTAAGACTTCTGCCTGGTCTATGAACCTGAGCAGGTCCGGCTTTGGCGCTGGGGCGCCAGGGAAGGGGAACATCCAGGTCAACGACTACATGGGGTCGATCACTGTCGGCGGCTATGATGCGCACCTATACATCCAGGGCGATGTGTTTCGACTGCGATCCACTCAGGACCGATGGAAAGCAGTGTGGTGCAACGGTAACGCCGTAGTCATGGGATGGGATCAAACCCATCAGGCTATCGTTGACAGGGATGGGTTTCGCGCCGTTGGAGGCAAGAACTTCATTATGCGCGTGCCGGGTGAGTGGCAGAAGCGCCACATGATGCTCCAGCACGCTAGCACCGAGTCGCCGCACGACGGCATTGAATACTGGGAGAACGTTGAACTTAACAGCGAAGGGCGCGCCACGTGGGTCCTCCCTGACTACATTCCGAAGATCGCTTCCCCGACTGCGCCATGGATTGTCCTCACGTCGTCATCCGCATCAGCCAAGCTAACTCAAGTCGGGTATGGTGTGGACGCAGCCCCATGGTCGGTCGAGGTATCCGGCCGGCCTGGGGAGACCGTCGCCGTCCTCGTTAAGGGCGCTCGTCAGATCGACGAATGGGATATGGCGACCGATCGTGTTGCCCTTCGCGACCGGTCCAAGGAGTCGGAGTGGGTTCTTCCCCCAGCCGCCAGCCCTGACGATGAGGGTCTGACTGATAGCGCCGTCGCTTACAATGGTCACGGAGGGTATGGGCCTTCACCCACACCACACTCGCCGCCGCCAACTGAGGAGAATCGGGACAGTAAATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
6298ee92272284837f70da8e8563e95a72eb1505a4f8cee6e8c88957cf960ce2
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5648
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Characterization of diverse anelloviruses, cressdnaviruses, and phages in the human oral virome in North Carolina Paietta,E.N., Kraberger,S., Custer,J.M., Vargas,K.L., Epsy,C., Ehmke,E., Yoder,A.D. and Varsani,A. 2023-08-26 GenBank