Protein
View in Explore- Genbank accession
- AUR83448.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSP
- Protein sequence
-
MTKKLQCFEGFDQWQGMSYAQSRFTETPDDDLDSSYISWAVNRTNITNVGADDRPKMKDIIDVGGGVFGGNALGLTGYGNWDFWPHFSGADGKGDFSPEEVTFADGSVQARQEGFCYMTYGFYLNLNDDSSWRRGVGGHIMEWAIRGAMTSDPQYPISGVPDVPTDRTIARLVAARRPTGDVYLGYMYNESHFESTPVLRGDIFPAITAVTGDGDTLWYINNNPTSTWATSRAWDNNTHPIDEQDIEITDTIGSGVTVRDCDMDRSTGLYYVLTDTQMVHQFNADWTPTGVSFDISALTIDPYGFLYDDENDDFMVSDIWFGDTDNPRNALKVLDKTFTSVETRRGAVDGIPTRDIYAISNEQGDSGERIFVRTSQATYRVNKQGGDTNTGDPNTFYVSSGSSDSPEVVTPGGLYHDGIMLNIPEAVVFTPFEATYQDTRSIGDDNVGNGGIGRPGAGLRSLIDTYGEATQLPRCEIQLNTLGNCFEDFAVKPNTAPYYLNSTIDQWPDEGTHPVSASAPEFLLQFGQSYFIESCYSARPRALKIDNNDTGWSAPKIPLQFWVPGVMTLKVDGTQYPINPFYTTGSTASARMRAPEGRIEQSRPPVDYKRGIFGLSLRLNGGTLAGLNVATMDDFYCITREEPLGNPGFAYGFDPEDYLGRIRIHTLRPTDVDNDGAAWRVPTSLEDNGYFPVDFLNKPYLSGLDSPFISFDVRGSQKVTAYGGEVPSIGGEIIGVSQCCAWAKENLDVNDSDWPDQGEELTDALRMSLATTPNYEAWYSCPNVDAQNRTSAVEIVSATLGIDNTQFPKPDDYSKVVTVTEFVYETVPETGQAWQLADVAKIQGKFAIEFNKYEFWPFYEVYDNQFY
- Physico‐chemical
properties -
protein length: 867 AA molecular weight: 96162,83100 Da isoelectric point: 4,32587 aromaticity: 0,12226 hydropathy: -0,44925
Tail Spike Domain Segmentation
Segmented into three structural domains: N-terminal, central, and C-terminal.
Domain layout
AUR83448.1
1
867 aa
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 10 | 10 | 0,2201 |
| Central domain | 11 | 258 | 249 | 0,9151 |
| C-terminal | 259 | 867 | 608 | 0,0995 |
Note: Constraints were applied during segmentation.
Sequence started with non-N-terminal domain
Sequence started with non-N-terminal domain
N-terminal
Central domain
C-terminal
View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.
Taxonomy
Coding sequence (CDS)
Genbank protein accession
AUR83448.1
[NCBI]
Genbank nucleotide accession
MG592419
[NCBI]
CDS location
range 13964 -> 16567
strand +
strand +
CDS
ATGACTAAGAAACTACAATGCTTCGAGGGATTCGACCAATGGCAGGGCATGAGCTATGCACAAAGCCGATTCACTGAGACTCCTGATGATGATTTGGATAGTAGCTACATTAGTTGGGCGGTCAATCGTACAAACATAACGAATGTGGGTGCTGACGACAGACCTAAGATGAAAGACATCATTGATGTTGGTGGCGGTGTATTCGGAGGGAATGCACTAGGACTGACTGGCTATGGTAATTGGGACTTCTGGCCACACTTTAGTGGCGCTGACGGTAAGGGTGATTTCAGTCCAGAGGAAGTGACCTTTGCTGATGGGAGTGTGCAGGCACGCCAAGAAGGTTTCTGCTACATGACCTACGGGTTCTACTTGAACTTGAATGACGATAGTTCATGGAGACGTGGAGTGGGTGGTCACATTATGGAGTGGGCTATTCGTGGCGCTATGACATCTGACCCTCAATATCCAATCAGCGGAGTACCTGATGTACCGACAGACCGTACAATCGCTCGCTTGGTTGCGGCTCGTCGCCCTACTGGTGATGTGTACTTGGGTTACATGTACAACGAGAGCCACTTCGAGTCTACACCAGTACTGCGCGGTGACATCTTCCCTGCAATAACAGCGGTGACAGGTGATGGCGACACCTTGTGGTACATCAACAACAACCCTACGTCTACGTGGGCTACTAGCCGTGCATGGGACAACAACACTCACCCTATTGATGAGCAGGACATAGAGATTACTGACACTATAGGCAGTGGCGTGACAGTGCGCGACTGTGACATGGACAGGTCTACAGGGCTGTACTATGTACTGACTGATACTCAAATGGTTCACCAATTCAATGCTGATTGGACACCGACAGGAGTATCGTTTGATATATCCGCGCTGACTATAGACCCTTACGGGTTCTTATACGACGACGAGAATGATGATTTTATGGTATCGGACATATGGTTCGGTGATACAGATAATCCTCGCAATGCTCTAAAGGTACTGGATAAAACTTTCACCTCAGTAGAGACTCGACGCGGTGCAGTCGATGGCATACCAACAAGAGACATTTACGCAATATCAAATGAGCAAGGTGACTCAGGCGAGCGTATCTTCGTCCGTACTTCGCAGGCAACGTACCGTGTCAATAAGCAGGGCGGTGATACCAATACAGGCGACCCTAATACGTTCTATGTATCTTCTGGTTCGTCAGACTCACCGGAAGTGGTGACACCGGGAGGTCTGTACCATGACGGCATCATGCTAAATATACCGGAGGCGGTGGTGTTCACGCCTTTCGAGGCTACCTACCAAGACACACGTAGTATAGGCGATGACAATGTAGGTAATGGCGGTATAGGCAGACCGGGTGCGGGGCTACGCAGTCTGATAGATACCTATGGAGAGGCTACCCAACTACCTCGATGTGAGATTCAATTGAACACACTAGGTAACTGCTTTGAAGATTTCGCAGTGAAGCCTAATACGGCTCCGTACTACCTTAACTCCACCATAGACCAGTGGCCAGATGAGGGTACTCACCCTGTATCAGCGAGCGCTCCTGAGTTCTTGTTGCAGTTCGGACAAAGCTACTTCATAGAATCATGTTACTCGGCTCGACCTAGAGCTTTAAAGATAGACAATAATGACACGGGTTGGTCTGCCCCGAAGATACCACTACAGTTTTGGGTGCCGGGAGTGATGACACTCAAGGTGGATGGAACGCAGTACCCAATCAATCCGTTCTACACGACAGGCTCTACTGCGTCTGCGCGTATGCGTGCACCGGAAGGCAGGATTGAACAGTCACGACCACCAGTGGATTACAAGCGTGGCATCTTTGGTCTATCCCTGAGATTGAATGGCGGTACATTGGCAGGACTGAACGTGGCAACGATGGATGACTTCTACTGCATTACTCGTGAGGAGCCACTAGGCAATCCGGGATTTGCTTATGGCTTCGACCCGGAAGATTACTTAGGTAGAATACGCATCCATACCCTCCGACCTACTGATGTCGATAATGATGGAGCCGCTTGGCGAGTACCAACTTCACTAGAAGATAACGGCTACTTCCCTGTGGACTTCTTGAACAAGCCTTACCTGTCTGGATTAGACTCACCGTTCATCTCGTTTGATGTACGTGGTAGTCAGAAGGTGACAGCTTACGGTGGTGAAGTACCATCAATAGGTGGTGAGATAATCGGGGTATCACAGTGCTGTGCTTGGGCTAAAGAGAACTTGGATGTGAACGATAGTGATTGGCCAGACCAAGGTGAAGAACTGACGGACGCATTGCGTATGTCTCTAGCTACCACACCGAACTACGAGGCGTGGTATAGTTGTCCTAACGTTGATGCACAGAACAGGACTTCTGCGGTGGAGATAGTATCGGCAACGCTAGGTATAGATAACACACAGTTCCCGAAACCTGACGACTACAGCAAGGTCGTTACTGTGACGGAGTTTGTGTACGAGACTGTACCGGAGACTGGACAGGCGTGGCAACTAGCAGACGTGGCCAAGATACAAGGCAAGTTCGCAATAGAGTTTAACAAGTACGAGTTCTGGCCGTTCTATGAAGTGTACGACAACCAGTTCTACTAG
Genome Context
Tertiary structure
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| A major lineage of nontailed dsDNA viruses as unrecognized killers of marine bacteria | Kauffman,K.M., Hussain,F.A., Yang,J., Arevalo,P., Brown,J.M., Chang,W.K., VanInsberghe,D., Elsherbini,J., Cutler,M.B., Kelly,L. and Polz,M.F. | 2018-01-24 | — | GenBank |