Genbank accession
AUR83448.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MTKKLQCFEGFDQWQGMSYAQSRFTETPDDDLDSSYISWAVNRTNITNVGADDRPKMKDIIDVGGGVFGGNALGLTGYGNWDFWPHFSGADGKGDFSPEEVTFADGSVQARQEGFCYMTYGFYLNLNDDSSWRRGVGGHIMEWAIRGAMTSDPQYPISGVPDVPTDRTIARLVAARRPTGDVYLGYMYNESHFESTPVLRGDIFPAITAVTGDGDTLWYINNNPTSTWATSRAWDNNTHPIDEQDIEITDTIGSGVTVRDCDMDRSTGLYYVLTDTQMVHQFNADWTPTGVSFDISALTIDPYGFLYDDENDDFMVSDIWFGDTDNPRNALKVLDKTFTSVETRRGAVDGIPTRDIYAISNEQGDSGERIFVRTSQATYRVNKQGGDTNTGDPNTFYVSSGSSDSPEVVTPGGLYHDGIMLNIPEAVVFTPFEATYQDTRSIGDDNVGNGGIGRPGAGLRSLIDTYGEATQLPRCEIQLNTLGNCFEDFAVKPNTAPYYLNSTIDQWPDEGTHPVSASAPEFLLQFGQSYFIESCYSARPRALKIDNNDTGWSAPKIPLQFWVPGVMTLKVDGTQYPINPFYTTGSTASARMRAPEGRIEQSRPPVDYKRGIFGLSLRLNGGTLAGLNVATMDDFYCITREEPLGNPGFAYGFDPEDYLGRIRIHTLRPTDVDNDGAAWRVPTSLEDNGYFPVDFLNKPYLSGLDSPFISFDVRGSQKVTAYGGEVPSIGGEIIGVSQCCAWAKENLDVNDSDWPDQGEELTDALRMSLATTPNYEAWYSCPNVDAQNRTSAVEIVSATLGIDNTQFPKPDDYSKVVTVTEFVYETVPETGQAWQLADVAKIQGKFAIEFNKYEFWPFYEVYDNQFY
Physico‐chemical
properties
protein length:867 AA
molecular weight: 96162,83100 Da
isoelectric point:4,32587
aromaticity:0,12226
hydropathy:-0,44925

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

AUR83448.1
1 867 aa
Domain Start End Length (AA) Confidence
N-terminal 1 10 10 0,2201
Central domain 11 258 249 0,9151
C-terminal 259 867 608 0,0995
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Vibrio phage 1.034.X._10N.261.46.B7 [NCBI] · taxon 1881190

Coding sequence (CDS)

Genbank protein accession
AUR83448.1 [NCBI]
Genbank nucleotide accession
MG592419 [NCBI]
CDS location
range 13964 -> 16567
strand +
CDS
ATGACTAAGAAACTACAATGCTTCGAGGGATTCGACCAATGGCAGGGCATGAGCTATGCACAAAGCCGATTCACTGAGACTCCTGATGATGATTTGGATAGTAGCTACATTAGTTGGGCGGTCAATCGTACAAACATAACGAATGTGGGTGCTGACGACAGACCTAAGATGAAAGACATCATTGATGTTGGTGGCGGTGTATTCGGAGGGAATGCACTAGGACTGACTGGCTATGGTAATTGGGACTTCTGGCCACACTTTAGTGGCGCTGACGGTAAGGGTGATTTCAGTCCAGAGGAAGTGACCTTTGCTGATGGGAGTGTGCAGGCACGCCAAGAAGGTTTCTGCTACATGACCTACGGGTTCTACTTGAACTTGAATGACGATAGTTCATGGAGACGTGGAGTGGGTGGTCACATTATGGAGTGGGCTATTCGTGGCGCTATGACATCTGACCCTCAATATCCAATCAGCGGAGTACCTGATGTACCGACAGACCGTACAATCGCTCGCTTGGTTGCGGCTCGTCGCCCTACTGGTGATGTGTACTTGGGTTACATGTACAACGAGAGCCACTTCGAGTCTACACCAGTACTGCGCGGTGACATCTTCCCTGCAATAACAGCGGTGACAGGTGATGGCGACACCTTGTGGTACATCAACAACAACCCTACGTCTACGTGGGCTACTAGCCGTGCATGGGACAACAACACTCACCCTATTGATGAGCAGGACATAGAGATTACTGACACTATAGGCAGTGGCGTGACAGTGCGCGACTGTGACATGGACAGGTCTACAGGGCTGTACTATGTACTGACTGATACTCAAATGGTTCACCAATTCAATGCTGATTGGACACCGACAGGAGTATCGTTTGATATATCCGCGCTGACTATAGACCCTTACGGGTTCTTATACGACGACGAGAATGATGATTTTATGGTATCGGACATATGGTTCGGTGATACAGATAATCCTCGCAATGCTCTAAAGGTACTGGATAAAACTTTCACCTCAGTAGAGACTCGACGCGGTGCAGTCGATGGCATACCAACAAGAGACATTTACGCAATATCAAATGAGCAAGGTGACTCAGGCGAGCGTATCTTCGTCCGTACTTCGCAGGCAACGTACCGTGTCAATAAGCAGGGCGGTGATACCAATACAGGCGACCCTAATACGTTCTATGTATCTTCTGGTTCGTCAGACTCACCGGAAGTGGTGACACCGGGAGGTCTGTACCATGACGGCATCATGCTAAATATACCGGAGGCGGTGGTGTTCACGCCTTTCGAGGCTACCTACCAAGACACACGTAGTATAGGCGATGACAATGTAGGTAATGGCGGTATAGGCAGACCGGGTGCGGGGCTACGCAGTCTGATAGATACCTATGGAGAGGCTACCCAACTACCTCGATGTGAGATTCAATTGAACACACTAGGTAACTGCTTTGAAGATTTCGCAGTGAAGCCTAATACGGCTCCGTACTACCTTAACTCCACCATAGACCAGTGGCCAGATGAGGGTACTCACCCTGTATCAGCGAGCGCTCCTGAGTTCTTGTTGCAGTTCGGACAAAGCTACTTCATAGAATCATGTTACTCGGCTCGACCTAGAGCTTTAAAGATAGACAATAATGACACGGGTTGGTCTGCCCCGAAGATACCACTACAGTTTTGGGTGCCGGGAGTGATGACACTCAAGGTGGATGGAACGCAGTACCCAATCAATCCGTTCTACACGACAGGCTCTACTGCGTCTGCGCGTATGCGTGCACCGGAAGGCAGGATTGAACAGTCACGACCACCAGTGGATTACAAGCGTGGCATCTTTGGTCTATCCCTGAGATTGAATGGCGGTACATTGGCAGGACTGAACGTGGCAACGATGGATGACTTCTACTGCATTACTCGTGAGGAGCCACTAGGCAATCCGGGATTTGCTTATGGCTTCGACCCGGAAGATTACTTAGGTAGAATACGCATCCATACCCTCCGACCTACTGATGTCGATAATGATGGAGCCGCTTGGCGAGTACCAACTTCACTAGAAGATAACGGCTACTTCCCTGTGGACTTCTTGAACAAGCCTTACCTGTCTGGATTAGACTCACCGTTCATCTCGTTTGATGTACGTGGTAGTCAGAAGGTGACAGCTTACGGTGGTGAAGTACCATCAATAGGTGGTGAGATAATCGGGGTATCACAGTGCTGTGCTTGGGCTAAAGAGAACTTGGATGTGAACGATAGTGATTGGCCAGACCAAGGTGAAGAACTGACGGACGCATTGCGTATGTCTCTAGCTACCACACCGAACTACGAGGCGTGGTATAGTTGTCCTAACGTTGATGCACAGAACAGGACTTCTGCGGTGGAGATAGTATCGGCAACGCTAGGTATAGATAACACACAGTTCCCGAAACCTGACGACTACAGCAAGGTCGTTACTGTGACGGAGTTTGTGTACGAGACTGTACCGGAGACTGGACAGGCGTGGCAACTAGCAGACGTGGCCAAGATACAAGGCAAGTTCGCAATAGAGTTTAACAAGTACGAGTTCTGGCCGTTCTATGAAGTGTACGACAACCAGTTCTACTAG

Genome Context

Tertiary structure

AUR83448.1
ESMFold structure
Source ESMFold
pLDDT 30.6
Oligomeric state monomer

Literature

Title Authors Date PMID Source
A major lineage of nontailed dsDNA viruses as unrecognized killers of marine bacteria Kauffman,K.M., Hussain,F.A., Yang,J., Arevalo,P., Brown,J.M., Chang,W.K., VanInsberghe,D., Elsherbini,J., Cutler,M.B., Kelly,L. and Polz,M.F. 2018-01-24 GenBank