Genbank accession
AWT50525.1 [GenBank]
Protein name
hypothetical protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,88
TSP
Evidence RBPdetect2
Probability 0,97
Protein sequence
MPYTRPTIVSGVTRATKAFFDNLLDGIDERVTKTDADATYAPVWKASTVYALGAKVVSPLTGDVIRRLTAGTSRESFDTTEQALWLAAYGTVTRTVAAVDTPMPFRARADYLCDGTADLATIQQALDALPDDNAVSGEIVLLAGNYSDATNATLSVGSPSSATLNPRKVLRFERGARINVSGRTGRKAVVKVESPDCQIINPNIAGNAAFGNGTGISIGGDVATLGGFWGKVANRVLIYEPILSNLETGLEFASIDGGPGVGGSTGDCKVHGGYIFQCKTGIRAAGYTNTMYAPTLANNNKAIWVEARRSEAQLRVHDPAIVGWNEVGILVEGGFGSVFSNTWMEQNPASASTATEAIRLGQSGTVRANATKFTGTTHVQLVNEQYAIKYVGAIGTEVEELVLSTSGAVPSVAVARNEMQSTSKNNRIRRYTFGPNAIPSHTSLSIDAAAWGELFIDRVSGLVGADGFSTRRNPSTRASSTPVARKLTDTSKTSDATMGSDSELTVNLNPGTVYALDGIIFFDAGQTGDFKMALSVSGTNSTISWVGVGPASSHTNAVGVSSVTTQRATSGFVMTWGGAAAGTVIGVPIKGVVSVTELATITMTWAQAVADATPTILKAGSYLELTPIS
Physico‐chemical
properties
protein length:629 AA
molecular weight: 65696,99780 Da
isoelectric point:6,06543
aromaticity:0,06995
hydropathy:-0,00541

Domains

View on InterPro
AWT50525.1
1 629 aa
STR 114–348 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

AWT50525.1
1 629 aa
Domain Start End Length (AA) Confidence
N-terminal 1 117 117 0,9816
Central domain 118 466 350 0,9896
C-terminal 467 629 162 0,9834
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Phage
Gordonia phage Sitar [NCBI] · taxon 2182348
Host No host information

Coding sequence (CDS)

Genbank protein accession
AWT50525.1 [NCBI]
Genbank nucleotide accession
MH153809 [NCBI]
CDS location
range 20616 -> 22505
strand +
CDS
ATGCCGTACACACGACCGACCATCGTCAGCGGCGTCACCCGCGCGACAAAAGCATTCTTCGACAACCTGCTCGACGGCATCGACGAACGCGTCACAAAAACGGACGCCGACGCCACTTATGCCCCGGTCTGGAAGGCGTCCACGGTGTACGCCCTCGGCGCGAAGGTGGTGTCGCCGTTGACGGGCGACGTCATCCGCAGGCTGACTGCGGGCACCTCGCGCGAGAGTTTCGACACCACCGAACAGGCGCTGTGGCTCGCCGCCTACGGCACGGTCACACGGACCGTTGCGGCCGTCGACACCCCGATGCCGTTCCGCGCCCGGGCGGACTACCTCTGCGACGGCACTGCTGACCTCGCGACCATTCAGCAGGCACTCGACGCCCTGCCTGACGACAACGCCGTGTCTGGCGAAATCGTGCTGCTCGCAGGCAATTACTCCGACGCGACCAACGCCACCCTGTCGGTCGGCTCTCCGTCGTCGGCCACGCTCAACCCCCGCAAGGTGCTGCGCTTCGAGCGCGGCGCCCGGATTAACGTCAGCGGGCGGACCGGACGCAAAGCCGTCGTCAAGGTCGAATCCCCCGACTGCCAGATCATCAACCCGAACATCGCGGGCAACGCTGCCTTCGGCAACGGCACGGGTATCTCGATCGGCGGCGACGTCGCCACACTCGGCGGGTTCTGGGGCAAGGTCGCCAATCGTGTGCTGATCTACGAGCCGATCCTGTCCAACCTGGAGACCGGTCTGGAGTTCGCCAGCATCGACGGCGGGCCCGGCGTCGGAGGATCGACCGGGGACTGCAAGGTTCACGGCGGCTACATCTTCCAGTGCAAGACCGGCATCCGGGCGGCCGGCTACACCAACACTATGTACGCCCCGACATTGGCGAACAACAACAAGGCAATCTGGGTAGAAGCTCGGCGATCGGAAGCCCAGCTCCGCGTGCATGACCCTGCGATCGTCGGCTGGAACGAGGTCGGCATCCTCGTCGAGGGCGGCTTCGGGTCGGTGTTCTCGAACACCTGGATGGAGCAGAACCCGGCGTCGGCGAGCACCGCGACCGAAGCCATTCGGCTCGGACAGTCGGGAACCGTCCGCGCCAACGCGACAAAGTTCACCGGCACCACACACGTGCAGCTCGTGAACGAGCAGTACGCAATCAAGTACGTCGGCGCGATCGGCACCGAGGTCGAGGAACTCGTGCTCTCGACGAGCGGGGCGGTCCCGTCGGTCGCCGTGGCACGCAACGAGATGCAATCGACCAGCAAGAACAACCGCATCCGGCGCTACACCTTCGGACCGAACGCGATCCCCAGCCACACCTCGCTCTCGATCGACGCGGCTGCGTGGGGCGAGCTGTTCATCGACCGCGTGTCCGGGCTGGTCGGGGCAGACGGATTCAGCACCCGACGCAACCCGTCCACCCGCGCATCATCCACGCCGGTCGCGCGGAAGCTGACCGACACGTCGAAGACATCCGACGCCACGATGGGGTCCGACTCAGAGCTGACGGTGAACCTCAACCCTGGCACCGTCTACGCGCTCGACGGCATCATCTTCTTCGATGCGGGGCAGACGGGCGACTTCAAGATGGCGCTGTCGGTGTCGGGCACCAACTCGACGATCTCGTGGGTGGGTGTCGGACCAGCGTCCTCGCACACCAACGCTGTCGGGGTGTCGAGTGTGACGACGCAGCGCGCGACGTCGGGGTTCGTGATGACGTGGGGCGGGGCCGCGGCTGGCACGGTGATCGGCGTGCCGATCAAGGGCGTCGTCTCGGTTACTGAGTTGGCGACCATCACGATGACGTGGGCTCAGGCCGTGGCGGATGCGACCCCCACCATCCTCAAAGCCGGTTCGTACCTCGAGCTCACCCCGATCAGTTGA

Genome Context

Tertiary structure

AWT50525.1
ESMFold structure
Source ESMFold
pLDDT 81.3
Oligomeric state monomer