Protein
View in Explore- Genbank accession
- AXH66313.1 [GenBank]
- Protein name
- tail spike protein
- RBP type
-
TFTSP
- Protein sequence
-
MPLPPERVVTGTYVNPVNGEPYDGSNGDHYLIFEPVPDRWTDRAGNQILLGGGKVTLDENGHFSEDLVCTDAADVYPVEGRLWRVRQFVGGSWDSGTFALPEGDDPLDITDILSVDICGVDYVPVPGPPGPQGPPGPPDGPPGPPGPEGASAYEVAVEQGFSGTEDEWLASLQGEQGEPGATGGPGPQGEPGTPGEDGPSAYQVALDNGFVGTEAQWLASLQGPAGTDGAPGPEGDSAYEVAVENGFEGSEAQWLISLRGAPGAPGDTGPQGEPGPEGPSAYEVAVSGGFSGTEAEWLASLVGPEGDTGATGATGPAGDSAYQVALDEGFVGTEEEWLASLVGPQGPAGRDGGMDTGISLGGDITASETDPLAVVINPLTGRIVDYFADPVTITDVEVTSPITVTLDSVAQERTITWLLMDADQIVYQQQARPSPEDRRSFLVLGMVAQDDGEIFLAQSIPTIARQPVNQLYDLMDAIGAFGIMGNDVSPNGANLQLNVGAGQVFSRGWNHFDGGTETINPHIVTTLGATPASWTHALRSSDLEHASASTTVDVGHYDLNGTLTAVGGDTDTSVVHQLWMFPTNEGSEVHILQYGQQLFDTLEDAISGAGTAAFVTNTALPGNAIMLGYLAVRGVATDLSDAAQAMFIKAGKFGASPGGGASVDLSGYAMLAGAEFTGQIATNMAQPDAVAQSSRVTTNGSDSWRRQVDGEMQWGPGDGPMDAFLRRLGIGMLAFLNTDLLVGQEDAKSYRFRQSGTSLDLDASGADLFLSVFELVNFLGEQYTYLRLESGEFTAHASGKWVFGDGPFDGAGHTLDGTANQAGFFGATPVGQQTVSGARTTGAGLQSLLDALEALGLIVDSTTAGVAVVETVNGQAGPDVSLTASDVGVGDVIVAANDSRNKDGADFVCTGTNDHLVIQQAIDLVDAAPGKGTVRLLDGTFWLGATVTIPSGAGLNLTGTGWGTVLKISPGVEDYAITFDGDETRARFSHFRIDGNLSGQTAGPCGGIWAAGAVECVFQHINFSHCWSAGLVLTAINGGAFGHNNYVLHCLFDNGMTSTEEGNGIYLSSSDENFIVACDFQFLGGAFGAGGAGIYDQAGTQTILGCNFVGGGNSLPAIRVQDASATKITACNFDGVGGDAVFLAATNCLVEGNTIFGVGAIGTAGTFSGIHLEYGATGNLIANNSISSHTVNGAARSLIREESVGDSGNNLVVGNMLITKGTLSVGALDLNAPGTIARSNFGGGADGDPVVTTDMIGVADGVASLDATGKVPAAQMPDGLVTSVNGEDGDVTLTAADVGALDEAAGDARYTQQDALYLNAKEHGAVGDETTDDSTVINALLSTSPAGSTVVLPPGTYGISVPLVVPPGKTLMGLRTNLMQITDVYDPQVSIKPLAGFTGVAAVRFLDQATGGYDDISGEQRLIDIMLDGSALTAGVDGLQAKGNIQNVGLRDVTIRFFPNSGIYSGLEGGIGPYSWRMHRVMLDNNHAHGMYAEKMVDLTMVDCQAIGNWSNGFMLNNAANSQAVACRAEWSGNHGFYLTGDWGDGQGSGGMLLSACSTDRNGFNGVFIDSTGNAPIVISNLMTRRDGRNGGTGGGGYAGLAVNAAEMPVIVGDWTNYPGVDDNGASTSSPDYGASVTGSTHVQIDNAYLHAATAGLNDGGGNTALILGANITYATGATTAPVRAVQQLAAARIPDLAASKVTSGTFDAARIPDLAASKVTSGTFDAARIPALNYVPTGSAVLLTGAQTVGGLKTFSSGLAVKDGTNAGDVHIKDSQDVNLLVVESSNATAAAGVMKLIAGQASQAAFLGTWMGAATNAWSIRADGRLEAGDGATGRDTYLTRTAAGVWQVTSQIRASASAPANAADLTRKDYVDGLDTANVKLTGAQTVAGIKTFSSIPVGPASNPTTDSQLARKAYVDQMEPNGFAPDDIGLLAWSSDPACCVSTPAFCGVGVMRMTAVVLHTAQTVGKIVWYAGGYAGGLLSGSWAGIWNAAGTTRLAGVAAMHTGASEPAEVHNAGGAMHYAPLDGGSVSLAAGVYLVAWRHVYTASPADGPMILQYENAAGSPPNTLALNTVKRFGYITSGASTLPSSISGVLTDGGNRFWVGLAT
- Physico‐chemical
properties -
protein length: 2111 AA molecular weight: 216104,70790 Da isoelectric point: 4,26272 aromaticity: 0,07153 hydropathy: -0,01421
Domains
Domains [InterPro]
DC_1109
ATT
1–194
ATT
1–194
G3DSA:1.20.5.320
STR
125–170
STR
125–170
PTHR24637
Unmapped
125–353
Unmapped
125–353
IPR039448
ENZ
1007–1186
ENZ
1007–1186
1
2111
Architecture
ATT 1-277 | STR 278-462 | STR 758-850 | STR 894-1244 | ATT 1245-1424 | STR 1425-1750 | RBD 1789-2105 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
2111
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 914 | 914 | 0,6554 |
| Central domain | 915 | 1730 | 817 | 0,8529 |
| C-terminal | 1731 | 2111 | 380 | 0,8322 |
Note: Constraints were applied during segmentation.
Fixed 163 C-terminal predictions appearing before Central domain
Fixed 163 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-914
1-914
Central
915-1730
915-1730
C-terminal
1731-2111
1731-2111
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Streptomyces phage Satis [NCBI] |
2283264 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AXH66313.1
[NCBI]
Genbank nucleotide accession
MH576962.1
[NCBI]
CDS location
range 94356 -> 100691
strand -
strand -
CDS
ATGCCGCTGCCGCCCGAGCGTGTCGTCACGGGTACCTACGTCAACCCGGTCAACGGCGAGCCGTACGACGGCTCGAATGGCGACCACTACCTGATTTTCGAGCCCGTCCCCGACCGCTGGACCGATCGGGCCGGAAACCAGATTCTCCTCGGCGGCGGAAAGGTCACCCTCGACGAGAACGGACACTTCTCGGAAGACCTCGTCTGCACGGACGCGGCCGACGTGTACCCCGTGGAAGGCCGTCTGTGGCGCGTGCGTCAGTTCGTGGGCGGCTCCTGGGATTCCGGCACTTTCGCCCTGCCTGAAGGCGACGATCCGCTCGACATCACCGACATCCTCTCGGTGGACATTTGCGGCGTGGACTACGTCCCCGTGCCGGGTCCCCCCGGGCCGCAGGGTCCTCCCGGGCCGCCCGACGGCCCGCCTGGGCCGCCCGGACCCGAAGGTGCATCCGCCTACGAAGTGGCGGTCGAGCAGGGATTCTCCGGCACCGAGGACGAGTGGCTGGCCAGCCTCCAGGGGGAACAGGGTGAGCCCGGCGCGACCGGCGGCCCCGGCCCGCAGGGTGAGCCCGGCACGCCTGGCGAGGACGGCCCCTCCGCCTATCAGGTGGCCCTCGACAACGGGTTCGTCGGCACCGAGGCGCAGTGGCTCGCTTCACTCCAGGGACCGGCCGGTACGGACGGGGCGCCGGGCCCCGAAGGTGACTCGGCGTACGAAGTCGCCGTGGAGAACGGCTTCGAGGGCTCCGAGGCGCAGTGGCTGATCTCCCTCCGAGGCGCCCCCGGTGCCCCGGGCGACACCGGGCCGCAGGGTGAACCCGGTCCGGAGGGTCCTTCGGCCTACGAGGTGGCGGTCTCCGGAGGCTTCTCCGGTACGGAGGCGGAATGGCTCGCCTCTCTCGTGGGGCCCGAAGGTGACACCGGAGCCACCGGGGCCACCGGTCCTGCTGGCGATTCCGCCTACCAGGTCGCCCTCGACGAAGGCTTCGTGGGAACCGAGGAGGAGTGGCTTGCCTCGCTCGTGGGACCCCAGGGTCCCGCCGGTCGCGACGGCGGCATGGACACCGGCATCAGCCTCGGCGGAGACATCACCGCCAGCGAGACCGACCCGCTCGCAGTGGTCATCAACCCTCTCACGGGTCGCATCGTCGACTACTTCGCCGACCCGGTCACCATCACCGACGTCGAGGTGACCTCGCCCATCACGGTGACTCTCGACTCGGTGGCGCAGGAGCGCACGATCACCTGGCTCCTGATGGACGCGGACCAGATCGTCTACCAGCAGCAGGCCCGCCCCTCACCCGAGGACCGCCGGTCCTTCCTGGTGCTCGGCATGGTCGCCCAGGACGACGGCGAGATCTTCCTCGCGCAGTCGATCCCCACCATCGCGCGCCAGCCGGTGAACCAGCTCTACGACCTGATGGACGCCATCGGCGCCTTCGGCATCATGGGCAACGACGTGAGCCCGAACGGGGCGAACCTCCAGCTCAACGTCGGGGCCGGTCAGGTCTTCTCGCGCGGCTGGAATCACTTCGACGGCGGCACGGAAACCATCAACCCCCACATCGTCACCACCCTGGGCGCCACGCCTGCGTCCTGGACGCACGCCCTTCGGTCATCGGACCTGGAGCACGCCTCTGCCTCGACGACCGTGGACGTAGGGCACTACGACTTGAACGGCACGCTCACAGCCGTCGGCGGGGACACGGACACCTCCGTGGTGCACCAGCTCTGGATGTTCCCGACCAACGAGGGCTCGGAAGTTCACATCCTCCAGTACGGCCAGCAGCTTTTCGACACCCTCGAAGACGCCATCTCGGGCGCGGGCACCGCCGCCTTCGTGACGAACACGGCCCTGCCCGGCAACGCCATCATGCTCGGATACCTCGCGGTGCGGGGGGTGGCCACCGATCTCTCCGACGCCGCACAGGCCATGTTCATCAAGGCGGGCAAGTTCGGGGCGAGCCCCGGCGGCGGCGCCTCTGTGGACCTGAGCGGTTACGCCATGCTGGCGGGCGCCGAGTTCACGGGCCAGATCGCCACGAACATGGCGCAGCCGGACGCCGTCGCGCAGTCCTCGCGCGTCACGACGAACGGCTCGGACTCCTGGCGCAGGCAGGTCGACGGCGAGATGCAGTGGGGCCCCGGTGACGGCCCCATGGATGCGTTTCTGCGGCGCCTCGGCATCGGGATGCTCGCCTTCCTCAACACCGACCTGCTCGTGGGCCAGGAGGATGCGAAGTCGTACCGTTTCCGCCAGTCCGGTACCAGCCTGGATCTGGACGCCTCCGGCGCCGACCTGTTCCTGTCGGTGTTCGAGCTGGTCAACTTCCTCGGCGAGCAGTACACGTACCTCCGGCTGGAGTCCGGCGAGTTCACGGCGCACGCCTCCGGTAAGTGGGTCTTCGGCGACGGCCCGTTCGACGGGGCCGGGCACACGCTGGACGGCACCGCGAACCAGGCTGGCTTCTTCGGTGCGACGCCCGTGGGCCAGCAGACGGTGTCGGGTGCCCGAACCACTGGAGCGGGCCTCCAGAGCCTGCTGGACGCCCTCGAAGCACTCGGTCTGATCGTGGACTCCACGACCGCCGGAGTGGCCGTCGTGGAAACCGTCAACGGCCAGGCCGGGCCCGACGTGAGTCTCACCGCCTCCGACGTGGGCGTGGGCGACGTCATCGTGGCGGCGAACGACTCGCGCAACAAGGACGGCGCCGACTTCGTCTGCACGGGCACCAACGACCACCTCGTGATCCAGCAGGCGATCGATCTGGTGGACGCGGCGCCCGGCAAGGGAACGGTCCGGCTGCTCGACGGCACCTTCTGGCTGGGCGCGACCGTCACCATCCCGAGCGGCGCCGGTCTGAACCTGACGGGCACCGGGTGGGGCACCGTCCTGAAGATCTCCCCCGGGGTCGAGGACTACGCGATCACCTTCGACGGGGACGAGACGCGGGCGCGGTTCTCCCACTTCCGGATCGACGGAAACCTGAGCGGCCAGACGGCGGGTCCCTGCGGAGGCATCTGGGCAGCCGGGGCCGTCGAGTGCGTCTTCCAGCACATCAACTTCTCACACTGCTGGAGCGCCGGGCTGGTCCTCACCGCCATCAACGGTGGAGCCTTCGGCCACAACAACTACGTCCTGCACTGCCTGTTCGACAACGGCATGACCAGCACCGAAGAAGGCAACGGGATCTACCTCTCGTCCAGCGACGAGAACTTCATCGTCGCCTGCGACTTCCAGTTCCTGGGCGGCGCCTTCGGTGCGGGCGGCGCGGGCATTTACGACCAGGCGGGCACGCAGACCATCCTGGGCTGCAACTTCGTCGGTGGCGGCAACAGCCTTCCGGCGATCCGCGTGCAGGACGCCTCGGCCACGAAGATCACAGCCTGCAACTTCGACGGAGTCGGGGGCGACGCGGTCTTCCTGGCTGCGACGAACTGCCTGGTCGAGGGCAACACCATCTTCGGTGTCGGCGCCATCGGCACGGCGGGCACCTTCTCCGGGATTCACCTGGAGTACGGAGCGACCGGCAATCTCATCGCCAACAACTCGATCTCCTCCCACACCGTGAACGGCGCCGCACGCTCTCTCATCCGAGAGGAGTCGGTCGGCGATTCAGGCAACAACCTCGTCGTGGGCAACATGCTCATCACGAAGGGAACGCTGTCGGTCGGGGCGCTCGACCTCAACGCACCCGGCACCATCGCCCGGTCCAACTTCGGCGGCGGCGCGGACGGTGACCCGGTCGTCACCACGGACATGATCGGCGTAGCCGACGGCGTGGCCTCCCTCGACGCCACGGGCAAGGTGCCCGCCGCGCAGATGCCCGACGGGCTGGTCACCTCGGTCAACGGCGAGGACGGGGATGTCACTCTTACCGCAGCGGACGTGGGTGCGCTCGACGAAGCGGCGGGCGACGCGCGGTACACTCAGCAGGACGCGCTCTACCTGAACGCGAAGGAGCATGGGGCGGTCGGCGACGAGACGACGGACGACTCGACCGTCATCAACGCGCTGCTCTCCACGTCCCCGGCGGGCAGCACCGTGGTCCTGCCGCCCGGCACCTATGGCATCAGCGTGCCGCTCGTGGTTCCGCCGGGGAAGACCCTCATGGGCCTGCGCACGAACCTGATGCAGATCACCGACGTCTATGATCCGCAGGTCAGCATCAAGCCGCTCGCTGGCTTCACGGGCGTCGCGGCGGTGCGCTTCCTGGACCAGGCGACCGGTGGCTACGACGACATCTCCGGCGAGCAGCGGCTGATCGACATCATGCTCGACGGCTCCGCGCTCACCGCAGGAGTCGACGGACTCCAGGCCAAGGGCAACATCCAGAACGTCGGATTGCGCGATGTGACGATCCGGTTCTTCCCGAACTCCGGAATCTACTCCGGACTGGAGGGCGGCATCGGCCCGTACTCCTGGCGGATGCACCGGGTGATGCTCGACAACAACCACGCCCACGGCATGTACGCCGAGAAGATGGTCGACCTGACGATGGTCGACTGCCAGGCCATCGGCAACTGGTCCAACGGCTTCATGCTGAACAACGCGGCGAACTCCCAGGCGGTCGCCTGCCGCGCGGAGTGGAGCGGCAACCACGGCTTCTACCTCACGGGCGACTGGGGCGACGGCCAGGGCTCCGGCGGGATGCTCCTCTCGGCCTGCTCCACCGACCGCAACGGCTTCAACGGCGTCTTCATCGACTCCACCGGCAACGCGCCCATCGTCATCTCGAACCTGATGACGCGCCGCGACGGACGCAACGGAGGAACGGGCGGCGGCGGTTACGCCGGTCTCGCAGTGAACGCCGCCGAGATGCCGGTCATCGTCGGCGACTGGACGAACTACCCCGGGGTGGACGACAACGGCGCCTCGACCAGCAGTCCCGACTACGGGGCCTCCGTCACCGGCAGCACGCACGTGCAGATCGACAACGCCTACCTGCACGCCGCCACCGCCGGGCTCAACGATGGCGGCGGCAACACGGCCCTGATCCTGGGCGCGAACATCACGTACGCCACGGGCGCGACGACGGCTCCGGTGCGCGCGGTGCAGCAGCTCGCGGCGGCCCGCATCCCCGATCTCGCCGCATCCAAAGTGACCTCGGGAACCTTCGACGCCGCGCGCATCCCGGATCTGGCAGCCTCGAAGGTCACGTCGGGCACGTTCGACGCGGCCCGGATCCCGGCCCTGAACTACGTGCCCACCGGCTCGGCCGTTCTCCTCACGGGAGCGCAGACGGTCGGCGGACTGAAGACCTTCAGCTCCGGGCTCGCAGTGAAGGACGGCACCAACGCGGGCGACGTGCATATCAAGGACTCGCAGGACGTGAACCTGCTGGTCGTGGAGTCGAGCAACGCCACGGCCGCCGCCGGAGTCATGAAGCTCATCGCCGGGCAGGCCAGTCAGGCTGCCTTCCTCGGAACCTGGATGGGTGCGGCGACCAACGCCTGGTCGATCCGCGCCGACGGCCGTCTCGAAGCCGGTGACGGCGCTACCGGACGTGACACCTACCTGACCCGCACGGCAGCCGGAGTATGGCAGGTCACCAGCCAGATCAGGGCCAGCGCGTCGGCCCCCGCGAACGCGGCGGACCTCACGCGCAAGGACTACGTGGACGGCCTCGACACGGCGAACGTGAAGCTCACGGGCGCGCAGACCGTAGCGGGCATCAAGACCTTCAGCTCGATTCCCGTGGGTCCCGCCTCGAATCCGACCACCGACAGCCAGCTCGCCCGCAAGGCGTACGTGGATCAGATGGAGCCGAACGGGTTCGCTCCCGACGACATCGGCCTGCTGGCCTGGTCCTCCGACCCGGCCTGCTGTGTCTCGACTCCCGCCTTCTGTGGTGTGGGCGTGATGCGCATGACGGCGGTCGTGCTGCACACCGCGCAGACTGTGGGGAAGATCGTCTGGTACGCGGGCGGGTATGCGGGTGGTCTGCTCTCGGGATCCTGGGCGGGCATCTGGAATGCGGCCGGAACCACCCGGCTGGCTGGCGTGGCGGCCATGCACACGGGTGCCAGTGAACCCGCCGAGGTCCACAACGCGGGCGGTGCGATGCACTACGCACCGCTGGACGGCGGATCGGTCTCCCTGGCGGCCGGGGTCTACCTGGTGGCCTGGCGGCACGTCTACACGGCCTCCCCTGCCGACGGGCCGATGATCCTCCAGTACGAGAACGCGGCCGGATCCCCGCCGAACACCCTCGCACTGAACACGGTGAAGCGGTTCGGGTACATCACCTCCGGCGCGTCCACCCTCCCGAGCAGCATCTCGGGCGTGCTCACCGACGGTGGCAACCGCTTCTGGGTGGGCCTGGCCACCTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
367ef37744762e133dccd7c9b1d43a0e02a758e6d33e78a1de2263990dbdc062
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50