Protein
View in Explore- Genbank accession
- QNN98311.1 [GenBank]
- Protein name
- minor tail protein
- RBP type
-
TSP
- Protein sequence
-
MRILLRDLTPGTDYNIQLRANDGTNVSDWSRIFPLTTIQDTLAPAAPTGLTWVVNRSAFSAKWNAVTQNEDASPLEDFSHYLVKIAIPGGSYVTIKTTNTFYDFPFETNKAQFGTPQASLEATVYAVDLTGNVSVASTTLTATNPPPPDPTGVVAAGIVGGVSMRWDVQVIDDLAAYDVYMSTSGSGFTPSNANRIYSGTGNTVVYDSSSLGVVHYFKIRSRDVFDSISNYVTVSATPISPTDVDTTAPGVPTGLAATMAVDTNDSAFAVATVSWTAPSDTDLAGYVVRYKQNADTGYDYVNVPVGTTSIIIGGLTVGVQYNFGVQAYDRSTNRSAYSTNVNATAANTAPSTPAAPTAVADVMSIQVSHSLQKATSGRLEADVSYLEVHLGTTSTFTASDSTMIGQLQVEPGSTFVSEVFNTPAQDSALARWVRVIAVDRGGLKSASSAVASVTVGLISNAYIANATITSAKISNLDANKITAGTGIINALLIKNSLTVDTGGTIKSTNYVAGTTGYQLSNNTLEINGGTIRAAALLLQDAPNIILPQYADFEFQSTWYTGKQVTFNDGGTTSWTIATAPELTPKFGTQCLKHTWTGGGTFSRVYQGSSYTDYNVVVEPNTDYIASVWVFNPSGSGDKTVGFGVKMGDGVTYPQPGGTPIVVANGTWTRISGTFNTGANSTLMTYMSLYNTGSVYFDGIQIERKLTSSTVASPWRAPSTTSIDGGIIRTGEIRSTALANGLSGQPAWSINMTGGAQFGDATVRGRIVVGDPSNPSADGVNSRIHSSNYVAGTSGWIIRNDGYAEFRNLAVNSIKVTALDAPMQNNTYAKLFDYMQDGSLWLSNGAVLQKTDPGAYSAESLFEFTGSGLILRNAVGVTKVAYDPTILYRISARVRAYSVATLNANPGFETNTTGWFAYGTNSITRDTTKFFTGVASVRWDQVGSSNGVYGMGTVVNVKAGYTYTFSARMLPNSTVIRDNLKMNIVWKDAANATISTTFNDMPPPVDVNGTPIPIDGTTWSQFSSTGTAPTGAVTASFEIQAGISGSPVAGIQGWFDDVTITTPPRIKVGLFGFDNNNNIIDWDYVDDATTPTKKHAMPTDYSLLSGYSANQYMMVQNNSEVQIATGSSATTADWITVTGYVRGRGGAGATGVLGTQGEHQDPYSPASLNQSVRYLVPYVEWDVATGSIAQLDQFSIEAYENGAPAKVATTDVNGQKAVSVENIQGSIFDHAIRFYSGEADEIYPGMIGHVMDGDWNDASHIRIAPALINKNSDYGSVFIGVYDRNPNYIYDATFEDGITGWTGMANTTLSQETTVGREDTNSLKILATGTISNPATTELLGKYNVSTLVNQELIGQKVTVTGYAMMGTATGRNVRLVVKFLDEAGAMINGYFIEKAVTNTAWVNFSFISPIVIPDTCYSVEFSFSWFNGATGDIVYVDDVQLEMGTRTDFRTASASKIMLQSDYVKSNGGIIISKSELDLPDNVWGSTNGGRRDTPFYPGVILQSEGGTGGFRYVNYTDSAGNRASTQVTNFSPSGVEEHGIRFYGMNDATYPGRWVLTNATGQFVMSSYANTDEDMTTARNVRVYGHLDVEGAPAWASATLLNGATAVSGAYTPSYVLNNSTTYFRGAVAGWTKGSGTPFMTLPSGLRPAKTIYLSTVSYTTTSDAQATVLVVTTAGNVYVYSSVATKTNVMLDGLSFSTAPDTYTTPPTGDTTAPGTPTGFKITPLSSSTTTGTYRLNWTNPSASDTAGVKVIWRSDRYPTVTIAGSGTKTLTTDGKIITVTGGASAAKQYDHSGLPVNKTIYYRVVSYDTSGNHSTYVSASRYLLASPVTVTANSSDSYRLGYGGMWRNDGDEVYQGDWTGNDNHRGIYLYGTKIYDALNTGGVVRTPTKATIYLKRLSTAHGNNTGVGINLRGHIYQTKPSGDPVGGMTNEGSDGDDIVFLSRGEAATVTIPSSWYNNIVDSTAANRIEGFGVYGSTTSDYAVMYGVSSGSSYGKITLYHKG
- Physico‐chemical
properties -
protein length: 2005 AA molecular weight: 213323,98620 Da isoelectric point: 5,04750 aromaticity: 0,10075 hydropathy: -0,15830
Domains
Domains [InterPro]
DC_2195
ATT
1–43
ATT
1–43
IPR003961
STR
1–40
STR
1–40
IPR036116
STR
3–102
STR
3–102
IPR003961
STR
249–335
STR
249–335
IPR003961
STR
249–344
STR
249–344
IPR003961
STR
251–351
STR
251–351
IPR003961
STR
271–341
STR
271–341
1
2005
Architecture
ATT 1-494 | STR 495-1827 | RBD 1828-1996 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
2005
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 248 | 248 | 0,6821 |
| Central domain | 249 | 545 | 298 | 0,4570 |
| C-terminal | 546 | 2005 | 1459 | 0,2756 |
Note: Constraints were applied during segmentation.
Fixed 7 C-terminal predictions appearing before Central domain|Sequence started with non-N-terminal domain
Fixed 7 C-terminal predictions appearing before Central domain|Sequence started with non-N-terminal domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-248
1-248
Central
249-545
249-545
C-terminal
546-2005
546-2005
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Streptomyces phage LilMartin [NCBI] |
2767566 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QNN98311.1
[NCBI]
Genbank nucleotide accession
MT684590
[NCBI]
CDS location
range 43837 -> 49854
strand +
strand +
CDS
ATGAGAATTCTTCTAAGAGACCTGACACCAGGCACAGACTACAACATTCAGCTACGTGCGAATGATGGAACAAACGTCTCCGACTGGAGTCGCATCTTTCCGCTTACCACAATTCAGGACACGCTAGCTCCAGCAGCCCCAACGGGGCTTACTTGGGTTGTCAACAGAAGCGCATTCTCTGCTAAGTGGAATGCTGTCACTCAGAATGAAGATGCCAGTCCATTGGAAGACTTTTCTCACTATCTTGTGAAGATTGCCATTCCGGGCGGCTCATATGTCACCATCAAGACTACCAATACATTCTACGATTTCCCATTCGAAACAAATAAGGCTCAGTTCGGAACTCCACAAGCTTCTCTAGAAGCTACTGTTTATGCTGTAGACCTTACCGGAAATGTGTCAGTAGCGTCCACAACTCTTACTGCTACAAACCCACCACCACCTGACCCAACTGGAGTAGTTGCGGCCGGTATTGTGGGTGGCGTTAGCATGCGTTGGGACGTGCAGGTAATTGATGACCTAGCGGCCTATGACGTATACATGAGTACGTCAGGCTCAGGATTCACTCCAAGCAATGCAAATAGGATTTATAGTGGCACAGGAAACACCGTCGTATATGACTCTTCATCTCTCGGTGTCGTCCACTACTTCAAGATTCGCTCACGTGACGTATTCGATAGCATTTCAAACTACGTCACTGTTTCAGCGACTCCTATTTCCCCAACCGACGTAGACACAACTGCGCCCGGCGTTCCAACAGGTCTCGCTGCAACTATGGCAGTAGATACCAATGACTCTGCTTTTGCTGTAGCTACAGTTTCATGGACAGCTCCATCAGACACCGACCTTGCCGGGTATGTTGTCCGATACAAGCAGAATGCAGACACTGGATATGACTACGTGAATGTCCCTGTCGGAACAACATCAATTATTATCGGTGGTCTCACCGTAGGTGTTCAGTACAATTTTGGTGTACAGGCTTATGACCGTAGCACAAACAGGAGCGCTTACTCTACGAATGTAAATGCGACGGCAGCCAATACCGCCCCGTCAACTCCTGCCGCTCCAACTGCTGTCGCAGATGTTATGAGTATTCAGGTAAGTCACTCATTGCAGAAGGCAACCTCTGGACGCCTTGAAGCAGACGTAAGCTATCTCGAAGTTCACCTGGGAACCACCTCTACATTCACCGCAAGCGACTCTACAATGATTGGGCAATTGCAGGTCGAGCCTGGTAGCACCTTCGTTTCTGAAGTTTTCAATACGCCTGCTCAAGACTCTGCTTTGGCTCGTTGGGTTCGAGTTATTGCTGTTGACCGAGGTGGACTGAAGTCAGCATCTTCTGCTGTTGCATCAGTCACTGTCGGTCTAATCAGCAATGCCTACATCGCCAATGCAACCATTACCTCAGCCAAGATTAGCAATCTAGACGCCAACAAGATTACGGCTGGTACAGGTATTATCAATGCTCTGCTTATCAAGAACTCATTGACAGTAGACACTGGCGGTACTATCAAGAGCACCAACTACGTAGCTGGCACGACCGGATACCAGCTATCCAATAACACCCTTGAAATCAATGGTGGAACAATTCGTGCCGCAGCTCTATTGCTACAGGATGCTCCAAACATCATTCTTCCTCAGTACGCAGACTTTGAATTCCAGTCAACCTGGTACACAGGAAAGCAAGTCACCTTCAATGATGGAGGAACAACGTCATGGACAATTGCTACTGCCCCTGAATTGACTCCTAAGTTCGGCACTCAATGTCTTAAGCACACCTGGACTGGTGGAGGAACATTCTCCAGAGTTTATCAGGGTTCAAGCTACACAGACTACAATGTCGTTGTAGAACCAAACACTGATTATATCGCTTCTGTATGGGTATTCAATCCATCAGGTTCTGGTGACAAGACAGTCGGATTCGGTGTCAAGATGGGAGACGGTGTCACTTATCCACAGCCTGGTGGAACTCCAATTGTTGTCGCTAATGGAACATGGACCAGAATTTCTGGAACATTCAATACTGGGGCAAACAGCACTCTCATGACTTACATGAGTCTGTACAATACTGGCTCTGTGTATTTCGATGGTATTCAGATTGAGCGAAAGCTAACAAGCTCAACTGTTGCTTCTCCATGGCGTGCGCCAAGTACCACCTCAATTGATGGTGGAATTATTCGTACTGGTGAAATTCGTTCCACTGCTCTGGCCAATGGTCTTTCTGGTCAGCCAGCATGGTCAATCAATATGACTGGTGGAGCACAGTTTGGTGACGCTACTGTCCGAGGACGAATTGTTGTTGGTGACCCAAGCAATCCATCAGCCGATGGTGTTAACTCTCGTATCCATTCATCAAACTATGTGGCCGGTACGAGTGGTTGGATTATCCGCAATGACGGATACGCTGAATTCCGTAACCTTGCAGTCAACTCAATCAAGGTAACAGCCCTTGACGCTCCAATGCAGAACAATACCTACGCAAAGCTATTTGACTACATGCAGGACGGTAGCCTATGGCTATCAAATGGTGCAGTTTTGCAGAAGACCGACCCTGGAGCTTATTCCGCTGAATCACTATTTGAGTTCACTGGCTCTGGTCTGATTCTGCGTAATGCTGTCGGTGTCACCAAGGTTGCCTACGACCCAACGATTCTCTACCGAATCTCTGCTCGTGTCCGTGCGTACTCCGTAGCAACCCTCAACGCCAATCCAGGCTTTGAAACAAACACGACTGGGTGGTTTGCCTATGGTACAAACAGCATTACGCGAGACACCACGAAGTTCTTCACAGGGGTCGCATCTGTTCGATGGGACCAGGTGGGCTCATCAAACGGTGTCTACGGAATGGGAACAGTTGTCAACGTAAAGGCTGGATACACTTACACATTCTCAGCTAGAATGCTTCCAAACAGCACAGTCATCAGAGACAATCTCAAGATGAACATTGTCTGGAAGGACGCTGCGAATGCAACAATTAGCACGACTTTCAATGACATGCCACCTCCAGTCGATGTAAACGGAACTCCAATTCCAATTGATGGAACGACATGGTCACAGTTCTCATCCACGGGAACTGCTCCAACAGGTGCAGTGACTGCTAGCTTTGAAATCCAGGCCGGTATCTCAGGAAGTCCGGTAGCTGGAATTCAGGGATGGTTTGATGATGTAACTATTACCACTCCTCCACGCATCAAGGTAGGTCTGTTCGGATTCGACAACAACAATAACATCATTGACTGGGATTATGTTGACGATGCGACAACGCCAACCAAGAAGCATGCCATGCCAACCGATTACAGCCTGCTTTCTGGATATTCTGCAAATCAGTATATGATGGTTCAGAATAATTCAGAAGTACAAATTGCGACCGGTAGCTCAGCAACCACTGCTGACTGGATTACTGTAACTGGATATGTTCGTGGTCGAGGTGGAGCCGGTGCAACAGGTGTTCTCGGTACTCAGGGAGAGCATCAGGACCCATATAGCCCAGCATCCCTCAACCAGTCCGTTCGTTACCTTGTTCCTTATGTGGAATGGGATGTTGCAACAGGCTCAATTGCTCAGCTTGACCAGTTCTCCATTGAAGCATATGAAAATGGTGCCCCTGCAAAGGTCGCTACGACAGATGTCAATGGTCAGAAGGCAGTCTCTGTTGAGAACATTCAGGGTTCAATCTTTGACCACGCCATTAGGTTCTACAGTGGCGAGGCTGATGAAATCTATCCTGGAATGATTGGTCATGTAATGGATGGTGACTGGAACGATGCATCTCACATCAGAATAGCACCTGCGCTCATCAACAAGAATTCCGACTATGGCTCTGTGTTCATTGGAGTTTATGACCGTAATCCAAACTACATCTATGATGCTACATTTGAGGATGGTATTACGGGATGGACCGGAATGGCGAATACCACGCTTTCTCAGGAAACGACAGTCGGTAGAGAAGACACAAACTCTCTGAAGATTCTGGCTACAGGAACCATCTCAAATCCAGCAACAACTGAGCTTCTTGGTAAGTACAATGTAAGCACTCTGGTCAATCAGGAACTCATCGGTCAGAAGGTAACGGTTACTGGTTATGCAATGATGGGAACCGCGACCGGCAGGAATGTTCGCCTTGTTGTTAAGTTCCTTGATGAAGCCGGAGCCATGATTAATGGTTACTTTATCGAAAAGGCTGTCACCAATACGGCATGGGTGAACTTCTCATTCATCTCTCCAATTGTCATTCCAGACACCTGCTACTCTGTTGAGTTCTCATTCTCATGGTTCAATGGTGCAACTGGCGACATTGTGTACGTTGATGATGTCCAGCTTGAAATGGGAACAAGAACAGACTTCCGTACCGCATCTGCAAGCAAGATTATGCTTCAGTCCGACTACGTAAAGAGCAATGGTGGAATCATCATTTCAAAGTCCGAGCTTGACCTTCCTGACAATGTCTGGGGTTCAACAAACGGTGGGCGTCGTGACACACCATTCTATCCTGGTGTAATTCTACAGTCTGAAGGTGGAACTGGTGGATTCCGTTACGTAAACTACACAGACTCTGCTGGTAACCGGGCATCAACTCAGGTAACGAACTTCTCTCCATCTGGAGTCGAAGAACACGGAATTCGATTCTATGGAATGAATGACGCAACGTATCCTGGACGATGGGTTCTCACGAACGCTACGGGTCAGTTTGTAATGTCTAGCTATGCAAATACTGATGAGGACATGACGACTGCTCGAAACGTTCGAGTATATGGCCACCTTGATGTTGAAGGCGCACCGGCTTGGGCTTCTGCAACACTTCTTAATGGAGCTACCGCAGTTTCTGGTGCATACACTCCTTCATACGTTTTGAACAATTCAACAACGTATTTTCGTGGCGCGGTAGCTGGTTGGACAAAGGGGTCAGGAACTCCATTTATGACCCTCCCATCTGGCTTGAGACCAGCAAAGACTATTTACCTGTCAACAGTTTCATACACAACTACTTCCGACGCTCAGGCAACGGTTCTTGTTGTTACAACTGCTGGTAACGTATACGTCTATTCATCTGTTGCAACAAAGACAAACGTAATGTTGGACGGTCTTTCATTCTCTACTGCTCCTGATACATATACGACTCCACCAACTGGTGATACGACAGCTCCTGGAACCCCTACCGGATTCAAGATTACACCATTGTCATCAAGCACCACAACTGGTACTTATCGTCTGAACTGGACCAACCCTTCAGCATCAGACACGGCCGGTGTCAAGGTTATCTGGAGAAGCGACAGATATCCTACTGTAACAATTGCAGGAAGCGGTACTAAGACTCTTACTACAGATGGAAAAATCATTACCGTAACAGGTGGTGCAAGCGCTGCAAAGCAGTACGACCACTCAGGTCTACCAGTTAACAAGACCATCTACTACCGAGTTGTTTCATATGACACAAGTGGTAACCACTCAACATATGTTTCAGCAAGCAGGTATCTACTGGCTTCACCAGTCACTGTTACAGCGAACTCTTCAGACTCCTACCGTCTTGGATATGGTGGTATGTGGAGAAACGACGGTGACGAAGTCTATCAGGGTGACTGGACAGGCAATGACAATCACCGAGGAATCTATCTGTATGGAACGAAGATTTATGATGCTCTCAATACGGGCGGCGTTGTAAGAACTCCTACAAAGGCTACTATTTACCTCAAGCGACTAAGCACAGCTCATGGTAACAACACTGGAGTAGGAATTAATCTTCGAGGTCATATCTATCAGACTAAGCCTTCTGGTGACCCTGTTGGAGGTATGACTAATGAAGGCTCTGATGGAGATGACATTGTGTTCCTTTCAAGAGGAGAGGCAGCGACAGTAACAATTCCATCAAGCTGGTATAACAATATTGTCGATTCAACTGCTGCAAACCGAATTGAAGGATTTGGTGTTTATGGAAGTACTACAAGTGACTACGCAGTAATGTACGGAGTCAGTTCAGGTTCTTCTTACGGAAAGATTACGCTCTACCACAAGGGCTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
4a1d33062bd44104154f1eff828361270c5d98c6ca6f56d45ef4ab6e6a10fd3b
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50