Protein
View in Explore- UniProt accession
- A0A8S5NVC4 [UniProt]
- Protein name
- Tail protein
- RBP type
-
TSPTSP
- Protein sequence
-
MIRLVIARNPFDLTTKQETIVPFVEGKKLNQYFTEPGEWVHSINGELVDDTASPTDEAYVVVLPKLEKQAFAILLSIGLSIATAGIASGAIFGITSVLGRTLAAMAIGMIGNAIISKIAAPKTDNSNTEQSATYGWQGAQTIIGQGHPLAITYGKCKSAGMLISRHVTSDGEKQYLNLLYCAGEGPIDAITDVKLNGNPIGNYKEVQLDVRLGTNDQEIIPNFNDNYADQPLTYELTNDWSIHQTQGNLSTALEVTISLPNGLYYSNDKGGLSETSVTIEGGYRKVGSAEWIPLPISNNGGQSAMLEKTDNRWFKRNSHSRTSIDNSQYTGVIKDSSNKAIYRVFRFDVKEPGQYEIRMRCAHKDGNSNRHVNKVYWSQLTQIVYDDFIHPGKVLIGIKALATDQLNGNDPNVTWIQERKTVWVFNTYTGAYEAKPANNPAWACYDILHHCRKIGDEYVVKGAPRERFVYDAFKAWADKCDEKHITFNYIYDNASQVWDALKYAENVGRGKVIPLGTRFSCIYDYAATPTQLFTVGNIKMDSFMEEFQATSSRANAIEVSFLNKAKDYERDVLPVFSEEYDVTTSLTSPAQVELMGCVDVDQAYNYAKHYLRANKYEVRTCTFEAFTDAIACTIGDVILLQHDVTDWGQGGRVESAVGNKVTLDREVTFEQGKTYRLMVRNAKTDALESYDVTGVTGKTLTLASNAVIQTDDLYTYGEATKEAKPFRVLSISKSNSEMTRKISCIEYYPELYAGDEGSVPIIDYTTKSDVIKVINLVLLADVKTLKDGTVLCDINGTWQLPRGKVAKNIIVYYKPVTAKEWQQFKVLDGSATSVTIPSVATDVNYDVKIVCTNNTGAAYEGVERAVYVSGKEIPPATPKGFKVTQDAVNSSVLHLSWEPNTEADLHGYTLYDGNDVVLIKHIGGTSYSYFIPNTGNYLFKLSAIDTSGNESGKAEARITASVSAESVATPKAPARGEVKIGKTITAAWDPVENTYIDYYEVRLDSNVGQSNNLLAKTTDIRSEIKLSARRGAVFVYAHNPVKGYGPALRLDYNAAVPKAPTNVKVKGNITGVSVVFDSIPDTCIGANIYIGTEKYFVITNVNMIPHDPGVFDVKVAYVDVFGEGTYSNIIGSSVPASIDPALIDKESLGIKAMDDKIKELTKTANAYSTQVKNLTTNMATQFSQLEDGIDLKLKALNGDELISRINLSSTGTRIDGKLLHVTGQALFDNNIITKRMLAAKAVSADKLNVSSLSAISANLGEVTGGKIIGGTIQNKTGTFKVDANGNIVGANITGSRIDAQSIMQAGFKIRNIDVQIYKVRHGDYCPLLEGFTESQCTFIPVGYKMTEDYSDVTGGTSDGREKWDIANGRRIDDCTIYFQSNISSNYHDTKPTIGLNGRRAVCQSIWYRYFSNRDDDGYHHHISFGELYVLVIGKK
- Physico‐chemical
properties -
protein length: 1435 AA molecular weight: 157842,12570 Da isoelectric point: 5,84318 aromaticity: 0,09199 hydropathy: -0,26544
Domains
Domains [InterPro]
DC_0129
STR
1–970
STR
1–970
IPR053171
Unmapped
184–951
Unmapped
184–951
IPR055385
ATT
230–385
ATT
230–385
NF040662
Unmapped
331–753
Unmapped
331–753
IPR036116
STR
797–878
STR
797–878
1
1435
Architecture
STR 1-229 | ATT 230-385 | STR 386-524 | ATT 525-648 | STR 649-970 | RBD 971-1430 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1435
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 283 | 283 | 0,6802 |
| Central domain | 284 | 482 | 200 | 0,1771 |
| C-terminal | 483 | 1435 | 952 | 0,1305 |
Note: Constraints were applied during segmentation.
Fixed 3 C-terminal predictions appearing before Central domain
Fixed 3 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-283
1-283
Central
284-482
284-482
C-terminal
483-1435
483-1435
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Myoviridae sp. ctPT18 [NCBI] |
2825098 | Uroviricota > Caudoviricetes > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
DAD98689.1
[NCBI]
Genbank nucleotide accession
BK015266
[NCBI]
CDS location
range 31021 -> 35328
strand +
strand +
CDS
ATGATTAGATTAGTAATTGCTCGAAACCCATTCGACCTTACCACTAAACAAGAGACCATTGTGCCTTTTGTTGAAGGTAAAAAGCTAAACCAATATTTCACTGAACCAGGTGAATGGGTGCACTCCATAAATGGTGAGTTAGTAGATGATACCGCATCACCTACTGATGAAGCTTATGTAGTGGTATTGCCTAAACTTGAAAAGCAAGCATTCGCTATCTTATTATCTATTGGTTTATCAATAGCGACTGCCGGTATTGCCTCCGGTGCGATATTCGGTATTACTAGCGTATTAGGTCGTACGTTAGCAGCAATGGCTATCGGTATGATTGGTAACGCGATCATATCTAAAATAGCTGCACCTAAGACAGATAACTCTAATACAGAGCAGTCCGCTACTTATGGTTGGCAAGGGGCACAGACTATTATTGGCCAAGGTCATCCTTTAGCCATTACCTATGGTAAGTGTAAAAGTGCCGGTATGCTTATATCTCGCCACGTAACGAGCGACGGTGAAAAGCAATATCTTAACCTCTTATACTGTGCGGGTGAGGGGCCTATTGACGCTATAACGGACGTTAAATTAAACGGTAACCCTATCGGCAACTACAAGGAAGTTCAACTCGATGTAAGACTCGGTACGAATGACCAAGAGATTATCCCTAACTTTAACGATAACTATGCTGACCAACCTTTGACGTATGAGCTTACCAACGACTGGTCAATACATCAAACGCAAGGTAACTTATCTACTGCGTTAGAGGTTACTATATCACTCCCTAATGGTTTGTATTATTCAAATGATAAGGGCGGACTGAGTGAAACGTCAGTCACTATTGAAGGTGGCTATCGTAAAGTAGGCTCTGCAGAGTGGATACCATTACCGATTAGTAACAATGGTGGCCAAAGCGCTATGCTTGAAAAGACAGATAATCGTTGGTTTAAACGGAACAGTCACTCAAGAACGTCTATCGACAATAGCCAATATACTGGCGTTATCAAGGATAGCTCGAATAAAGCTATCTATCGTGTGTTCCGGTTCGATGTAAAAGAACCAGGGCAATACGAAATCCGTATGCGATGTGCACATAAGGACGGTAATTCTAACCGCCACGTAAACAAAGTATACTGGTCACAGTTAACTCAGATTGTCTACGATGACTTTATTCATCCTGGTAAGGTGCTCATCGGTATTAAGGCACTAGCTACTGACCAATTAAATGGTAATGATCCAAACGTAACTTGGATTCAAGAGCGTAAAACAGTATGGGTATTTAACACCTACACTGGAGCGTATGAAGCTAAGCCTGCTAATAATCCTGCATGGGCTTGCTACGATATCCTTCATCATTGCCGTAAGATTGGCGATGAGTATGTAGTTAAAGGTGCTCCTCGTGAGCGCTTCGTATATGACGCATTTAAGGCGTGGGCCGATAAGTGCGACGAGAAGCATATTACATTTAACTACATTTATGACAATGCTAGCCAAGTATGGGATGCACTTAAATACGCTGAGAATGTAGGTAGAGGTAAGGTAATACCTTTAGGTACTCGGTTTAGTTGTATTTATGATTATGCTGCTACACCTACTCAGCTATTTACTGTAGGTAATATCAAAATGGATTCCTTTATGGAAGAGTTCCAAGCTACATCATCTAGGGCAAATGCTATCGAGGTATCATTCCTCAATAAAGCAAAAGACTATGAGCGTGATGTACTTCCTGTGTTTAGTGAAGAGTATGACGTGACTACATCGCTCACCAGTCCAGCGCAAGTCGAACTCATGGGATGTGTGGATGTAGACCAAGCCTACAATTACGCTAAACACTACCTAAGAGCGAATAAGTACGAGGTGCGTACTTGTACCTTTGAGGCTTTCACAGACGCCATAGCGTGTACGATAGGGGATGTAATCCTACTACAACACGATGTGACGGACTGGGGGCAAGGTGGCCGTGTAGAGTCTGCTGTAGGTAATAAAGTAACTCTTGATAGAGAGGTTACTTTTGAACAAGGTAAGACTTATAGGCTCATGGTGCGTAACGCTAAAACGGATGCTTTAGAGTCTTACGACGTAACTGGTGTAACCGGTAAGACCTTAACGCTTGCTAGTAATGCGGTTATTCAGACCGACGATTTATACACCTATGGTGAAGCTACAAAAGAAGCTAAACCATTTAGGGTATTATCCATTAGCAAGTCTAACTCTGAAATGACTCGTAAGATATCCTGTATTGAGTACTACCCTGAGTTGTACGCCGGTGATGAAGGATCAGTACCAATCATCGACTACACAACAAAGTCTGATGTAATTAAGGTTATTAACTTAGTGCTCTTAGCTGACGTCAAGACATTAAAAGACGGTACTGTACTTTGTGATATCAATGGTACTTGGCAACTGCCACGGGGTAAGGTGGCCAAAAATATTATCGTGTATTACAAGCCTGTTACCGCTAAAGAGTGGCAACAGTTCAAAGTACTAGATGGCAGTGCTACTAGCGTAACTATTCCAAGTGTAGCGACCGACGTTAACTACGACGTTAAGATTGTATGCACCAATAATACTGGTGCTGCGTATGAAGGAGTAGAGCGTGCAGTGTATGTGAGTGGTAAGGAAATACCACCGGCTACACCTAAAGGCTTTAAGGTGACTCAGGACGCAGTAAATAGTAGCGTACTTCACTTATCATGGGAGCCTAATACAGAGGCTGACCTACATGGTTACACTTTATATGACGGAAATGATGTAGTGCTGATTAAACATATAGGCGGTACATCCTACTCATACTTCATTCCGAATACTGGCAATTACCTGTTCAAGCTATCTGCTATTGATACATCTGGTAATGAAAGTGGTAAGGCTGAAGCTCGTATTACTGCGAGTGTATCCGCTGAGAGTGTGGCTACACCTAAAGCACCGGCTCGAGGTGAGGTGAAAATTGGTAAGACGATCACTGCTGCATGGGACCCAGTAGAGAATACCTACATCGATTACTACGAAGTTCGACTTGATAGTAATGTTGGACAGTCCAATAATCTATTAGCCAAGACTACAGACATTCGCTCTGAAATTAAGTTATCGGCTCGTAGAGGTGCGGTATTCGTTTACGCACACAATCCTGTTAAAGGTTACGGTCCGGCGCTTAGACTTGACTATAATGCAGCAGTTCCTAAAGCTCCGACGAATGTCAAAGTAAAAGGTAATATTACAGGCGTTAGCGTGGTCTTTGATAGCATACCGGATACTTGTATAGGCGCTAATATCTACATCGGCACAGAGAAGTATTTCGTTATTACAAACGTAAATATGATACCGCATGACCCAGGTGTATTTGATGTAAAAGTTGCCTATGTTGACGTGTTCGGCGAAGGTACATACTCCAATATTATTGGTAGCTCTGTACCGGCTAGTATTGACCCGGCTTTAATTGACAAGGAATCTCTTGGCATTAAGGCTATGGACGATAAGATTAAGGAGCTCACAAAGACTGCTAATGCATATTCTACTCAAGTTAAAAACTTAACTACTAATATGGCTACTCAATTCAGCCAATTAGAAGACGGCATTGACTTGAAATTAAAAGCATTAAATGGTGATGAGCTAATCAGTCGTATCAATCTAAGCTCTACAGGAACAAGAATTGACGGTAAGTTACTGCACGTTACAGGGCAAGCATTATTCGACAATAATATCATTACTAAACGGATGCTCGCTGCCAAAGCAGTATCTGCAGACAAGCTAAACGTTAGCTCCTTAAGTGCTATCTCAGCTAACCTAGGTGAAGTAACAGGCGGTAAGATTATCGGCGGTACGATCCAAAATAAAACCGGTACATTTAAAGTTGACGCCAACGGTAATATAGTAGGTGCTAACATTACAGGCTCTCGTATTGACGCTCAATCCATTATGCAAGCCGGGTTTAAAATCAGAAACATTGACGTACAAATCTACAAAGTACGTCATGGCGACTATTGTCCCCTACTAGAAGGCTTTACAGAGTCTCAATGTACGTTTATCCCTGTTGGCTATAAAATGACAGAAGATTATAGTGACGTAACAGGAGGTACTAGCGATGGTCGAGAAAAATGGGATATCGCTAATGGGCGAAGGATTGATGATTGCACAATATATTTCCAGTCTAATATATCGAGCAACTATCACGATACTAAGCCAACCATTGGACTAAATGGTCGCAGGGCTGTTTGCCAATCGATATGGTATCGTTATTTCAGCAATCGGGATGATGATGGCTATCATCATCATATCTCCTTTGGGGAACTATACGTTCTCGTCATTGGTAAAAAGTAG
Genome Context
Genome Context
Tertiary structure
PDB ID
9ae5087464aacd2f400410c919c68ac90fd9d25fb5770c47e69380634445a32a
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50