Protein
View in Explore- Genbank accession
- YP_009909918.1 [GenBank]
- Protein name
- tail fiber protein
- RBP type
-
TFTFTFTSPTF
- Protein sequence
-
MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPVEGPVDGLKSVLLNSTPVLDSEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVVGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQNCDQSVPDGFGGTEPRITCNAWLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKVWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPNNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISIGGRVLAVNSQTRTLTLDREITLPFSGTTLISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVTADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETTARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKRLTAPTITSGGNPPAFSLTSDGRLTAKNADISGSVNANSGTLNNVTINQNCTIKGMLEATQVRGDFVKAVSKSFPKQAGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYSDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKVFHKGNQWAGNITDCTVIVTKKAASGISIR
- Physico‐chemical
properties -
protein length: 1165 AA molecular weight: 127305,36860 Da isoelectric point: 5,85171 aromaticity: 0,07983 hydropathy: -0,32644
Domains
Domains [InterPro]
IPR053171
Unmapped
1–837
Unmapped
1–837
DC_0014
STR
1–1161
STR
1–1161
IPR055385
ATT
86–207
ATT
86–207
IPR055383
STR
610–714
STR
610–714
IPR036116
STR
617–717
STR
617–717
IPR003961
STR
618–710
STR
618–710
IPR003961
STR
618–701
STR
618–701
IPR003961
STR
620–715
STR
620–715
1
1165
Architecture
STR 1-85 | ATT 86-207 | STR 208-330 | ATT 331-498 | STR 499-715 | ATT 716-818 | STR 819-1164 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1165
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 1024 | 1024 | 0,9059 |
| Central domain | 1025 | 1154 | 131 | 0,1330 |
| C-terminal | 1155 | 1165 | 10 | 0,9803 |
Note: Constraints were applied during segmentation.
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-1024
1-1024
Central
1025-1154
1025-1154
C-terminal
1155-1165
1155-1165
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage ev099 [NCBI] |
2847061 | Uroviricota > Caudoviricetes > Radostvirus > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
YP_009909918.1
[NCBI]
Genbank nucleotide accession
NC_049953.1
[NCBI]
CDS location
range 15491 -> 18988
strand +
strand +
CDS
ATGGGTAAAGGCAGCAGTAAGGGGCATACTCCGCGCGAAGCGAAGGACAACCTGAAGTCCACGCAGTTGCTGAGTGTGATCGATGCCATCAGCGAAGGGCCGGTTGAAGGTCCGGTGGATGGATTAAAAAGCGTGCTGCTGAACAGTACGCCAGTGCTGGACAGTGAGGGGAATACCAATATCTCCGGCGTCACGGTGGTGTTCCGGGCCGGTGAGCAGGAGCAGACACCGCCGGAGGGATTTGAATCCTCCGGCTCCGAGACGGTGCTGGGTACGGAAGTGAAATACGACACGCCGATCACCCGGACCATCACGTCGGCAAACATTGACCGACTGCGTTTTACCTTCGGCGTGCAGGCACTGGTGGAAACCACCTCAAAGGGGGACAGGAATCCGTCGGAAGTCCGCCTGCTGGTTCAGATCCAGCGTAATGGTGGCTGGGTGACGGAAAAAGACATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGGTGGGTAACCTGCCGCCGCGCCCGTTCAATATCCGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAGCTGCAGAACAAAACGCTCTGGTCGTCATACACCGAAATCATCGATGTGAAACAGTGCTACCCGAACACGGCACTGGTCGGCGTGCAGGTGGATTCGGAGCAGTTCGGCAGCCAGCAGGTGAGCCGTAATTATCATCTGCGCGGGCGCATTCTGCAGGTGCCGTCGAACTATAACCCGCAGACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAACCGGCATACAGCAACAACCCGGCCTGGTGTCTGTGGGATATGCTGACCCACCCGCGCTACGGCATGGGGAAACGTCTTGGTGCGGCGGATGTGGATAAATGGGCGCTGTATGTCATCGGCCAGAATTGCGACCAGTCGGTGCCGGACGGCTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCCTGGCTGACCACACAGCGCAAGGCGTGGGATGTGCTCAGTGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGGAACGGGCAGACGCTGACGTTCGTGCAGGACCGACCATCAGATAAGGTGTGGACCTATAACCGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTGAAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGAACAACGGCTGGGAGACGGCGACAGAGCTTGTGGAGGACACGCAGGCCATTGCCCGTTACGGTCGTAACGTCACGAAGATGGATGCCTTTGGCTGTACCAGCCGGGGGCAGGCACATCGCGCCGGGCTGTGGCTGATTAAAACAGAACTGCTGGAAACGCAGACCGTGGACTTCAGCGTGGGCGCAGAAGGGCTTCGCCATGTGCCGGGCGATGTCATTGAAATCTGTGATGATGACTATGCCGGTATCAGCATCGGCGGGCGCGTGCTGGCGGTAAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATTCTCCGGTACCACGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTGAAAGTGAGCCGTGTTCCTGACGGCGTTGCTGAATACAGCGTGTGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAAGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTGAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACTGCCGAAGTCACCGCAGACAGCGGGGAATATCAGGTGCTGGCGCGCTGGGACACGCCGAAGGTGGTGAAGGGCGTGAGCTTCCTGCTCCGTCTGACCGTAACAGCGGATGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCGCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCGTGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGATCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAGCGGATTGCGGATATCAGGCAGGTTGAAACCACAGCACGCTATCTTGGCACGGCGCTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTACTTTTATATCCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTTGGTCAGCCGAGTGATGATGCATCCGGTTATCTGGATTTTTTCAAAGGCGAGATAGGGAAAACCCATCTGGCTCAGGAGCTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGGCTGAAATCAGGACGTCCATTACGGATGTCAGCAATGAAATCACGCAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCGGCAATTCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCAGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATTGCGGGTATCGGTGCCGGTATTGAGAACACCCCTGACGGCATGCAGAGTCAGGTGCTGCTGGCAGCAGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAATGAAGTGTTCCTGAAACGCCTGACGGCCCCCACCATTACCAGCGGCGGTAATCCTCCGGCATTTTCCCTGACATCAGACGGGAGACTGACGGCGAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGAACGCTCAACAATGTCACGATTAACCAGAACTGTACGATTAAGGGCATGCTGGAGGCGACCCAGGTCAGAGGTGACTTCGTTAAAGCTGTATCCAAATCATTCCCGAAACAGGCTGGTACGTGGGGTAACACGGAAACACCAAACGGGACGGTTACAGTCACCATAAGCGATGATCATAACTTTGACCGTCAAATCATTATTCCGCCCATTATCTTTAACGGAATAGCGTATAGCGATCCGGGAAGTGGTAATAACCCGGGAGGTACAAGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGTGTATTAATCGCATCCAGAGAAACTAAAGGGGCTATTCCCGGTAGCTACAGTGCGGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGGTTTTCCATAAAGGCAATCAGTGGGCAGGTAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCCGCTTCCGGCATCAGTATTCGTTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
a81b470d2b6945f8e9273073f97e3bbb41de3811055e5135b99a44b81d357431
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50