Protein
View in Explore- Genbank accession
- BCI50034.1 [GenBank]
- Protein name
- phage host specificity protein
- RBP type
-
TFTSPTF
- Protein sequence
-
MGKGGGKAHTPVEAKDNLKSTQMMSVIDAIGEGPIEGPVKGLQSILVNKTPLTDTDGNPVIHGVTAVWRAGEQEQTPPEGFESSGAETALGVEVTKAKPVTRTITSANIDRLRVTFGVQSLVQTTSQGDRNPASVRLLIQLQRNGNWVTEKDVTINGKTTSQFLASVILENLPPRPFNIRMVRETADSTTDQLQNKTLWSSYTEIIDVKQCYPNTAIVGLQVDAEQFGGQQMTVNYHIRGRIIQVPSNYDPEKRTYSGIWDGSLKPAYSNNPAWCLWDMLTHPRYGMGKRLGAADVDKWALYAIAQYCDQTVPDGFGGTEPRMTFNAYLSQQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDVVWPYTNCDVVVDDNGVGFRYSFSALKDRHTAVEVNYTDPQNGWQTSTELVEDPEAILRYGRNLLKMDAFGCTSRGQAHRAGLWVIKTGLLETQTVDFTLGSQGLRHTPGDIIEICDNDYAGTMTGGRILSIDAASRTLTLDREVTLPETGTSTVNLINGSGKPVSVDITAHPAPDRIQVSTLPDGVATYGVWGLSLPSLRRRLFRCVSIRENTDGTFAITAVQHVPEKEAIVDNGASFEPQSGTLNSVIPPAVQHLTVEVSAADGQYLAQAKWDTPRVVKGVRFSLRLTSGSGEDSRLVTTAITADTEHRFSGLPPGEYTLTVRAINSYGQQGEPATTTFRINAPVVPATIELTPGYFQITAVPRLAVYDPTVQFEFWFSETKIADTSQVETSARYLGTGSQWSVSGPHIKPGKDFWFYVRSVNLVGKSAFVEASGRASNDAEGYLGLFREKIGKLHLAQGLWELIDNSQLADEMAEMKTTITETRNEITQTVSKTLEDQSATIQQIQRVQKDTNDDLAALYMLKVQKTKDGIPYVAGIGAGIEDTDGQPLSNILLLADRIAMINPESGNSTPLFVAQGNQLFMNDVFLKRLFAVSITSSGNPPAFSLTPDGRLTAKNADISGSVNANSGTLNNVTINENCQIKGKLSANQIEGDIVKTVSKSFPRTSTYASGTITVRISDDQKFDRQVMIPPVLFRGGKHENFNSNNQQSYWYSTCRLRVTRNGQEIFNQSTTDAQGVFSSVIDMPAGQGTLTLTFTVSSSGANNWTPTTSISDLLVVVMKKSTAGISIS
- Physico‐chemical
properties -
protein length: 1158 AA molecular weight: 126422,61930 Da isoelectric point: 5,60799 aromaticity: 0,07599 hydropathy: -0,30130
Domains
Domains [InterPro]
IPR053171
Unmapped
1–856
Unmapped
1–856
DC_0023
STR
1–1155
STR
1–1155
IPR055385
ATT
86–207
ATT
86–207
IPR055383
STR
609–712
STR
609–712
IPR036116
STR
617–709
STR
617–709
IPR003961
STR
617–706
STR
617–706
IPR003961
STR
617–699
STR
617–699
IPR003961
STR
619–716
STR
619–716
1
1158
Architecture
STR 1-85 | ATT 86-207 | STR 208-329 | ATT 330-498 | STR 499-713 | ATT 714-814 | STR 815-1158
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1158
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 1018 | 1018 | 0,9244 |
| Central domain | 1019 | 1147 | 130 | 0,1500 |
| C-terminal | 1148 | 1158 | 10 | 0,9875 |
Note: Constraints were applied during segmentation.
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-1018
1-1018
Central
1019-1147
1019-1147
C-terminal
1148-1158
1148-1158
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Stx2a-converting phage Stx2_EH1910 [NCBI] |
2751387 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
BCI50034.1
[NCBI]
Genbank nucleotide accession
LC567834
[NCBI]
CDS location
range 33461 -> 36937
strand +
strand +
CDS
GTGGGCAAAGGTGGCGGCAAGGCGCACACGCCGGTTGAGGCAAAGGACAATCTTAAGTCCACGCAGATGATGAGCGTGATTGATGCGATTGGTGAAGGGCCGATTGAAGGTCCGGTGAAGGGGCTGCAGAGTATTCTGGTGAACAAAACCCCGCTGACGGACACGGACGGTAATCCTGTGATACATGGTGTGACAGCGGTCTGGCGCGCCGGGGAGCAGGAGCAGACACCACCTGAAGGCTTTGAGTCCTCCGGGGCGGAAACCGCACTGGGCGTGGAAGTGACGAAGGCAAAGCCTGTGACGCGCACCATTACATCCGCGAACATTGACCGCCTGCGGGTCACCTTCGGGGTGCAGTCACTGGTGCAGACCACCTCACAGGGTGACCGTAACCCGGCATCCGTCCGCCTGCTGATTCAGCTGCAGCGTAACGGTAACTGGGTGACGGAAAAGGATGTCACCATTAACGGCAAGACCACCTCACAGTTCCTCGCTTCGGTGATTCTGGAGAATCTGCCTCCCCGCCCCTTTAACATCCGGATGGTCAGGGAGACGGCGGACAGCACCACGGACCAGCTGCAGAATAAGACGCTCTGGTCGTCATACACCGAAATCATCGATGTGAAACAGTGCTACCCGAACACGGCGATTGTGGGGCTGCAGGTGGATGCGGAGCAGTTTGGCGGTCAGCAGATGACGGTGAACTACCATATCCGCGGTCGCATCATCCAGGTACCGTCAAACTATGACCCGGAAAAACGCACGTACAGCGGCATCTGGGACGGCAGCCTGAAACCGGCATACAGCAACAACCCGGCCTGGTGCCTGTGGGACATGCTGACTCACCCGCGCTACGGCATGGGAAAACGTCTGGGGGCGGCAGACGTGGACAAATGGGCGCTGTATGCCATTGCGCAGTACTGCGACCAGACGGTCCCGGATGGTTTCGGGGGCACAGAGCCGCGGATGACCTTTAATGCGTACCTGTCACAACAGCGTAAGGCGTGGGACGTTCTCAGTGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGGAACGGCCAGACGCTGACGTTCGTTCAGGACCGCCCGTCGGATGTGGTGTGGCCGTACACCAACTGCGATGTGGTGGTGGATGATAACGGCGTGGGGTTTCGCTACAGCTTCAGCGCCCTGAAGGACCGCCACACGGCGGTGGAGGTGAATTACACCGACCCGCAGAACGGCTGGCAGACCTCCACGGAACTGGTGGAAGACCCGGAAGCCATACTGCGCTACGGGCGCAACCTGCTGAAGATGGATGCGTTCGGTTGCACCAGTCGCGGTCAGGCCCACCGTGCCGGGCTGTGGGTGATAAAGACCGGACTGCTGGAAACGCAGACGGTGGATTTCACGCTCGGGTCACAGGGGCTGCGTCACACACCCGGTGACATTATTGAAATCTGTGATAACGACTATGCCGGGACCATGACCGGCGGACGTATCCTGTCCATCGATGCCGCCAGCCGCACCCTGACACTGGACCGTGAGGTGACCCTGCCGGAGACAGGTACATCGACGGTGAACCTGATTAACGGCAGCGGTAAGCCGGTGAGTGTGGACATCACCGCACACCCCGCGCCGGACCGGATACAGGTCAGTACCCTGCCGGATGGCGTGGCGACATACGGTGTGTGGGGACTCTCCCTGCCGTCACTGCGTCGTCGCCTGTTCCGCTGTGTTTCCATCCGGGAAAACACGGACGGCACCTTTGCCATCACGGCGGTGCAGCACGTACCGGAAAAGGAAGCCATCGTGGATAACGGGGCCAGCTTTGAGCCGCAGTCAGGCACCCTGAACAGCGTTATTCCACCGGCAGTGCAGCACCTGACGGTGGAGGTGAGCGCGGCTGACGGTCAGTATCTGGCACAGGCGAAATGGGACACGCCGCGGGTGGTGAAGGGTGTGCGCTTCAGTCTGCGCCTGACCAGCGGAAGCGGAGAAGACAGCCGTCTGGTGACCACCGCCATCACTGCGGATACAGAGCATCGTTTCAGTGGTCTGCCGCCCGGGGAATACACCCTGACAGTCAGGGCAATTAACAGTTATGGCCAGCAGGGCGAACCGGCCACCACCACGTTCAGGATTAATGCACCTGTGGTACCCGCCACGATTGAGCTGACACCGGGCTATTTTCAGATAACAGCGGTCCCGCGTCTTGCGGTGTATGACCCGACGGTACAGTTTGAGTTCTGGTTTTCGGAGACAAAAATCGCAGACACATCTCAGGTGGAAACCTCTGCCCGTTATCTGGGGACCGGCAGTCAGTGGAGTGTATCCGGCCCGCACATTAAGCCCGGGAAGGATTTCTGGTTTTACGTGCGCAGCGTCAACCTGGTGGGGAAATCTGCTTTTGTGGAAGCCAGTGGCCGGGCCAGCAATGATGCAGAAGGGTATCTGGGGCTGTTTCGGGAAAAAATAGGAAAACTGCATCTGGCTCAGGGGCTGTGGGAGCTGATAGACAACAGCCAGCTTGCGGATGAGATGGCGGAGATGAAGACCACCATCACCGAAACCCGCAATGAAATCACACAGACGGTCAGTAAAACGCTGGAAGACCAGAGCGCCACCATACAGCAGATACAGCGCGTGCAGAAGGACACAAATGATGACCTGGCTGCGCTGTACATGCTGAAGGTTCAAAAAACGAAAGACGGCATTCCCTATGTGGCCGGGATTGGTGCAGGGATTGAGGATACTGATGGCCAGCCACTGAGCAACATACTGCTGCTGGCTGACCGTATCGCGATGATAAATCCGGAGAGCGGCAACAGCACGCCGTTATTTGTGGCGCAGGGGAATCAGCTGTTCATGAACGACGTGTTCCTGAAACGACTGTTTGCGGTGAGCATCACGTCATCCGGCAATCCTCCGGCATTTTCCCTGACGCCGGACGGGCGACTGACGGCGAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGGACGCTCAACAACGTCACGATTAATGAGAACTGTCAGATTAAGGGGAAACTGTCAGCCAACCAGATTGAAGGCGATATTGTCAAAACGGTCAGCAAGTCTTTCCCCCGCACGAGCACTTATGCCAGTGGCACTATCACGGTAAGAATCAGTGATGATCAGAAGTTTGACCGGCAGGTCATGATACCGCCAGTGTTATTCCGCGGTGGTAAGCATGAGAATTTCAACAGTAATAACCAACAGTCATACTGGTATTCAACCTGCCGGTTAAGAGTGACCCGCAATGGTCAGGAGATTTTTAATCAGTCCACGACGGATGCTCAGGGCGTATTTTCCTCAGTTATAGATATGCCTGCCGGACAGGGGACGCTGACACTGACATTCACCGTATCTTCATCAGGAGCGAATAACTGGACACCAACAACCAGTATCAGCGATCTGCTGGTTGTGGTGATGAAAAAATCCACAGCAGGTATCAGTATCAGCTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
908f80bcd0895729b5209f374601b80bb8cde3e284c6c33c6834b8fae9f4162a
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| Dynamics of Stx phages in STEC O145:H28 | Nakamura,K., Ogura,Y., Iyoda,S. and Hayashi,T. | 2024-08-06 | — | GenBank |