Protein
View in Explore- Genbank accession
- QIN95398.1 [GenBank]
- Protein name
- hypothetical protein
- RBP type
-
TSP
- Protein sequence
-
MAQTIRLISANVSPAPEKYILNKMYNCEISLEFVSQIDKMAFYNNNIVDADGNKLTTGGINLTPREPNSPFKISGISFIVYKKKDMSYFIEFRDIDGTIHKVPLVNITADQIEDGNSTPKILTCANNSSGEEYLYTLSPYRMDAGAFNNMFVIDSVDFFEYSNKETPFHTELLPSVGTSYIIPKTDRRTAVYAVVNISDRDTGKKLINSFYSFSEIHPNKTGPTTLLSSLKYLVDSTGTELSIFPKISVTPPIPNLIFNHNLYTPIQRGVSLAPNNRLDQTINMYIDGSTTAAGAVVASDNASFLTRPASTSTTISMANLKNGTKLFLKDYDQKVIKELDFVDTEYIILPISNLKVKSMKPMDFTKPVWEYEFSPILPGISETEPYVVFESFVKETMKKLGDRFYASDLKGAIVPNSDSIDNMLSSNTAGIEYYWLHIWYPNLGKMYLLRNSNGDGPGVYPLIKTGKEQVTVSKTGEVIHGKNIELAATEFGFSEINAGKRTFQWYKDGVELPGQTTRALLITNLKGEDSGDYSVKVTATPSHSYAEDSIMEFSSTPFNINVDTTRTITAELTCTPMPIVMGQPFTLSGKISGGYGTTIEYVRLIKNSVILEDYPLDSMTTSTVIPNATLTSSGIYTLAVSYFDNGVKRLALSEPVKAFFADSKPLNLTCVLAGPDMANEGDDVLFNANVAVSSDASLTPVYTMHWYKDDIPVSDSSPDNLSLSITNAHYPEDEGVYYCIVSAHVEGYEPATIESNKIDLTILTDIRLVARLYSDNAIINKGDTTIIKIGFQADRPVNPTWKWHHKDGTLLQENGTELMVSPTSTTTYYAIAYPGYGEGIQKPTMTNEFTIEVIEPSVDADCDIYIHPLMPGRQGGFLWVGWWVIDEINQAIKDGFDWKADPTNSRFKYPCTIKAIVRAMNDFGGVEAQESRNGYILKDDYFNR
- Physico‐chemical
properties -
protein length: 944 AA molecular weight: 104915,46330 Da isoelectric point: 4,94638 aromaticity: 0,10487 hydropathy: -0,22214
Domains
Domains [InterPro]
DC_0400
STR
116–556
STR
116–556
IPR007110
STR
457–551
STR
457–551
IPR013783
STR
470–551
STR
470–551
IPR003599
STR
473–563
STR
473–563
IPR036179
RBD
475–540
RBD
475–540
PTHR10075
Unmapped
502–867
Unmapped
502–867
cd00096
STR
696–753
STR
696–753
1
944
Architecture
STR 116-563 | STR 626-763 | RBD 855-942 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
944
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 182 | 182 | 0,3791 |
| Central domain | 183 | 518 | 337 | 0,8062 |
| C-terminal | 519 | 944 | 425 | 0,0803 |
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-182
1-182
Central
183-518
183-518
C-terminal
519-944
519-944
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage MN01 [NCBI] |
2711182 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
QIN95398.1
[NCBI]
Genbank nucleotide accession
MT129652
[NCBI]
CDS location
range 14623 -> 17457
strand +
strand +
CDS
ATGGCACAAACTATACGTCTTATTTCGGCTAATGTATCACCTGCCCCAGAAAAATACATATTGAACAAGATGTACAACTGCGAGATTTCTTTAGAATTTGTATCACAAATAGACAAAATGGCGTTTTATAATAATAATATAGTAGACGCGGATGGAAACAAACTTACTACCGGTGGTATAAATCTTACACCACGAGAACCTAATTCGCCATTTAAAATTAGTGGTATCTCATTTATAGTATATAAAAAGAAAGATATGTCTTATTTTATAGAATTCCGAGATATCGACGGAACTATACATAAAGTACCTCTTGTTAATATTACTGCAGACCAAATAGAAGATGGAAACTCGACACCAAAAATATTGACCTGTGCTAATAATTCTTCAGGTGAAGAGTATCTTTATACTTTAAGTCCATACAGAATGGATGCGGGCGCTTTTAACAATATGTTTGTTATAGATTCGGTAGATTTCTTTGAATATTCTAATAAAGAAACACCGTTTCACACTGAACTTTTACCAAGTGTAGGTACTTCCTACATAATACCTAAAACCGATAGAAGGACTGCGGTATATGCTGTAGTTAATATATCTGACCGTGACACAGGTAAAAAACTTATAAATTCATTTTATTCGTTTTCGGAAATCCACCCAAATAAAACAGGGCCCACAACCCTTCTTTCAAGTCTCAAATATTTAGTTGATTCTACAGGCACAGAATTATCGATATTTCCTAAAATATCAGTCACACCACCTATTCCAAATTTAATTTTTAACCACAATTTATACACACCTATACAAAGAGGAGTTTCTTTAGCCCCTAATAATAGATTAGACCAAACTATAAACATGTATATAGATGGTTCTACAACTGCTGCTGGAGCTGTAGTTGCGTCTGATAATGCTTCATTTCTTACTAGACCTGCTAGCACTAGCACTACAATATCAATGGCTAATCTAAAAAATGGGACCAAACTATTTTTAAAAGATTATGACCAAAAGGTAATAAAAGAATTAGATTTTGTTGATACAGAATATATCATATTACCTATATCAAATCTTAAAGTAAAATCAATGAAACCTATGGATTTTACTAAGCCTGTTTGGGAATATGAATTTTCTCCTATATTACCCGGTATAAGTGAAACGGAACCTTATGTTGTCTTCGAAAGCTTTGTAAAAGAGACAATGAAAAAATTAGGTGACCGTTTTTATGCTAGTGATTTAAAAGGTGCTATAGTTCCTAATTCAGATTCCATAGATAATATGTTATCTAGTAATACTGCTGGTATAGAATATTATTGGCTGCATATATGGTATCCTAATTTAGGAAAAATGTACTTATTAAGAAACTCTAATGGAGATGGTCCTGGTGTATATCCTCTAATAAAAACGGGCAAAGAACAAGTAACCGTTTCTAAAACCGGTGAAGTTATCCATGGTAAAAATATAGAACTCGCTGCAACTGAATTTGGATTTTCAGAAATAAACGCAGGAAAACGCACATTCCAATGGTACAAGGACGGGGTTGAATTACCAGGCCAAACTACAAGGGCCCTTTTAATTACAAATTTAAAAGGCGAAGACAGCGGCGATTATAGTGTTAAAGTAACAGCTACACCTAGTCACTCGTACGCTGAAGATTCTATTATGGAATTTAGTTCTACTCCATTTAACATCAATGTTGATACCACAAGGACCATAACAGCCGAATTGACGTGCACCCCAATGCCTATAGTAATGGGCCAGCCTTTTACTCTTTCTGGTAAAATAAGCGGTGGGTATGGAACTACTATAGAATATGTCAGGCTGATTAAAAACTCGGTAATCTTGGAAGACTATCCATTAGACTCTATGACAACTAGCACTGTTATCCCTAATGCAACTCTTACTTCAAGTGGAATTTATACTCTTGCCGTAAGTTATTTTGATAACGGAGTAAAACGTCTTGCTCTGTCTGAACCTGTTAAAGCATTTTTTGCTGATAGCAAACCATTAAATCTAACCTGTGTATTGGCAGGTCCTGATATGGCTAATGAGGGCGATGATGTTTTATTCAATGCAAATGTTGCGGTGAGTTCGGACGCCAGTTTGACACCTGTATATACAATGCATTGGTATAAAGACGATATACCGGTATCAGATTCATCTCCTGACAATTTGAGTCTAAGTATAACCAATGCTCATTATCCTGAAGATGAAGGGGTGTATTATTGTATAGTTTCGGCGCATGTTGAAGGATATGAACCTGCTACTATTGAGAGTAATAAAATTGATTTAACCATACTTACTGACATAAGATTGGTAGCAAGACTTTATTCCGACAACGCTATAATCAATAAAGGAGACACCACAATTATTAAAATAGGGTTCCAAGCAGATAGACCTGTTAACCCTACATGGAAATGGCACCACAAAGATGGTACTCTGTTACAAGAAAACGGAACCGAATTAATGGTTTCTCCAACATCAACTACTACCTATTATGCTATTGCATATCCTGGGTACGGCGAAGGAATACAAAAACCTACAATGACTAATGAATTCACTATAGAAGTTATCGAGCCATCAGTAGACGCTGATTGTGATATCTACATTCATCCATTGATGCCTGGACGTCAAGGTGGCTTTTTATGGGTGGGGTGGTGGGTTATTGATGAAATCAACCAGGCTATAAAAGATGGATTTGATTGGAAAGCAGACCCTACTAATAGTAGATTTAAGTACCCGTGTACTATAAAGGCTATTGTTAGAGCCATGAATGACTTTGGGGGAGTTGAAGCTCAAGAATCCAGAAATGGATACATTCTTAAAGACGACTATTTCAACAGGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
11577ab60a4fc1f8450250c3fbe4401fd0daaeb13dd2ef67cdb20e21e49f809f
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50