Protein
View in Explore- Genbank accession
- NP_958185.1 [GenBank]
- Protein name
- tail needle protein
- RBP type
-
TF
- Protein sequence
-
MADSNLNVPVIIQATRLDTSVLPRNIFSQSYLLYVIAQGTDVGNVANKANEAGQGAYDAQVRNDEQDVILADHEQRISAAEATLVNHEERIRQAESTLQDHETRIAQNESDISSLDTRVQSLESQVSDHETRIDALEYATTRKKSEVVYSGVSVTIPTAPTNLVSLLKTLTPSSGTLAPFFDTVNNKMVVFNENKTLFFKLSIVGTWPSGTANRSMQLTFSGSVPDTLVSSRNSATTTDNILLATFFSVDKDGFLATNGSTLTIQSNGASFTATTIKIIAEQ
- Physico‐chemical
properties -
protein length: 282 AA molecular weight: 30597,61360 Da isoelectric point: 4,79979 aromaticity: 0,06028 hydropathy: -0,20745
Domains
Domains [InterPro]
SSF57997
STR
40–140
STR
40–140
SSF57997
STR
50–140
STR
50–140
Coil
Unmapped
70–139
Unmapped
70–139
G3DSA:1.20.5.170
STR
70–126
STR
70–126
IPR032395
RBD
108–282
RBD
108–282
1
282
Architecture
STR 40-282
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Shigella phage Sf6 [NCBI] |
10761 | Uroviricota > Caudoviricetes > Lederbergvirus > |
| Host |
Shigella flexneri [NCBI] |
623 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
NP_958185.1
[NCBI]
Genbank nucleotide accession
NC_005344
[NCBI]
CDS location
range 8254 -> 9102
strand +
strand +
CDS
ATGGCGGATTCGAATCTCAATGTGCCGGTAATCATCCAAGCTACGCGGCTCGATACATCAGTCCTTCCACGCAATATCTTCTCGCAGTCGTATCTGCTTTACGTTATCGCACAGGGCACTGATGTTGGTAACGTGGCTAACAAAGCCAACGAGGCCGGACAGGGCGCTTATGATGCACAGGTCAGGAACGATGAGCAGGATGTGATTCTCGCTGACCATGAGCAGCGAATTTCTGCTGCGGAAGCAACGCTTGTTAATCATGAGGAGCGAATCAGACAGGCAGAATCAACTCTTCAGGACCATGAAACACGAATAGCTCAGAATGAAAGCGATATTTCGTCGCTTGACACAAGAGTTCAGTCGCTGGAGTCGCAGGTTTCAGACCATGAAACGCGCATCGATGCTCTGGAGTATGCAACCACACGCAAGAAGTCAGAGGTTGTTTACTCTGGTGTATCTGTAACCATCCCGACAGCGCCGACCAACCTTGTTAGCCTGCTGAAAACGCTCACACCGTCATCCGGCACGTTGGCACCATTCTTCGACACCGTTAACAACAAGATGGTTGTGTTCAACGAGAACAAAACCCTGTTCTTCAAGCTGTCGATTGTCGGGACGTGGCCCAGTGGAACCGCCAACAGGTCAATGCAGCTAACCTTTTCCGGCTCTGTTCCTGACACACTGGTAAGCAGTCGCAACTCGGCGACAACGACCGATAACATCCTGTTAGCTACGTTCTTCAGCGTGGATAAAGACGGCTTTCTTGCCACAAATGGCAGCACGTTAACCATTCAGTCGAATGGTGCGTCGTTTACTGCCACAACCATCAAGATAATCGCGGAGCAGTAA
Genome Context
Genome Context
Tertiary structure
1 / 2
PDB ID