Protein
View in Explore- Genbank accession
- YP_009621320.1 [GenBank]
- Protein name
- tail protein
- RBP type
-
TF
- Protein sequence
-
MADNIEIEQRVLDGDLIGPVNAEIDTEDFTGRVALENSLLDGTGSEISIELSESLGVVGIDDLVEVKSEIDKSVLKAESTVAEAQRQIDKANDTIAALNSISSQITNLVGEAKQAASTATNEAADAERSENAAAGSAKAASDSQKAAAGSAKAASDSQKAAATSQAQALASQKAAKTSETNASGSEARTAQSEKNAAASAKEADTDQKAAKAAQKAAEAARDAAKSSENKSNTYAGNAYSSKTDAETALAAAKKSEAAAAQSAKEAETSASSMTQSVADARAARDAAQNARDRSESARDISVSKASLATTKASEASSSASSASSSATAAKTSETNALNSQRAAKTSETNSKTSETNSKASERSALASKDSAAKSAAAASTSAAAAKKSQDAAKASRDEAEHFAEELRKGSVYRGIWNPNSNKYPDVPPTNSRWDVQLNNGQLEKIFDGKEWNWGDRLIYVLETKSFDIIDSGTGVTSINGETGSVTINRDTLGALGKTEKAADSAKLEGSTKAQIIAKAREGVASAGTSYNKAESDSKYLPKTGKAADSEKVGGVDITRIVYGTNRFATMQQTNFNNNRKSGFIEMSNTISASKPGRASSWVWGWQTAHTGNDTSNKYGAQLVINQDSQLYYRVQNSGGVGTWATVYSTANKPTAADVGAVPKTGGTFTSNVTVSAGGANRNALELNTGNTNGGGIHLKMTDQAGSYTQYGTIGYYHQDDNSYGGQAAFVVGTSSDTEKTNFVLTGTGNFKVGNNEVYHEGNKPTLEDVTSIASVNKSLKLTANTWTTLLTRPDGLYESGVYQILVSYNSSDKGGGAYSQNYTGQFYWYAETTNSSNVCEIPLHHMGHADNDEYIYLRTRARGRSSSETQSVDIKCNKTFTSSALFTVKLIKLM
- Physico‐chemical
properties -
protein length: 894 AA molecular weight: 93652,74740 Da isoelectric point: 5,64539 aromaticity: 0,05705 hydropathy: -0,60045
Domains
Domains [InterPro]
DC_1362
ATT
15–202
ATT
15–202
Coil
Unmapped
74–101
Unmapped
74–101
1
894
Architecture
ATT 15-202 | STR 270-805 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Vibrio phage Thalassa [NCBI] |
2570301 | Uroviricota > Caudoviricetes > Demerecviridae > Thalassavirus > Thalassavirus thalassa |
| Host |
Vibrio harveyi [NCBI] |
669 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
YP_009621320.1
[NCBI]
Genbank nucleotide accession
NC_042095
[NCBI]
CDS location
range 31652 -> 34336
strand +
strand +
CDS
ATGGCAGATAATATTGAAATTGAACAAAGAGTATTAGACGGGGACTTAATTGGTCCTGTCAATGCTGAAATAGATACTGAAGATTTCACTGGTCGAGTAGCCCTAGAGAACAGCCTACTTGATGGTACAGGCTCTGAAATCTCAATAGAGCTATCTGAAAGTCTTGGCGTTGTTGGTATCGACGACTTAGTAGAGGTTAAGAGTGAGATAGATAAGTCTGTACTCAAGGCTGAGTCAACTGTAGCAGAAGCTCAGAGACAGATTGATAAAGCTAATGATACAATTGCTGCGCTTAACTCTATCAGTTCTCAAATAACCAATTTAGTAGGTGAGGCTAAACAAGCAGCATCTACTGCTACTAATGAAGCTGCTGATGCAGAGAGAAGTGAAAATGCAGCGGCTGGTAGTGCAAAAGCTGCGTCTGACTCTCAAAAAGCAGCAGCAGGTAGTGCTAAGGCTGCATCCGATTCACAAAAAGCAGCCGCTACAAGTCAAGCGCAAGCACTGGCATCACAAAAAGCAGCTAAGACTTCAGAAACTAATGCGAGCGGTTCCGAAGCTAGAACTGCCCAGAGTGAGAAAAATGCAGCTGCCTCAGCAAAGGAAGCTGATACAGATCAAAAAGCAGCAAAAGCAGCGCAAAAAGCAGCAGAAGCTGCAAGAGATGCAGCTAAATCTAGCGAGAATAAGTCTAATACATACGCAGGTAATGCTTATAGTTCTAAAACAGATGCAGAAACCGCTCTAGCGGCAGCGAAGAAATCAGAAGCAGCTGCTGCACAATCAGCTAAAGAAGCGGAAACGTCAGCATCCTCTATGACTCAATCTGTAGCAGATGCTCGTGCCGCCCGAGATGCAGCACAGAACGCTAGAGATCGCTCTGAGTCAGCTAGAGATATTTCAGTTAGTAAAGCCAGCTTAGCTACTACAAAAGCCTCAGAAGCTTCTTCTAGCGCATCGAGTGCTTCTTCTAGTGCAACAGCAGCTAAAACTAGTGAAACAAATGCGCTAAACAGCCAACGTGCAGCTAAGACTTCTGAGACAAATTCAAAAACCTCAGAGACTAATTCAAAAGCTTCTGAAAGAAGTGCTTTGGCCTCGAAAGATTCCGCAGCTAAAAGTGCAGCAGCAGCCAGTACTAGTGCAGCAGCTGCTAAGAAATCTCAAGATGCCGCTAAAGCCTCACGCGACGAGGCAGAACATTTTGCAGAAGAGCTACGCAAAGGCAGTGTATATAGAGGTATTTGGAATCCAAACTCTAATAAGTACCCAGATGTTCCACCAACTAACTCTCGTTGGGACGTGCAGCTTAACAATGGGCAACTAGAGAAAATATTTGATGGTAAAGAATGGAACTGGGGTGACAGACTAATATACGTATTAGAAACCAAATCCTTCGATATAATCGACTCTGGTACAGGTGTTACTAGCATTAATGGCGAAACCGGATCAGTTACTATCAATAGAGATACTCTAGGTGCTCTAGGTAAAACTGAAAAGGCAGCAGATAGTGCAAAACTAGAGGGTAGCACTAAAGCTCAAATTATTGCTAAGGCTCGTGAAGGAGTGGCTTCAGCCGGTACTAGCTATAATAAAGCTGAGTCTGATAGCAAATACCTACCTAAAACAGGTAAGGCAGCAGATTCTGAAAAAGTTGGCGGAGTTGATATTACCAGAATAGTATATGGTACTAATCGTTTTGCAACAATGCAACAAACTAATTTCAATAATAATAGAAAATCTGGTTTTATTGAGATGAGTAATACTATTTCAGCTAGTAAACCCGGTAGAGCATCTTCTTGGGTATGGGGGTGGCAGACAGCTCATACTGGCAATGATACGTCTAATAAGTATGGTGCACAGTTAGTTATAAACCAGGATAGCCAATTATACTATCGCGTTCAGAATAGTGGTGGCGTAGGCACTTGGGCTACTGTATACTCTACAGCTAACAAACCTACGGCTGCTGATGTAGGTGCTGTTCCTAAAACTGGCGGTACATTTACATCTAATGTAACTGTTTCAGCAGGTGGAGCAAATAGAAACGCCCTTGAATTGAATACTGGTAATACTAATGGTGGTGGTATTCATCTTAAGATGACAGACCAGGCTGGTAGCTATACTCAATATGGTACTATAGGGTACTATCACCAAGACGATAATTCTTACGGAGGACAGGCTGCCTTTGTAGTAGGTACTTCTTCAGACACCGAGAAGACCAACTTTGTACTAACCGGTACTGGTAACTTCAAAGTTGGTAACAACGAAGTATACCATGAAGGTAACAAACCAACACTAGAGGATGTTACTAGTATAGCATCTGTTAATAAGAGCCTGAAACTAACAGCCAATACTTGGACTACTCTTCTAACTAGGCCGGATGGCTTATATGAGAGTGGTGTTTATCAAATACTAGTATCCTATAACTCCTCTGACAAGGGTGGTGGTGCATATAGCCAGAATTATACTGGTCAGTTCTATTGGTATGCTGAAACTACAAACTCTAGCAATGTTTGCGAAATACCTTTACACCATATGGGGCATGCAGATAATGACGAATATATTTACCTTAGAACTAGGGCTAGAGGCCGTTCAAGCAGTGAAACTCAGTCAGTAGATATTAAATGTAATAAAACTTTTACTAGTTCGGCTTTATTCACTGTGAAACTAATTAAACTAATGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
30033e80cf4f18b58d194de319e8f4a8476403e7372d0b340a85f7ae0222073d
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50