Genbank accession
YP_009621320.1 [GenBank]
Protein name
tail protein
RBP type
TF
Evidence RBPdetect
Probability 0,89
Protein sequence
MADNIEIEQRVLDGDLIGPVNAEIDTEDFTGRVALENSLLDGTGSEISIELSESLGVVGIDDLVEVKSEIDKSVLKAESTVAEAQRQIDKANDTIAALNSISSQITNLVGEAKQAASTATNEAADAERSENAAAGSAKAASDSQKAAAGSAKAASDSQKAAATSQAQALASQKAAKTSETNASGSEARTAQSEKNAAASAKEADTDQKAAKAAQKAAEAARDAAKSSENKSNTYAGNAYSSKTDAETALAAAKKSEAAAAQSAKEAETSASSMTQSVADARAARDAAQNARDRSESARDISVSKASLATTKASEASSSASSASSSATAAKTSETNALNSQRAAKTSETNSKTSETNSKASERSALASKDSAAKSAAAASTSAAAAKKSQDAAKASRDEAEHFAEELRKGSVYRGIWNPNSNKYPDVPPTNSRWDVQLNNGQLEKIFDGKEWNWGDRLIYVLETKSFDIIDSGTGVTSINGETGSVTINRDTLGALGKTEKAADSAKLEGSTKAQIIAKAREGVASAGTSYNKAESDSKYLPKTGKAADSEKVGGVDITRIVYGTNRFATMQQTNFNNNRKSGFIEMSNTISASKPGRASSWVWGWQTAHTGNDTSNKYGAQLVINQDSQLYYRVQNSGGVGTWATVYSTANKPTAADVGAVPKTGGTFTSNVTVSAGGANRNALELNTGNTNGGGIHLKMTDQAGSYTQYGTIGYYHQDDNSYGGQAAFVVGTSSDTEKTNFVLTGTGNFKVGNNEVYHEGNKPTLEDVTSIASVNKSLKLTANTWTTLLTRPDGLYESGVYQILVSYNSSDKGGGAYSQNYTGQFYWYAETTNSSNVCEIPLHHMGHADNDEYIYLRTRARGRSSSETQSVDIKCNKTFTSSALFTVKLIKLM
Physico‐chemical
properties
protein length:894 AA
molecular weight: 93652,74740 Da
isoelectric point:5,64539
aromaticity:0,05705
hydropathy:-0,60045

Domains

Domains [InterPro]
DC_1362
ATT
15–202
Coil
Unmapped
74–101
YP_009621320.1
1 894
Architecture
ATT
STR
ATT 15-202 | STR 270-805 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage Thalassa
[NCBI]
2570301 Uroviricota > Caudoviricetes > Demerecviridae > Thalassavirus > Thalassavirus thalassa
Host Vibrio harveyi
[NCBI]
669 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_009621320.1 [NCBI]
Genbank nucleotide accession
NC_042095 [NCBI]
CDS location
range 31652 -> 34336
strand +
CDS
ATGGCAGATAATATTGAAATTGAACAAAGAGTATTAGACGGGGACTTAATTGGTCCTGTCAATGCTGAAATAGATACTGAAGATTTCACTGGTCGAGTAGCCCTAGAGAACAGCCTACTTGATGGTACAGGCTCTGAAATCTCAATAGAGCTATCTGAAAGTCTTGGCGTTGTTGGTATCGACGACTTAGTAGAGGTTAAGAGTGAGATAGATAAGTCTGTACTCAAGGCTGAGTCAACTGTAGCAGAAGCTCAGAGACAGATTGATAAAGCTAATGATACAATTGCTGCGCTTAACTCTATCAGTTCTCAAATAACCAATTTAGTAGGTGAGGCTAAACAAGCAGCATCTACTGCTACTAATGAAGCTGCTGATGCAGAGAGAAGTGAAAATGCAGCGGCTGGTAGTGCAAAAGCTGCGTCTGACTCTCAAAAAGCAGCAGCAGGTAGTGCTAAGGCTGCATCCGATTCACAAAAAGCAGCCGCTACAAGTCAAGCGCAAGCACTGGCATCACAAAAAGCAGCTAAGACTTCAGAAACTAATGCGAGCGGTTCCGAAGCTAGAACTGCCCAGAGTGAGAAAAATGCAGCTGCCTCAGCAAAGGAAGCTGATACAGATCAAAAAGCAGCAAAAGCAGCGCAAAAAGCAGCAGAAGCTGCAAGAGATGCAGCTAAATCTAGCGAGAATAAGTCTAATACATACGCAGGTAATGCTTATAGTTCTAAAACAGATGCAGAAACCGCTCTAGCGGCAGCGAAGAAATCAGAAGCAGCTGCTGCACAATCAGCTAAAGAAGCGGAAACGTCAGCATCCTCTATGACTCAATCTGTAGCAGATGCTCGTGCCGCCCGAGATGCAGCACAGAACGCTAGAGATCGCTCTGAGTCAGCTAGAGATATTTCAGTTAGTAAAGCCAGCTTAGCTACTACAAAAGCCTCAGAAGCTTCTTCTAGCGCATCGAGTGCTTCTTCTAGTGCAACAGCAGCTAAAACTAGTGAAACAAATGCGCTAAACAGCCAACGTGCAGCTAAGACTTCTGAGACAAATTCAAAAACCTCAGAGACTAATTCAAAAGCTTCTGAAAGAAGTGCTTTGGCCTCGAAAGATTCCGCAGCTAAAAGTGCAGCAGCAGCCAGTACTAGTGCAGCAGCTGCTAAGAAATCTCAAGATGCCGCTAAAGCCTCACGCGACGAGGCAGAACATTTTGCAGAAGAGCTACGCAAAGGCAGTGTATATAGAGGTATTTGGAATCCAAACTCTAATAAGTACCCAGATGTTCCACCAACTAACTCTCGTTGGGACGTGCAGCTTAACAATGGGCAACTAGAGAAAATATTTGATGGTAAAGAATGGAACTGGGGTGACAGACTAATATACGTATTAGAAACCAAATCCTTCGATATAATCGACTCTGGTACAGGTGTTACTAGCATTAATGGCGAAACCGGATCAGTTACTATCAATAGAGATACTCTAGGTGCTCTAGGTAAAACTGAAAAGGCAGCAGATAGTGCAAAACTAGAGGGTAGCACTAAAGCTCAAATTATTGCTAAGGCTCGTGAAGGAGTGGCTTCAGCCGGTACTAGCTATAATAAAGCTGAGTCTGATAGCAAATACCTACCTAAAACAGGTAAGGCAGCAGATTCTGAAAAAGTTGGCGGAGTTGATATTACCAGAATAGTATATGGTACTAATCGTTTTGCAACAATGCAACAAACTAATTTCAATAATAATAGAAAATCTGGTTTTATTGAGATGAGTAATACTATTTCAGCTAGTAAACCCGGTAGAGCATCTTCTTGGGTATGGGGGTGGCAGACAGCTCATACTGGCAATGATACGTCTAATAAGTATGGTGCACAGTTAGTTATAAACCAGGATAGCCAATTATACTATCGCGTTCAGAATAGTGGTGGCGTAGGCACTTGGGCTACTGTATACTCTACAGCTAACAAACCTACGGCTGCTGATGTAGGTGCTGTTCCTAAAACTGGCGGTACATTTACATCTAATGTAACTGTTTCAGCAGGTGGAGCAAATAGAAACGCCCTTGAATTGAATACTGGTAATACTAATGGTGGTGGTATTCATCTTAAGATGACAGACCAGGCTGGTAGCTATACTCAATATGGTACTATAGGGTACTATCACCAAGACGATAATTCTTACGGAGGACAGGCTGCCTTTGTAGTAGGTACTTCTTCAGACACCGAGAAGACCAACTTTGTACTAACCGGTACTGGTAACTTCAAAGTTGGTAACAACGAAGTATACCATGAAGGTAACAAACCAACACTAGAGGATGTTACTAGTATAGCATCTGTTAATAAGAGCCTGAAACTAACAGCCAATACTTGGACTACTCTTCTAACTAGGCCGGATGGCTTATATGAGAGTGGTGTTTATCAAATACTAGTATCCTATAACTCCTCTGACAAGGGTGGTGGTGCATATAGCCAGAATTATACTGGTCAGTTCTATTGGTATGCTGAAACTACAAACTCTAGCAATGTTTGCGAAATACCTTTACACCATATGGGGCATGCAGATAATGACGAATATATTTACCTTAGAACTAGGGCTAGAGGCCGTTCAAGCAGTGAAACTCAGTCAGTAGATATTAAATGTAATAAAACTTTTACTAGTTCGGCTTTATTCACTGTGAAACTAATTAAACTAATGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
30033e80cf4f18b58d194de319e8f4a8476403e7372d0b340a85f7ae0222073d
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,4132
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50