Protein
View in Explore- Genbank accession
- WWA79516.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MRGLVNGYRSVYLQGTPLQNSDGTFNFQNVAVETRSGTQDQSYIPGFPSVENEVIVGVELRADTPIVRTVSGSDLSAVRVRVAVDSLQKVDTSNGDTVGYSVSYAIEVATDGGAYTTVLNSAFTGKTTTQYERSHRIDLPPGSQWQIRIRRLTPNANSSTVADTTRVQSITEIIDAKLRYPNSALLAVSVDAQQFQSLPTIACNVYGRIIKVPANYDPELRTYATTGPGTTNGVWNGTFKSAWTNNPAWVFYDIVTNDRFGLGNNIPAAWVDAWNLYQIAQYCDEMVSDGFGGQEPRFTCNVYMQTRADAYKVLQDLAAVFRGVSYYAVGQMIASADMPKQVSVGYTNADVVDGRFEYESSARKVRHTVALVSWSDQTDMGRQKVERVEFRPGLIRYGIQETEVTAIGCTSRGQAQRIGNHILVSENLERETIAFKVGLEGVLVSPGDVFNVSNRNRAGRRMGGRIGDFTADSVTVDQIPDDIIAGDTIKVMLSTGKIEARTIQSVSGRIVTVTVDWSASPAKQGVFIFEKTELVAETFRCMGVADNEDGTYTISGLSYRADKFAYIDDGTRLEQPPISVIPPSVQAPPANVALTSFSVIDQAIARTDVTIQWDAAPNAIEYEVEWRRDDMDWVRMGRVSSTSVDIRGAYSGQYLARVRAVNALGARSIPATSMLTQIDGKTTPPPSITSLTTTSIVFAIGLRWGFPQGATDTERTEIWYSQGPDRASAIKLGDFAYPQDRHQINGLAAGAKFYFWGRLIDRSGNIGPWYPTDTGVVGESSIDQTEYDAYFSGRITESALGQDLLSKIDSIDNIVPLIWDANATYEEGQTVIYDGRIYSWTNTTPGNNQPPSADWQDVGEAIAQAGAIVGRVDQLELDVSEIEGQVVAQGQKVDGLFASIDTQFTGDEDDYTADNDVFAGTTTIQTVIATQDYALGKRVETVQASVGDVSDELNYVSASVQETSQALVDLDGQISASWTLKLQIAANGQYYAAGMGVGIENQPDGSFQSQILMQADRFAVINVANGQVTSPFVIQGGQTFISQALIGTSWITNAMIANAAITNAKISGTIQSDDYVSGQTGWRINKAAGGGFEFNGSIAGGYRLNITNQGVYIYYPNGNPAVELGVLL
- Physico‐chemical
properties -
protein length: 1128 AA molecular weight: 122430,71450 Da isoelectric point: 4,61330 aromaticity: 0,09131 hydropathy: -0,20683
Domains
Domains [InterPro]
IPR053171
Unmapped
2–907
Unmapped
2–907
DC_0014
STR
3–1127
STR
3–1127
IPR055385
ATT
50–176
ATT
50–176
IPR003961
STR
588–686
STR
588–686
IPR003961
STR
588–673
STR
588–673
IPR036116
STR
602–674
STR
602–674
1
1128
Architecture
STR 3-49 | ATT 50-176 | STR 177-306 | ATT 307-466 | STR 467-815 | ATT 816-861 | STR 862-1127 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1128
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 466 | 466 | 0,8304 |
| Central domain | 467 | 1067 | 602 | 0,0421 |
| C-terminal | 1068 | 1128 | 60 | 0,9470 |
Note: Constraints were applied during segmentation.
Fixed 70 C-terminal predictions appearing before Central domain
Fixed 70 C-terminal predictions appearing before Central domain
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-466
1-466
Central
467-1067
467-1067
C-terminal
1068-1128
1068-1128
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Xanthomonas phage Kintu [NCBI] |
3120010 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Xanthomonas vasicola pv. musacearum [NCBI] |
454958 | Pseudomonadota > Gammaproteobacteria > Lysobacterales > Lysobacteraceae > Xanthomonas > Xanthomonas vasicola |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
WWA79516.1
[NCBI]
Genbank nucleotide accession
PP313117.1
[NCBI]
CDS location
range 19453 -> 22839
strand +
strand +
CDS
ATGCGTGGGCTTGTCAACGGCTATAGGTCCGTATACCTGCAAGGCACTCCGCTACAGAATTCAGACGGCACATTCAACTTCCAGAATGTAGCTGTTGAGACGCGCTCAGGCACTCAGGATCAGTCTTATATCCCTGGCTTCCCAAGTGTGGAGAATGAGGTAATCGTCGGGGTAGAGCTTCGCGCTGATACCCCTATTGTCCGCACCGTATCGGGATCGGACCTTTCTGCCGTTCGTGTCCGCGTGGCGGTAGATTCTTTGCAGAAAGTTGATACCAGCAACGGTGACACGGTTGGCTATAGTGTCTCCTATGCTATTGAAGTGGCTACAGATGGTGGTGCATACACCACCGTATTGAATAGCGCCTTCACTGGCAAGACAACCACTCAGTATGAGCGCAGCCATCGTATTGACCTTCCCCCCGGCAGTCAGTGGCAAATCCGCATCCGCCGCCTGACGCCTAATGCGAACTCTTCTACGGTTGCTGATACTACTCGCGTCCAGTCGATCACTGAGATTATCGACGCCAAGCTGAGGTATCCGAATAGCGCACTCTTGGCTGTGTCTGTTGATGCTCAGCAGTTCCAGAGTCTTCCTACGATTGCCTGCAACGTATATGGCCGAATCATCAAGGTTCCGGCTAACTACGATCCCGAGCTAAGGACATACGCCACTACTGGCCCTGGCACTACTAACGGCGTCTGGAACGGCACATTCAAATCAGCATGGACTAATAACCCAGCATGGGTATTCTATGACATTGTTACCAATGACCGATTTGGCCTAGGCAACAACATTCCTGCTGCATGGGTAGATGCGTGGAACCTGTATCAGATCGCGCAGTATTGCGATGAGATGGTTTCGGATGGCTTTGGTGGACAGGAGCCTCGCTTTACCTGCAACGTCTACATGCAGACCCGCGCTGACGCCTATAAGGTGCTTCAGGACTTGGCTGCGGTTTTCCGTGGAGTAAGCTACTACGCCGTAGGGCAGATGATCGCATCCGCCGATATGCCTAAGCAGGTTTCGGTTGGATATACGAATGCAGATGTTGTTGATGGCCGTTTTGAATACGAAAGCTCAGCCCGTAAGGTCCGCCATACCGTTGCGCTTGTGTCGTGGTCAGATCAGACAGATATGGGGCGTCAGAAGGTTGAGCGCGTTGAATTTCGACCTGGCCTGATCCGCTATGGCATACAAGAGACTGAAGTAACCGCCATTGGCTGTACGTCTCGCGGGCAGGCTCAGCGCATTGGTAATCACATTCTAGTATCTGAGAACCTTGAGCGCGAAACGATTGCGTTTAAGGTTGGCCTAGAAGGTGTGCTTGTATCGCCTGGTGATGTTTTCAACGTCTCCAATAGAAACCGCGCCGGTCGCCGCATGGGTGGCCGTATCGGCGATTTCACTGCCGATAGCGTAACTGTCGATCAAATCCCTGATGACATTATTGCAGGTGATACGATTAAGGTAATGCTATCGACTGGAAAGATCGAAGCTAGGACTATTCAATCTGTGTCAGGTCGCATTGTCACTGTTACTGTAGATTGGTCAGCAAGTCCTGCTAAGCAAGGTGTTTTCATCTTTGAAAAGACTGAGCTTGTGGCGGAAACATTCCGCTGTATGGGCGTGGCTGATAATGAGGACGGAACCTATACGATTTCCGGACTATCCTATCGAGCAGACAAGTTTGCCTATATCGATGATGGCACACGTCTAGAGCAGCCGCCTATCAGCGTCATCCCGCCTAGTGTCCAGGCTCCTCCTGCGAACGTTGCGCTTACTTCATTCTCTGTTATCGACCAAGCGATTGCCCGCACTGATGTTACTATCCAGTGGGATGCTGCGCCTAACGCCATCGAGTATGAAGTTGAGTGGCGTCGAGATGATATGGATTGGGTGCGCATGGGCAGGGTTAGCTCTACTAGCGTAGACATTCGCGGAGCTTACTCAGGCCAATACCTTGCGCGTGTGCGTGCAGTCAATGCGCTTGGGGCCAGGTCGATCCCAGCGACTAGTATGTTGACTCAGATCGATGGCAAGACTACCCCGCCGCCGTCCATTACCTCGCTGACAACCACTAGCATTGTGTTTGCCATTGGCCTGCGCTGGGGCTTCCCGCAAGGTGCAACAGACACAGAGCGCACTGAGATTTGGTATTCGCAGGGTCCAGATAGGGCTAGCGCGATCAAGCTTGGTGACTTTGCCTATCCGCAGGATCGTCATCAGATCAACGGGCTTGCTGCTGGTGCTAAGTTCTATTTTTGGGGTCGCTTGATTGATCGCAGCGGGAATATAGGCCCTTGGTATCCTACGGATACGGGCGTTGTCGGTGAATCTAGCATCGATCAAACTGAATATGATGCTTACTTCTCTGGCCGCATTACTGAAAGCGCGCTTGGTCAGGATTTGCTGTCTAAGATCGACAGTATCGACAACATCGTACCTTTGATTTGGGACGCTAATGCAACCTATGAGGAAGGCCAGACAGTAATTTACGATGGCAGAATCTATAGTTGGACCAATACAACTCCTGGCAACAATCAGCCCCCTAGTGCTGACTGGCAGGACGTTGGTGAGGCTATTGCGCAGGCTGGCGCTATCGTTGGGCGAGTGGACCAGCTTGAACTTGACGTGTCTGAGATTGAGGGCCAAGTAGTTGCTCAGGGCCAAAAGGTAGATGGCCTATTTGCTTCTATCGACACTCAATTCACTGGCGACGAAGACGACTACACTGCTGATAATGATGTATTTGCTGGCACTACTACGATTCAGACAGTAATCGCTACCCAGGACTATGCGCTAGGAAAGCGAGTAGAAACTGTGCAAGCCAGCGTTGGCGATGTTTCGGATGAGCTAAATTATGTATCTGCTAGTGTCCAGGAGACTTCTCAAGCTTTGGTTGATCTTGATGGGCAGATCAGCGCATCGTGGACACTGAAGCTACAGATTGCCGCAAATGGGCAGTATTACGCCGCTGGTATGGGTGTTGGCATCGAGAACCAGCCTGACGGCAGTTTCCAGTCTCAGATTCTCATGCAGGCTGATAGGTTCGCTGTAATTAATGTTGCCAATGGGCAGGTAACTAGTCCGTTTGTCATTCAAGGCGGTCAGACATTCATCAGTCAGGCGCTAATCGGCACTAGCTGGATTACCAACGCCATGATTGCCAATGCGGCGATCACCAATGCCAAGATCAGCGGTACAATTCAATCCGATGACTATGTTTCGGGCCAGACGGGATGGAGAATTAACAAGGCTGCTGGCGGTGGGTTTGAGTTTAACGGCTCTATCGCTGGCGGATACCGCTTGAACATCACCAACCAAGGCGTATATATCTACTACCCTAATGGTAATCCGGCTGTTGAATTGGGAGTGCTGCTGTAA
Genome Context
Genome Context
Tertiary structure
PDB ID
c3a539f4aee8be88b0688ba5e1f6f3d527b8167f56377bf97678d7c2cb602040
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50