Protein
View in Explore- Genbank accession
- UNA07266.1 [GenBank]
- Protein name
- central tail fiber J
- RBP type
-
TFTSPTF
- Protein sequence
-
MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPIEGPVDGLKSVLLNSTPVLDTEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVMGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKTWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPNNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNSQTRTLTLDREITLPSSGTALISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWELKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGEQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVTADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRITDIRQVETTARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVELTEDNASRLEEFSKEWKDANDKWNAMWAVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAANRIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGSPPVFSLTSDGKLTAKNADISGSVNANSGTLNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPDGTVTVTISDDHNFDRQIIIPPIIFNGIAYDNPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGGGSVTLEFKVFHKGNQWAGNITDCTVIVTKKAASGISIR
- Physico‐chemical
properties -
protein length: 1137 AA molecular weight: 124680,45070 Da isoelectric point: 5,72718 aromaticity: 0,08531 hydropathy: -0,34723
Domains
Domains [InterPro]
IPR053171
Unmapped
1–840
Unmapped
1–840
DC_0014
STR
1–1134
STR
1–1134
IPR055385
ATT
86–207
ATT
86–207
IPR055383
STR
610–714
STR
610–714
IPR036116
STR
617–719
STR
617–719
IPR003961
STR
618–701
STR
618–701
IPR003961
STR
618–710
STR
618–710
IPR003961
STR
620–715
STR
620–715
1
1137
Architecture
STR 1-85 | ATT 86-207 | STR 208-330 | ATT 331-498 | STR 499-715 | ATT 716-818 | STR 819-1136 |
Legend:
ATT
STR
RBD
CBM
LEC
ENZ
CHP
LNK
TAS
TTP
UNK
Unmapped
Tail Spike Domain Segmentation
Tail Spike Domain Segmentation
This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.
Domain Layout
1
1137
| Domain | Start | End | Length (AA) | Confidence |
|---|---|---|---|---|
| N-terminal | 1 | 995 | 995 | 0,8916 |
| Central domain | 996 | 1126 | 132 | 0,1516 |
| C-terminal | 1127 | 1137 | 10 | 0,9806 |
Note: Constraints were applied during segmentation.
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Sequence started with non-N-terminal domain|C-terminal too short, adjusted boundary
Legend:
N-terminal
Central domain
C-terminal
3D Structure with Domain Coloring
The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).
Domain Coloring
N-terminal
1-995
1-995
Central
996-1126
996-1126
C-terminal
1127-1137
1127-1137
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage Lambda h434 imm21 [NCBI] |
2918896 | Viruses > Duplodnaviria > Heunggongvirae > Uroviricota > Caudoviricetes |
| Host |
Escherichia coli [NCBI] |
562 | cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Enterobacterales |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
UNA07266.1
[NCBI]
Genbank nucleotide accession
OM418627.1
[NCBI]
CDS location
range 15505 -> 18918
strand +
strand +
CDS
ATGGGTAAAGGAAGCAGTAAGGGGCATACCCCGCGCGAAGCGAAGGACAACCTGAAGTCCACGCAGTTGCTGAGTGTGATCGATGCCATCAGCGAAGGGCCGATTGAAGGTCCGGTGGATGGCTTAAAAAGCGTGCTGCTGAACAGTACGCCGGTGCTGGACACTGAGGGGAATACCAACATATCCGGTGTCACGGTGGTGTTCCGGGCTGGTGAGCAGGAGCAGACTCCGCCGGAGGGATTTGAATCCTCCGGCTCCGAGACGGTGCTGGGTACGGAAGTGAAATATGACACGCCGATCACCCGCACCATTACGTCTGCAAACATCGACCGTCTGCGCTTTACCTTCGGTGTACAGGCACTGGTGGAAACCACCTCAAAGGGTGACAGGAATCCGTCGGAAGTCCGCCTGCTGGTTCAGATACAACGTAACGGTGGCTGGGTGACGGAAAAAGACATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGATGGGTAACCTGCCGCCGCGCCCGTTTAATATCCGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAGCTGCAGAACAAAACGCTCTGGTCGTCATACACTGAAATCATCGATGTGAAACAGTGCTACCCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTGAGCCGTAATTATCATCTGCGCGGGCGTATTCTGCAGGTGCCGTCGAACTATAACCCGCAGACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAACCGGCATACAGCAACAACATGGCCTGGTGTCTGTGGGATATGCTGACCCATCCGCGCTACGGCATGGGGAAACGTCTTGGTGCGGCGGATGTGGATAAATGGGCGCTGTATGTCATCGGCCAGTACTGCGACCAGTCAGTGCCGGACGGCTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCGTACCTGACCACACAGCGTAAGGCGTGGGATGTGCTCAGCGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGGAACGGGCAGACGCTGACGTTCGTGCAGGACCGACCGTCGGATAAGACGTGGACCTATAACCGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTGAAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGAACAACGGCTGGGAGACGGCGACAGAGCTTGTTGAAGATACGCAGGCCATTGCCCGTTACGGTCGTAATGTTACGAAGATGGATGCCTTTGGCTGTACCAGCCGGGGGCAGGCACACCGCGCCGGGCTGTGGCTGATTAAAACAGAACTGCTGGAAACGCAGACCGTGGATTTCAGCGTCGGCGCAGAAGGGCTTCGCCATGTACCGGGCGATGTTATTGAAATCTGCGATGATGACTATGCCGGTATCAGCACCGGTGGTCGTGTGCTGGCGGTGAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCCTCCGGTACCGCGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTAAAAGTGAGCCGTGTTCCTGACGGTGTTGCTGAATACAGCGTATGGGAGCTGAAGCTGCCGACGCTGCGCCAGCGACTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTGCCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGAACAGAGTGGCACGGTGAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACTGCAGACAGCGGGGAATATCAGGTGCTGGCGCGATGGGACACACCGAAGGTGGTGAAGGGCGTGAGTTTCCTGCTCCGTCTGACCGTAACAGCGGACGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACATACCGCTTCACGCAACTGGCGCTGGGGAACTACAGGCTGACAGTCCGGGCGGTAAATGCGTGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATCGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCCGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAGCGGATTACCGATATCAGGCAGGTTGAAACCACAGCCCGCTATCTTGGTACGGCGCTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTACTTTTATATCCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCCGTCGGTCGGGCGAGCGATGATGCGGAAGGTTACCTGGATTTTTTCAAAGGCAAGATAACCGAATCCCATCTCGGCAAGGAGCTGCTGGAAAAAGTCGAGCTGACGGAGGATAACGCCAGCAGACTGGAGGAGTTTTCGAAAGAGTGGAAGGACGCCAACGATAAGTGGAATGCCATGTGGGCTGTCAAAATTGAGCAGACCAAAGACGGCAAACATTATGTCGCGGGTATTGGCCTCAGCATGGAGGACACGGAGGAAGGCAAACTGAGTCAGTTTCTGGTTGCCGCTAACCGTATCGCATTTATTGACCCGGCAAACGGGAATGAAACGCCGATGTTTGTGGCGCAGGGCAATCAGATATTCATGAACGACGTGTTCCTGAAGCGCCTGACGGCCCCCACCATTACCAGTGGTGGCAGTCCTCCGGTATTTTCCCTGACATCAGACGGAAAGCTGACCGCTAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGAACGCTCAACAATGTCACGATTAACCAGAACTGTACGATTAAGGGCATGCTGGAGGCGACCCAGGTCAGAGGGGATTTCGTTAAAGCTGTATCAAAAGCCTTCCCGAAAAAAGTCGGTACGTGGGGTAACACGGAAACACCAGACGGTACGGTTACAGTCACCATCAGCGATGATCATAACTTTGACCGCCAGATTATTATTCCGCCCATTATTTTTAACGGTATAGCGTATGACAATCCGGGGAGCGGAAATAACCCGGGAGGCACGCGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGCGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGTTACAGTGCAGTTATTGATATGCCTAGTGGTGGTGGTAGCGTCACTCTGGAGTTTAAGGTTTTCCATAAAGGCAATCAGTGGGCAGGTAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCCGCTTCCGGCATCAGTATTCGTTGA
Genome Context
Genome Context
Tertiary structure
PDB ID
38d38f4cfd16c46977c86a0ae2ee001c33aa8f5f2f5355e32e795e98f4c576ab
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
Literature
| Title | Authors | Date | PMID | Source |
|---|---|---|---|---|
| The complete genome sequences of lambdoid phages 21, 434, 434B, and several lambda hybrids | Feiss,M., Adhya,S., Georgopoulos,C.P., Hendrix,R.W., Hatfull,G.F., Gilcrease,E.B., Casjens,S.R., Ramsey,J. and Young,R.F. | 2022-05-19 | — | GenBank |