Genbank accession
YP_010078908.1 [GenBank]
Protein name
tail fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TSP
Evidence DepoScope
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,91
TSP
Evidence RBPdetect2
Probability 0,95
Protein sequence
MTNLSGATTAVFIISLSSALEVPVNVAWETKDGTAKAGTDYEAASGSVTFEAGDTQKQIQVTVYGRETGDTATRKFSILLYPPENAILDQTLTDVEIQVTDSDGVAVTSLVVATGPRGLKGDPGLSAYALAKLQGYEGTLEEWLESQNPSAAMLQKTLRVSESSIPMLPSAENRKNRILAFDNNGNPLLLFPESGSAADVLIELAKPKGFSQVGQVDSFTALRSVVPSYEGQSILLRAHPVGWAAMSHGPVGGGEFIARKGAAVDDGGYICVPTGQTEYYWQRIPKKPGKVCATEFGLYDGAGLDDILTRAVGYCIKNSLGYLSIPALGPSGYTLAGGLEFVNGTNGLVIEGPGMATKGVSPVITHTGANIGITFKRNSQAQSLFNPVILKNFTAVGNALATAFVRFSDFYGGSVFDAMIRDYTIGTGIDVYNNKGWTEVIRVDNVIVRTSQKGIWFHSNPESTDDQTLSFYGARVSNFAFQHGIVGASSGIYVGDGTRADNLYNCDIDMMGWAEGGGDSTAIYVANKARVDGAANFRYDGFAASAITSGTQPFRLVKKAGTTGYVKLNCKNYKHQASLELTAGVTELKIRPWLAIAEIIAGVATPHPTLPGESIVNVPGMKCKLVGKLFKGQNSVVSVVGLPPFHRYKVTTRCNLSSTAQQQYIVNVPSDNNGGITTRTDSVPAVTTNTNTTISGTTATSTSTSMAKNKNFEPIYISNAGSLVDNAYSVTNRQGFDIHLDGTQPNVVNDEYPVSIEIEAID
Physico‐chemical
properties
protein length:762 AA
molecular weight: 80933,08250 Da
isoelectric point:5,56121
aromaticity:0,08399
hydropathy:-0,09528

Domains

Domains [InterPro]
IPR038081
ATT
2–115
IPR038081
ATT
6–105
IPR038081
ATT
6–90
IPR003644
ATT
8–85
YP_010078908.1
1 762
Architecture
ATT
ATT
STR
ATT 2-115 | ATT 192-325 | STR 326-759 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_010078908.1
1 762
Domain Start End Length (AA) Confidence
N-terminal 1 303 303 0,9946
Central domain 304 597 295 0,9863
C-terminal 598 762 164 0,9582
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-303
Central
304-597
C-terminal
598-762

Taxonomy

  Name Taxonomy ID Lineage
Phage Klebsiella phage SopranoGao
[NCBI]
2026944 Uroviricota > Caudoviricetes > Lastavirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_010078908.1 [NCBI]
Genbank nucleotide accession
NC_054966.1 [NCBI]
CDS location
range 25390 -> 27678
strand -
CDS
ATGACCAACTTATCCGGCGCTACTACCGCTGTGTTTATCATTAGCCTTTCATCTGCACTGGAAGTGCCTGTTAATGTCGCTTGGGAAACGAAAGATGGCACCGCTAAAGCAGGAACAGATTACGAGGCTGCAAGCGGATCTGTGACCTTCGAAGCTGGCGATACGCAGAAGCAAATTCAGGTTACTGTCTATGGGCGTGAGACTGGGGATACCGCCACTCGCAAATTCAGCATCCTTCTTTATCCGCCAGAAAACGCAATTCTCGACCAGACGCTCACCGATGTAGAAATTCAGGTTACTGATAGCGATGGCGTGGCTGTTACGTCGCTGGTGGTCGCAACTGGCCCGCGAGGATTGAAAGGCGATCCAGGACTGAGTGCCTATGCCTTGGCAAAGTTGCAGGGTTACGAAGGAACGCTGGAAGAATGGCTTGAAAGCCAAAACCCATCGGCCGCGATGCTTCAAAAAACGCTCAGAGTTTCTGAATCAAGTATTCCGATGCTTCCTAGTGCAGAAAATAGAAAGAACCGGATTCTTGCATTTGATAACAATGGAAATCCTCTGTTGTTATTTCCGGAATCAGGTTCAGCTGCAGATGTATTAATTGAACTGGCGAAACCAAAAGGGTTTAGCCAGGTTGGTCAGGTAGACTCATTTACTGCTCTGCGCTCTGTTGTTCCTTCATATGAAGGTCAAAGTATTTTATTGCGCGCCCATCCTGTCGGTTGGGCTGCAATGTCTCACGGTCCCGTTGGTGGCGGAGAATTTATTGCCAGAAAAGGCGCGGCTGTAGATGACGGCGGTTATATTTGTGTGCCTACAGGTCAGACAGAGTATTACTGGCAACGCATCCCAAAAAAACCCGGTAAGGTATGTGCGACGGAGTTTGGGTTGTATGACGGAGCTGGGTTGGACGACATCTTAACTAGGGCCGTTGGGTACTGTATTAAAAACTCACTGGGATATCTTTCGATTCCAGCGCTTGGACCATCAGGATATACACTTGCTGGCGGACTTGAGTTCGTCAATGGAACTAACGGACTCGTGATCGAAGGCCCGGGAATGGCCACCAAAGGAGTCAGCCCGGTAATAACCCACACTGGGGCAAATATTGGGATTACATTCAAACGTAACTCGCAGGCACAAAGTCTGTTTAATCCGGTTATTCTTAAGAATTTCACCGCAGTTGGGAATGCGCTTGCTACTGCGTTTGTCAGGTTCTCAGATTTTTATGGCGGGTCGGTATTCGACGCCATGATTCGGGATTACACCATCGGAACGGGTATAGATGTTTACAACAACAAAGGCTGGACTGAAGTTATCAGGGTGGACAATGTCATAGTAAGGACGTCACAGAAAGGAATCTGGTTCCACTCAAACCCCGAATCCACTGATGATCAAACCCTGTCTTTTTACGGCGCCCGTGTTTCCAACTTCGCCTTCCAGCATGGTATCGTCGGAGCCTCATCCGGAATCTATGTAGGCGATGGTACGCGTGCAGATAACTTATATAACTGCGACATCGACATGATGGGGTGGGCGGAAGGCGGCGGCGATAGCACAGCGATTTACGTAGCCAATAAAGCGCGTGTCGATGGTGCAGCCAACTTCCGATACGACGGTTTTGCTGCAAGTGCGATTACATCGGGGACTCAGCCTTTCCGTCTTGTGAAGAAGGCTGGTACGACCGGTTATGTTAAGCTCAACTGTAAAAACTATAAGCATCAAGCCAGCCTTGAATTAACTGCTGGAGTTACCGAGTTAAAAATTCGTCCGTGGTTAGCTATTGCTGAAATAATAGCTGGTGTGGCAACTCCACATCCAACTCTCCCTGGCGAAAGCATAGTCAATGTACCAGGAATGAAATGCAAGTTGGTCGGGAAGTTGTTTAAAGGCCAGAACTCGGTTGTTTCTGTCGTTGGGTTGCCGCCGTTCCACCGGTACAAGGTAACTACTCGTTGTAACTTATCAAGCACCGCCCAACAGCAGTACATTGTTAATGTACCGAGTGACAATAACGGCGGCATCACCACCCGTACTGATTCGGTTCCAGCTGTAACTACCAACACTAACACCACAATAAGTGGCACCACCGCAACAAGTACATCAACCTCAATGGCTAAGAATAAGAACTTTGAGCCTATTTACATTTCTAATGCGGGCAGTCTGGTTGATAACGCGTATAGTGTCACGAACAGACAAGGTTTTGATATTCACCTAGATGGTACGCAGCCTAACGTCGTTAATGACGAGTATCCGGTTTCGATTGAGATTGAAGCTATTGATTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
dfa235a7594ba38b7c8fb03e73b8125b216551020146d5db2845513d9661ed90
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7233
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50