Genbank accession
AYR01830.1 [GenBank]
Protein name
structural protein
RBP type
TSP
Evidence DepoScope
Probability 1,00
Protein sequence
MARLVPGSGAVIEPIFDDIFGVRAVRVVNGGSGYDPADPPRLTIDGCGTPDQEAILYPIIAEGSGKIVHVRVLRRGRGYDPLRVEIVPQQETPNVVRSFDINRIWQRHPNSLTRGTFTDDRLRIESDNHPKPTWTQAEAAPGGGPLVDRSFDQTFVYRGGKDVPNFGTRLAQEDKVTGILSNGGLLHTPDWASDGGAPGTFSIDTVKYDYVKNADVYDAITESNIRYYSSSKTIDEFALENGVFQWGKLEQFTWNVKTELDNLLLFIDPASLDQTLGTIEVGRIITQIGGNARGEIAKVITDNNGLPTRIYIREVQSTFASGDKILGSNGFSFTIQSAPITFPTGIFYIDFGSEASEFGPFVPGTYYMAPKNILVKKNYLIIWNQSDSSNQNHPMRFSTTPDGPLNQSSPGTILYTSSGSSSAPAADYENEYQALFLMNEDETNRIYYHCAIHNYMSGYTGDEGYMILDTSTDDDDDVNMNTYYIEDFYQPGDTSTIDRSRHVDGHSKILGMSFDGYPIYGPWGYNSSGAVAREVSSYRLRTGNEVAGNREEIVTPSTVTYAITVANGQFLVDGSVVPFLNLKRGKTYVFNQDDSSNDANHLFISTTEDGWHVGAPPVIGDTTYLYSQPHFATYYIDGSQVTYTQYLSQFTTASQREMRFFVPVDAPNNLYTFAYSTSGLGFRLTQDGYVLGDFVEDYVYDSSVGTLDEFNGKFAVTPEYPNGTYAYFMTEDSSGNPAYPYAIGPKYYGVPLFEGDTVPQKPDIFPTRAEGEVALNPDGTIAYVNVTQQGDNYFGPTTARILGGEGSGALVNPVVQTVTGLTLLNPGQGYTVAPNLQFSGGGGQDAEGAAEVSPTGKVTSISINDPGEFYQEPPYILITGGGGSGARATAEVNQGQISAINITDQGAGYTSNPQVIFTKLVNLKRKTQARQSLNSDIRYLTGLVKNVTASDTNIYVDDTSAFPGSGSFIINKETVSYTSKTSGKFTGLTRGTNFNYDQRVIVDNSQLDDDGNSTYKFNVGDVVIRKVESASNKLARVYDWNPATRELLVTFEVDELAFIDAGIPSTEDAIVQFDGGVYSSSASSQLPHVVLTSQGNSITLLTEPITTLANSAFEDDDELDGVGDGIADLVNTGTQYEGQISLDGGIILGEPGETGRDSKFGIEETVGGQNTTLFQNGDQIKDASIPFKFSTITTAGGLSEGVEHIGLITLQLDPNNANGGNFSVNEVITGQVSGVQATVVSWDPTTSKLTIKDTVPFNTGDSNKGENGFLYEFSHNSTVVDIIVQNPGTNYTLAPNVAIENIGDIEATGTAVLTGAGDQVASVTITNGGYGITQSVDSGYNLHPTITFSAASGDTTGSGAAAYAILGGEDILGTGGSRYRIKGIDYQTIIRS
Physico‐chemical
properties
protein length:1392 AA
molecular weight: 150145,42040 Da
isoelectric point:4,43739
aromaticity:0,10201
hydropathy:-0,31006

Domains

Domains [InterPro]
DC_0832
ATT
80–1029
AYR01830.1
1 1392
Architecture
ATT
ATT 3-1390 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
AYR01830.1
1 1392
Domain Start End Length (AA) Confidence
N-terminal 1 117 117 0,9854
Central domain 118 980 864 0,2798
C-terminal 981 1392 411 0,0273
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-117
Central
118-980
C-terminal
981-1392

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-P4
[NCBI]
2484640 Uroviricota > Caudoviricetes > Pantevenvirales > Leucotheavirus > Leucotheavirus sp4
Host Synechococcus sp. WH 7803
[NCBI]
32051 Bacteria > Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AYR01830.1 [NCBI]
Genbank nucleotide accession
MH920639 [NCBI]
CDS location
range 40814 -> 44992
strand +
CDS
ATGGCAAGACTAGTTCCTGGATCTGGTGCCGTAATTGAACCGATTTTTGATGATATATTTGGTGTTCGAGCGGTAAGAGTAGTAAATGGTGGTAGCGGATATGATCCTGCTGATCCACCTAGACTTACTATTGATGGATGCGGCACTCCTGATCAGGAGGCGATATTATATCCAATCATTGCAGAAGGTTCTGGTAAAATTGTTCACGTTCGTGTTCTCAGAAGAGGAAGAGGATACGATCCACTTCGTGTTGAAATTGTTCCTCAACAAGAAACTCCAAATGTTGTAAGATCTTTTGATATTAATAGGATCTGGCAACGTCACCCCAACTCATTAACTAGGGGAACTTTTACAGATGATAGACTGAGAATTGAGTCTGATAATCATCCAAAACCTACATGGACTCAAGCGGAAGCAGCACCTGGTGGTGGTCCATTAGTAGATAGATCTTTTGATCAGACCTTTGTATACAGAGGTGGTAAAGATGTACCTAATTTTGGTACGAGACTTGCACAAGAAGATAAAGTAACTGGCATTCTTTCCAATGGCGGTCTTTTACATACTCCAGACTGGGCATCTGATGGTGGAGCACCAGGTACTTTCTCTATTGATACTGTAAAGTACGATTATGTAAAAAATGCAGATGTATATGATGCTATTACCGAAAGTAATATTAGATATTATTCATCATCCAAAACAATTGATGAATTTGCCTTAGAAAATGGTGTTTTTCAATGGGGTAAATTAGAACAATTTACATGGAATGTAAAAACTGAACTTGATAATTTATTATTATTCATTGATCCCGCATCTCTTGATCAAACACTGGGAACCATTGAAGTTGGTAGAATTATTACACAAATTGGTGGAAATGCCCGAGGAGAAATTGCTAAGGTTATTACCGATAATAATGGTCTTCCTACAAGAATTTATATAAGAGAAGTTCAATCTACTTTTGCATCTGGAGATAAAATTCTTGGTTCTAATGGATTTAGTTTCACAATTCAAAGTGCTCCTATTACATTCCCCACAGGTATTTTTTACATTGACTTTGGATCAGAAGCGTCTGAGTTTGGTCCTTTTGTGCCAGGGACTTATTACATGGCACCAAAAAATATTCTGGTCAAAAAGAATTACTTAATTATTTGGAATCAATCGGATAGTAGTAATCAAAACCATCCGATGCGTTTCAGTACAACTCCAGATGGTCCTTTAAATCAATCTTCACCTGGTACGATTTTATACACCAGTAGTGGATCGTCTTCAGCACCCGCTGCGGATTATGAAAATGAGTATCAGGCTTTATTCTTAATGAATGAAGATGAGACCAATAGAATCTATTATCATTGTGCCATTCATAATTATATGTCTGGTTACACTGGTGATGAAGGATATATGATTCTTGATACATCGACGGACGATGACGACGATGTAAATATGAACACATACTACATCGAAGATTTTTATCAACCTGGTGATACATCAACCATTGATCGCAGTAGACATGTAGATGGTCACTCTAAGATTTTGGGTATGTCTTTTGATGGATATCCCATTTATGGTCCATGGGGATATAATTCTAGTGGTGCTGTAGCAAGAGAAGTTTCTTCATACAGACTAAGAACTGGTAATGAGGTTGCTGGTAATCGCGAAGAAATTGTCACTCCATCAACGGTTACTTATGCAATCACTGTTGCAAATGGTCAGTTTTTAGTAGATGGTTCTGTAGTTCCATTTTTGAATCTAAAAAGAGGTAAAACCTACGTCTTTAACCAAGATGATTCTTCAAATGATGCAAATCATTTGTTTATTTCTACAACTGAAGACGGGTGGCATGTAGGTGCTCCTCCTGTTATTGGAGATACAACTTATCTTTATTCGCAACCTCATTTTGCAACTTATTATATTGATGGATCTCAAGTAACGTATACTCAGTATCTCAGTCAATTTACTACTGCATCTCAACGGGAGATGAGATTTTTTGTACCTGTAGATGCTCCAAACAATCTATATACATTTGCATATTCTACTTCTGGATTAGGATTCAGACTTACTCAAGATGGATATGTTCTTGGCGATTTTGTTGAGGATTATGTATATGATTCATCTGTTGGTACTTTAGATGAATTTAATGGCAAATTTGCCGTTACTCCTGAGTATCCCAACGGAACATATGCTTACTTCATGACCGAAGATAGCAGTGGTAATCCAGCATATCCTTATGCTATTGGTCCAAAATATTATGGTGTTCCTTTATTTGAAGGTGATACAGTTCCTCAAAAACCAGATATTTTCCCAACTAGAGCAGAGGGAGAGGTTGCACTAAATCCTGATGGAACCATTGCATACGTTAATGTCACTCAACAGGGTGATAATTATTTTGGTCCTACCACTGCTAGAATTTTAGGTGGTGAAGGAAGTGGAGCACTTGTTAATCCAGTTGTTCAAACAGTTACTGGTCTAACACTTTTAAATCCAGGACAAGGTTATACAGTTGCACCTAACCTACAATTCAGTGGTGGCGGTGGTCAAGATGCTGAAGGTGCTGCAGAAGTAAGTCCTACTGGTAAAGTTACTAGCATCAGTATTAACGATCCTGGTGAGTTCTACCAAGAACCTCCTTACATTTTAATTACTGGTGGTGGTGGATCTGGTGCAAGAGCAACAGCGGAAGTAAATCAAGGACAGATTTCTGCAATCAATATTACTGATCAGGGTGCTGGTTATACATCTAATCCTCAAGTTATTTTTACAAAACTTGTAAATTTAAAAAGAAAGACTCAAGCAAGACAATCTTTGAACTCGGATATTCGTTATCTGACAGGTCTTGTTAAGAACGTTACTGCTTCTGATACTAATATCTACGTTGATGATACTAGTGCTTTTCCTGGTTCTGGTTCATTTATTATCAATAAAGAGACAGTTTCTTATACCTCAAAAACATCTGGTAAATTTACTGGACTTACAAGAGGAACTAACTTTAATTATGATCAGAGAGTCATTGTTGACAATAGTCAACTTGATGATGATGGAAATTCCACTTACAAATTTAACGTAGGTGATGTTGTAATCCGAAAAGTTGAAAGTGCTTCTAATAAATTAGCACGAGTATACGACTGGAATCCTGCAACTAGAGAACTCCTTGTTACATTTGAAGTTGATGAACTTGCATTTATTGATGCAGGTATTCCCTCTACCGAGGATGCTATTGTTCAGTTTGATGGTGGTGTTTATAGTTCTAGTGCATCCTCACAACTTCCACATGTAGTGCTCACTTCTCAAGGCAATTCGATTACATTACTAACCGAACCTATTACAACTCTGGCAAATAGTGCCTTTGAAGATGATGATGAATTGGATGGTGTTGGTGATGGTATTGCCGATTTGGTTAATACTGGAACTCAATATGAGGGTCAAATTAGTTTGGATGGTGGTATTATTCTTGGTGAACCTGGCGAAACTGGTAGAGACTCTAAATTTGGTATTGAAGAAACTGTTGGTGGTCAAAACACTACATTATTCCAAAATGGAGACCAAATCAAAGATGCTTCTATTCCATTCAAATTCTCCACAATCACAACTGCTGGAGGTTTGAGTGAAGGTGTTGAGCATATTGGTTTAATAACTCTTCAGTTGGATCCTAATAATGCTAATGGTGGAAACTTTAGTGTTAATGAAGTTATTACTGGTCAAGTTTCGGGAGTACAGGCAACAGTAGTTTCTTGGGATCCAACTACATCAAAACTAACTATCAAAGATACAGTACCCTTCAACACTGGCGATTCCAACAAAGGCGAGAATGGATTCTTGTATGAATTCTCACACAATTCTACAGTTGTTGATATCATTGTTCAAAATCCTGGAACAAACTACACATTGGCACCTAATGTTGCAATTGAAAACATTGGTGATATTGAGGCAACTGGAACAGCAGTTCTTACTGGCGCTGGTGACCAAGTTGCTTCAGTAACTATAACTAACGGCGGATATGGCATAACACAATCTGTTGATAGTGGATATAACTTACATCCCACAATAACATTCTCTGCAGCAAGTGGCGACACTACAGGTAGTGGTGCTGCAGCGTATGCTATTTTGGGTGGTGAGGACATTTTGGGAACGGGCGGATCTAGATATAGAATCAAAGGAATCGATTATCAAACAATCATTCGTTCGTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
1f51b1300901f23562687ab7d4a417eb2c2ddf845d902f46a41c252fea670019
ColabFold
Source ColabFold
Method ColabFold
Resolution 0,8222
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50