Genbank accession
YP_003969401.1 [GenBank]
Protein name
long tail fiber protein proximal connector
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,89
Protein sequence
MAELVIARVTEQGIQARDLSETFSGDNELVIKGQQGNAGGECVATLNGISLSPSQTKGLNCIHLTEDFKATGMAFDFSIAADTDRFTNWVNSLSTGIILLMSHTENVTNDKLNTYFDQIGSVGWKYYWNPAKGTNRSSYVAIIDCPLKKIMTEQFMGHGKTTMQAQLCIVFDTFADIGVTGYGDMIAWEENEVISSTAGYAVVKIIDKPLSELNIKVGEYIEMNAEVFHEQKAITDKTGCALIMQFWNGNTYISGMHIKPTAVDVWTPGSNMSRVPTNCNRIELSLYRYPSVPTSTAAIGMRDVMVKLRKPEVNKSRSHGTVGQWGFITSTANEGDAEASFVASSKVGDLVSQSYEELDQEWLPQFDQTGYFDIPDWIPGSKFEVSQTISYDYPDITDTWMRSGMGGTTDGTENRNLKVGIRTVHHDRGEGHFFSYGGFGYDFDARIEYGKSYTYKIEVDGSNVKFYLDGVEVQSGIATAPFKQITHFAVGADKRGNVYMHKLRGSVHELNMIDKTRLNGTNDRFYNFVRPGQLDDKRTVPGKSRYSGKEYFATNISPDNSANALYCGPGSKLVPGGLCDESKDVFVTILDAGGNTLLPGLELNKRYSCVVREIASPRADWVQSSSQYSIRNPLDVNNGAWIAEADGKNVNTMKFKIECVISPKINGKVTNVDWIDINKQIAPLNGVDQYSDIQPWNGPGEFQIKFNSRSGKGYILEGRTLTETINQFGGIYIATTGILTAYGNYVISTVNGTPFTAGMQVLANTDYVLSGYIKSGSRVVRVGARYNNTEFFDGYIWDFSLDGKGKDDRYYKNYNMSRYDAYNNYVKDELAKADVIASKTYVSDLSLFSTAVRQNEKSYKFNTATTPSGGTLKFSAPKGSIVRIDFDIEGETRIELRYSPAQSGSAPLIKSLPAGRCKGSVAYKITDEAPGVYIRTVAPKLNEVVKIHKLNVSRVYTDVIIENSSGSEYKLEDRETPIMKTVNRTKWIPDYRTGYLHIQNAWKPDPKDWAIRVKCKAGALKDDINPILSGPVYDTSTICVDQYRSVSSVRVFSYNAAKQLNTGITIDAGAKVGDELDIFVRVVGNTVTITVNGKTATGAWNPDGTESINLIGARAGGYRFNNDITLVELIDSSTKLCNSRRYDLSQYSTVKPVDTFIENELGAIKYIRIGEIDATNKIYGWDITKGEENQIGRMKDSHYIDGVQMDLFTTINTSGYSAYLHFVGDKRPYNAIDVELYQIDDNNNEQYIGTFTAIQAINATGYGIPENAVVTSWFSGLNTNKKVIFRPRGAGSQLALADAKPWVAI
Physico‐chemical
properties
protein length:1305 AA
molecular weight: 144517,71910 Da
isoelectric point:5,84080
aromaticity:0,10421
hydropathy:-0,35188

Domains

Domains [InterPro]
DC_0912
STR
1–1305
IPR013320
STR
366–515
YP_003969401.1
1 1305
Architecture
STR
STR 1-1305
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Tail Spike Domain Segmentation

Tail Spike Domain Segmentation

This protein has been segmented into three structural domains: N-terminal, central domain, and C-terminal.

Domain Layout
N-terminal
Central
C-terminal
YP_003969401.1
1 1305
Domain Start End Length (AA) Confidence
N-terminal 1 142 142 0,2918
Central domain 143 440 299 0,6344
C-terminal 441 1305 864 0,1483
Legend: N-terminal Central domain C-terminal
3D Structure with Domain Coloring

The structure is colored according to the domain segmentation: N-terminal (blue), Central (green), C-terminal (pink).

Domain Coloring
N-terminal
1-142
Central
143-440
C-terminal
441-1305

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage phiAS5
[NCBI]
879630 Uroviricota > Caudoviricetes > Pantevenvirales > Chrysonvirus > Chrysonvirus as5
Host Aeromonas salmonicida
[NCBI]
645 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Aeromonadales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_003969401.1 [NCBI]
Genbank nucleotide accession
NC_014636 [NCBI]
CDS location
range 72129 -> 76046
strand +
CDS
ATGGCTGAATTAGTTATTGCCCGTGTTACAGAACAGGGTATTCAAGCTCGCGATCTGTCAGAAACATTTTCTGGCGATAACGAGTTAGTTATTAAGGGTCAACAAGGCAACGCAGGAGGTGAATGCGTTGCCACCTTAAACGGAATCAGCCTTAGCCCTTCTCAAACGAAGGGCTTAAACTGCATTCATTTGACCGAAGATTTTAAAGCAACCGGTATGGCATTTGACTTTTCAATTGCCGCGGATACAGATAGATTTACAAACTGGGTTAATAGTTTGTCTACTGGTATTATTCTTCTAATGAGTCACACAGAAAACGTGACTAATGACAAGCTTAATACATATTTCGATCAAATTGGTTCTGTTGGTTGGAAGTATTATTGGAATCCAGCAAAAGGAACAAATAGAAGTTCTTACGTTGCTATTATTGATTGTCCACTGAAAAAGATCATGACTGAACAATTCATGGGTCATGGTAAAACAACTATGCAAGCCCAGCTTTGTATTGTATTTGATACATTCGCTGATATTGGTGTTACCGGTTACGGTGATATGATTGCGTGGGAAGAAAACGAAGTAATATCATCCACTGCTGGATATGCAGTTGTTAAAATCATCGATAAGCCATTATCTGAGTTGAATATCAAAGTCGGCGAATATATCGAAATGAATGCGGAAGTGTTTCACGAACAAAAAGCAATTACAGATAAAACTGGTTGCGCTCTTATTATGCAGTTCTGGAATGGAAATACATATATTTCTGGTATGCACATTAAGCCAACTGCGGTAGATGTTTGGACGCCGGGTTCAAATATGTCTAGGGTTCCGACAAACTGCAATCGAATTGAGTTGTCATTATATCGATATCCTAGTGTGCCAACAAGCACAGCAGCTATTGGTATGCGCGACGTAATGGTAAAGCTTAGAAAGCCGGAAGTTAATAAGTCTAGATCACATGGAACAGTTGGACAATGGGGCTTTATTACTTCTACCGCCAATGAAGGCGATGCGGAAGCGTCTTTCGTTGCTAGTTCTAAGGTAGGCGATCTAGTATCTCAATCATACGAAGAACTAGATCAAGAATGGTTGCCACAATTCGATCAAACTGGATATTTCGATATACCAGACTGGATTCCGGGTTCTAAATTCGAAGTTAGTCAAACTATCTCTTATGACTATCCAGATATTACCGATACGTGGATGCGTTCTGGTATGGGTGGAACTACCGACGGAACGGAGAATAGAAATCTAAAAGTTGGTATTAGAACAGTTCATCATGATAGAGGAGAAGGACACTTCTTTAGTTATGGAGGATTTGGATACGATTTTGATGCTAGAATCGAATATGGTAAATCATACACATATAAGATTGAGGTTGACGGCTCAAATGTTAAGTTCTATCTGGATGGTGTAGAAGTTCAATCTGGAATCGCTACAGCTCCGTTTAAACAAATAACTCATTTTGCAGTCGGAGCAGATAAGCGAGGAAACGTTTATATGCATAAACTCCGCGGATCTGTCCATGAACTAAACATGATTGACAAAACTCGCCTGAATGGAACAAATGATCGCTTTTATAATTTCGTTAGACCCGGTCAACTAGATGATAAACGAACAGTTCCTGGCAAATCACGCTATAGCGGCAAAGAATATTTTGCTACCAACATTTCTCCTGATAATAGTGCAAATGCTCTTTATTGTGGACCTGGATCTAAACTTGTTCCCGGTGGTCTATGTGACGAAAGTAAAGATGTTTTCGTGACGATTTTAGACGCCGGCGGAAACACATTGCTTCCTGGATTGGAACTCAATAAACGATATTCTTGCGTTGTTCGTGAGATCGCTAGTCCAAGAGCAGATTGGGTACAAAGCTCAAGTCAATATTCTATCAGAAATCCACTCGATGTTAATAATGGAGCGTGGATAGCAGAAGCCGACGGTAAAAACGTTAACACAATGAAGTTTAAGATTGAGTGTGTTATTAGTCCGAAGATTAACGGCAAAGTGACTAACGTTGATTGGATTGATATTAATAAGCAAATCGCTCCATTGAATGGCGTAGATCAATATTCAGATATTCAGCCTTGGAATGGTCCCGGTGAGTTTCAAATCAAATTCAATTCTCGTTCTGGTAAAGGATATATTCTTGAAGGTAGAACATTAACTGAAACTATCAATCAGTTTGGTGGAATTTACATTGCAACAACTGGAATATTAACTGCATACGGAAATTATGTGATTTCTACTGTAAACGGAACTCCATTCACTGCCGGAATGCAAGTTCTTGCAAATACGGATTACGTTCTTAGTGGATATATCAAGTCGGGTAGTAGAGTTGTTCGGGTTGGTGCAAGATATAATAACACCGAGTTTTTCGATGGATATATCTGGGACTTCTCACTTGATGGAAAGGGTAAAGATGATCGTTATTACAAAAATTATAACATGAGTCGATATGATGCCTATAACAACTATGTCAAGGACGAACTGGCAAAAGCAGATGTAATCGCAAGCAAAACTTACGTTTCAGATCTGAGTTTGTTTAGCACTGCCGTTAGACAAAATGAAAAGTCGTATAAGTTCAATACCGCAACGACTCCTAGTGGAGGCACTCTGAAATTTAGTGCACCAAAGGGTTCTATTGTTCGAATTGATTTTGACATCGAAGGAGAAACTCGGATTGAGTTGAGATATTCACCGGCACAAAGTGGTAGTGCTCCATTGATCAAATCTCTTCCTGCTGGAAGATGCAAAGGGTCAGTTGCTTATAAAATCACGGACGAAGCTCCTGGCGTCTATATAAGAACGGTTGCACCTAAGCTCAATGAGGTAGTCAAGATTCATAAGCTAAACGTCTCCAGAGTTTACACCGACGTTATTATCGAGAATTCTAGTGGAAGTGAATATAAACTAGAAGATCGAGAGACTCCGATCATGAAAACTGTGAATCGAACTAAGTGGATTCCTGATTACAGAACTGGATATCTTCATATTCAAAACGCTTGGAAGCCAGATCCTAAAGATTGGGCAATTCGTGTTAAGTGTAAAGCTGGTGCTCTAAAAGACGATATCAATCCTATTTTGAGCGGGCCAGTCTATGATACCAGCACGATTTGCGTTGACCAATATCGAAGTGTTAGCTCAGTTCGTGTGTTCTCCTATAATGCGGCCAAGCAGCTCAATACAGGAATAACAATCGACGCTGGTGCTAAAGTGGGTGACGAGCTTGATATATTTGTGCGTGTTGTTGGTAACACAGTCACGATAACAGTAAATGGAAAAACTGCTACCGGGGCTTGGAATCCAGATGGTACAGAAAGTATAAATTTGATCGGCGCAAGGGCTGGAGGATATAGATTCAACAATGATATCACGCTTGTTGAGTTGATTGACAGTAGCACTAAGTTGTGCAATAGCCGTCGTTATGATCTATCACAATATTCAACCGTTAAACCAGTTGATACTTTCATAGAAAACGAGTTGGGCGCAATCAAATATATTAGAATTGGTGAGATTGACGCAACAAACAAGATCTATGGATGGGATATAACTAAAGGCGAAGAAAATCAGATTGGCAGAATGAAAGACTCACATTATATCGACGGTGTTCAAATGGATCTTTTTACTACTATAAATACAAGTGGTTATAGTGCTTACTTGCACTTTGTTGGCGATAAAAGACCATATAACGCAATCGATGTTGAGCTGTATCAAATCGACGATAATAATAACGAGCAGTATATCGGAACGTTTACTGCAATTCAGGCAATCAACGCAACTGGATATGGAATTCCAGAAAACGCTGTTGTCACTTCTTGGTTCAGTGGATTGAACACTAACAAGAAAGTTATTTTCCGTCCCCGTGGAGCAGGATCGCAACTAGCTTTAGCAGATGCGAAACCGTGGGTGGCAATCTAA

Genome Context

Genome Context

Tertiary structure

PDB ID
7170d460997e256ca0bd9fa4ae05744f562f206ad53f812ee8a8f6c58fd04a3d
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,5503
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
Complete genome sequence of bacteriophage phiAS4 and phiAS5 infecting Aeromonas salmonicida Kim,J.H., Son,J.S., Choi,Y.J., Kim,K.S. and Park,S.C. 2011-11-25 GenBank