Genbank accession
YP_004323404.1 [GenBank]
Protein name
tail fiber protein; host specificity
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,85
Protein sequence
MAIPYNTTNAGTSVRNALRHSSIKTGGNWQHLEDVHVKHSGSWRDVKEVHVKSGGSWRLVHEGEHFLFNASLNSNSQGEWSLSSYISGLGYGGNKIKGLVTVTGGNTRRQVNLGNFSSDSLIYLRIESNNRIQARGGNGANVGGNGSNGQRALYTRTNFVLDNGGIIAGGGGGGSGGNNSNYSYEVQQAYGCQKGSTCYRQQTITEFIPGGGGGGGAGYPNSSGGSGGSNSYNGAGGNFNSGGSGGDAASGGTSNAGGDGGNLGQNGQDTAGGGSAGSSGTAIDGWSYRTGQSGSNDGDIRGPKTN
Physico‐chemical
properties
protein length:306 AA
molecular weight: 30876,51050 Da
isoelectric point:9,03584
aromaticity:0,07516
hydropathy:-0,69967

Domains

Domains [InterPro]
DC_1762
ATT
20–195
IPR007932
RBD
75–302
YP_004323404.1
1 306
Architecture
ATT
RBD
ATT 20-195 | RBD 196-302 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Prochlorococcus phage P-HM2
[NCBI]
445696 Uroviricota > Caudoviricetes > Eurybiavirus >
Host Prochlorococcus marinus subsp. pastoris str. CCMP1986
[NCBI]
59919 Bacteria > Cyanobacteria > Prochlorales > Prochlorococcaceae > Prochlorococcus > Prochlorococcus marinus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_004323404.1 [NCBI]
Genbank nucleotide accession
NC_015284.1 [NCBI]
CDS location
range 27671 -> 28591
strand +
CDS
ATGGCAATACCATATAATACTACTAATGCAGGAACTAGCGTACGTAACGCACTCAGGCACAGTTCTATAAAAACTGGTGGTAACTGGCAACACCTTGAGGATGTACATGTAAAACATAGTGGATCATGGCGTGATGTTAAAGAGGTTCACGTTAAGTCAGGTGGTTCATGGAGATTAGTTCATGAAGGTGAGCATTTCTTATTCAACGCATCTCTCAATAGTAATAGTCAAGGTGAGTGGAGTTTGTCAAGTTATATCAGTGGTTTAGGATATGGTGGTAATAAAATAAAAGGTCTTGTAACTGTAACAGGTGGAAACACAAGACGTCAAGTTAATCTTGGTAACTTCTCATCTGATTCTCTGATATATCTAAGAATAGAATCAAACAATAGAATACAGGCAAGAGGTGGAAATGGTGCTAACGTAGGTGGTAACGGAAGCAATGGACAACGTGCACTATATACTAGAACAAACTTTGTTTTGGATAATGGTGGTATCATCGCAGGAGGAGGCGGTGGTGGCTCAGGAGGTAACAACTCCAACTATTCATATGAAGTACAACAAGCATATGGTTGTCAAAAAGGTTCCACATGCTATAGACAACAGACTATCACAGAGTTCATACCTGGTGGTGGTGGAGGAGGAGGAGCAGGATATCCAAACTCCTCTGGTGGATCAGGTGGATCCAATTCTTACAATGGAGCAGGAGGAAACTTCAACTCTGGTGGGTCTGGTGGTGATGCAGCATCTGGAGGTACTTCCAATGCAGGAGGTGACGGTGGTAATCTAGGTCAAAATGGTCAAGATACCGCAGGAGGTGGTTCAGCTGGTAGCTCAGGAACCGCAATTGATGGTTGGTCATATAGAACTGGTCAATCAGGTAGTAATGATGGAGACATCCGAGGTCCCAAAACTAATTAG

Genome Context

Genome Context

Tertiary structure

PDB ID
dd3c9653e59ef76692aee9500aaed8a6581676ff56274e01ac67e652978efd65
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6775
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Literature

Title Authors Date PMID Source
The Genome Sequence of Cyanophage M4-259 Henn,M.R., Sullivan,M.S., Osburne,M.S., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Erlich,R., Young,S.K., Koehrsen,M., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Borenstein,D., Chen,Z., Engels,R., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heiman,D., Hepburn,T., Howarth,C., Jen,D., Larson,L., Lewis,B., Mehta,T., Park,D., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., Walk,T., White,J., Yu,Q., Coleman,M.L., Huang,K.H., Weigele,P.R., DeFrancesco,A.S., Kern,S.E., Thompson,L.R., Fu,R., Hombeck,B., Chisholm,S.W., Haas,B., Nusbaum,C., Galagan,J. and Birren,B. 2011-09-23 GenBank