Genbank accession
YP_012022215.1 [GenBank]
Protein name
prophage tail fiber N-terminal domain-containing protein
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TF
Evidence GenBank
Probability 1,00
TF
Evidence RBPdetect
Probability 0,85
TF
Evidence RBPdetect2
Probability 0,51
Protein sequence
MAVTTKIIVQQILNIDDTKATASKFPRYTVILGNSISSITASELVSSIEAAAKSAAAAKDSEIAAKTSELNAKNSEQEAAISAGASEASATQSATSATQSAASADKSAESAAAAKVSETNAKASETKAKTSETNAKTSETNAKSSETKAKASETNAKASETNATASAAAAKLSEDNAKVSETNAAASAADSSGFRNEAETFSTQAATSASAAKTSETNAKASETNAKASETKTKASETNAASSATSASQSVTTIQGLKSDVEQLKSDTQAIKNSAVTETTAIKADVAQLKTDTQGIKNSAITETTALKNQAATSATSAANSATEAGKQATSAANSANTAKTEADRSKTEADRSEAAANSTPDIQPLPDVWIPFNDSLDMITGFAPGYKKITVGDEEITLPSDKIVSFTRASTATYINKSGALTIAEINEPRFEKEGLLIEGQRTNYFANSNAPELWNSNSGLSKSETKTDDRGFKYATFGPGVYSGSTGTYGIILGNSQNNISVVKDDAVTLSFRARGHNLRFVARFSKGETPATVAVLFIDSDTLATFTSGQDASNITVKNVVQDGEWVAIEVVYKVTDNSAYINGGIQIVQKADATYDDSSFVEVTTPQIEKGSCASSFIITSSTPATRASDMVLIPTDCNQPSSIPLSLLVEVNRNWDIAPNSAPRIVHVANAPEDQLLVAFRVPSSDTVEPLPYSQLGVSQSFTPVSTKTSGKMVTGFVCNKSSELRCVTNAVFGAPVKTTWKAGLSKNLRIGGISADGGKHLFGHVRNFRIWHKELTDRQMRESV
Physico‐chemical
properties
protein length:790 AA
molecular weight: 82538,02410 Da
isoelectric point:5,50681
aromaticity:0,05063
hydropathy:-0,35722

Domains

Domains [InterPro]
DC_0608
ATT
2–203
G3DSA:1.20.5.1700
Unmapped
239–317
Coil
Unmapped
254–274
YP_012022215.1
1 790
Architecture
ATT
STR
ATT 2-203 | STR 231-738 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage JLBYU40
[NCBI]
2894749 Uroviricota > Caudoviricetes > Demerecviridae > Tequintavirus > Tequintavirus JLBYU40
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
YP_012022215.1 [NCBI]
Genbank nucleotide accession
NC_105514.1 [NCBI]
CDS location
range 83403 -> 85775
strand -
CDS
ATGGCGGTAACTACTAAAATTATTGTGCAACAAATACTGAATATAGATGATACGAAAGCTACTGCTAGTAAATTTCCTAGATATACAGTAATTCTTGGAAATTCTATTAGTTCTATTACTGCTAGTGAGTTAGTATCTTCTATAGAGGCCGCTGCTAAATCTGCTGCAGCTGCAAAAGATTCTGAGATAGCAGCTAAGACCTCAGAGCTTAATGCTAAGAATTCGGAGCAGGAAGCTGCTATTTCTGCCGGAGCTTCAGAAGCTTCTGCTACTCAGTCTGCTACATCTGCTACTCAATCTGCTGCATCAGCGGATAAATCTGCAGAATCAGCCGCAGCAGCTAAAGTATCCGAGACTAATGCGAAAGCTAGTGAAACTAAGGCTAAAACCTCCGAGACTAATGCAAAAACATCAGAAACTAATGCAAAGTCTAGTGAAACCAAAGCTAAGGCCTCCGAGACTAATGCTAAGGCTAGTGAAACTAATGCTACTGCTTCGGCTGCAGCGGCAAAACTTAGTGAGGATAATGCAAAGGTCAGTGAGACTAATGCTGCCGCTTCCGCTGCGGATTCCAGTGGTTTTAGGAATGAGGCGGAAACATTCTCTACACAAGCTGCTACATCAGCTTCTGCTGCAAAAACCTCTGAAACTAATGCAAAAGCTAGTGAGACTAATGCAAAAGCTAGTGAGACTAAAACCAAAGCTAGTGAGACTAACGCAGCTAGTTCCGCAACTTCAGCTAGTCAGTCAGTAACTACTATTCAAGGCTTAAAATCAGATGTTGAGCAACTAAAATCAGATACTCAGGCCATTAAAAACAGTGCTGTAACAGAGACAACAGCTATAAAGGCAGATGTTGCGCAGTTAAAAACAGATACACAAGGCATTAAAAATAGCGCGATAACAGAAACTACCGCGCTAAAAAACCAAGCTGCAACTTCTGCAACTAGTGCTGCCAATTCTGCTACTGAAGCAGGAAAACAAGCTACTAGTGCTGCCAATAGTGCTAATACTGCTAAAACTGAAGCCGACCGTTCAAAAACTGAGGCTGATAGATCAGAAGCTGCTGCTAATTCTACCCCCGACATTCAACCTCTTCCAGATGTATGGATACCGTTTAATGATTCTCTAGATATGATCACCGGCTTTGCACCAGGCTATAAAAAAATAACAGTCGGTGATGAGGAAATAACACTGCCTAGCGACAAGATTGTTAGCTTTACCCGTGCATCAACTGCGACATATATTAATAAGTCCGGCGCTCTTACCATTGCAGAAATTAATGAGCCACGTTTTGAAAAAGAAGGTCTGCTTATTGAAGGTCAGAGAACTAACTATTTTGCGAATTCAAACGCGCCAGAATTATGGAACTCGAATTCAGGGCTAAGCAAAAGTGAAACAAAAACCGATGATCGTGGTTTTAAATATGCAACGTTCGGGCCTGGTGTTTACTCTGGCTCTACTGGTACATATGGGATTATATTAGGTAACTCGCAAAACAATATATCTGTAGTTAAAGATGACGCAGTTACACTATCGTTTAGGGCAAGAGGACATAATCTAAGATTTGTTGCTAGATTTAGCAAAGGAGAAACACCTGCTACTGTTGCTGTTCTTTTTATTGACAGCGACACGCTAGCTACATTCACGTCCGGGCAGGATGCGTCTAACATCACTGTGAAAAATGTTGTTCAGGATGGTGAATGGGTTGCTATAGAGGTTGTTTATAAAGTTACAGATAACTCAGCATATATTAACGGTGGAATTCAGATTGTGCAAAAAGCCGATGCCACTTATGATGATTCTAGTTTTGTTGAGGTGACTACTCCGCAAATTGAAAAAGGGTCATGCGCTTCATCGTTCATAATTACAAGCAGTACGCCCGCCACAAGAGCTAGTGACATGGTCCTAATACCAACTGATTGCAATCAACCATCCTCTATACCGTTAAGTCTACTTGTTGAGGTAAATAGAAATTGGGATATAGCCCCAAACTCAGCACCAAGGATAGTACATGTAGCAAATGCACCAGAAGACCAGTTATTAGTTGCTTTCAGGGTTCCATCAAGCGATACAGTAGAGCCGCTGCCTTATTCTCAGTTGGGGGTCAGTCAGTCATTTACGCCAGTATCAACAAAAACAAGCGGGAAAATGGTGACGGGTTTTGTTTGCAACAAAAGCTCAGAATTAAGATGCGTAACAAACGCTGTGTTCGGTGCGCCTGTGAAAACAACATGGAAAGCTGGTTTATCTAAAAATCTTCGTATCGGCGGCATAAGTGCAGATGGTGGGAAACATCTTTTCGGGCATGTCAGAAATTTTAGAATCTGGCATAAAGAATTAACGGATCGTCAAATGAGGGAGTCTGTATGA

Genome Context

Genome Context

Tertiary structure

PDB ID
9ba526d36db19ecf40f752732938fc9b7c63517594e70edc900eba47702a39f9
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,6909
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50