UniProt accession
K7NTM0 [UniProt]
Protein name
Proximal tail fiber subunit
RBP type
TF
Evidence UniProt/TrEMBL
Probability 1,00
TSP
Evidence RBPdetect
Probability 0,55
TF
Evidence RBPdetect2
Probability 0,67
Protein sequence
MVAKSFRARSGLDAAGEKVINVGKADRNTLSDGVNVDFFNEFNGIQQYDPTRGYSQDMAIIYARRIWYAKQNIASPAGAFDESKWIATRNDPKWVYSNVTTPDGSIIESGSYIMADGRFTELLYLLPDNPTEGDVITFKDCGGLVGVNSILVKSNTRQIRLRTVQSAQYRLTHPYMIATFIYNGNVWRVAETLDNRDSEIVNATGTGSFQLQSGMTVFRNSATGKITLQLPKYANDGDVITTYDADKMNSINVAVLQIYPGSGHTISDGAITGVTSVTSQKSGFGMFIFDAQNSQWKVYDADNRVRLRRIYSDLNTVPNDYVFVTANPSGTIPNVTVTLPTDVADGDRVYVSLYMMGKNQNCTIKVKDGTTDKIRTNKNMMQFPQRKDYPPDDWFSVTSLAFNAASDYLPYIEFSYMKATKEWVVANYRPIVERVDATNRSRTGVIALAAQAEVNKNLEDNPNDETAITPMTLANKTATETRRGIARLATTAEVNKLSTDTYLDDVIVTPKKLNERTATETRRGLAEIATQAETNGSTDDITIVTPKKLHNRIASPTLTGILALVATGGAPNTNTDRAQAGTGVYDHSDYQKAVTPKTLREYKATQLQSGAVWLASETEVINGTVASANIPTVVTPEMLHKKTSTDGRIGLIEIATQAETNAGTDYTRAVTPKTLNDRAATETLTGIIAIATTAEVSAGTVTDKAIVPSKLKGYLDDTSHITVATADGLTQSGTIWTTVNIGIQSATETQRGTLRVATQSETNAGTLDTVFVTPKKLHAKKATESAEGIIQVATAAETTAGTVANKAVSPKNLKNTIQVDTSWQATDLVRGTVKLSKGLGTWSGNDVAGSTLPDDGYAAVGVAVSPYELNLTLKHYLPIGAKAVDADKLDNLDSSQFIRRDVDQTVNGALTLTRTTTVQANIDSSADATFRVMNVNGDLNVGDGSSMGKLRLNGGSSNDWSIQASSASGRIAMISTGNTSTVHLSVYNDTRGVVANVKFQAPEIQAISKVTLGNDTVITAAGSVLSMGTNNKTTKILTSDAGNIVAEESANSYKVFTEKNAQTLLNPTYVRKAGDTMSGRLTVNNSSIIIAGQAAWSTLDAVTEASRGNWTAEITASEQYNLLPGYAVPVLEPDPINPEIMIVTRYTYVKAPGTLTQFGNGTAFTYQIWAPRPTSGTGVNALAQSFWIRQMNPITGKFDGWGRMYTSNNPPTAGEIGATSAVGTTVKNMTVTDWIKVGNVKIYPDPVTQTVKFEWVA
Physico‐chemical
properties
protein length:1257 AA
molecular weight: 135451,20420 Da
isoelectric point:6,14841
aromaticity:0,07080
hydropathy:-0,27661

Domains

View on InterPro
K7NTM0
1 1257 aa
ATT 1103–1205 ·

ATT Attachment Domain STR Structural Domain RBD Receptor-Binding Domain CBM Carbohydrate-Binding Module LEC Lectin-like Domain ENZ Enzymatic Domain CHP Intramolecular Chaperone LNK Linker/Spacer Domain TAS Tail-Associated Structural TTP Tail Tubular Protein UNK Uncharacterized Domain Unmapped

Tail Spike Domain Segmentation

Segmented into three structural domains: N-terminal, central, and C-terminal.

K7NTM0
1 1257 aa
Domain Start End Length (AA) Confidence
N-terminal 1 829 829 0,8121
Central domain 830 1028 200 0,2340
C-terminal 1029 1257 228 0,6473
N-terminal Central domain C-terminal

View these domains on the 3D structure via the Color by → Tail spike option in the Tertiary structure section below.

Taxonomy

Coding sequence (CDS)

Genbank protein accession
AEX26741.1 [NCBI]
Genbank nucleotide accession
HQ918180 [NCBI]
CDS location
range 159003 -> 162776
strand +
CDS
ATGGTAGCTAAATCATTCCGCGCACGAAGTGGCCTTGATGCTGCTGGTGAAAAAGTTATCAACGTTGGGAAGGCTGATCGTAATACGCTGAGTGACGGCGTTAACGTTGACTTTTTCAATGAATTTAACGGAATCCAGCAGTATGACCCGACCCGTGGCTATTCTCAAGACATGGCGATCATTTATGCGCGTCGTATCTGGTACGCAAAACAGAATATTGCTTCCCCTGCTGGGGCTTTCGACGAATCCAAATGGATCGCAACTCGTAATGACCCGAAATGGGTTTATAGTAACGTAACCACTCCAGACGGTAGCATTATTGAATCTGGTAGTTACATTATGGCTGATGGACGCTTTACTGAACTTCTGTACCTGCTGCCGGACAACCCGACAGAGGGCGACGTAATCACGTTTAAAGATTGCGGTGGTTTAGTAGGTGTGAACAGTATCCTTGTTAAGTCTAACACCCGTCAGATCCGTTTACGTACTGTACAATCAGCACAGTATCGACTGACCCACCCGTACATGATCGCAACGTTTATCTATAACGGTAACGTATGGCGTGTTGCTGAAACTCTGGATAACCGCGATTCTGAGATCGTGAACGCAACCGGAACAGGTTCATTCCAGTTACAATCTGGTATGACAGTTTTCCGTAACAGTGCTACGGGTAAAATCACTCTCCAGTTGCCGAAATATGCAAATGATGGTGATGTGATCACAACCTATGACGCTGATAAAATGAACTCTATCAACGTAGCCGTTTTGCAGATTTATCCAGGTAGTGGACACACTATTTCTGACGGTGCAATTACTGGTGTTACTTCGGTAACTAGCCAGAAATCTGGCTTTGGTATGTTCATCTTTGATGCACAGAATAGCCAATGGAAAGTATATGATGCTGATAATCGTGTTCGTCTGCGTCGTATCTATAGTGATTTAAACACAGTTCCTAACGATTACGTTTTTGTTACTGCAAACCCTTCTGGGACTATACCAAACGTTACAGTTACCCTTCCAACTGATGTTGCTGATGGTGATCGCGTTTACGTATCCCTTTATATGATGGGTAAAAACCAGAACTGTACAATCAAAGTTAAAGATGGTACAACGGACAAGATCCGCACCAACAAAAATATGATGCAGTTCCCGCAGCGCAAAGACTATCCGCCTGATGATTGGTTTAGTGTAACTTCATTAGCATTTAATGCTGCTAGTGATTACCTCCCGTATATCGAGTTTTCTTACATGAAAGCTACGAAAGAATGGGTTGTTGCTAATTATCGTCCGATCGTTGAACGTGTTGACGCAACTAACAGATCGCGTACTGGTGTTATTGCTCTGGCTGCACAGGCAGAAGTAAACAAAAACTTGGAAGACAACCCGAACGACGAAACTGCTATTACTCCAATGACGTTAGCAAATAAAACTGCTACAGAAACCCGTCGTGGTATTGCACGGTTAGCAACAACTGCTGAGGTTAACAAACTTTCAACCGATACCTATTTGGATGATGTGATTGTTACTCCTAAGAAGCTGAACGAAAGAACAGCGACTGAAACCCGTCGTGGATTGGCAGAAATCGCAACTCAGGCAGAAACAAACGGAAGCACCGATGATATTACGATTGTAACCCCGAAAAAGTTGCATAACCGTATTGCATCGCCGACCTTAACTGGTATCCTTGCCCTTGTTGCTACAGGTGGTGCTCCTAACACCAACACGGATCGTGCTCAGGCTGGTACTGGGGTTTATGATCATTCAGATTATCAGAAAGCGGTAACGCCTAAAACTCTTCGTGAGTATAAAGCGACTCAGTTACAATCTGGCGCTGTATGGCTGGCTTCTGAAACCGAAGTTATTAATGGTACTGTTGCAAGTGCAAACATTCCGACCGTAGTTACTCCGGAAATGCTGCACAAGAAAACCTCTACTGATGGTCGTATTGGTTTGATCGAGATTGCAACTCAGGCAGAAACAAACGCTGGTACTGATTACACGCGAGCGGTAACGCCTAAAACGCTTAACGATCGTGCTGCAACGGAAACGCTGACAGGTATCATTGCAATTGCAACCACTGCCGAAGTATCAGCAGGTACTGTAACGGATAAAGCGATCGTACCGTCTAAACTGAAAGGTTATCTGGACGATACAAGCCATATTACTGTTGCTACTGCTGACGGGTTAACTCAATCTGGGACTATTTGGACTACGGTTAACATCGGTATTCAATCAGCAACTGAAACTCAACGTGGTACTTTACGCGTTGCTACGCAGTCTGAGACGAACGCAGGGACATTAGATACAGTATTTGTCACCCCTAAGAAGTTACACGCTAAGAAAGCGACTGAGAGCGCAGAAGGTATCATTCAGGTGGCTACGGCTGCTGAAACTACCGCTGGCACCGTTGCAAACAAGGCTGTTTCTCCTAAGAACTTGAAAAATACAATTCAGGTTGATACTTCATGGCAAGCTACCGATCTGGTACGCGGTACTGTGAAACTGTCTAAGGGGCTTGGTACTTGGTCTGGTAATGATGTGGCTGGTTCTACTCTTCCGGATGATGGTTATGCCGCTGTAGGTGTTGCTGTTTCTCCTTATGAATTGAACCTGACGCTGAAACATTATCTGCCGATCGGGGCTAAAGCGGTTGATGCTGATAAGCTGGATAACCTGGATTCTTCCCAGTTCATTCGTCGTGATGTTGATCAGACGGTCAACGGGGCGTTGACTCTAACGAGAACAACCACTGTACAGGCCAACATTGATTCAAGCGCTGATGCAACATTCCGCGTAATGAATGTTAATGGAGATCTGAACGTTGGTGATGGTTCGTCAATGGGTAAACTTCGTTTGAATGGTGGTTCATCTAACGACTGGTCAATTCAAGCAAGTTCAGCCTCTGGGCGTATTGCAATGATCTCAACGGGTAACACTAGCACAGTTCATCTTTCTGTGTATAATGATACTCGCGGCGTTGTAGCTAACGTTAAATTCCAGGCTCCTGAAATTCAGGCGATTAGCAAAGTGACTCTGGGTAATGATACCGTGATCACTGCTGCTGGTTCTGTTCTATCTATGGGTACGAACAACAAGACAACTAAGATCCTGACCTCTGATGCTGGCAACATCGTAGCGGAAGAATCTGCGAACTCTTATAAAGTCTTTACTGAGAAGAACGCGCAAACCTTGCTTAACCCTACGTATGTACGCAAAGCAGGTGATACGATGTCTGGACGGTTGACAGTAAACAACAGTTCAATTATCATCGCAGGTCAAGCGGCTTGGTCAACACTGGATGCAGTAACAGAAGCATCTAGGGGTAACTGGACGGCTGAAATCACAGCATCGGAACAGTACAACTTACTTCCTGGTTACGCGGTTCCGGTTCTTGAACCAGATCCGATTAATCCAGAAATCATGATTGTGACCCGTTATACCTATGTTAAAGCACCGGGTACTTTAACGCAGTTTGGTAACGGAACTGCATTCACTTATCAGATTTGGGCACCTCGTCCGACCTCTGGTACTGGTGTTAATGCGCTGGCGCAATCCTTCTGGATCCGTCAAATGAACCCAATTACGGGTAAATTTGATGGTTGGGGCCGCATGTATACTAGCAACAACCCGCCTACTGCTGGTGAGATTGGTGCAACGTCTGCTGTTGGTACTACGGTTAAAAACATGACTGTTACCGATTGGATCAAAGTTGGTAACGTTAAGATTTACCCAGATCCAGTTACTCAGACAGTTAAATTTGAGTGGGTGGCATAA

Genome Context

Tertiary structure

K7NTM0
ESMFold structure
Source ESMFold
pLDDT 54.1
Oligomeric state monomer