Genbank accession
NP_899594.1 [GenBank]
Protein name
tail collar fiber protein
RBP type
TF
Evidence GenBank
Probability 1,00
TF
Evidence Phold
Probability 1,00
TF
Evidence RBPdetect
Probability 0,91
Protein sequence
MSKGTQIFNHVIDDAGTVTVEVAGTAFDGQTGGNDDLQTCLELIQDHAVQPLPDYPVASTTVAGITKLSDEAAVVDPLNTDSAVTPSSLDYWMQNHATATELQYGFVKLITESTIDTVAPSDPVEAAQKHAFTLKTLNYALNTRFYATESDPGAVRLATNAQATTTGTLSTTVAMTPQRVKEMLDVWANTTASDASETTKGLIRLANGTEVNSTLATEDNLAISPYRFNFRTATTTRKAGFYLPDATVANARASNEHAVTVGTLNLFSANSSRVGVAKIANNLTTNDPLQALSAAMGYKLNNEKIGDAGGTVTGTLKINNVQSVGGTQLMTNGLIESQAMLNMYPVGSVYMSLVSTSPATLFGGTWARLAQGRVLVSEGSYGGRTFAVRQTGGEYEVQLTEATIPAHKHAGWGEHYDGNGIGFGVAKQYGRNNPGSRRTDSDNYLYYTSPVGGNQAHNNVQPYYTVYMWERTA
Physico‐chemical
properties
protein length:473 AA
molecular weight: 50436,33440 Da
isoelectric point:5,22273
aromaticity:0,07822
hydropathy:-0,27230

Domains

Domains [InterPro]
DC_0176
STR
1–450
IPR054500
STR
279–303
IPR053827
ATT
343–472
IPR044916
STR
372–466
NP_899594.1
1 473
Architecture
STR
ATT
STR 1-342 | ATT 343-472 |
Legend: ATT STR RBD CBM LEC ENZ CHP LNK TAS TTP UNK Unmapped

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage KVP40 (isolate Vibrio parahaemolyticus/Japan/Matsuzaki/1991)
[NCBI]
75320 Uroviricota > Caudoviricetes > Pantevenvirales > Schizotequatrovirus > Schizotequatrovirus KVP40
Host Vibrio
[NCBI]
662 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales
Host Vibrio parahaemolyticus
[NCBI]
670 cellular organisms > Bacteria > Pseudomonadati > Pseudomonadota > Gammaproteobacteria > Vibrionales

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
NP_899594.1 [NCBI]
Genbank nucleotide accession
NC_005083.2 [NCBI]
CDS location
range 208485 -> 209906
strand +
CDS
ATGAGTAAAGGAACTCAAATTTTCAATCACGTGATTGATGATGCGGGTACGGTTACAGTTGAAGTCGCTGGTACTGCCTTTGACGGGCAGACTGGCGGCAACGATGATCTTCAGACGTGCCTTGAATTGATTCAAGATCACGCTGTTCAGCCGTTACCTGATTATCCAGTCGCGTCAACGACTGTGGCTGGTATTACGAAATTAAGTGATGAAGCCGCTGTAGTTGACCCGCTAAACACAGATAGCGCGGTTACGCCAAGTTCGCTTGATTATTGGATGCAAAATCACGCAACTGCAACTGAATTGCAGTATGGTTTCGTAAAGCTGATAACCGAAAGCACAATCGACACAGTAGCACCGTCAGATCCCGTAGAAGCTGCACAAAAGCACGCATTCACGTTAAAAACGCTGAACTATGCTCTTAATACTAGATTTTACGCGACTGAATCTGACCCAGGTGCTGTGCGTCTAGCGACGAACGCGCAAGCAACGACAACAGGCACACTTTCAACGACTGTAGCAATGACGCCGCAGCGAGTTAAAGAAATGCTTGACGTGTGGGCAAATACAACTGCATCAGACGCATCCGAAACTACAAAAGGGTTGATTCGACTAGCAAACGGGACTGAAGTCAACAGTACATTAGCGACTGAAGATAACTTGGCAATTTCGCCATATCGTTTTAACTTTAGAACAGCAACTACGACTCGTAAGGCAGGGTTTTATCTACCCGACGCTACAGTTGCCAACGCTCGTGCGTCGAACGAACATGCAGTTACAGTGGGCACATTGAATCTATTCAGTGCAAATAGTTCTCGTGTGGGTGTTGCTAAAATAGCAAATAACCTGACGACGAATGATCCGTTACAAGCTCTAAGCGCAGCAATGGGTTATAAGTTGAACAACGAGAAGATTGGCGATGCAGGTGGTACAGTCACGGGTACCCTGAAGATCAACAATGTTCAGTCAGTTGGCGGAACTCAGTTGATGACTAATGGTCTAATTGAATCGCAAGCAATGCTTAATATGTATCCAGTCGGTTCAGTTTACATGTCGTTAGTATCAACATCACCGGCTACACTTTTCGGTGGTACATGGGCTAGATTGGCACAAGGTCGTGTTCTTGTTTCAGAGGGTTCATATGGCGGGAGAACTTTTGCAGTTCGCCAAACTGGCGGCGAGTATGAAGTTCAGCTAACTGAAGCAACAATCCCTGCTCATAAACACGCTGGCTGGGGTGAACATTACGATGGTAATGGTATAGGATTTGGTGTTGCGAAGCAATACGGTCGTAACAACCCAGGTTCTCGTAGAACAGACAGTGACAACTATCTATACTACACATCACCAGTCGGTGGTAACCAAGCGCACAACAACGTACAGCCATACTATACTGTTTACATGTGGGAAAGAACTGCATAA

Genome Context

Genome Context

Tertiary structure

PDB ID
59b1361de4493c60e1960071742633d8a5aef353e5d44115d1b060d4d327038f
ESMFold
Source ESMFold
Method ESMFold
Resolution 0,7089
Oligomeric State monomer
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50