Abstract:
In a typical de novo motif discovery process, it is quite common that many of the motif candidates output from motif discovery programs are either already known motifs or...Show MoreMetadata
Abstract:
In a typical de novo motif discovery process, it is quite common that many of the motif candidates output from motif discovery programs are either already known motifs or motif-like decoy/repeat patterns. To prevent the false discovery and also to increase the chance of authentic novel motif discovery, some motif discovery programs employ a pre-processing stage in order to mask certain repeat positions in the input sequences. There are a few approaches to sequence masking aimed at avoiding the false discovery. This paper introduces a novel approach and a tool, called MotifHider, to sequence masking problem. MotifHider exploits sensitive knowledge hiding principles from database sharing. By hiding certain patterns, it provides successive motif discovery programs to avoid false discovery and rediscovery. At the same time, it avoids overly distortion of the input sequences so as to retain most of the authentic motifs.
Date of Conference: 14-16 September 2009
Date Added to IEEE Xplore: 23 October 2009
ISBN Information: