Extracting biologically significant patterns from short time series gene expression data
Abstrak
BackgroundTime series gene expression data analysis is used widely to study the dynamics of various cell processes. Most of the time series data available today consist of few time points only, thus making the application of standard clustering techniques difficult.ResultsWe developed two new algorithms that are capable of extracting biological patterns from short time point series gene expression data. The two algorithms, ASTRO and MiMeSR, are inspired by the rank order preserving framework and the minimum mean squared residue approach, respectively. However, ASTRO and MiMeSR differ from previous approaches in that they take advantage of the relatively few number of time points in order to reduce the problem from NP-hard to linear. Tested on well-defined short time expression data, we found that our approaches are robust to noise, as well as to random patterns, and that they can correctly detect the temporal expression profile of relevant functional categories. Evaluation of our methods was performed using Gene Ontology (GO) annotations and chromatin immunoprecipitation (ChIP-chip) data.ConclusionOur approaches generally outperform both standard clustering algorithms and algorithms designed specifically for clustering of short time series gene expression data. Both algorithms are available at http://www.benoslab.pitt.edu/astro/.
Topik & Kata Kunci
Penulis (4)
A. Tchagang
Kevin V. Bui
Thomas McGinnis
P. Benos
Akses Cepat
- Tahun Terbit
- 2009
- Bahasa
- en
- Total Sitasi
- 22×
- Sumber Database
- Semantic Scholar
- DOI
- 10.1186/1471-2105-10-255
- Akses
- Open Access ✓