Frequent Subgraph Mining Across Massive
Biological Networks for Functional Discovery
Prof. Jasmine Zhou
University of Southern California
Abstract:
The rapid accumulation of biological network data translates into
an urgent need for computational methods for graph pattern mining. One
important problem is to identify recurrent patterns across multiple networks to
discover biological modules. However, existing algorithms for frequent pattern
mining become very costly in time and space as the pattern sizes and network
numbers increase. Currently, no efficient algorithm is available for mining
recurrent patterns across large collections of genome-wide networks. We
developed a novel algorithm, CODENSE, to efficiently mine frequent coherent
dense subgraphs across large numbers of massive graphs. Compared with previous
methods, our approach is scalable in the number and size of the input graphs
and adjustable in terms of exact or approximate pattern mining. Applying
CODENSE to 39 co-expression networks derived from microarray datasets, we
discovered a large number of functionally homogeneous clusters and made
functional predictions for 169 uncharacterized yeast genes.
Speaker Bio:
Jasmine Zhou is an assistant professor in Molecular and
Computational Biology at University of Southern California. She received her
PhD in Bioinformatics from Swiss Federal Institute of Technology in 2000, and
did her post-doc training at UCLA and Harvard from 2000 to 2003. Her research
includes integrative genomics analysis, network algorithms, and prediction of
gene functions and regulation. She is a recipient of the Alfred Sloan
fellowship in 2006.