Results: We demonstrate the effectiveness of DiffRank on synthetic and real datasets. For the synthetic dataset, we develop a simulator for generating synthetic differential scale-free networks, and we compare our method with existing methods. The comparisons show that our algorithm outperforms the existing methods. For the real datasets, we apply the proposed algorithm on collaboration networks and and on five gene expression datasets.
Co-authorship Networks: In scientific co-authorship networks, the nodes are authors of academic papers and the edges represent co-authorship (or collaboration) relationships between the authors. Two authors may have different relationships in two different research topics such as data mining and database. The differential hubs in this case include the authors who are highly active in one topic but not active in the other topic, or they may include the authors who are active in both topics but with different collaborators in each topic. Differential networking can also be used to analyze two co-authorship networks that are constructed from two mutually exclusive time intervals to identify the authors whose collaborations change over time (The DBLP dataset). The networks and the results are availabl in the paper. .
Biological Networks: Microarray studies are used to measure the expression level of thousands of genes under different conditions. These conditions could be different tissue types (normal vs cancerous), different subject types (e.g., male vs female), different group types (African-American and Caucasian American) , different stage of cancer (early stage vs developed stage) or different time points. Here, the nodes are the genes, and the edges represent the interactions between the genes. Since the genes that have strongly altered connectivity play an important role in the disease phenotype, finding the differential genes can be used in several applications such as identifying disease-causing genes and examining the effects of a certain treatment. Links to the five gene expression datastes used in the paper The Leukemia dataset The Medulloblastoma dataset The Lung cancer dataset The Colon cancer dataset The Gastric cancer dataset The results of the gene expression datasets were evaluated using the DAVID tool. In addition, we compare our results with the previously published results. We show that the proposed method provides biologically interesting rankings.The Results of the gene expression data is available here
Omar Odibat and Chandan K. Reddy, "Ranking Differential Genes in Co-expression Networks", Journal of Bioinformatics and Computational Biology (JBCB), 2011. Invited Paper (in press.) [ PDF ]