首页 | 本学科首页   官方微博 | 高级检索  
     


Identifying condition specific key genes from basal-like breast cancer gene expression data
Affiliation:1. Institute of Biochemistry and Biotechnology, Chung Shan Medical University, Taichung, Taiwan;2. Clinical Laboratory, Chung Shan Medical University Hospital, Taichung, Taiwan;3. Department of Pathology, Changhua Christian Hospital, Changhua, Taiwan;4. Department of Medical Technology, Jen-Teh Junior College of Medicine, Nursing and Management, Miaoli, Taiwan;5. School of Medicine, Chung Shan Medical University, Taichung, Taiwan;6. Department of Gastroenterology, Changhua Christian Hospital, Changhua, Taiwan;1. Department of Obstetrics and Gynecology, Qilu Hospital of Shandong University, Jinan, China;2. Department of Gynecology, Yucheng People''s Hospital, Dezhou, China
Abstract:
Mining patterns of co-expressed genes across the subset of conditions help to narrow down the search space for the analysis of gene expression data. Identifying conditions specific key genes from the large-scale gene expression data is a challenging task. The conditions specific key gene signifies functional behavior of a group of co-expressed genes across the subset of conditions and can be act as biomarkers of the diseases. In this paper, we have propose a novel approach for identification of conditions specific key genes from Basal-Like Breast Cancer (BLBC) disease using biclustering algorithm and Gene Co-expression Network (GCN). The proposed approach is a two-stage approach. In the first stage, significant biclusters have been extracted with the help of ‘runibic’ biclustering algorithm. The second stage identifies conditions specific key genes from the extracted significant biclusters with the help of GCN. By using difference matrix and gene correlation matrix, we have constructed biologically meaningful and statistically strong GCN. Also, presented the proposed approach with the help of a process diagram and demonstrated the procedure with an example of bicluster number 93 (Bic93). From the experimental results, we observed that 95% and 85% of the extracted biclusters are found to be biologically significant at the p-values less than 0.05 and 0.01 respectively. We have compared proposed approach with the Weighted Gene Co-expression Network Analysis (WGCNA) based approach. From the comparison, our approach has performed effectively and extracted biologically significant biclusters. Also, identified conditions specific key genes which cannot be extracted using the WGCNA based approach. Some of the important identified known key genes are PIK3CA, SHC3, ERBB2, SHC4, PTOV1, STAG1, ZNF215 etc. These key genes can be used as a diagnostic and prognostic biomarker for the BLBC disease after the rigorous analysis. The identified conditions specific key genes can be helpful to reduce the analysis time and increase the accuracy of further research such as biomarker identification, drug target discovery etc.
Keywords:Bicluster  BLBC  Gene expression data  GCN  Bioinformatics  Data mining
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号