期刊首页 优先出版 当期阅读 过刊浏览 作者中心 关于期刊 English

《中国工程科学》 >> 2011年 第13卷 第9期

知识发现系统框架及其理论体系的构造方法论

北京科技大学计算机与通信工程学院,北京 100083

资助项目 :国家自然科学基金项目(60675030、60875029、61175048) 收稿日期: 2010-07-30 发布日期: 2011-09-15 09:59:51.000

下一篇 上一篇

摘要

当前知识发现的主流发展是围绕着寻求在各类数据库和应用背景下高性能、高扩展性的挖掘算法这一主题而展开的。事实上有比算法重要的决定挖掘流程的过程模型的研究,还有更为重要的决定模型和算法的内在机理(反映知识发现系统或过程本身规律)的研究,尚未得到应有的重视。笔者另辟蹊径,将所论三者有机地融合与集成,构造了一类基于认知心理特征的自主知识发现“系统框架”,通过对类似的几类“系统框架”的交叉融合、综合集成,构造出基于内在认知机理的知识发现理论体系KDTICM。研究与实验结果表明:这种高起点、高层次的构造方法论研究,有可能形成高效能挖掘系统与新的研究方向;这种构造方法论的研究,可使长时间得不到解决的“领域知识实质性地介入到知识发现过程中”对知识库进行“动态实时维护”等重要问题得以解决;通过揭示知识发现的潜在规律与复杂性,可反作用于主流发展。最后,给出了此种构造方法有效性的有力佐证。

图片

图1

图2

图3

图4

图5

图5

图6

图7

图8

图9

参考文献

[ 1 ] Chen M S,Han J,Yu P S.Data mining:an overview from a data‐ base perspective[J].IEEE Transactions on Knowledge and Data Engineering,1996,8(6):866 -883. 链接1

[ 2 ] Han J,Kamber M.Data Mining:Concepts and Techniques[M]. San Francisco:Morgan Kaufmann,2001.

[ 3 ] Indranil Bose,Mahapatra R K.Business data mining -a machine learning perspective[J].Information & Management,2001,39: 211 -225. 链接1

[ 4 ] Witten I H,Frank E.Data Mining:Practical Machine Learning Tools and Techniques with Java Implementations[M].San Fran‐ cisco:Morgan Kaufmann,2000.

[ 5 ] Friedman H.Data mining and statistics:what is the connection? [C]//Keynote Speech of the 29th Symposium of the Interface: Computing Sciences and Statistics,Houston,TX,1997.

[ 6 ] Hand D,Mannila H,Smyth P.Principles of Data Mining [M ]. Cambridge:MIT Press,2001.

[ 7 ] Kleinberg J,Papadimitriou C,Raghavan P.A microeconomic view of data mining[J].Data Mining and Knowledge Discovery,1998, 2(4):311 -324. 链接1

[ 8 ] 杨炳儒.知识发现进展中的两大核心问题[J].中国科学技 术 前沿:中国工程科学版,2006(9):205 -269.

[ 9 ] 杨炳儒.基于内在认知机理的知识发现[M].北京:国防工业 出版社,2009.

[10] Wang J F ,Lee T T.An invariant for hypergraphs[J].Chinese AC‐ TA Mathematical Application Sinica,1996,2(2):113 -120.

[11] Piatetsky -shapiro G,Matheus C J.Knowledge discovery work - bench for exploring business databases[J].International Journal of Intelligent Systems,1992,7:668 -675. 链接1

[12] Yang Bingru.KDK based double -basis fusion mechanism and its structural model [J].International Journal of Artificial Intelli‐ gence Tools,2005,14(3):399 -423. 链接1

[13] 杨炳儒,李晋宏,宋威,等.面向复杂系统的知识发现过程模型KD(D&K)及其应用[J].自动化学报,2007,33(2):151-155. 链接1

[14] 杨炳儒,宋威,,徐章艳.基于知识发现创新技术的专家系统新构造[J].中国科学(E辑),2007,37(6):738-747. 链接1

[15] Yang Bingru,Xiong Fanlun.KD (D&K)and double -bases co‐ operating mechanism [J].Journal of Systems Engineering and Electronics,1999,10(2):48 -54.

[16] Yang Bingru,Tang Jing.Research of discovery feature sub‐space model (DFSSM)based on complex type data[C]//Proceedings of 2002 International Conference on Machine Learning and Cyber‐ netics,2002,1:256 -260.

[17] Yang Bingru,Sun Haihong,Xiong Fanlun.Mining quantitative association rules with standard SQL queries and its evaluation [J].Journal of Computer Research and Development,2002,39 (3):307 -312.

[18] Kevin Karplus,Barrett C,Hughey R,et al.Sequence comparisons using multiple sequences detect twice as many remote homologues as pairwise methods [J].Journal of Molecular Biology,1998, 284:1201 -1210. 链接1

[19] David T,Jones.Protein secondary structure prediction based on position -specific scoring matrices[J].J Mol Biol,1999,292: 195 -202. 链接1

[20] Li Wenmin,Han Jiawei,Pei Jian.CMAR:accurate and efficient classification based on multiple class‐association rules[C]//Proc of the 2001 IEEE International Conference on Data Mining,San Jose,California,2001:369 -376.

[21] Yang Bingru,Hou Wei,et al.KAAPRO:an approach of protein secondary structure prediction based on KDD 倡 in the compound pyramid prediction model [J].Expert Systems with Applica‐ tions,2009,36(5):9000 -9006.

[22] Rost B,Sander C.Prediction of secondary structure at better than 70 % accuracy[J].J Mol Biol,1993,232(2):584 -599.

[23] Cuff J A,Barton G J.Evaluation and improvement of multiple sequence methods for protein secondary structure prediction[J]. Proteins:Structure,Function and Genet,1999,34:508 -519. 链接1

[24] Protein Structure Prediction Center. http://predictioncenter org /.

[25] Baldi P,Brunak S,Frasconi P,et al.Bidirectional dynamics for protein secondary structure prediction [C ]//Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI99),Stockholm,Sweden,1999.

[26] Karplus K,Karchin R,Draper J,et al.Combining local‐struc‐ ture,fold‐recognition,and new‐fold methods for protein structure prediction[J].Proteins,2003,53:491 -496. 链接1

[27] Rost B,Sander C,Schneider R.PHD‐an automatic mail server for protein secondary structure prediction[J].Comput Appl Bios‐ ci,1994,10:1153 -1160. 链接1

[28] Ouali M,King R. Cascaded multiple classifiers for secondary structure prediction [J]. Protein Science, 2000, 9: 1162 - 1176. 链接1

[29] Cuff J,Clamp M,Siddiqui A,et al.JPRED:a consensus second‐ ary structure prediction server [J].Bioinformatics,1998,14: 892 -893. 链接1

[30] Hyunsoo Kim,Haesun Park.Protein secondary structure predic‐ tion based on an improved support vector machines approach[J]. Protein Engineering,2003,16(8):553 -560. 链接1

[31] Hu H J,Pan Yi,Robert Harrison,et al.Improved protein sec‐ ondary structure prediction using support vector machine with a new encoding scheme and an advanced tertiary classifier [J]. IEEE Transactions on NanoBioscience,2004,3(4):265 -271. 链接1

[32] Xie Xiao,Yang Bo,Chen Yuehui.Protein secondary structure prediction based on nerve network [J].Journal of University of Jinan (Science and Technology),2008,(2):111 -115.

[33] Chen Jfinmiao,Narendra S Chaudhari.Cascaded bidirectional recurrent neural networks for protein secondary structure predic‐ tion[J].IEEE /ACM Transactions on Computational Biology and Bioinformatics,2007,4(4):572 -582. 链接1

[34] Paras Chopra,Andreas Bender. Evolved cellular automata for protein secondary structure prediction imitate the determinants for folding observed in nature[J].Silico Biol,2007,7 (1 ):87 - 93. 链接1

[35] Liu Yan,Jaime Carbonel,Judith Klein‐Seetharaman,et al.Con‐ text sensitive vocabulary and its application in protein secondary structure prediction[C]//Proceedings of the 27 th Annual Inter‐ national ACM SIGIR Conference on Research and Development in Information Retrieval,Sheffield,United Kingdom,ACM,2004, 538 -539.

[36] Guo Jian,Chen Hu,Sun Zhirong,et al.A novel method for pro‐ tein secondary structure prediction using dual‐layer SVM and pro‐ files[J].Proteins,2004,54(4):738 -743. 链接1

[37] Wang Longhui,Liu Juan,Li Yanfu,et al.Predicting protein sec‐ ondary structure by a support vector machine based on a new cod‐ ing scheme[J].Genome Informatics,2004,15(2):181 -190. 链接1

相关研究