首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于DSFCI-tree的分布式数据流频繁闭合模式挖掘
引用本文:程转流,胡为成,胡学钢.基于DSFCI-tree的分布式数据流频繁闭合模式挖掘[J].微电子学与计算机,2007,24(9):120-122,125.
作者姓名:程转流  胡为成  胡学钢
作者单位:1. 合肥工业大学,计算机与信息学院,安徽,合肥,230009;铜陵学院,计算机系,安徽,铜陵,244000
2. 铜陵学院,计算机系,安徽,铜陵,244000
3. 合肥工业大学,计算机与信息学院,安徽,合肥,230009
摘    要:频繁闭合模式集可惟一确定频繁模式完全集且数据量要小几个数量级。根据分布式数据流的特点,提出了一种挖掘频繁闭合项集的算法,该算法采用K叉树形结构,以叶子节点接收各条数据流,创建DSFCI_tree结构来存贮各条数据流中的每段闭合模式,然后逐层往上合并更新,从而在根节点可得整个分布式数据流的频繁闭合模式。

关 键 词:数据挖掘  分布式数据流  关联规则  频繁闭合项集
文章编号:1000-7180(2007)09-0120-03
修稿时间:2007-06-28

Mining Frequent Closed Patterns over Distributed Data Streams Based on DSFCI-tree
CHENG Zhuan-liu,HU Wei-cheng,HU Xue-gang.Mining Frequent Closed Patterns over Distributed Data Streams Based on DSFCI-tree[J].Microelectronics & Computer,2007,24(9):120-122,125.
Authors:CHENG Zhuan-liu  HU Wei-cheng  HU Xue-gang
Institution:1 College of Computer Science, Hefei Technology University, Hefei 230009, China; 2 Department of Computer Science, Tongling College, Tongling 244000, China
Abstract:The set of frequent closed patterns uniquely determines the complete set of all frequent patterns, and it can be orders of magnitude smaller than the latter. According to the features of distributed data streams, a new algorithm is proposed for mining the frequent closed patterns. This algorithm uses K-children tree structure, receives each data stream by the leaf node, founds the DSFCI_tree to store each section of closed patterns in each data stream, then the cascade merges upward and renews, thus may result in the root node the frequent closed pattern in the entire distributional data streams. The experiments and analysis show that the algorithm has good performance.
Keywords:data mining  distributed data streams  association rule  frequent closed itemsets
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号