首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于语句查询扩展和高性能计算平台的分布式信息检索系统DQSSQE
引用本文:彭敏,杨铭,孙松涛,何炎祥.基于语句查询扩展和高性能计算平台的分布式信息检索系统DQSSQE[J].武汉大学学报(理学版),2012,58(3):243-250.
作者姓名:彭敏  杨铭  孙松涛  何炎祥
作者单位:武汉大学计算机学院,湖北武汉,430072
基金项目:国家自然科学基金,国家软件工程重点实验室开放基金,武汉市科技晨光计划
摘    要:提出了一种基于语句的查询扩展方法以及语句向量的融合策略,使得扩展后的查询语句的查询性能优于原始查询语句;基于微软高性能计算平台HPC Server和查询扩展策略,设计实现了一个分布式文本检索系统DQSSQE.实验结果表明,在检索性能方面,所提出的查询扩展策略能够有效的提高查准率,召回率上也有一定的提高;在分布式检索计算性能方面,DQSSQE系统具有较好的计算加速比,随着文本集规模的增加,其计算性能的优越性体现明显.

关 键 词:信息检索  查询扩展  高性能计算  分布式

A Distributed Information Retrieval System DQSSQE Based on Sentences Query Expansion and High Performance Computing Platform
PENG Min,YANG Ming,SUN Songtao,HE Yanxiang.A Distributed Information Retrieval System DQSSQE Based on Sentences Query Expansion and High Performance Computing Platform[J].JOurnal of Wuhan University:Natural Science Edition,2012,58(3):243-250.
Authors:PENG Min  YANG Ming  SUN Songtao  HE Yanxiang
Institution:(School of Computer,Wuhan University,Wuhan 430072,Hubei,China)
Abstract:In this paper,a query expansion method based on sentences and a sentence vectors combination strategy are proposed to improve the query performance.A distributed text retrieval system DQSSQE is designed based on Microsoft HPC Server platform and query expansion strategy.The experiment result shows that the proposed query expansion strategy improves the precision ration greatly,and improves the recall ratio as well.At the same time,DQSSQE system gets a higher computation speedup ratio,and the more large the text set is,the higher performance the system will get,compared to the ordinary text retrieval systems.
Keywords:information retrieval  query expansion  high performance computing  distributed systems
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号