首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Extractive summarization using complex networks and syntactic dependency
Authors:Diego R Amancio  Maria GV NunesOsvaldo N Oliveira Jr  Luciano da F Costa
Institution:
  • a Instituto de Física de São Carlos, Universidade de São Paulo, CP 369, PO BOX 13560-970, São Carlos, SP, Brazil
  • b Instituto de Ciências Matemáticas e de Computação Universidade de São Paulo, CP 668, 13560-970, São Carlos, SP, Brazil
  • Abstract:The realization that statistical physics methods can be applied to analyze written texts represented as complex networks has led to several developments in natural language processing, including automatic summarization and evaluation of machine translation. Most importantly, so far only a few metrics of complex networks have been used and therefore there is ample opportunity to enhance the statistics-based methods as new measures of network topology and dynamics are created. In this paper, we employ for the first time the metrics betweenness, vulnerability and diversity to analyze written texts in Brazilian Portuguese. Using strategies based on diversity metrics, a better performance in automatic summarization is achieved in comparison to previous work employing complex networks. With an optimized method the Rouge score (an automatic evaluation method used in summarization) was 0.5089, which is the best value ever achieved for an extractive summarizer with statistical methods based on complex networks for Brazilian Portuguese. Furthermore, the diversity metric can detect keywords with high precision, which is why we believe it is suitable to produce good summaries. It is also shown that incorporating linguistic knowledge through a syntactic parser does enhance the performance of the automatic summarizers, as expected, but the increase in the Rouge score is only minor. These results reinforce the suitability of complex network methods for improving automatic summarizers in particular, and treating text in general.
    Keywords:Summarization  Complex networks  Diversity metrics  Entropy  Syntactical dependency
    本文献已被 ScienceDirect 等数据库收录!
    设为首页 | 免责声明 | 关于勤云 | 加入收藏

    Copyright©北京勤云科技发展有限公司  京ICP备09084417号