码迷,mamicode.com
首页 > 其他好文 > 详细

关键词提取新方法-YAKE! Collection-independent Automatic Keyword Extractor

时间:2020-06-30 20:36:13      阅读:83      评论:0      收藏:0      [点我收藏+]

标签:tween   line   ber   produce   rom   sam   form   源地址   rpo   

Extracting keywords from texts has become a challenge for individuals and organizations as the information grows in complexity and size. The need to automate this task so that texts can be processed in a timely and adequate manner has led to the emergence of automatic keyword extraction tools. Despite the advances, there is a clear lack of multilingual online tools to automatically extract keywords from single documents. In this paper, we present Yake!, a novel feature-based system for multi-lingual keyword extraction, which supports texts of different sizes, domain or languages. Unlike most of the systems, Yake! does not rely on dictionaries nor thesauri, neither is trained against any corpora. Instead, we follow an unsupervised approach which builds upon features extracted from the text, making it thus applicable to documents written in different languages without the need for further knowledge. This can be beneficial for a large number of tasks and a plethora of situations where the access to training corpora is either limited or restricted. In this demo, we offer an easy to use, interactive session, where users from both academia and industry can try our system, either by using a sample document or by introducing their own text. As an add-on, we compare our extracted keywords against the output produced by the IBM Natural Language Understanding and Rake system. This will enable users to understand the distinctions between the three approaches.

 

开源地址:https://boiling-castle-88317.herokuapp.com/

paper:A Text Feature Based Automatic Keyword Extraction Method for Single Documents

关键词提取新方法-YAKE! Collection-independent Automatic Keyword Extractor

标签:tween   line   ber   produce   rom   sam   form   源地址   rpo   

原文地址:https://www.cnblogs.com/demo-deng/p/13215615.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!