Text Mining in Keywords Extraction

From REU@MU
Revision as of 16:22, 3 June 2016 by Phucnguyen (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Project Description and Goal

Text mining or text analysis refers to the use of computational techniques to discover new and unknown information from unstructured textual resources. Within text mining, keywords extraction is one of the most important tasks that automatically identifies and retrieves the most relevant information from unstructured texts. Despite being commonly used in search engines to locate information, appropriate keywords are difficult to generate since the process is time-consuming for humans amid the massive amount of information available nowadays. Thus, many traditional methods have been used over years and new solutions are constantly proposed to tackle this problem. Examples of some prevalent algorithms or models are TF-IDF (Term Frequency-Inverse Document Frequency), RAKE (Rapid Automatic Keywords Extraction), or TextRank as well as some less popular ones such as using lexical chains or using Bayes classifier.