A service provided by the WU Library and the WU IT-Services

Text Mining Infrastructure in R

Meyer, David and Hornik, Kurt and Feinerer, Ingo (2008) Text Mining Infrastructure in R. Journal of Statistical Software, 25 (5). pp. 1-54. ISSN 1548-7660

[img]
Preview
PDF
Available under License Creative Commons Attribution Austria.

Download (685Kb) | Preview

Abstract

During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R. We give a survey on text mining facilities in R and explain how typical application tasks can be carried out using our framework. We present techniques for count-based analysis methods, text clustering, text classiffication and string kernels. (authors' abstract)

Item Type: Article
Additional Information: Article contains supplementary files. See http://dx.doi.org/10.18637/jss.v025.i05
Keywords: text mining / R / count-based evaluation / text clustering / text classiffication / string kernels
Divisions: Departments > Finance, Accounting and Statistics > Statistics and Mathematics > Hornik
Version of the Document: Published
Variance from Published Version: None
Depositing User: ePub Administrator
Date Deposited: 08 Oct 2013 10:53
Last Modified: 14 Apr 2016 14:50
Related URLs:
FIDES Link: https://bach.wu.ac.at/d/research/results/43299/
URI: http://epub.wu.ac.at/id/eprint/3978

Actions

View Item