International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
|
Volume 7 - Issue 2 |
Published: April 2014 |
Authors: Tanu Verma, Renu, Deepti Gaur |
![]() |
Tanu Verma, Renu, Deepti Gaur . Tokenization and Filtering Process in RapidMiner. International Journal of Applied Information Systems. 7, 2 (April 2014), 16-18. DOI=10.5120/ijais14-451139
@article{ 10.5120/ijais14-451139, author = { Tanu Verma,Renu,Deepti Gaur }, title = { Tokenization and Filtering Process in RapidMiner }, journal = { International Journal of Applied Information Systems }, year = { 2014 }, volume = { 7 }, number = { 2 }, pages = { 16-18 }, doi = { 10.5120/ijais14-451139 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2014 %A Tanu Verma %A Renu %A Deepti Gaur %T Tokenization and Filtering Process in RapidMiner%T %J International Journal of Applied Information Systems %V 7 %N 2 %P 16-18 %R 10.5120/ijais14-451139 %I Foundation of Computer Science (FCS), NY, USA
Text mining is defined as a knowledge-intensive process in which a user interacts with a document collection. As in data mining[2,4,9], text mining seeks to extract useful information from data sources through the identi?cation and exploration of interesting patterns. A key element of text mining is its focus on the document collection. A document collection can be any grouping of text-based documents. Most text mining solutions are aimed at discovering patterns across very large document collections. The number of documents can range from the many thousands to millions. In this paper, we will see how text mining is implemented in Rapidminer.