Survey on Pre-Processing Techniques for Text Mining

Arjun Srinivas Nayak ,, Ananthu P Kanive, Naveen Chandavekar Dr. Balasubramani R

Articles

Open Access

Survey on Pre-Processing Techniques for Text Mining

Arjun Srinivas Nayak ,, Ananthu P Kanive, Naveen Chandavekar Dr. Balasubramani R

Vol 5, Issue 6, (2016) ·· Published: Jan 01, 2018

PDF

Views: 295 PDF downloads: 72

Abstract

: Data Mining is a versatile sublet in the field of computer science. It is the computational evolution mode of detecting patterns in large data sets. This paper give an indication on the different pre-processing techniques to mine text data. Text mining applications include – Information Retrieval, Information Extraction, Categorization, and Natural Language Processing. The pre-processing of text mining starts with Tokenization, followed by Stop-word removal and finally stemming. This paper evaluates Porter’s and Krovetz algorithm, highlighting their applications and drawbacks.

Publication date: Jan 01, 2018

Issue: Vol 5, Issue 6, (2016)

Authors

Arjun Srinivas Nayak ,, Ananthu P Kanive, Naveen Chandavekar Dr. Balasubramani R

Full text PDF

Extracted figures/tables will appear here.

This is an open access article under the terms of the CC BY.

Survey on Pre-Processing Techniques for Text Mining

Abstract

Author Resources

Journal Policies

Author Desk