PREPARATORY DOCUMENT STRUCTURING TECHNIQUE

No Thumbnail Available
Date
2020
Journal Title
Journal ISSN
Volume Title
Publisher
International Journal of Psychosocial Rehabilitation, Vol.24, Issue 02
Abstract
The need for mining structured data has increased in the past few years. This structured data is used as input for data mining tasks. Text mining is part of data mining where the data used is in the form of unstructured text. Text mining can able to handle unstructured or semi-structured data sets such as emails HTML files and full text documents etc. The unstructured data usually refers to information that does not reside in a traditional row-column database and it is the opposite of structured data. In order to extract information from text, preprocessing steps are needed. This paper discussed about the theoretical basis of preprocessing document for Text Mining. Brief descriptions of some representative approaches such as NLP tasks and Information extraction are provided as well.
Description
Keywords
text mining, document structuring, information extraction
Citation