Publication Type:

Conference Paper

Source:

Second International Symposium on Emerging topics in Computing and Communication, 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE, Jaipur, India (2016)

ISBN:

9781509020294

URL:

https://ieeexplore.ieee.org/document/7732268

Keywords:

Algorithm design and analysis, class specific document, Classification algorithms, Data mining, data mining classification, Dictionaries, dictionary discriminatory strength, Dictionary learning, dictionary learning algorithm, dictionary-based approach, Document Classification, Document collection, document handling, document representation, Image coding, K-LDA, K-SVD, LDA, learning (artificial intelligence), Linear discriminant analysis, Matching pursuit algorithms, OMP, Pattern classification, Signal processing algorithms, Singular value decomposition, singular value decomposition updation, Sparse coding, Sparse matrices, supervised learning method, SVD updation

Abstract:

Over the past few years there are many developments in the area of classification in data mining. Classification is a supervised learning method, that maps data into predefined groups or classes. Nowadays classification techniques are extensively used in different applications. In this area most of the research works are done on text, image, signal etc. The main goal of this paper is to use a dictionary-based approach to learn, represent and classify documents. We consider dictionary as a collection of documents and each document in the dictionary is represented as a collection of vectors. An algorithm is also implemented to easily locate a class specific document in the dictionary and if it is not present, update the dictionary. The existing method is based on a dictionary learning algorithm which only improves the document representation based on Singular Value Decomposition (SVD) updation. Since SVD will not be helpful for discrimination of data, so our proposed algorithm is Linear Discriminant Analysis (LDA) for learning a discriminating dictionary. On applying the proposed algorithm on well known dataset, the overall results obtained shows 90% improvement in accuracy. The advantage is that it can be used for both representation as well as classification.

Cite this Research Publication

Remya Rajesh, Kini, N. V., and Krishnan, G. A., “Harnessing the Discriminatory Strength of Dictionaries”, in Second International Symposium on Emerging topics in Computing and Communication, 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, India, 2016.