Induction from multi-label examples

Hind Hazza Alsharif, Wadee Saleh Alhalabi, Miroslav Kubat

Research output: Contribution to journalArticlepeer-review

Abstract

The task of text categorization is to assign one or more classes to a document. The simplest machine learning approach to such domains, simply induces a binary classifier separately for each class, and then uses these classifiers in parallel. An example of motivating application is a digital library collection that used to be classified into classes and sub-classes in a hierarchical order. Another important issue that we are considering is the document might belong to more than one class, in this case we will be working on a high performance multi-class label classifier. The study we are intending to do herein is going to show how much we can gain from machine learning. This mean, if we need something like 10 to 15% of the data for training, and testing or do we need > 50% of the data set for training and testing. In the latter case, the machine learning may don't contribute that much. However, if 10 to 15% of the data set is needed, then, machine learning has a great contribution. The last issue we are working on in this research is the inter-class relation. Which means, if the example is classified to belong to a class C, does this mean, the example belong to parents and grandparents classes of the class C, and on the opposite way too? We will use a framework to classify documents automatically and this can indeed answer these questions.

Original languageEnglish (US)
Article number67
Pages (from-to)495-511
Number of pages17
JournalLife Science Journal
Volume11
Issue number10
StatePublished - Jan 1 2014

Keywords

  • Induction process
  • Inter-class relation
  • KNN algorithms
  • Multi-label classifiers
  • Naïve Bays algorithms
  • Text categorization

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)

Fingerprint Dive into the research topics of 'Induction from multi-label examples'. Together they form a unique fingerprint.

Cite this