Deep Learning for Imbalanced Multimedia Data Classification

Yilin Yan, Min Chen, Mei Ling Shyu, Shu Ching Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

77 Scopus citations


Classification of imbalanced data is an important research problem as lots of real-world data sets have skewed class distributions in which the majority of data instances (examples) belong to one class and far fewer instances belong to others. While in many applications, the minority instances actually represent the concept of interest (e.g., fraud in banking operations, abnormal cell in medical data, etc.), a classifier induced from an imbalanced data set is more likely to be biased towards the majority class and show very poor classification accuracy on the minority class. Despite extensive research efforts, imbalanced data classification remains one of the most challenging problems in data mining and machine learning, especially for multimedia data. To tackle this challenge, in this paper, we propose an extended deep learning approach to achieve promising performance in classifying skewed multimedia data sets. Specifically, we investigate the integration of bootstrapping methods and a state-of-the-art deep learning approach, Convolutional Neural Networks (CNNs), with extensive empirical studies. Considering the fact that deep learning approaches such as CNNs are usually computationally expensive, we propose to feed low-level features to CNNs and prove its feasibility in achieving promising performance while saving a lot of training time. The experimental results show the effectiveness of our framework in classifying severely imbalanced data in the TRECVID data set.

Original languageEnglish (US)
Title of host publicationProceedings - 2015 IEEE International Symposium on Multimedia, ISM 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages6
ISBN (Electronic)9781509003792
StatePublished - Mar 25 2016
Event17th IEEE International Symposium on Multimedia, ISM 2015 - Miami, United States
Duration: Dec 14 2015Dec 16 2015

Publication series

NameProceedings - 2015 IEEE International Symposium on Multimedia, ISM 2015


Other17th IEEE International Symposium on Multimedia, ISM 2015
Country/TerritoryUnited States


  • classification
  • convolutional neural network (CNN)
  • deep learning
  • imbalanced data
  • semantic indexing

ASJC Scopus subject areas

  • Computer Science Applications
  • Hardware and Architecture
  • Software
  • Computer Networks and Communications


Dive into the research topics of 'Deep Learning for Imbalanced Multimedia Data Classification'. Together they form a unique fingerprint.

Cite this