Enhancing multimedia imbalanced concept detection using VIMP in Random Forests

Saad Sadiq, Yilin Yan, Mei-Ling Shyu, Shu Ching Chen, Hemant Ishwaran

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

Recent developments in social media and cloud storage lead to an exponential growth in the amount of multimedia data, which increases the complexity of managing, storing, indexing, and retrieving information from such big data. Many current content-based concept detection approaches lag from successfully bridging the semantic gap. To solve this problem, a multi-stage random forest framework is proposed to generate predictor variables based on multivariate regressions using variable importance (VIMP). By fine tuning the forests and significantly reducing the predictor variables, the concept detection scores are evaluated when the concept of interest is rare and imbalanced, i.e., having little collaboration with other high level concepts. Using classical multivariate statistics, estimating the value of one coordinate using other coordinates standardizes the covariates and it depends upon the variance of the correlations instead of the mean. Thus, conditional dependence on the data being normally distributed is eliminated. Experimental results demonstrate that the proposed framework outperforms those approaches in the comparison in terms of the Mean Average Precision (MAP) values.

Original languageEnglish (US)
Title of host publicationProceedings - 2016 IEEE 17th International Conference on Information Reuse and Integration, IRI 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages601-608
Number of pages8
ISBN (Electronic)9781509032075
DOIs
StatePublished - 2016
Event17th IEEE International Conference on Information Reuse and Integration, IRI 2016 - Pittsburgh, United States
Duration: Jul 28 2016Jul 30 2016

Other

Other17th IEEE International Conference on Information Reuse and Integration, IRI 2016
CountryUnited States
CityPittsburgh
Period7/28/167/30/16

    Fingerprint

Keywords

  • Multimedia imbalanced concept detection
  • Multivariate regression
  • Random forests
  • Variable importance (VIMP)

ASJC Scopus subject areas

  • Information Systems
  • Information Systems and Management

Cite this

Sadiq, S., Yan, Y., Shyu, M-L., Chen, S. C., & Ishwaran, H. (2016). Enhancing multimedia imbalanced concept detection using VIMP in Random Forests. In Proceedings - 2016 IEEE 17th International Conference on Information Reuse and Integration, IRI 2016 (pp. 601-608). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IRI.2016.87