Enhancing rare class mining in multimedia big data by concept correlation

Yilin Yan, Mei-Ling Shyu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

The development in information science has enabled an explosive growth of data, which attracts more and more researchers to engage in the field of big data analytics. Noticeably, in many real-world applications, large amounts of data are imbalanced data since the events of interests occur infrequently. However, the detection of these events is such an important research problem and has attracted significant research efforts as lots of real-world big data sets have skewed class distributions. Despite extensive research efforts, rare class mining remains one of the most challenging problems in information science, especially for multimedia big data. Though inter-concept correlations have been utilized to address this issue recently, the very small number of instances in the minority class often lead to the detection of imprecise correlations and unsatisfactory classification results. This paper proposes a novel concept correlation analysis strategy framework using the correlations between the retrieval scores and labels. By integrating the correlation information, the proposed framework can help imbalance data classification and enhance rare class (or concept) mining even with trivial scores from the minority class. Experimental results on the TRECVID multimedia big benchmark data set demonstrate the effectiveness of the proposed framework with promising performance.

Original languageEnglish (US)
Title of host publicationProceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages281-286
Number of pages6
ISBN (Electronic)9781509045709
DOIs
StatePublished - Jan 18 2017
Event18th IEEE International Symposium on Multimedia, ISM 2016 - San Jose, United States
Duration: Dec 11 2016Dec 13 2016

Other

Other18th IEEE International Symposium on Multimedia, ISM 2016
CountryUnited States
CitySan Jose
Period12/11/1612/13/16

Fingerprint

Information science
Labels
Big data

Keywords

  • Concept correlation
  • Imbalanced data
  • Information integration
  • Multimedia big data
  • Rare class mining

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Media Technology
  • Computer Science Applications

Cite this

Yan, Y., & Shyu, M-L. (2017). Enhancing rare class mining in multimedia big data by concept correlation. In Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016 (pp. 281-286). [7823629] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ISM.2016.75

Enhancing rare class mining in multimedia big data by concept correlation. / Yan, Yilin; Shyu, Mei-Ling.

Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016. Institute of Electrical and Electronics Engineers Inc., 2017. p. 281-286 7823629.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Yan, Y & Shyu, M-L 2017, Enhancing rare class mining in multimedia big data by concept correlation. in Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016., 7823629, Institute of Electrical and Electronics Engineers Inc., pp. 281-286, 18th IEEE International Symposium on Multimedia, ISM 2016, San Jose, United States, 12/11/16. https://doi.org/10.1109/ISM.2016.75
Yan Y, Shyu M-L. Enhancing rare class mining in multimedia big data by concept correlation. In Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016. Institute of Electrical and Electronics Engineers Inc. 2017. p. 281-286. 7823629 https://doi.org/10.1109/ISM.2016.75
Yan, Yilin ; Shyu, Mei-Ling. / Enhancing rare class mining in multimedia big data by concept correlation. Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 281-286
@inproceedings{a8a5d23ae35f479b8eb5892880ac01e3,
title = "Enhancing rare class mining in multimedia big data by concept correlation",
abstract = "The development in information science has enabled an explosive growth of data, which attracts more and more researchers to engage in the field of big data analytics. Noticeably, in many real-world applications, large amounts of data are imbalanced data since the events of interests occur infrequently. However, the detection of these events is such an important research problem and has attracted significant research efforts as lots of real-world big data sets have skewed class distributions. Despite extensive research efforts, rare class mining remains one of the most challenging problems in information science, especially for multimedia big data. Though inter-concept correlations have been utilized to address this issue recently, the very small number of instances in the minority class often lead to the detection of imprecise correlations and unsatisfactory classification results. This paper proposes a novel concept correlation analysis strategy framework using the correlations between the retrieval scores and labels. By integrating the correlation information, the proposed framework can help imbalance data classification and enhance rare class (or concept) mining even with trivial scores from the minority class. Experimental results on the TRECVID multimedia big benchmark data set demonstrate the effectiveness of the proposed framework with promising performance.",
keywords = "Concept correlation, Imbalanced data, Information integration, Multimedia big data, Rare class mining",
author = "Yilin Yan and Mei-Ling Shyu",
year = "2017",
month = "1",
day = "18",
doi = "10.1109/ISM.2016.75",
language = "English (US)",
pages = "281--286",
booktitle = "Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Enhancing rare class mining in multimedia big data by concept correlation

AU - Yan, Yilin

AU - Shyu, Mei-Ling

PY - 2017/1/18

Y1 - 2017/1/18

N2 - The development in information science has enabled an explosive growth of data, which attracts more and more researchers to engage in the field of big data analytics. Noticeably, in many real-world applications, large amounts of data are imbalanced data since the events of interests occur infrequently. However, the detection of these events is such an important research problem and has attracted significant research efforts as lots of real-world big data sets have skewed class distributions. Despite extensive research efforts, rare class mining remains one of the most challenging problems in information science, especially for multimedia big data. Though inter-concept correlations have been utilized to address this issue recently, the very small number of instances in the minority class often lead to the detection of imprecise correlations and unsatisfactory classification results. This paper proposes a novel concept correlation analysis strategy framework using the correlations between the retrieval scores and labels. By integrating the correlation information, the proposed framework can help imbalance data classification and enhance rare class (or concept) mining even with trivial scores from the minority class. Experimental results on the TRECVID multimedia big benchmark data set demonstrate the effectiveness of the proposed framework with promising performance.

AB - The development in information science has enabled an explosive growth of data, which attracts more and more researchers to engage in the field of big data analytics. Noticeably, in many real-world applications, large amounts of data are imbalanced data since the events of interests occur infrequently. However, the detection of these events is such an important research problem and has attracted significant research efforts as lots of real-world big data sets have skewed class distributions. Despite extensive research efforts, rare class mining remains one of the most challenging problems in information science, especially for multimedia big data. Though inter-concept correlations have been utilized to address this issue recently, the very small number of instances in the minority class often lead to the detection of imprecise correlations and unsatisfactory classification results. This paper proposes a novel concept correlation analysis strategy framework using the correlations between the retrieval scores and labels. By integrating the correlation information, the proposed framework can help imbalance data classification and enhance rare class (or concept) mining even with trivial scores from the minority class. Experimental results on the TRECVID multimedia big benchmark data set demonstrate the effectiveness of the proposed framework with promising performance.

KW - Concept correlation

KW - Imbalanced data

KW - Information integration

KW - Multimedia big data

KW - Rare class mining

UR - http://www.scopus.com/inward/record.url?scp=85015186015&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85015186015&partnerID=8YFLogxK

U2 - 10.1109/ISM.2016.75

DO - 10.1109/ISM.2016.75

M3 - Conference contribution

AN - SCOPUS:85015186015

SP - 281

EP - 286

BT - Proceedings - 2016 IEEE International Symposium on Multimedia, ISM 2016

PB - Institute of Electrical and Electronics Engineers Inc.

ER -