Supporting Semantic Concept Retrieval with Negative Correlations in a Multimedia Big Data Mining System

Yilin Yan, Mei Ling Shyu, Qiusha Zhu

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

With the extensive use of smart devices and blooming popularity of social media websites such as Flickr, YouTube, Twitter, and Facebook, we have witnessed an explosion of multimedia data. The amount of data nowadays is formidable without effective big data technologies. It is well-acknowledged that multimedia high-level semantic concept mining and retrieval has become an important research topic; while the semantic gap (i.e., the gap between the low-level features and high-level concepts) makes it even more challenging. To address these challenges, it requires the joint research efforts from both big data mining and multimedia areas. In particular, the correlations among the classes can provide important context cues to help bridge the semantic gap. However, correlation discovery is computationally expensive due to the huge amount of data. In this paper, a novel multimedia big data mining system based on the MapReduce framework is proposed to discover negative correlations for semantic concept mining and retrieval. Furthermore, the proposed multimedia big data mining system consists of a big data processing platform with Mesos for efficient resource management and with Cassandra for handling data across multiple data centers. Experimental results on the TRECVID benchmark datasets demonstrate the feasibility and the effectiveness of the proposed multimedia big data mining system with negative correlation discovery for semantic concept mining and retrieval.

Original languageEnglish (US)
Pages (from-to)247-267
Number of pages21
JournalInternational Journal of Semantic Computing
Volume10
Issue number2
DOIs
StatePublished - Jun 1 2016

Keywords

  • Big data
  • Cassandra
  • Hadoop
  • information integration
  • MapReduce
  • Mesos
  • multimedia semantic mining and retrieval
  • negative correlation
  • Spark

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Linguistics and Language
  • Computer Science Applications
  • Computer Networks and Communications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Supporting Semantic Concept Retrieval with Negative Correlations in a Multimedia Big Data Mining System'. Together they form a unique fingerprint.

Cite this