CANEapp: A user-friendly application for automated next generation transcriptomic data analysis

Dmitry Velmeshev, Patrick Lally, Marco Magistri, Mohammad Ali Faghihi

Research output: Contribution to journalArticle

6 Scopus citations

Abstract

Background: Next generation sequencing (NGS) technologies are indispensable for molecular biology research, but data analysis represents the bottleneck in their application. Users need to be familiar with computer terminal commands, the Linux environment, and various software tools and scripts. Analysis workflows have to be optimized and experimentally validated to extract biologically meaningful data. Moreover, as larger datasets are being generated, their analysis requires use of high-performance servers. Results: To address these needs, we developed CANEapp (application for Comprehensive automated Analysis of Next-generation sequencing Experiments), a unique suite that combines a Graphical User Interface (GUI) and an automated server-side analysis pipeline that is platform-independent, making it suitable for any server architecture. The GUI runs on a PC or Mac and seamlessly connects to the server to provide full GUI control of RNA-sequencing (RNA-seq) project analysis. The server-side analysis pipeline contains a framework that is implemented on a Linux server through completely automated installation of software components and reference files. Analysis with CANEapp is also fully automated and performs differential gene expression analysis and novel noncoding RNA discovery through alternative workflows (Cuffdiff and R packages edgeR and DESeq2). We compared CANEapp to other similar tools, and it significantly improves on previous developments. We experimentally validated CANEapp's performance by applying it to data derived from different experimental paradigms and confirming the results with quantitative real-time PCR (qRT-PCR). CANEapp adapts to any server architecture by effectively using available resources and thus handles large amounts of data efficiently. CANEapp performance has been experimentally validated on various biological datasets. CANEapp is available free of charge at http://psychiatry.med.miami.edu/research/laboratory-of-translational-rna-genomics/CANE-app. Conclusions: We believe that CANEapp will serve both biologists with no computational experience and bioinformaticians as a simple, timesaving but accurate and powerful tool to analyze large RNA-seq datasets and will provide foundations for future development of integrated and automated high-throughput genomics data analysis tools. Due to its inherently standardized pipeline and combination of automated analysis and platform-independence, CANEapp is an ideal for large-scale collaborative RNA-seq projects between different institutions and research groups.

Original languageEnglish (US)
Article number49
JournalBMC genomics
Volume17
Issue number1
DOIs
StatePublished - Jan 13 2016

Keywords

  • Automated pipeline
  • Differential gene expression
  • Graphical user interface
  • Long noncoding RNAs
  • Platform-independent
  • RNA sequencing
  • User-friendly application

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Fingerprint Dive into the research topics of 'CANEapp: A user-friendly application for automated next generation transcriptomic data analysis'. Together they form a unique fingerprint.

  • Cite this