Topic Highlight
Copyright ©2014 Baishideng Publishing Group Inc. All rights reserved.
World J Gastroenterol. Oct 7, 2014; 20(37): 13325-13342
Published online Oct 7, 2014. doi: 10.3748/wjg.v20.i37.13325
Biomarkers for pancreatic cancer: Recent achievements in proteomics and genomics through classical and multivariate statistical methods
Emilio Marengo, Elisa Robotti
Emilio Marengo, Elisa Robotti, Department of Sciences and Technological Innovation, University of Piemonte Orientale, 15121 Alessandria, Italy
Author contributions: All authors contributed equally to the drafting of the manuscript.
Correspondence to: Emilio Marengo, Professor, Department of Sciences and Technological Innovation, University of Piemonte Orientale, Viale Michel 11, 15121 Alessandria, Italy. marengoe@tin.it
Telephone: +39-0131-360259 Fax: +39-0131-360250
Received: October 28, 2013
Revised: June 4, 2014
Accepted: June 26, 2014
Published online: October 7, 2014
Abstract

Pancreatic cancer (PC) is one of the most aggressive and lethal neoplastic diseases. A valid alternative to the usual invasive diagnostic tools would certainly be the determination of biomarkers in peripheral fluids to provide less invasive tools for early diagnosis. Nowadays, biomarkers are generally investigated mainly in peripheral blood and tissues through high-throughput omics techniques comparing control vs pathological samples. The results can be evaluated by two main strategies: (1) classical methods in which the identification of significant biomarkers is accomplished by monovariate statistical tests where each biomarker is considered as independent from the others; and (2) multivariate methods, taking into consideration the correlations existing among the biomarkers themselves. This last approach is very powerful since it allows the identification of pools of biomarkers with diagnostic and prognostic performances which are superior to single markers in terms of sensitivity, specificity and robustness. Multivariate techniques are usually applied with variable selection procedures to provide a restricted set of biomarkers with the best predictive ability; however, standard selection methods are usually aimed at the identification of the smallest set of variables with the best predictive ability and exhaustivity is usually neglected. The exhaustive search for biomarkers is instead an important alternative to standard variable selection since it can provide information about the etiology of the pathology by producing a comprehensive set of markers. In this review, the most recent applications of the omics techniques (proteomics, genomics and metabolomics) to the identification of exploratory biomarkers for PC will be presented with particular regard to the statistical methods adopted for their identification. The basic theory related to classical and multivariate methods for identification of biomarkers is presented and then, the most recent applications in this field are discussed.

Keywords: Pancreatic cancer, Biomarker identification, Multivariate analysis, Principal component analysis, Ranking principal component analysis

Core tip: Biomarkers are statistically identified as significant by: (1) classical statistical tests where each biomarker is independent from the others; and (2) multivariate methods that take into consideration the correlation among the biomarkers. This last approach provides pools of biomarkers with superior diagnostic and prognostic performances. Multivariate techniques are often applied with variable selection procedures to provide the smallest set of biomarkers with the best predictive ability. The exhaustive identification is instead a valid alternative since it can provide comprehensive information about the etiology of the pathology. The most recent applications of the omics approaches to the identification of biomarkers for PC are presented, with particular regard to the statistical methods adopted.