Advanced search
1 file | 1.27 MB Add to list

Principal bicorrelation analysis: unraveling associations between three data sources

Federico Mattiello (UGent) , Olivier Thas (UGent) and Bie Verbist (UGent)
Author
Organization
Project
Abstract
In this article, we propose a statistical explorative method for data integration. It is developed in the context of early drug development for which it enables the detection of chemical substructures and the identification of genes that mediate their association with the bioactivity (BA). The core of the method is a sparse singular value decomposition for the identification of the gene set and a permutation-based method for the control of the false discovery rate. The method is illustrated using a real dataset, and its properties are empirically evaluated by means of a simulation study. Quantitative Structure Transcriptional Activity Relationship (QSTAR, www.qstar-consortium.org) is a new paradigm in early drug development that extends QSAR by not only considering data on the chemical structure of the compounds and on the compound-induced BA, but by simultaneously using transcriptomics data (gene expression). This approach enables, for example, the detection of chemical substructures that are associated with BA, while at the same time a gene set is correlated with both these substructures and the BA. Although causal associations cannot be formally concluded, these associations may suggest that the compounds act on the BA through a particular genomic pathway.
Keywords
multivariate analysis, Data integration, sparse singular value decomposition, GENE-EXPRESSION DATA, COMPONENTS

Downloads

  • (...).pdf
    • full text
    • |
    • UGent only
    • |
    • PDF
    • |
    • 1.27 MB

Citation

Please use this url to cite or link to this publication:

MLA
Mattiello, Federico, et al. “Principal Bicorrelation Analysis: Unraveling Associations between Three Data Sources.” JOURNAL OF BIOPHARMACEUTICAL STATISTICS, vol. 26, no. 3, 2015, pp. 534–51, doi:10.1080/10543406.2015.1052491.
APA
Mattiello, F., Thas, O., & Verbist, B. (2015). Principal bicorrelation analysis: unraveling associations between three data sources. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 26(3), 534–551. https://doi.org/10.1080/10543406.2015.1052491
Chicago author-date
Mattiello, Federico, Olivier Thas, and Bie Verbist. 2015. “Principal Bicorrelation Analysis: Unraveling Associations between Three Data Sources.” JOURNAL OF BIOPHARMACEUTICAL STATISTICS 26 (3): 534–51. https://doi.org/10.1080/10543406.2015.1052491.
Chicago author-date (all authors)
Mattiello, Federico, Olivier Thas, and Bie Verbist. 2015. “Principal Bicorrelation Analysis: Unraveling Associations between Three Data Sources.” JOURNAL OF BIOPHARMACEUTICAL STATISTICS 26 (3): 534–551. doi:10.1080/10543406.2015.1052491.
Vancouver
1.
Mattiello F, Thas O, Verbist B. Principal bicorrelation analysis: unraveling associations between three data sources. JOURNAL OF BIOPHARMACEUTICAL STATISTICS. 2015;26(3):534–51.
IEEE
[1]
F. Mattiello, O. Thas, and B. Verbist, “Principal bicorrelation analysis: unraveling associations between three data sources,” JOURNAL OF BIOPHARMACEUTICAL STATISTICS, vol. 26, no. 3, pp. 534–551, 2015.
@article{6845065,
  abstract     = {{In this article, we propose a statistical explorative method for data integration. It is developed in the context of early drug development for which it enables the detection of chemical substructures and the identification of genes that mediate their association with the bioactivity (BA). The core of the method is a sparse singular value decomposition for the identification of the gene set and a permutation-based method for the control of the false discovery rate. The method is illustrated using a real dataset, and its properties are empirically evaluated by means of a simulation study. Quantitative Structure Transcriptional Activity Relationship (QSTAR, www.qstar-consortium.org) is a new paradigm in early drug development that extends QSAR by not only considering data on the chemical structure of the compounds and on the compound-induced BA, but by simultaneously using transcriptomics data (gene expression). This approach enables, for example, the detection of chemical substructures that are associated with BA, while at the same time a gene set is correlated with both these substructures and the BA. Although causal associations cannot be formally concluded, these associations may suggest that the compounds act on the BA through a particular genomic pathway.}},
  author       = {{Mattiello, Federico and Thas, Olivier and Verbist, Bie}},
  issn         = {{1054-3406}},
  journal      = {{JOURNAL OF BIOPHARMACEUTICAL STATISTICS}},
  keywords     = {{multivariate analysis,Data integration,sparse singular value decomposition,GENE-EXPRESSION DATA,COMPONENTS}},
  language     = {{eng}},
  number       = {{3}},
  pages        = {{534--551}},
  title        = {{Principal bicorrelation analysis: unraveling associations between three data sources}},
  url          = {{http://doi.org/10.1080/10543406.2015.1052491}},
  volume       = {{26}},
  year         = {{2015}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: