netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

netDx : Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]. / Pai, Shraddha; Weber, Philipp; Isserlin, Ruth; Kaka, Hussam; Hui, Shirley; Shah, Muhammad Ahmad; Giudice, Luca; Giugno, Rosalba; Nøhr, Anne Krogh; Baumbach, Jan; Bader, Gary D.

In: F1000Research, Vol. 9, 1239, 2021.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Pai, S, Weber, P, Isserlin, R, Kaka, H, Hui, S, Shah, MA, Giudice, L, Giugno, R, Nøhr, AK, Baumbach, J & Bader, GD 2021, 'netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]', F1000Research, vol. 9, 1239. https://doi.org/10.12688/f1000research.26429.2

APA

Pai, S., Weber, P., Isserlin, R., Kaka, H., Hui, S., Shah, M. A., Giudice, L., Giugno, R., Nøhr, A. K., Baumbach, J., & Bader, G. D. (2021). netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]. F1000Research, 9, [1239]. https://doi.org/10.12688/f1000research.26429.2

Vancouver

Pai S, Weber P, Isserlin R, Kaka H, Hui S, Shah MA et al. netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]. F1000Research. 2021;9. 1239. https://doi.org/10.12688/f1000research.26429.2

Author

Pai, Shraddha ; Weber, Philipp ; Isserlin, Ruth ; Kaka, Hussam ; Hui, Shirley ; Shah, Muhammad Ahmad ; Giudice, Luca ; Giugno, Rosalba ; Nøhr, Anne Krogh ; Baumbach, Jan ; Bader, Gary D. / netDx : Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]. In: F1000Research. 2021 ; Vol. 9.

Bibtex

@article{76dba4b262e84d3b930a5b58e62a5059,
title = "netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]",
abstract = "Patient classification based on clinical and genomic data will further the goal of precision medicine. Interpretability is of particular relevance for models based on genomic data, where sample sizes are relatively small (in the hundreds), increasing overfitting risk netDx is a machine learning method to integrate multi-modal patient data and build a patient classifier. Patient data are converted into networks of patient similarity, which is intuitive to clinicians who also use patient similarity for medical diagnosis. Features passing selection are integrated, and new patients are assigned to the class with the greatest profile similarity. netDx has excellent performance, outperforming most machine-learning methods in binary cancer survival prediction. It handles missing data - a common problem in real-world data - without requiring imputation. netDx also has excellent interpretability, with native support to group genes into pathways for mechanistic insight into predictive features. The netDx Bioconductor package provides multiple workflows for users to build custom patient classifiers. It provides turnkey functions for one-step predictor generation from multi-modal data, including feature selection over multiple train/test data splits. Workflows offer versatility with custom feature design, choice of similarity metric; speed is improved by parallel execution. Built-in functions and examples allow users to compute model performance metrics such as AUROC, AUPR, and accuracy. netDx uses RCy3 to visualize top-scoring pathways and the final integrated patient network in Cytoscape. Advanced users can build more complex predictor designs with functional building blocks used in the default design. Finally, the netDx Bioconductor package provides a novel workflow for pathway-based patient classification from sparse genetic data.",
keywords = "Classification, Data integration, Genomics, Networks, Precision medicine, Supervised learning",
author = "Shraddha Pai and Philipp Weber and Ruth Isserlin and Hussam Kaka and Shirley Hui and Shah, {Muhammad Ahmad} and Luca Giudice and Rosalba Giugno and N{\o}hr, {Anne Krogh} and Jan Baumbach and Bader, {Gary D.}",
note = "Publisher Copyright: {\textcopyright} 2021 Pai S et al.",
year = "2021",
doi = "10.12688/f1000research.26429.2",
language = "English",
volume = "9",
journal = "F1000Research",
issn = "2046-1402",
publisher = "F1000Research",

}

RIS

TY - JOUR

T1 - netDx

T2 - Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks [version 2; peer review: 2 approved]

AU - Pai, Shraddha

AU - Weber, Philipp

AU - Isserlin, Ruth

AU - Kaka, Hussam

AU - Hui, Shirley

AU - Shah, Muhammad Ahmad

AU - Giudice, Luca

AU - Giugno, Rosalba

AU - Nøhr, Anne Krogh

AU - Baumbach, Jan

AU - Bader, Gary D.

N1 - Publisher Copyright: © 2021 Pai S et al.

PY - 2021

Y1 - 2021

N2 - Patient classification based on clinical and genomic data will further the goal of precision medicine. Interpretability is of particular relevance for models based on genomic data, where sample sizes are relatively small (in the hundreds), increasing overfitting risk netDx is a machine learning method to integrate multi-modal patient data and build a patient classifier. Patient data are converted into networks of patient similarity, which is intuitive to clinicians who also use patient similarity for medical diagnosis. Features passing selection are integrated, and new patients are assigned to the class with the greatest profile similarity. netDx has excellent performance, outperforming most machine-learning methods in binary cancer survival prediction. It handles missing data - a common problem in real-world data - without requiring imputation. netDx also has excellent interpretability, with native support to group genes into pathways for mechanistic insight into predictive features. The netDx Bioconductor package provides multiple workflows for users to build custom patient classifiers. It provides turnkey functions for one-step predictor generation from multi-modal data, including feature selection over multiple train/test data splits. Workflows offer versatility with custom feature design, choice of similarity metric; speed is improved by parallel execution. Built-in functions and examples allow users to compute model performance metrics such as AUROC, AUPR, and accuracy. netDx uses RCy3 to visualize top-scoring pathways and the final integrated patient network in Cytoscape. Advanced users can build more complex predictor designs with functional building blocks used in the default design. Finally, the netDx Bioconductor package provides a novel workflow for pathway-based patient classification from sparse genetic data.

AB - Patient classification based on clinical and genomic data will further the goal of precision medicine. Interpretability is of particular relevance for models based on genomic data, where sample sizes are relatively small (in the hundreds), increasing overfitting risk netDx is a machine learning method to integrate multi-modal patient data and build a patient classifier. Patient data are converted into networks of patient similarity, which is intuitive to clinicians who also use patient similarity for medical diagnosis. Features passing selection are integrated, and new patients are assigned to the class with the greatest profile similarity. netDx has excellent performance, outperforming most machine-learning methods in binary cancer survival prediction. It handles missing data - a common problem in real-world data - without requiring imputation. netDx also has excellent interpretability, with native support to group genes into pathways for mechanistic insight into predictive features. The netDx Bioconductor package provides multiple workflows for users to build custom patient classifiers. It provides turnkey functions for one-step predictor generation from multi-modal data, including feature selection over multiple train/test data splits. Workflows offer versatility with custom feature design, choice of similarity metric; speed is improved by parallel execution. Built-in functions and examples allow users to compute model performance metrics such as AUROC, AUPR, and accuracy. netDx uses RCy3 to visualize top-scoring pathways and the final integrated patient network in Cytoscape. Advanced users can build more complex predictor designs with functional building blocks used in the default design. Finally, the netDx Bioconductor package provides a novel workflow for pathway-based patient classification from sparse genetic data.

KW - Classification

KW - Data integration

KW - Genomics

KW - Networks

KW - Precision medicine

KW - Supervised learning

U2 - 10.12688/f1000research.26429.2

DO - 10.12688/f1000research.26429.2

M3 - Journal article

C2 - 33628435

AN - SCOPUS:85100967261

VL - 9

JO - F1000Research

JF - F1000Research

SN - 2046-1402

M1 - 1239

ER -

ID: 262741420