Supplementary Vignette 1 from Building Tools for Machine Learning and Artificial Intelligence in Cancer Research: Best Practices and a Case Study with the PathML Toolkit for Computational Pathology
posted on 2023-12-15, 14:20authored byJacob Rosenthal, Ryan Carelli, Mohamed Omar, David Brundage, Ella Halbert, Jackson Nyman, Surya N. Hari, Eliezer M. Van Allen, Luigi Marchionni, Renato Umeton, Massimo Loda
In this vignette, we use 15 publicly available images of a wide range of imaging modalities and file formats to demonstrate how PathML supports loading all of them under a simple, standardized syntax.
Funding
NIH
NCI
DoD
History
ARTICLE ABSTRACT
Imaging datasets in cancer research are growing exponentially in both quantity and information density. These massive datasets may enable derivation of insights for cancer research and clinical care, but only if researchers are equipped with the tools to leverage advanced computational analysis approaches such as machine learning and artificial intelligence. In this work, we highlight three themes to guide development of such computational tools: scalability, standardization, and ease of use. We then apply these principles to develop PathML, a general-purpose research toolkit for computational pathology. We describe the design of the PathML framework and demonstrate applications in diverse use cases. PathML is publicly available at www.pathml.com.