To realize data-driven reproducible cytometry analysis and accelerate the adoption of computational analysis methods by both translational and clinical researchers, the US National Center for Advancing Translational Sciences (NCATS) funded a 5-year project in 2016 to develop a web-based computational infrastructure – FlowGate – to improve the accessibility and usability of the cutting-edge data analytical approaches to cytometry data processing and analysis. The main goals include creating an advanced user interface to support both bioinformaticians and also wet-lab translational and clinical investigators, making an analysis environment that is extensible to new methods and scalable to data size, and providing education and training for the appropriate use of the resulting data analysis pipelines.
Design of the FlowGate architecture consists of a front-end user interface, a middle-tier management of data processing and analysis pipelines, and a back-end parallel computing environment. The back-end infrastructure is built upon a large cluster computer at San Diego Supercomputer Center - Comet, which has 1,944 compute nodes with 24 cores/28G DRAM/320G SSD memory on each node. Managed by the GenePattern (GP) platform, computational modules and pipelines on FlowGate can be easily expanded, with data/analysis provenance and software versioning. A GP Notebook environment, leveraging the Jupyter Notebook platform, is implemented in FlowGate to support interactive analytics, which is integrated with the front-end Java applications for customizable generation and download of data analysis results. Result visualization and statistical tables are included in a single HTML page for both on-line browsing and off-line sharing. Docker/Singularity containers are used to disseminate FlowGate software components for deployment on different computing platforms.
Compared with other flow data analysis tools or platforms, the main features of FlowGate include:
The development team of FlowGate includes:
External Advisory Board Members (alphabetical order):
Professor, Medical Genetics, Faculty of Medicine, UBC; Associate Faculty Member, Bioinformatics Centre, UBC; CEO, Cytapex Bioinformatics, Inc. Applying bioinformatics techniques (e.g., flowCore/flowUtils/flowClust) to flow cytometry data. Also involved in the development of flow cytometry ontologies and minimum information standards.
Professor of Genetics; Chair of Flow Cytometry and Genetics. Translational researcher interested in basic mechanisms that determine and regulate gene expression and cell function in the immune system. Developed new, user friendly and statistically reliable software to facilitate flow data analysis.
Director of Hematopathology and Medical Director of SCCA pathology laboratories. He was responsible for implementing the first use of 9- and 10-color flow cytometry in the clinical laboratory, as well as subsequently exploiting its potential for the identification of minimal residual disease in acute lymphoid and myeloid leukemias.
Professor of Medicine and Medical Humanities in the Division of Nephrology; Co-Director of the Clinical and Translational Science Institute; Director of the Rochester Center for Health Informatics. Research is focused on understanding how B cells respond to vaccines and organ transplants to produce antibodies, and understanding how to improve population health and healthcare delivery, using high dimensional clustering methods, graph theory, differential equation and stochastic branching process modeling.
Funding Agency: National Institutes of Health/National Center for Advancing Translational Sciences
Grant Number: U01TR001801
Funding Period: September 2016 to June 2021