Machine-learning for biopharmaceutical batch process monitoring with limited data

Abstract Commercial biopharmaceutical manufacturing comprises of multiple distinct processing steps that require effective and efficient monitoring of many variables simultaneously in real-time. This article addresses the problem of real-time statistical batch process monitoring (BPM) for biopharmaceutical processes with limited production history; herein, referred to as the ‘Low-N’ problem. In this article, we propose an approach to transition from a Low-N scenario to a Large-N scenario by generating an arbitrarily large number of in silico batch data sets. The proposed method is a combination of hardware exploitation and algorithm development. To this effect, we propose a Bayesian non-parametric approach to model a batch process, and then use probabilistic programming to generate an arbitrarily large number of dynamic in silico campaign data sets. The efficacy of the proposed solution is elucidated on an industrial process.

[1]  Salvatore Ingrassia,et al.  Neural Network Modeling for Small Datasets , 2005, Technometrics.

[2]  Thomas A. Henzinger,et al.  Probabilistic programming , 2014, FOSE.

[3]  Haichao Zhu,et al.  A New Method to Assist Small Data Set Neural Network Learning , 2006, Sixth International Conference on Intelligent Systems Design and Applications.

[4]  W. Byrne GENERALIZATION AND MAXIMUM LIKELIHOOD FROMSMALL DATA , 1993 .

[5]  W. Byrne,et al.  Generalization and maximum likelihood from small data sets , 1993, Neural Networks for Signal Processing III - Proceedings of the 1993 IEEE-SP Workshop.

[6]  R. B. Gopaluni,et al.  Parameter estimation in nonlinear chemical and biological processes with unmeasured variables from small data sets , 2011 .

[7]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[8]  R. Bhushan Gopaluni,et al.  Robust model-based delay timer alarm for non-linear processes , 2016, 2016 American Control Conference (ACC).

[9]  J. Macgregor,et al.  Monitoring batch processes using multiway principal component analysis , 1994 .

[10]  R. Bhushan Gopaluni,et al.  Design and assessment of delay timer alarm systems for nonlinear chemical processes , 2018 .

[11]  Paul I. Barton,et al.  Reachability-based fault detection method for uncertain chemical flow reactors , 2016 .

[12]  S. Wold,et al.  Multi‐way principal components‐and PLS‐analysis , 1987 .

[13]  Nola D. Tracy,et al.  Multivariate Control Charts for Individual Observations , 1992 .

[14]  Marek J. Druzdzel,et al.  Learning Bayesian network parameters from small data sets: application of Noisy-OR gates , 2001, Int. J. Approx. Reason..

[15]  Aditya Tulsyan,et al.  Advances in industrial biopharmaceutical batch process monitoring: Machine‐learning methods for small data problems , 2018, Biotechnology and bioengineering.