Partial Bayesian Co-training for Virtual Metrology

Building accurate regression models using limited data is a challenging problem in manufacturing data analysis. In this paper, we study a particular semisupervised learning problem where labeled data are limited, while unlabeled data are plentiful. In these conditions, conventional single-view learning methods are prone to overfitting. To tackle this problem, we develop a novel co-training technique, namely partial Bayesian co-training (PBCT). PBCT scales down the original set of features to create a partial view, and then exploit side information from the partial view to enhance the complete model. The PBCT model also allows integrating domain knowledge to enhance model accuracy. The proposed method is validated with experiments on industrial manufacturing data. The experimental results show that under a reduction of labeled data by up to 50%, a robust estimation is still attainable. This suggests that the PBCT model is a promising solution to a broad spectrum of applications.

[1]  Joon-Hyuk Chang,et al.  Improved Gaussian Mixture Regression Based on Pseudo Feature Generation Using Bootstrap in Blood Pressure Estimation , 2016, IEEE Transactions on Industrial Informatics.

[2]  Sungzoon Cho,et al.  Semi-supervised support vector regression based on self-training with label uncertainty: An application to virtual metrology in semiconductor manufacturing , 2016, Expert Syst. Appl..

[3]  Yixin Chen,et al.  Automatic Feature Decomposition for Single View Co-training , 2011, ICML.

[4]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[5]  Kuniaki Uehara,et al.  Graph-based Semi-Supervised Regression and Its Extensions , 2015 .

[6]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[7]  Ulf Brefeld,et al.  Multi-view Discriminative Sequential Learning , 2005, ECML.

[8]  H. Jin Kim,et al.  Target Localization in Wireless Sensor Networks Using Online Semi-Supervised Support Vector Regression , 2015, Sensors.

[9]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[10]  Dacheng Tao,et al.  A Survey on Multi-view Learning , 2013, ArXiv.

[11]  Fan-Tien Cheng,et al.  Application development of virtual metrology in semiconductor industry , 2005, 31st Annual Conference of IEEE Industrial Electronics Society, 2005. IECON 2005..

[12]  Thomas Gärtner,et al.  Efficient co-regularised least squares regression , 2006, ICML.

[13]  Feng Xia,et al.  Time-Location-Relationship Combined Service Recommendation Based on Taxi Trajectory Data , 2017, IEEE Transactions on Industrial Informatics.

[14]  Xiang Li,et al.  Partial co-training for virtual metrology , 2017, 2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation (ETFA).

[15]  Maria-Florina Balcan,et al.  Co-Training and Expansion: Towards Bridging Theory and Practice , 2004, NIPS.

[16]  William T. Estler,et al.  Advances in Large-Scale Metrology – Review and future trends , 2016 .

[17]  Krzysztof Jemielniak,et al.  Advanced monitoring of machining operations , 2010 .

[18]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[19]  Zhi-Hua Zhou,et al.  Analyzing Co-training Style Algorithms , 2007, ECML.

[20]  D.M. Tilbury,et al.  An Approach for Factory-Wide Control Utilizing Virtual Metrology , 2007, IEEE Transactions on Semiconductor Manufacturing.

[21]  X. Li,et al.  Machine learning approach for shaft crack detection through acoustical emission signals , 2015, 2015 IEEE 20th Conference on Emerging Technologies & Factory Automation (ETFA).

[22]  James Moyne,et al.  Virtual metrology and feedback control for semiconductor manufacturing processes using recursive partial least squares , 2008 .

[23]  Jean-Olivier Plouchart,et al.  Co-Learning Bayesian Model Fusion: Efficient performance modeling of analog and mixed-signal circuits using side information , 2015, 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[24]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  R. Bharat Rao,et al.  Bayesian Co-Training , 2007, J. Mach. Learn. Res..

[26]  권홍우,et al.  Bootstrapping , 2002, ACL.

[27]  Mikhail Belkin,et al.  A Co-Regularization Approach to Semi-supervised Learning with Multiple Views , 2005 .

[28]  Florian Steinke,et al.  Semi-supervised Regression using Hessian energy with an application to semi-supervised dimensionality reduction , 2009, NIPS.

[29]  C.H. Yu,et al.  Virtual metrology: a solution for wafer to wafer advanced process control , 2005, ISSM 2005, IEEE International Symposium on Semiconductor Manufacturing, 2005..

[30]  Michelangelo Ceci,et al.  Predictive Modeling of PV Energy Production: How to Set Up the Learning Task for a Better Prediction? , 2017, IEEE Transactions on Industrial Informatics.

[31]  Koichi Fujiwara,et al.  Virtual sensing technology in process industries: Trends and challenges revealed by recent industria , 2013 .

[32]  Kunpeng Zhu,et al.  Online Tool Wear Monitoring Via Hidden Semi-Markov Model With Dependent Durations , 2018, IEEE Transactions on Industrial Informatics.