A visualized bibliometric analysis of mapping research trends of machine learning in engineering (MLE)

Abstract In this work, we conducted a visualized bibliometric analysis to map the research trends of machine learning in engineering (MLE) based on articles indexed in the Web of Science Core Collection published between 2000 and 2019. The research distributions, knowledge bases, research hotspots, and research frontiers for MLE studies are revealed by using VOSviewer software and visualization technology. The growth of the literature related to MLE averaged 24.3% in the past two decades. A total of 3057 peer-reviewed papers from 96 countries published in 1299 different journals were identified. The USA was the most productive country, with 23.73% of the overall articles and 32.25% of the overall citations. The most active research organization was MIT, with 41 publications and 1079 citations, and the Journal of Machine Learning Research had the largest number of citations in the field of MLE. In particular, our findings indicate that the research issues of “random forests”, “support vector machine”, “extreme learning machine”, “deep learning”, “statistical learning theory”, and “Python machine learning” formed the knowledge bases of MLE from 2000 to 2019, while the research hotspots focused on applications of machine learning benchmark algorithms. Burst detection analysis results showed that more burst keywords emerged and had a higher frequency of change after 2010. This study provides an insight view of the overall research trends of MLE and may help researchers better understand this research field and predict its dynamic directions.

[1]  Craig A. Coburn,et al.  Temporal dynamics of sand dune bidirectional reflectance characteristics for absolute radiometric calibration of optical remote sensing data , 2017 .

[2]  Dorin Comaniciu,et al.  Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Chen Lu,et al.  Fault diagnosis of rotary machinery components using a stacked denoising autoencoder-based health state identification , 2017, Signal Process..

[4]  Enrico Zio,et al.  Failure and reliability prediction by support vector machines regression of time series data , 2011, Reliab. Eng. Syst. Saf..

[5]  Shuming Gao,et al.  A deep learning approach to the classification of 3D CAD models , 2014, Journal of Zhejiang University SCIENCE C.

[6]  Hui Peng,et al.  Identification of the interfacial cohesive law parameters of FRP strips externally bonded to concrete using machine learning techniques , 2021 .

[7]  Michael Krauthammer,et al.  GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data , 2004, J. Biomed. Informatics.

[8]  Hojjat Adeli,et al.  A probabilistic neural network for earthquake magnitude prediction , 2009, Neural Networks.

[9]  Witold Pedrycz,et al.  Subspace learning for unsupervised feature selection via matrix factorization , 2015, Pattern Recognit..

[10]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[11]  Jia Liu,et al.  Concrete dam deformation prediction model for health monitoring based on extreme learning machine , 2017 .

[12]  D. T. Lee,et al.  Travel-time prediction with support vector regression , 2004, IEEE Transactions on Intelligent Transportation Systems.

[13]  Qijun Chen,et al.  A Deep Learning-Based Computational Algorithm for Identifying Damage Load Condition: An Artificial Intelligence Inverse Problem Solution for Failure Analysis , 2018, Computer Modeling in Engineering & Sciences.

[14]  Ajith Abraham,et al.  A Scientometric Study of Neurocomputing Publications (1992-2018): An Aerial Overview of Intrinsic Structure , 2018, Publ..

[15]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[16]  Andy Wai Kan Yeung,et al.  The Changing Landscape of Neuroscience Research, 2006–2015: A Bibliometric Study , 2017, Front. Neurosci..

[17]  Ming Yang,et al.  Entity recognition from clinical texts via recurrent neural network , 2017, BMC Medical Informatics and Decision Making.

[18]  Ludo Waltman,et al.  Software survey: VOSviewer, a computer program for bibliometric mapping , 2009, Scientometrics.

[19]  Martín Carpio,et al.  Hyper-Parameter Tuning for Support Vector Machines by Estimation of Distribution Algorithms , 2017, Nature-Inspired Design of Hybrid Intelligent Systems.

[20]  Pieter de Wilde,et al.  The gap between predicted and measured energy performance of buildings: A framework for investigation , 2014 .

[21]  Zhiwei Guo,et al.  Application of Least Squares Support Vector Machine for Regression to Reliability Analysis , 2009 .

[22]  Zhenyu Wang,et al.  Which can better predict the future success of articles? Bibliometric indices or alternative metrics , 2019, Scientometrics.

[23]  S. Brunton,et al.  Discovering governing equations from data by sparse identification of nonlinear dynamical systems , 2015, Proceedings of the National Academy of Sciences.

[24]  H. Son,et al.  Automated Color Model-Based Concrete Detection in Construction-Site Images by Using Machine Learning Algorithms , 2012, J. Comput. Civ. Eng..

[25]  Ali Abbas,et al.  Estimation of air dew point temperature using computational intelligence schemes , 2016 .

[26]  Ting-Peng Liang,et al.  Research Landscape of Business Intelligence and Big Data analytics: A bibliometrics study , 2018, Expert Syst. Appl..

[27]  Qijun Chen,et al.  Application of deep learning neural network to identify collision load conditions based on permanent plastic deformation of shell structures , 2019, Computational Mechanics.

[28]  A. Goh,et al.  Support vector machines: Their use in geotechnical engineering as illustrated using seismic liquefaction data , 2007 .

[29]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[30]  Hongjie Chen,et al.  An extreme learning machine approach for slope stability evaluation and prediction , 2014, Natural Hazards.

[31]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[32]  Shahaboddin Shamshirband,et al.  Coupling a firefly algorithm with support vector regression to predict evaporation in northern Iran , 2018 .

[33]  R. Deo,et al.  Estimation of monthly evaporative loss using relevance vector machine, extreme learning machine and multivariate adaptive regression spline models , 2016, Stochastic Environmental Research and Risk Assessment.

[34]  Jane Labadin,et al.  Ensemble model of non-linear feature selection-based Extreme Learning Machine for improved natural gas reservoir characterization , 2015 .

[35]  Zhijing Yang,et al.  Convolutional neural network extreme learning machine for effective classification of hyperspectral images , 2018, Journal of Applied Remote Sensing.

[36]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[37]  Aleksei Shkurin,et al.  Using random forests for assistance in the curation of G-protein coupled receptor databases , 2017, Biomedical engineering online.

[38]  Massimo Piccardi,et al.  Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition , 2017, J. Biomed. Informatics.

[39]  Ludo Waltman,et al.  Citation-based clustering of publications using CitNetExplorer and VOSviewer , 2017, Scientometrics.

[40]  Zaher Mundher Yaseen,et al.  An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction , 2019, Journal of Hydrology.

[41]  Nhat-Duc Hoang,et al.  Punching shear capacity estimation of FRP-reinforced concrete slabs using a hybrid machine learning approach , 2016 .

[42]  Chaomei Chen,et al.  How many ways to use CiteSpace? A study of user interactive events over 14 months , 2017, J. Assoc. Inf. Sci. Technol..

[43]  Sebastián Lozano,et al.  Complex network analysis of keywords co-occurrence in the recent efficiency analysis literature , 2019, Scientometrics.

[44]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[45]  Jui-Sheng Chou,et al.  Smart Artificial Firefly Colony Algorithm‐Based Support Vector Regression for Enhanced Forecasting in Civil Engineering , 2015, Comput. Aided Civ. Infrastructure Eng..

[46]  Juan M. Fernández-Luna,et al.  Bibliometric study of the scientific research on “Learning to Rank” between 2000 and 2013 , 2014, Scientometrics.

[47]  Yiming Zhao,et al.  Partial Discharge Pattern Recognition of Gas-Insulated Switchgear via a Light-Scale Convolutional Neural Network , 2019, Energies.

[48]  Miao Su,et al.  Selected machine learning approaches for predicting the interfacial bond strength between FRPs and concrete , 2020 .

[49]  Jing Wang,et al.  Systematic Evaluation of Research Progress on Natural Language Processing in Medicine Over the Past 20 Years: Bibliometric Study on PubMed , 2020, Journal of medical Internet research.

[50]  David Zimbra,et al.  Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network , 2013, Expert Syst. Appl..

[51]  Haiping Lu,et al.  A survey of multilinear subspace learning for tensor data , 2011, Pattern Recognit..

[52]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[53]  Moncef Gabbouj,et al.  Real-time vibration-based structural damage detection using one-dimensional convolutional neural networks , 2017 .

[54]  Zhanya Xu,et al.  Coupling logistic model tree and random subspace to predict the landslide susceptibility areas with considering the uncertainty of environmental features , 2019, Scientific Reports.

[55]  Amir Hossein Alavi,et al.  Machine learning in geosciences and remote sensing , 2016 .

[56]  Ravinesh C. Deo,et al.  Application of the Artificial Neural Network model for prediction of monthly Standardized Precipitation and Evapotranspiration Index using hydrometeorological parameters and climate indices in eastern Australia , 2015 .

[57]  Leo Egghe,et al.  Co-citation, bibliographic coupling and a characterization of lattice citation networks , 2002, Scientometrics.

[58]  Fangkun Zhao,et al.  Mapping research trends of retinal vein occlusion from 2009 to 2018: a bibliometric analysis , 2019, PeerJ.

[59]  Ed C. M. Noyons,et al.  A unified approach to mapping and clustering of bibliometric networks , 2010, J. Informetrics.

[60]  Thomas G. Dietterich Machine-Learning Research , 1997, AI Mag..

[61]  Jasmin Kevric,et al.  Evidence-based clinical engineering: Machine learning algorithms for prediction of defibrillator performance , 2019, Biomed. Signal Process. Control..

[62]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[63]  Chao Long,et al.  Comparing keywords plus of WOS and author keywords: A case study of patient adherence research , 2016, J. Assoc. Inf. Sci. Technol..

[64]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[65]  Munan Li,et al.  An exploration to visualise the emerging trends of technology foresight based on an improved technique of co-word analysis and relevant literature data of WOS , 2017, Technol. Anal. Strateg. Manag..

[66]  Zeshui Xu,et al.  A bibliometric overview of International Journal of Machine Learning and Cybernetics between 2010 and 2017 , 2018, International Journal of Machine Learning and Cybernetics.

[67]  Shahaboddin Shamshirband,et al.  Estimating building energy consumption using extreme learning machine method , 2016 .

[68]  Noorbakhsh Amiri Golilarz,et al.  A New Automatic Method for Control Chart Patterns Recognition Based on ConvNet and Harris Hawks Meta Heuristic Optimization Algorithm , 2019, IEEE Access.

[69]  Lei Wang,et al.  Comparison of random forest, artificial neural networks and support vector machine for intelligent diagnosis of rotating machinery , 2018, Trans. Inst. Meas. Control.

[70]  F. Mouriaux,et al.  International publication trends in dry eye disease research: A bibliometric analysis. , 2018, The ocular surface.

[71]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[72]  Noureddine Zerhouni,et al.  Health assessment and life prediction of cutting tools based on support vector regression , 2015, J. Intell. Manuf..

[73]  Pingfeng Wang,et al.  Failure diagnosis using deep belief learning based health state classification , 2013, Reliab. Eng. Syst. Saf..

[74]  Xin Zou,et al.  Visualization and analysis of mapping knowledge domain of road safety studies. , 2018, Accident; analysis and prevention.

[75]  Yongyi Yang,et al.  Machine Learning in Medical Imaging , 2010, IEEE Signal Processing Magazine.

[76]  Riyad Ismail,et al.  Land cover and land use mapping of the iSimangaliso Wetland Park, South Africa: comparison of oblique and orthogonal random forest algorithms , 2016 .

[77]  H. Nguyen,et al.  The Current Research Landscape on the Artificial Intelligence Application in the Management of Depressive Disorders: A Bibliometric Analysis , 2019, International journal of environmental research and public health.

[78]  Steven Verstockt,et al.  Convolutional Neural Network Based Fault Detection for Rotating Machinery , 2016 .

[79]  Zhibin Lin,et al.  Enabling Damage Identification of Structures Using Time Series–Based Feature Extraction Algorithms , 2019, Journal of Aerospace Engineering.

[80]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[81]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[82]  C. David Page,et al.  Human pluripotent stem cell-derived neural constructs for predicting neural toxicity , 2015, Proceedings of the National Academy of Sciences.

[83]  Jianzhong Wang,et al.  Urban noise recognition with convolutional neural network , 2018, Multimedia Tools and Applications.

[84]  Zongben Xu,et al.  Three improved neural network models for air quality forecasting , 2003 .

[85]  Lin Lin,et al.  Random forests-based extreme learning machine ensemble for multi-regime time series prediction , 2017, Expert Syst. Appl..

[86]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[87]  Donghong Ji,et al.  Long short-term memory RNN for biomedical named entity recognition , 2017, BMC Bioinformatics.

[88]  A. Barnard,et al.  Feature Engineering of Solid‐State Crystalline Lattices for Machine Learning , 2019, Advanced Theory and Simulations.

[89]  Daniel Roggen,et al.  Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition , 2016, Sensors.