论文信息 - Evaluation of Stanford NER for extraction of assembly information from instruction manuals

Evaluation of Stanford NER for extraction of assembly information from instruction manuals

Teaching industrial robots by demonstration can significantly decrease the repurposing costs of assembly lines worldwide. To achieve this goal, the robot needs to detect and track each component with high accuracy. To speedup the initial object recognition phase, the learning system can gather information from assembly manuals in order to identify which parts and tools are required for assembling a new product (avoiding exhaustive search in a large model database) and if possible also extract the assembly order and spatial relation between them. This paper presents a detailed analysis of the fine tuning of the Stanford Named Entity Recognizer for this text tagging task. Starting from the recommended configuration, it was performed 91 tests targeting the main features / parameters. Each test only changed a single parameter in relation to the recommend configuration, and its goal was to see the impact of the new configuration in the precision, recall and F1 metrics. This analysis allowed to fine tune the Stanford NER system, achieving a precision of 89.91%, recall of 83.51% and F1 of 84.69%. These results were retrieved with our new manually annotated dataset containing text with assembly operations for alternators, gearboxes and engines, which were written in a language discourse that ranges from professional to informal. The dataset can also be used to evaluate other information extraction and computer vision systems, since most assembly operations have pictures and diagrams showing the necessary product parts, their assembly order and relative spatial disposition.

Armando Sousa | Carlos M. Costa | Germano Veiga | Sérgio Nunes

[1] Alexandre Bernardino,et al. From human instructions to robot actions: Formulation of goals, affordances and probabilistic planning , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[2] Pierre Nugues,et al. Natural language programming of industrial robots , 2013, IEEE ISR 2013.

[3] Moritz Tenorth,et al. Understanding and executing instructions for everyday manipulation tasks from the World Wide Web , 2010, 2010 IEEE International Conference on Robotics and Automation.

[4] K. Izumi,et al. Approximate Decision Making by Natural Language Commands for Robots , 2006, IECON 2006 - 32nd Annual Conference on IEEE Industrial Electronics.

[5] Alois Knoll,et al. Integrating Language, Vision and Action for Human Robot Dialog Systems , 2007, HCI.

[6] Stefan Dlugolinsky,et al. Evaluation of named entity recognition tools on microposts , 2013, 2013 IEEE 17th International Conference on Intelligent Engineering Systems (INES).

[7] Maksim Tkatchenko,et al. Named entity recognition: Exploring features , 2012, KONVENS.

[8] Lars Asplund,et al. Intuitive industrial robot programming through incremental multimodal language and augmented reality , 2011, 2011 IEEE International Conference on Robotics and Automation.

[9] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[10] Stephanie Guerlain,et al. Named-entity recognition and data visualization techniques to communicate mission command to autonomous systems , 2016, 2016 IEEE Systems and Information Engineering Design Symposium (SIEDS).

[11] Steven Skiena,et al. POLYGLOT-NER: Massive Multilingual Named Entity Recognition , 2014, SDM.

[12] Asif Ekbal,et al. Active machine learning technique for named entity recognition , 2012, ICACCI '12.

[13] Christopher D. Manning,et al. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[14] Kazuhito Yokoi,et al. A humanoid robot that listens, speaks, sees and manipulates in human environments , 2008, 2008 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems.

[15] L. Kovacs,et al. Robot controlling in natural language , 2012, 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom).

[16] Milan Dojchinovski,et al. Datasets, GATE Evaluation Framework for Benchmarking Wikipedia-Based NER Systems , 2013, NLP-DBPEDIA@ISWC.

[17] Oussama Khatib,et al. Experimental Robotics IV, The 4th International Symposium, Stanford, California, USA, June 30 - July 2, 1995 , 1997, ISER.

[18] Dan Roth,et al. Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.

[19] Andrew Smith,et al. Using Gazetteers in Discriminative Information Extraction , 2006, CoNLL.

[20] Raphaël Troncy,et al. Analysis of named entity recognition and linking for tweets , 2014, Inf. Process. Manag..