Towards a Workflow Manager for Curation Technologies in the Legal Domain

We develop a system for the curation and further processing of documents from the legal domain. The platform is based on a legal knowledge graph. The overall project will result in three use-case-specific prototypes for different areas of the legal domain. For the purpose of designing the exact needs, demands, ideas, wishes and feature requests we currently collect the functional and non-functional requirements from the three use case partners. The objective of our work is the design and implementation of a generic, yet customisable, workflow management system for content and data curation services in the legal domain. In this article we describe and discuss how the inherent characteristics of a specific domain influence the design and development process of automatic workflows of text and data processing as well as curation components. Different techniques for the analysis and for collecting requirements are presented, followed by our survey and hybrid approach.

[1]  Nada Lavrac,et al.  TextFlows: A visual programming platform for text mining and natural language processing , 2016, Sci. Comput. Program..

[2]  B. Nyenzi,et al.  GLOSSARY , 2019, Evidence-Based Dentistry.

[3]  Georg Rehm,et al.  Domain-specific Entity Spotting : Curation Technologies for Digital Humanities and Text Analytics , 2016 .

[4]  Wil vanderAalst,et al.  Workflow Management: Models, Methods, and Systems , 2004 .

[5]  Kyle Chard,et al.  A cloud-based approach to medical NLP. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[6]  Georg Rehm,et al.  Towards User Interfaces for Semantic Storytelling , 2017, HCI.

[7]  Sven Rahmann,et al.  Snakemake--a scalable bioinformatics workflow engine. , 2012, Bioinformatics.

[8]  Kim M. Unertl,et al.  Traversing the many paths of workflow research: developing a conceptual framework of workflow terminology through a systematic literature review , 2010, J. Am. Medical Informatics Assoc..

[9]  Sven Rahmann,et al.  Genome analysis , 2022 .

[10]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[11]  Ankit Srivastava,et al.  Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters , 2017, NEWS@ACL.

[12]  Iryna Gurevych,et al.  A broad-coverage collection of portable NLP components for building shareable analysis pipelines , 2014, OIAF4HLT@COLING.

[13]  Andreas Neumann,et al.  Oozie: towards a scalable workflow management system for Hadoop , 2012, SWEET '12.

[14]  K. Bretonnel Cohen,et al.  U-Compare: share and compare text mining tools with UIMA , 2009, Bioinform..

[15]  K. Bretonnel Cohen,et al.  U-Compare: A modular NLP workflow construction and evaluation system , 2011, IBM J. Res. Dev..

[16]  Sophia Ananiadou,et al.  Argo: an integrative, interactive, text mining-based workbench supporting curation , 2012, Database J. Biol. Databases Curation.

[17]  Bourgonje Peter,et al.  Processing Document Collections to Automatically Extract Linked Data: Semantic Storytelling Technologies for Smart Curation Workflows , 2016 .

[18]  K Bretonnel Cohen,et al.  Journal of Biomedical Discovery and Collaboration Open Access an Open-source Framework for Large-scale, Flexible Evaluation of Biomedical Text Mining Systems , 2008 .

[19]  Jing He,et al.  Designing User Interfaces for Curation Technologies , 2017, HCI.

[20]  Jing He,et al.  Different Types of Automated and Semi-automated Semantic Storytelling: Curation Technologies for Different Sectors , 2017, GSCL.

[21]  Samuel Fricker,et al.  Requirements Engineering: Best Practice , 2015 .

[22]  Daniel J. Blankenberg,et al.  Galaxy: A Web‐Based Genome Analysis Tool for Experimentalists , 2010, Current protocols in molecular biology.