Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Natural language querying of process execution data

Published: 01 June 2023 Publication History

Abstract

Process-oriented data analysis techniques allow organizations to understand how their processes operate, where modifications are needed and where enhancements are possible. A recurrent task in any process analysis technique is querying. Process data querying allows analysts to easily explore the data with the intent of getting insights about the execution of business processes. The current generation of process query languages targets data scientists. However, there is a need to a query language to support domain analysts who may be inexperienced with database technologies. This paper addresses this challenge by proposing a natural language interface that assists the end-users in querying the stored event data. The interface takes a natural language query from the user, automatically constructs a corresponding structured query to be executed over the stored event data. We use graph based storage techniques, namely labeled property graphs, which allow to explicitly model event data relationships. As an executable query language, we use the Cypher language which is widely used for querying property graphs. The approach has been implemented and evaluated using two publicly available event logs.

Highlights

We proposed a natural language interface for querying process execution data from natural language.
We presents a Labeled Graph metamodel for stroing process data.
We proposed a hybrid pipline to automatically constructing Cypher queries from natural language.
Our NLI system is hybrid and combines machine learning and rule-based approaches.
We defined a set of general intent patterns that are domain-independent.
We evaluated the proposed system with more than 530 natural language queries.

References

[1]
Van Der Aalst W., Process Mining: Data Science in Action, Vol. 2, Springer, 2016.
[2]
Benatallah B., Sakr S., Grigori D., Motahari-Nezhad H.R., Barukh M.C., Gater A., Ryu S.H., et al., Process Analytics: Concepts and Techniques for Querying and Analyzing Process Data, Springer, 2016.
[3]
Polyvyanyy A., Ouyang C., Barros A., van der Aalst W.M., Process querying: Enabling business intelligence through query-based process analytics, Decis. Support Syst. 100 (2017) 41–56.
[4]
R. Angles, The Property Graph Database Model, in: AMW, 2018.
[5]
Kobeissi M., Assy N., Gaaloul W., Defude B., Haidar B., An intent-based natural language interface for querying process execution data, in: 2021 3rd International Conference on Process Mining, ICPM, IEEE, 2021, pp. 152–159.
[6]
Polyvyanyy A., Ouyang C., Barros A., van der Aalst W.M.P., Process querying: Enabling business intelligence through query-based process analytics, Decis. Support Syst. 100 (2017) 41–56.
[7]
Kammerer K., Kolb J., Reichert M., PQL-a descriptive language for querying, abstracting and changing process models, in: Enterprise, Business-Process and Information Systems Modeling, Springer, 2015, pp. 135–150.
[8]
Delfmann P., Steinhorst M., Dietrich H., Becker J., The generic model query language GMQL - conceptual specification, implementation, and runtime evaluation, Inf. Syst. 47 (2015) 129–177.
[9]
Beeri C., Eyal A., Kamenkovich S., Milo T., Querying business processes with BP-QL, Inf. Syst. 33 (6) (2008) 477–507.
[10]
Polyvyanyy A., La Rosa M., Ter Hofstede A.H., Indexing and efficient instance-based retrieval of process models using untanglings, in: International Conference on Advanced Information Systems Engineering, Springer, 2014, pp. 439–456.
[11]
Jin T., Wang J., Rosa M.L., ter Hofstede A.H.M., Wen L., Efficient querying of large process model repositories, Comput. Ind. 64 (1) (2013) 41–49.
[12]
A. Awad, G. Decker, M. Weske, Efficient Compliance Checking Using BPMN-Q and Temporal Logic, in: Business Process Management, 6th International Conference, Vol. 5240, BPM, 2008, pp. 326–341.
[13]
C. Beeri, A. Eyal, T. Milo, A. Pilberg, Monitoring Business Processes with Queries, in: Proceedings of the 33rd International Conference on Very Large Data Bases, 2007, pp. 603–614.
[14]
Deutch D., Milo T., Type inference and type checking for queries on execution traces, Proc. VLDB Endow. 1 (1) (2008) 352–363.
[15]
S. Beheshti, B. Benatallah, H.R.M. Nezhad, S. Sakr, A Query Language for Analyzing Business Processes Execution, in: Business Process Management - 9th International Conference, Vol. 6896, 2011, pp. 281–297.
[16]
Räim M., Ciccio C.D., Maggi F.M., Mecella M., Mendling J., Log-based understanding of business processes through temporal logic query checking, in: On the Move To Meaningful Internet Systems: OTM, 2014, pp. 75–92.
[17]
K. Yongsiriwit, N.N. Chan, W. Gaaloul, Log-Based Process Fragment Querying to Support Process Design, in: 48th Hawaii International Conference on System Sciences, HICSS, 2015, pp. 4109–4119.
[18]
Fazzinga B., Flesca S., Furfaro F., Masciari E., Pontieri L., Pulice C., A framework supporting the analysis of process logs stored in either relational or NoSQL DBMSs, in: International Symposium on Methodologies for Intelligent Systems, Springer, 2015, pp. 52–58.
[19]
van der Aalst W.M., Object-centric process mining: Dealing with divergence and convergence in event data, in: International Conference on Software Engineering and Formal Methods, Springer, 2019, pp. 3–25.
[20]
IEEE standard for extensible event stream (XES) for achieving interoperability in event logs and event streams, 2016, https://xes-standard.org/.
[21]
S. Schönig, A. Rogge-Solti, C. Cabanillas, S. Jablonski, J. Mendling, Efficient and Customisable Declarative Process Mining with SQL, in: Advanced Information Systems Engineering - 28th International Conference, Vol. 9694, CAiSE, 2016, pp. 290–305.
[22]
de Murillas E.G.L., Reijers H.A., van der Aalst W.M.P., Connecting databases with process mining: A meta model and toolset, Softw. Syst. Model. 18 (2) (2019) 1209–1247.
[23]
Dijkman R.M., Gao J., Syamsiyah A., van Dongen B.F., Grefen P., ter Hofstede A.H.M., Enabling efficient process mining on large data sets: Realizing an in-database process mining operator, Distrib. Parallel Databases 38 (1) (2020) 227–253.
[24]
Resource description framework (RDF): Concepts and abstract syntax, 2004, https://www.w3.org/TR/rdf-concepts/.
[25]
Esser S., Fahland D., Multi-dimensional event data in graph databases, 2020, CoRR abs/2005.14552.
[26]
Catarci T., Leva M., Mecella M., Visual query languages for data stream, in: Liu L., Özsu M.T. (Eds.), Encyclopedia of Database Systems, second ed., Springer, 2018.
[27]
Lloret-Gazo J., A survey on visual query systems in the web era (extended version), 2017, CoRR abs/1708.00192.
[28]
Benzi F., Maio D., Rizzi S., VISIONARY: A viewpoint-based visual language for querying relational databases, J. Vis. Lang. Comput. 10 (2) (1999) 117–145.
[29]
Lee S.K., Whang K., VOQL*: A visual object query language with inductively defined formal semantics, J. Vis. Lang. Comput. 12 (4) (2001) 413–433.
[30]
Choi R.H., Wong R.K., VXQ: A visual query language for XML data, Inf. Syst. Front. 17 (4) (2015) 961–981.
[31]
Jarrar M., Dikaiakos M.D., Querying the data web: The mashql approach, IEEE Internet Comput. 14 (3) (2010) 58–67.
[32]
C. Jin, S.S. Bhowmick, X. Xiao, J. Cheng, B. Choi, GBLENDER: Towards blending visual query formulation and query processing in graph databases, in: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, 2010, pp. 111–122.
[33]
Yi P., Choi B., Bhowmick S.S., Xu J., AutoG: A visual query autocompletion framework for graph databases, VLDB J. 26 (3) (2017) 347–372.
[34]
Blunschi L., Jossen C., Kossmann D., Mori M., Stockinger K., SODA: Generating SQL for business users, Proc. VLDB Endow. 5 (10) (2012) 932–943.
[35]
Saha D., Floratou A., Sankaranarayanan K., Minhas U.F., Mittal A.R., Özcan F., ATHENA: An ontology-driven system for natural language querying over relational data stores, Proc. VLDB Endow. (2016).
[36]
Song D., Schilder F., Smiley C., Brew C., Zielund T., Bretz H., Martin R., Dale C., Duprey J., Miller T., et al., TR discover: A natural language interface for querying and analyzing interlinked datasets, in: International Semantic Web Conference, Springer, 2015, pp. 21–37.
[37]
Z. Tan, J. Zhang, X. Huang, G. Chen, S. Wang, M. Sun, H. Luan, Y. Liu, THUMT: An Open-Source Toolkit for Neural Machine Translation, in: Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, AMTA, 2020, pp. 116–122.
[38]
Zhong V., Xiong C., Socher R., Seq2SQL: Generating structured queries from natural language using reinforcement learning, 2017, CoRR abs/1709.00103.
[39]
Dong L., Lapata M., Language to logical form with neural attention, 2016, arXiv preprint arXiv:1601.01280.
[40]
W. Zheng, H. Cheng, L. Zou, J.X. Yu, K. Zhao, Natural Language Question/Answering: Let Users Talk With The Knowledge Graph, in: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM, 2017, pp. 217–226.
[41]
Affolter K., Stockinger K., Bernstein A., A comparative survey of recent natural language interfaces for databases, VLDB J. 28 (5) (2019) 793–819.
[42]
Bergamaschi S., Guerra F., Interlandi M., Lado R.T., Velegrakis Y., Combining user and database perspective for solving keyword queries over relational databases, Inf. Syst. 55 (2016) 1–19.
[43]
Shekarpour S., Marx E., Ngomo A.N., Auer S., SINA: Semantic interpretation of user queries for question answering on interlinked data, J. Web Semant. 30 (2015) 39–51.
[44]
Sun C., A Natural Language Interface for Querying Graph Databases, Massachusetts Institute of Technology, 2018.
[45]
Friedrich F., Mendling J., Puhlmann F., Process model generation from natural language text, in: International Conference on Advanced Information Systems Engineering, Springer, 2011, pp. 482–496.
[46]
R.C.B. Ferreira, L.H. Thom, M. Fantinato, A Semi-automatic Approach to Identify Business Process Elements in Natural Language Texts, in: ICEIS (3), 2017, pp. 250–261.
[47]
Elleuch M., Nassim L., Ismaili O.A., Gaaloul W., Discovering business processes and activities from messaging systems: State-of-the art, in: 2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises, WETICE, IEEE, 2020, pp. 137–142.
[48]
Elleuch M., Ismaili O.A., Laga N., Gaaloul W., Benatallah B., Discovering activities from emails based on pattern discovery approach, in: International Conference on Business Process Management, Springer, 2020, pp. 88–104.
[49]
Elleuch M., Ismaili O.A., Laga N., Assy N., Gaaloul W., Discovery of activities’ actor perspective from emails based on speech acts detection, in: 2020 2nd International Conference on Process Mining, ICPM, IEEE, 2020, pp. 73–80.
[50]
N. Laga, M. Elleuch, W. Gaaloul, O.A. Ismaili, Emails Analysis for Business Process Discovery, in: ATAED@ Petri Nets/ACSD, 2019, pp. 54–70.
[51]
Barbieri L., Madeira E.R.M., Stroeh K., van der Aalst W.M., Towards a natural language conversational interface for process mining, in: International Conference on Process Mining, Springer, Cham, 2022, pp. 268–280.
[52]
Han X., Hu L., Sen J., Dang Y., Gao B., Isahagian V., Lei C., Efthymiou V., Özcan F., Quamar A., et al., Bootstrapping natural language querying on process automation data, in: 2020 IEEE International Conference on Services Computing, SCC, IEEE, 2020, pp. 170–177.
[53]
Augusto A., Conforti R., Dumas M., Rosa M.L., Maggi F.M., Marrella A., Mecella M., Soo A., Automated discovery of process models from event logs: Review and benchmark, IEEE Trans. Knowl. Data Eng. 31 (4) (2019) 686–705.
[54]
Povalyaeva E., Khamitov I., Fomenko A., Bpic 2017: Density analysis of the interaction with clients, BPI Challenge (2017).
[55]
Ghahfarokhi A.F., Park G., Berti A., van der Aalst W.M., OCEL: A standard for object-centric event logs, in: European Conference on Advances in Databases and Information Systems, Springer, 2021, pp. 169–175.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Information Systems
Information Systems  Volume 116, Issue C
Jun 2023
174 pages

Publisher

Elsevier Science Ltd.

United Kingdom

Publication History

Published: 01 June 2023

Author Tags

  1. Process querying
  2. Process mining
  3. Natural language interface
  4. Graph database
  5. Cypher language

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media