Exploring ML testing in practice -- Lessons learned from an interactive rapid review with Axis Communications

Song, Qunying; Borg, Markus; Engström, Emelie; Ardö, Håkan; Rico, Sergio

Computer Science > Software Engineering

arXiv:2203.16225 (cs)

[Submitted on 30 Mar 2022]

Title:Exploring ML testing in practice -- Lessons learned from an interactive rapid review with Axis Communications

Authors:Qunying Song, Markus Borg, Emelie Engström, Håkan Ardö, Sergio Rico

View PDF

Abstract:There is a growing interest in industry and academia in machine learning (ML) testing. We believe that industry and academia need to learn together to produce rigorous and relevant knowledge. In this study, we initiate a collaboration between stakeholders from one case company, one research institute, and one university. To establish a common view of the problem domain, we applied an interactive rapid review of the state of the art. Four researchers from Lund University and RISE Research Institutes and four practitioners from Axis Communications reviewed a set of 180 primary studies on ML testing. We developed a taxonomy for the communication around ML testing challenges and results and identified a list of 12 review questions relevant for Axis Communications. The three most important questions (data testing, metrics for assessment, and test generation) were mapped to the literature, and an in-depth analysis of the 35 primary studies matching the most important question (data testing) was made. A final set of the five best matches were analysed and we reflect on the criteria for applicability and relevance for the industry. The taxonomies are helpful for communication but not final. Furthermore, there was no perfect match to the case company's investigated review question (data testing). However, we extracted relevant approaches from the five studies on a conceptual level to support later context-specific improvements. We found the interactive rapid review approach useful for triggering and aligning communication between the different stakeholders.

Comments:	Accepted for publication in the Proc. of CAIN 2022 - 1st International Conference on AI Engineering - Software Engineering for AI
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2203.16225 [cs.SE]
	(or arXiv:2203.16225v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2203.16225

Submission history

From: Markus Borg [view email]
[v1] Wed, 30 Mar 2022 12:01:43 UTC (127 KB)

Computer Science > Software Engineering

Title:Exploring ML testing in practice -- Lessons learned from an interactive rapid review with Axis Communications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Exploring ML testing in practice -- Lessons learned from an interactive rapid review with Axis Communications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators