Towards Guidelines for Assessing Qualities of Machine Learning Systems

Siebert, Julien; Joeckel, Lisa; Heidrich, Jens; Nakamichi, Koji; Ohashi, Kyoko; Namba, Isao; Yamamoto, Rieko; Aoyama, Mikio

doi:10.1007/978-3-030-58793-2_2

Julien Siebert⁹,
Lisa Joeckel⁹,
Jens Heidrich⁹,
Koji Nakamichi¹⁰,
Kyoko Ohashi¹⁰,
Isao Namba¹⁰,
Rieko Yamamoto¹⁰ &
…
Mikio Aoyama¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1266))

Included in the following conference series:

International Conference on the Quality of Information and Communications Technology

2042 Accesses
10 Altmetric

Abstract

Nowadays, systems containing components based on machine learning (ML) methods are becoming more widespread. In order to ensure the intended behavior of a software system, there are standards that define necessary quality aspects of the system and its components (such as ISO/IEC 25010). Due to the different nature of ML, we have to adjust quality aspects or add additional ones (such as trustworthiness) and be very precise about which aspect is really relevant for which object of interest (such as completeness of training data), and how to objectively assess adherence to quality requirements. In this article, we present the construction of a quality model (i.e., evaluation objects, quality aspects, and metrics) for an ML system based on an industrial use case. This quality model enables practitioners to specify and assess quality requirements for such kinds of ML systems objectively. In the future, we want to learn how the term quality differs between different types of ML systems and come up with general guidelines for specifying and assessing qualities of ML systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Construction of a quality model for machine learning systems

Article Open access 25 June 2021

Quality issues in machine learning software systems

Article 11 September 2024

A Framework for Managing Quality Requirements for Machine Learning-Based Software Systems

References

Wan, Z., Xia, X., Lo, D., Murphy, G.C.: How does machine learning change software development practices? IEEE Trans. Softw. Eng. 1 (2019)
Google Scholar
Sculley, D., et al.: Hidden technical debt in machine learning systems. In: Proceedings of the 28th International Conference on Neural Information Processing Systems, pp. 2503–2511 (2015)
Google Scholar
Zhang, J.M., Harman, M., Ma, L., Liu, Y.: Machine learning testing: survey, landscapes and horizons. IEEE Trans. Softw. Eng. 1 (2020)
Google Scholar
ISO/IEC 25010:2011: Systems and software engineering—Systems and software Quality Requirements and Evaluation (SQuaRE)—System and software quality models
Google Scholar
ISO/TS 8000:2011: Data Quality
Google Scholar
High-Level Expert Group on Artificial Intelligence: Ethics Guidelines for Trustworthy AI. European Commission (2019)
Google Scholar
DIN SPEC 92001-01: Künstliche Intelligenz - Life Cycle Prozesse und Qualitätsanforderungen. Teil 1: Qualitäts-Meta-Modell. Beuth Verlag GmbH, Berlin
Google Scholar
Hamada, K., Ishikawa, F., Masuda, S., Matsuya, M., Ujita, Y.: Guidelines for quality assurance of machine learning-based artificial intelligence. In: SEKE2020: the 32nd International Conference on Software Engineering & Knowledge Engineering, pp. 335–341 (2020)
Google Scholar
Trustworthy Use of Artificial Intelligence. Priorities from a Philosophical, Ethical, Legal, and Technological Viewpoint as a Basis for Certification of Artificial Intelligence. Fraunhofer Institute for Intelligent Analysis and Information Systems IAIS Schloss Birlinghoven (2019)
Google Scholar
From Principles to Practice. An interdisciplinary framework to operationalise AI ethics. VDE, Bertelsmann Stiftung (2020)
Google Scholar
Marselis, R., Shaukat, H., Gansel, T.: Testing of Artificial Intelligence. Sogeti, Paris (2017)
Google Scholar
Marselis, R., Shaukat, H.: Machine Intelligence Quality Characteristics. How to Measure the Quality of Artificial Intelligence and Robotics. Sogeti, Paris (2018)
Google Scholar
Nakajima, S.: Quality assurance of machine learning software. In: 2018 IEEE 7th Global Conference on Consumer Electronics (GCCE), 9–12 October 2018, pp. 601–604. IEEE, Piscataway (2018)
Google Scholar
Mariscal, G., Marbán, Ó., Fernández, C.: A survey of data mining and knowledge discovery process models and methodologies. Knowl. Eng. Rev. 25, 137–166 (2010)
Article Google Scholar
Martinez-Plumed, F., et al.: CRISP-DM twenty years later: from data mining processes to data science trajectories. IEEE Trans. Knowl. Data Eng. 1 (2020)
Google Scholar
Lwakatare, L.E., Raj, A., Bosch, J., Olsson, H.H., Crnkovic, I.: A taxonomy of software engineering challenges for machine learning systems: an empirical investigation. In: Kruchten, P., Fraser, S., Coallier, F. (eds.) XP 2019. LNBIP, vol. 355, pp. 227–243. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-19034-7_14
Chapter Google Scholar
Amershi, S., et al.: Software engineering for machine learning: a case study. In: 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), pp. 291–300 (2019)
Google Scholar
Hossin, M., Sulaiman, M.N.: A review on evaluation metrics for data classification evaluations. IJDKP 5, 1–11 (2015)
Google Scholar
Emmons, S., Kobourov, S., Gallant, M., Börner, K.: Analysis of network clustering algorithms and cluster quality metrics at scale. PLoS ONE 11, e0159161 (2016)
Article Google Scholar
Barocas, S., Boyd, D.: Engaging the ethics of data science in practice. Commun. ACM 60, 23–25 (2017)
Article Google Scholar
Kleinberg, J., Mullainathan, S., Raghavan, M.: Inherent Trade-Offs in the Fair Determination of Risk Scores. arXiv.org (2016)
Google Scholar
Wagner, S., et al.: Operationalised product quality models and assessment: the Quamoco approach. Inf. Softw. Technol. 62, 101–123 (2015)
Article Google Scholar
Kaufman, S., Rosset, S., Perlich, C.: Leakage in data mining. In: Apte, C., Ghosh, J., Smyth, P. (eds.) Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, Ca, USA, 21–24 August 2011, p. 556. ACM, New York (2011)
Google Scholar
Kläs, M., Vollmer, A.M.: Uncertainty in machine learning applications: a practice-driven classification of uncertainty. In: Gallina, B., Skavhaug, A., Schoitsch, E., Bitsch, F. (eds.) SAFECOMP 2018. LNCS, vol. 11094, pp. 431–438. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99229-7_36
Chapter Google Scholar
Nakamichi, K., et al.: Requirements-driven method to determine quality characteristics and measurements for machine learning software and its evaluation. In: 28th IEEE International Requirements Engineering Conference (RE’20)
Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer IESE, Kaiserslautern, Germany
Julien Siebert, Lisa Joeckel & Jens Heidrich
Fujitsu Laboratories Ltd., Kawasaki, Japan
Koji Nakamichi, Kyoko Ohashi, Isao Namba & Rieko Yamamoto
Nanzan University, Nagoya, Japan
Mikio Aoyama

Authors

Julien Siebert
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Joeckel
View author publications
You can also search for this author in PubMed Google Scholar
Jens Heidrich
View author publications
You can also search for this author in PubMed Google Scholar
Koji Nakamichi
View author publications
You can also search for this author in PubMed Google Scholar
Kyoko Ohashi
View author publications
You can also search for this author in PubMed Google Scholar
Isao Namba
View author publications
You can also search for this author in PubMed Google Scholar
Rieko Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Mikio Aoyama
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julien Siebert .

Editor information

Editors and Affiliations

Brunel University, London, UK
Martin Shepperd
Lisbon University Institute, Lisbon, Portugal
Fernando Brito e Abreu
University of Lisbon, Lisbon, Portugal
Alberto Rodrigues da Silva
University of Castilla-La Mancha, Talavera de la Reina, Spain
Ricardo Pérez-Castillo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Siebert, J. et al. (2020). Towards Guidelines for Assessing Qualities of Machine Learning Systems. In: Shepperd, M., Brito e Abreu, F., Rodrigues da Silva, A., Pérez-Castillo, R. (eds) Quality of Information and Communications Technology. QUATIC 2020. Communications in Computer and Information Science, vol 1266. Springer, Cham. https://doi.org/10.1007/978-3-030-58793-2_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-58793-2_2
Published: 31 August 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58792-5
Online ISBN: 978-3-030-58793-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards Guidelines for Assessing Qualities of Machine Learning Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Construction of a quality model for machine learning systems

Quality issues in machine learning software systems

A Framework for Managing Quality Requirements for Machine Learning-Based Software Systems

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Towards Guidelines for Assessing Qualities of Machine Learning Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Construction of a quality model for machine learning systems

Quality issues in machine learning software systems

A Framework for Managing Quality Requirements for Machine Learning-Based Software Systems

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation