This document provides an introduction to AI-based systems. These systems are typically complex (e.g. deep neural nets), are sometimes based on big data, can be poorly specified and can be non-deterministic, which creates new challenges and opportunities for testing them.

This document explains those characteristics which are specific to AI-based systems and explains the corresponding difficulties of specifying the acceptance criteria for such systems.

This document presents the challenges of testing AI-based systems, the main challenge being the test oracle problem, whereby testers find it difficult to determine expected results for testing and therefore whether tests have passed or failed. It covers testing of these systems across the life cycle and gives guidelines on how AI-based systems in general can be tested using black-box approaches and introduces white-box testing specifically for neural networks. It describes options for the test environments and test scenarios used for testing AI-based systems.

In this document an AI-based system is a system that includes at least one AI component.

PDF Catalog

PDF Pages	PDF Title
2	National foreword
7	Foreword
8	Introduction
9	1 Scope 2 Normative references 3 Terms, definitions and abbreviated terms 3.1 Terms and definitions
18	3.2 Abbreviated terms
19	4 Introduction to AI and testing 4.1 Overview of AI and testing 4.2 Artificial intelligence (AI) 4.2.1 Definition of ‘artificial intelligence’ 4.2.2 AI use cases
20	4.2.3 AI usage and market 4.2.4 AI technologies
23	4.2.5 AI hardware 4.2.6 AI development frameworks 4.2.7 Narrow vs general AI
24	4.3 Testing of AI-based systems 4.3.1 The importance of testing for AI-based systems 4.3.2 Safety-related AI-based systems 4.3.3 Standardization and AI
26	5 AI system characteristics 5.1 AI-specific characteristics 5.1.1 General
27	5.1.2 Flexibility and adaptability
28	5.1.3 Autonomy 5.1.4 Evolution 5.1.5 Bias
29	5.1.6 Complexity 5.1.7 Transparency, interpretability and explainability
30	5.1.8 Non-determinism 5.2 Aligning AI-based systems with human values 5.3 Side-effects
31	5.4 Reward hacking 5.5 Specifying ethical requirements for AI-based systems
32	6 Introduction to the testing of AI-based systems 6.1 Challenges in testing AI-based systems 6.1.1 Introduction to challenges testing AI-based systems 6.1.2 System specifications
33	6.1.3 Test input data 6.1.4 Self-learning systems 6.1.5 Flexibility and adaptability 6.1.6 Autonomy 6.1.7 Evolution
34	6.1.8 Bias 6.1.9 Transparency, interpretability and explainability 6.1.10 Complexity 6.1.11 Probabilistic and non-deterministic systems 6.1.12 The test oracle problem for AI-based systems
35	6.2 Testing AI-based systems across the life cycle 6.2.1 General 6.2.2 Unit/component testing 6.2.3 Integration testing
36	6.2.4 System testing 6.2.5 System integration testing 6.2.6 Acceptance testing 6.2.7 Maintenance testing 7 Testing and QA of ML systems 7.1 Introduction to the testing and QA of ML systems
37	7.2 Review of ML workflow 7.3 Acceptance criteria 7.4 Framework, algorithm/model and hyperparameter selection 7.5 Training data quality 7.6 Test data quality 7.7 Model updates 7.8 Adversarial examples and testing
38	7.9 Benchmarks for machine learning 8 Black-box testing of AI-based systems 8.1 Combinatorial testing
39	8.2 Back-to-back testing
40	8.3 A/B testing 8.4 Metamorphic testing
41	8.5 Exploratory testing 9 White-box testing of neural networks 9.1 Structure of a neural network
43	9.2 Test coverage measures for neural networks 9.2.1 Introduction to test coverage levels 9.2.2 Neuron coverage 9.2.3 Threshold coverage 9.2.4 Sign change coverage 9.2.5 Value change coverage 9.2.6 Sign-sign coverage
44	9.2.7 Layer coverage 9.3 Test effectiveness of the white-box measures 9.4 White-box testing tools for neural networks
45	10 Test environments for AI-based systems 10.1 Test environments for AI-based systems
46	10.2 Test scenario derivation 10.3 Regulatory test scenarios and test environments
47	Annex A Machine learning
56	Bibliography

Published By	Publication Date	Number of Pages
BSI	2020	60


PDF Format	Searchable	Printable	Preview Now

Status	Definitive
Pages	60
Publication Date	2020-12-07
ISBN	978 0 539 15872 4
Standard Number	PD ISO/IEC TR 29119-11:2020
Title	Software and systems engineering. Software testing – Guidelines on the testing of AI-based systems
Identical National Standard Of	ISO/IEC CD TR 29119-11
Descriptors	Testing, Systems engineering, Artificial intelligence, Computer software, Software (computers)
Publisher	BSI
Committee	IST/15
ICS Codes	35.080 - Software

BSI PD ISO/IEC TR 29119-11:2020

PDF Catalog

Related products