1162 lines
127 KiB
Plaintext
1162 lines
127 KiB
Plaintext
Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
Contents lists available at ScienceDirect
|
||
|
||
|
||
Computer Standards & Interfaces
|
||
journal homepage: www.elsevier.com/locate/csi
|
||
|
||
|
||
|
||
|
||
Quality assessment for software data validation in automotive industry: A
|
||
systematic literature review
|
||
Gilmar Pagoto a,* , Luiz Eduardo Galvão Martins a , Jefferson Seide Molléri b
|
||
a
|
||
UNIFESP – Federal University of São Paulo, São José dos Campos, SP 12.247-014, Brazil
|
||
b
|
||
Kristiania University of Applied Sciences, Kirkegata 17, Oslo 0167, Norway
|
||
|
||
|
||
|
||
|
||
A R T I C L E I N F O A B S T R A C T
|
||
|
||
Keywords: Context: The complexity of automotive systems continues to grow, making software quality assessment crucial for
|
||
Software quality assessment vehicle performance, safety, and cybersecurity.
|
||
Software quality characteristics Objectives: This study explores Quality Assessment (QA) in this context, focusing on its key characteristics,
|
||
Automotive software validation
|
||
practical implications, and expected deliverables.
|
||
Automotive software safety
|
||
Automotive software performance
|
||
Method: We performed a systematic literature review (SLR) by selecting 60 studies from digital libraries.
|
||
Automotive software cybersecurity Results: This SLR highlighted essential QA characteristics that should be incorporated into a software validation
|
||
phase. Our insights encourage the exploration of advanced techniques, such as Artificial Intelligence (AI), and
|
||
Machine Learning (ML), to support safety-critical software quality assessments in the automotive domain.
|
||
Conclusion: The QA of software data validation requires a holistic approach that combines safety, security, and
|
||
customer expectations, aligned with industry standards, requirements, and specifications. The relevance of AI
|
||
and ML in managing complex technologies is evidenced, and the traditional real-world validation dependencies
|
||
bring risks for safety-critical systems validation.
|
||
|
||
|
||
|
||
|
||
1. Introduction restrict coverage and prevent them from offering a fully comprehensive
|
||
and efficient approach to safety and cybersecurity.
|
||
Based on the IEEE 1012–2016 standard [1], software data validation In addition, aspects related to driver experience pose unique chal
|
||
involves an evaluation process, as part of a system or component, con lenges, as reference criteria are typically established by trained pro
|
||
ducted during or after the development process to verify compliance fessionals rather than end-users, whose specific use cases may not be
|
||
with specified requirements. This standard establishes the guidelines for fully addressed. To fulfil these already-known application variations,
|
||
Verification and Validation (V&V) that provide an objective assessment minimise the uncovered use cases, and align with customer quality
|
||
of products throughout the life cycle. In this work, the target product perception, more effort and resources are required [3].
|
||
level is the automotive software. Furthermore, ensuring quality in this Moreover, certain quality aspects are not directly visible to the end-
|
||
evaluation process is crucial for vehicle quality, particularly in terms of user but are still important for ensuring the product’s safety throughout
|
||
safety and security. This is a complex task, and this complexity increases its lifecycle. Poor software quality can lead to accidents, potentially
|
||
with the integration of emerging technologies. Advances such as resulting in material damage, financial loss, or even fatalities. ISO
|
||
autonomous driving, Advanced Driver Assistance Systems (ADAS), standards provide support and serve as foundational recommendations
|
||
connectivity systems, electric propulsion vehicles, battery electric ve for software data validation [4–7], contributing to the process robust
|
||
hicles (BEV), and hybrid electric vehicles (HEV) are expanding the de ness. In summary, the extensive scope and growing complexity of QA for
|
||
mands on software validation. Additionally, technologies like Artificial software data validation motivated this research.
|
||
Intelligence (AI), and Machine Learning (ML) play a key role in The primary goals of this study are to identify and categorise soft
|
||
enhancing safety system performance and improving the overall ware quality approaches used in the automotive industry according to
|
||
customer experience. their characteristics, applications, and focal areas. This study also seeks
|
||
Traditional Quality Assessment (QA) tools, as Failure Mode and Ef to enhance automotive professionals’ and researchers’ understanding of
|
||
fects Analysis (FMEA) [2], are no longer sufficient. Their limitations the critical role of quality in software data validation by outlining
|
||
|
||
|
||
* Corresponding author.
|
||
E-mail address: gilmar.pagoto@unifesp.br (G. Pagoto).
|
||
|
||
https://doi.org/10.1016/j.csi.2025.104110
|
||
Received 27 November 2024; Received in revised form 30 August 2025; Accepted 2 December 2025
|
||
Available online 4 December 2025
|
||
0920-5489/© 2025 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
practical implications, expected deliverables, and the direct effects of 2.2. Automotive software data validation challenges
|
||
QA efforts.
|
||
The contributions of this Systematic Literature Review (SLR) include Selecting and ensuring software data validation in automotive sys
|
||
the identification of the most applied interventions for QA in software tems is a complex task. Meeting end-user expectations often involves
|
||
data validation, their relevant characteristics, the impacts of applied subjective factors that are directly or indirectly influenced by software.
|
||
interventions, and state-of-the-art trends that suggest future directions Another key challenge lies in linking objective software quality metrics,
|
||
for research. as determined by QA methods, with the subjective evaluations of
|
||
This paper is organised as follows: Section 2 presents the background experienced professionals [3].
|
||
and related work. Section 3 describes the adopted research methodol Many Software Reliability Growth Models (SRGMs) rely heavily on
|
||
ogy. The results and analysis based on research questions are presented large datasets of failure information and often make unrealistic as
|
||
in Section 4. In Section 5, we presented our conclusions. sumptions about software quality estimation [11]. While SRGMs can
|
||
measure quality by tracking remaining faults, failure rates, and overall
|
||
2. Background and related work software reliability, they struggle to capture customer perspectives and
|
||
insights [12].
|
||
This section reviews relevant international standards, discusses the In Automotive Driver Assistance Systems (ADAS), validating soft
|
||
limitations of traditional QA tools, and offers insights into the current ware quality in real-world conditions is costly and time-consuming. A
|
||
state of research in software quality validation. malfunction during this phase can lead to serious accidents. To mitigate
|
||
such risks, vehicle-in-the-loop (ViL) testing can be employed. The main
|
||
2.1. Software quality standards in the automotive industry challenge with ViL is ensuring that the virtual data inputs to the vehi
|
||
cle’s sensors are consistent with real-world inputs [13].
|
||
In software engineering (SE), the ISO/IEC 25,000 series [8], known Traditional tools for QA, such as Fault Tree Analysis (FTA), Failure
|
||
as SQuaRE (Software Quality Requirements and Evaluation), provides Mode and Effect Analysis (FMEA), and current software quality stan
|
||
guidelines for managing, modelling, measuring, specifying, and evalu dards, often fall short when it comes to safety-critical systems, particu
|
||
ating software quality. It outlines a framework for developing quality larly machine learning-based systems. This occurs mainly because of the
|
||
measures, including those focused on quality in use (ISO/IEC 25,022), high complexity of data-driven systems without a widely accepted en
|
||
system and software product quality (ISO/IEC 25,023), and data quality gineering solution to deal with that. As a result, more refined, adapted,
|
||
(ISO/IEC 25,024). Specifically, ISO/IEC 25,012 addresses software data and specialised QA approaches are required [14].
|
||
validation, defining 15 quality characteristics grouped into inherent and This work aims to provide a comprehensive picture of the most
|
||
system-dependent perspectives. Furthermore, the IEEE 1012 standard critical software data characteristics observed in recent years, discuss
|
||
for System, Software, and Hardware Verification and Validation (V&V) their contributions to the aforementioned challenges during software
|
||
provides recommendations for evaluating the compliance of develop validation phases, and provide insights for handling the increasing
|
||
ment products with their intended use requirements throughout the life complexity in the software-defined vehicles domain.
|
||
cycle.
|
||
In the automotive software domain, ISO 21,434 [9] focuses on 2.3. Related work
|
||
cybersecurity assurance, addressing software defects, incorrect imple
|
||
mentations, and vulnerabilities in security data, especially for embedded Feitosa et al. [15] conducted a systematic mapping study (SMS) with
|
||
software controlling vehicle functions like stability control. Functional the main goal of “analyse existing software engineering literature for the
|
||
Safety (FuSa), outlined in ISO 26,262 [10], addresses potential hazards purpose of characterising the state of the art concerning approaches (e.
|
||
in electric and electronic safety-related systems, aligning with the g., processes, methods, and tools) for designing critical embedded sys
|
||
system-dependent data characteristics of ISO/IEC 25,012 to support tems (CES) from the point of view of researchers and practitioners in the
|
||
robust software data validation. context of software-intensive systems engineering”. The study identified
|
||
Table 1 shows a link between standards and their practical applica seven critical quality characteristics (dependability, fault-tolerance,
|
||
tion examples. performance, reliability, safety, security, and timeliness). Its most rele
|
||
vant outcome is the identification and characterization of the created
|
||
and/or used approaches to design CES.
|
||
In 2005, Bhansali [16] proposed an optimised subset for a universal
|
||
Table 1
|
||
software standard focused on safety. This paper identifies the minimum
|
||
Practical contributions of standards for software data validation.
|
||
safety-critical quality characteristics set for multi-domain applications,
|
||
Standard Short description Use-case contributions such as: “commercial, military and space aviation; medical diagnostic
|
||
ISO/ Comprehensive Structured Evaluation of software data in- and therapeutic instruments; automotive and transportation systems;
|
||
IEC25000 guidelines for in-use software use characteristics on inherent industrial process control and robotics; nuclear power plants and
|
||
Series data quality. and/or system levels (e.g.
|
||
weapons control; commercial appliances and ride electronics”. The
|
||
Efficiency, Completeness,
|
||
Accessibility, Consistency, etc.). proposed benefits provided by a common safety standard are lower cost
|
||
ISO21434 Cybersecurity in electrical and Software integrity analysis and quality software improvement, achieved through common pro
|
||
electronic systems, including against potential external cesses and tools.
|
||
software. attacks through functional Rana et al. [17] presented a study of software prediction methods
|
||
software tests, vulnerability
|
||
scanning, fuzz testing, and/or
|
||
focused on defect detection. The authors presented an overview of 8
|
||
penetration testing. prediction models and described the applicability of each method
|
||
ISO26262 Functional safety requirements Checking of software data for overall software lifecycles. The integrity of software data monitoring is a
|
||
Series management, design, safety-related systems (e.g., critical factor not only for the software’s performance but also for safety
|
||
implementation, verification, autonomous systems) submitted
|
||
and security. This study pointed out that post-release software moni
|
||
validation, and confirmation to malfunctioning behaviour
|
||
measures conditions. toring was limited in the past due to a lack of software monitoring
|
||
IEEE1012 Support for Verification and Automotive function during the in-use phase, or insufficient skills and the data retrieval
|
||
Validation (V&V), including specification analysis based on capabilities.
|
||
software. vehicle requirements before Myllyaho et al. [18] presented an SLR on reported validation
|
||
software construction phase.
|
||
methods for AI-specific systems. The study classifies the applied
|
||
|
||
2
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
methods into trial, simulation, model-centred validation, and expert 2.4. Definitions
|
||
opinion, and offers a detailed description of their application in realistic
|
||
settings. Gezici and Tarhan [19] conducted an SLR to investigate the Key terms are defined in Appendix A.
|
||
state-of-the-art practices on software quality (SQ) approaches for
|
||
AI-based systems. They identified quality characteristics, models, re 3. Methodology
|
||
ported challenges by selected studies, and suggested future work on
|
||
definition/specification, design/evaluation, and process/socio-technical The SLR was conducted followed the guidelines by Kitchenham and
|
||
categories. Furthermore, Ali et al. [20] conducted a SMS on the AI Charters [22] and Biolchini et al. [23]. Fig. 1 shows the steps of the SLR
|
||
systems, software, and components. This study identified no work on AI process, which are grouped into three main phases: planning, con
|
||
on software component quality models, a potential gap in that knowl ducting, and reporting.
|
||
edge area.
|
||
Krichen [21] explored various formal methods and validation tech
|
||
niques focused on automotive systems security. The study provides a 3.1. Planning
|
||
comprehensive overview of the state-of-the-art for systems security
|
||
validation, including the automotive lifecycle, requirements engineer As its objective and motivation (step1), this SLR aims to provide the
|
||
ing, design, implementation, and test phases. The challenges identified existing QA processes, methodologies, frameworks, models, or tech
|
||
are scalability, efficiency, and applicability in real-world situations. niques applied for the software data validation phase in the automotive
|
||
industry (Fig. 1, step 1).
|
||
|
||
|
||
|
||
|
||
Fig. 1. SLR steps and phases.
|
||
|
||
3
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
We created a research protocol (Fig. 1, step 2) using the Parsifal Table 3
|
||
online tool [24], which provides a template and facilitates interactive Research questions and related objectives.
|
||
collaboration approach for researchers carrying out that task [25]. The ID Research question Specific objectives
|
||
protocol defines the strategies and criteria adopted during the SLR
|
||
RQ1 What are the approaches involved To identify and classify the
|
||
execution, ensuring a standardized SLR process, minimizing biases, and in the quality assessment during software quality approaches based
|
||
improving reproducibility [23]. software data validation in the on proposed studies.
|
||
To elaborate the research questions (Fig. 1, step 3) we applied the automotive industry?
|
||
PICOC (population, intervention, comparison, outcomes, and context) RQ1.1 How are the approaches applied in To identify how the approach is
|
||
the software data validation? applied based and where the
|
||
approach as recommended by Kitchenham and Charters [22] and application focus is placed.
|
||
Wohlin et al. [26]. The PICOC terms are detailed in Table 2. RQ2 Which are the main quality To extract the main aspects or
|
||
This work is part of a master’s degree thesis for software validation assessment characteristics for characteristics taken into account
|
||
and electronic systems integration for the automotive domain. Based on software data validation in the in each study.
|
||
automotive industry?
|
||
that scenario and the PICOC, we formulated the research questions
|
||
RQ3 What are the impacts of such To identify and understand the
|
||
shown in Table 3. characteristics in the validation, influences of quality approaches on
|
||
The search strategy for this study followed the guidelines of Kitch releasing and using phases of the the software lifecycle.
|
||
enham et al. [27] and included the recommended databases: ACM software data validation process?
|
||
Digital Library, IEEE Digital Library, ScienceDirect, Scopus, and
|
||
Springer Link.
|
||
To identify the relevant databases and create an appropriate search Table 4
|
||
string, we conducted the Search and Selection Strategy phase (Fig. 1, Final search strings resulting from our preliminary search.
|
||
step 4). A preliminary search in Google Scholar using general terms Name Search String Results Filter
|
||
related to the topic of interest was performed to assess coverage in the
|
||
ACM Digital 26 Full year and text
|
||
research domain. We checked the results on the first four pages, looking
|
||
"automotive" AND
|
||
Library "software data" AND
|
||
for their potential relevance to our RQ’s.. (test OR validation OR
|
||
Since the automotive domain was previously defined in the PICOC verification) AND
|
||
context, the “automotive” term was included in the preliminary search ("quality assessment" OR
|
||
string. Due to a large application of terms such as “quality” and “soft "quality assurance") AND
|
||
(method OR framework OR
|
||
ware” in many unrelated domains (e.g. domestic equipment, audio, and model OR process)
|
||
video), we refined the search string to include “automotive industry”, IEEE Digital automotive AND quality 89 Full year and text
|
||
which showed more accurate results in some digital libraries. After this Library AND software data AND
|
||
initial exploration, we formulated a general search string -“automotive (test OR validation) AND
|
||
|
||
industry" AND software AND ("quality assurance" OR "quality assessment")
|
||
(method OR framework OR
|
||
model OR process)
|
||
AND (test OR validation) AND (method OR framework OR model OR pro Science@Direct "automotive industry" 69 Full text and year
|
||
cess” - using an iterative process. We conducted trial searches to identify AND "software data" AND
|
||
relevant studies and refine the search string based on their results. The quality AND (assurance
|
||
general search string was adapted to the specifics of each database, as OR assessment) AND (test
|
||
OR validation) AND
|
||
shown in Table 4. (framework OR process)
|
||
Adaptations included using database-specific operators and filtering Scopus automotive AND software 77 TITLE-ABS-KEY
|
||
options, as follows: AND (test OR validation)
|
||
AND (“quality assurance"
|
||
|
||
• Selecting the “Full text” option for most databases, except Scopus,
|
||
OR "quality assessment")
|
||
AND (method OR framework
|
||
which uses a dedicated search method for title, abstract, and OR model OR process)
|
||
keywords. Springer Link "automotive industry" 602 1. Full year,
|
||
• An initial search in Springer Link returned over 3000 papers. To AND software AND articles and
|
||
conference
|
||
refine the results, we filtered to “Articles and conference papers” ("quality assurance" OR
|
||
"quality assessment") papers
|
||
only, which reduced the results to 602 candidate studies. AND (test OR validation) 2. “Preview-only”
|
||
AND (method OR framework selected
|
||
OR model OR process)
|
||
|
||
|
||
Table 2
|
||
PICOC for research questions and rationale for each element. • In ACM and Springer Link, the original term “software data” to be too
|
||
narrow and replaced it with the broader term “software”. In the same
|
||
Population Software testing, validation, In this study, we focus on
|
||
and quality software validation, tests, and
|
||
way, we replaced “automotive industry” with the term “automotive”
|
||
quality phases within the in IEEE, ACM, and Scopus.
|
||
automotive industry. • We added the keyword “verification” in ACM, to make the query
|
||
Intervention Methodologies, processes, Refers to the environments and more specific and retrieve additional studies.
|
||
frameworks, models, methods, methods applied, such as
|
||
and techniques methodologies, processes, We did not limit the studies by publication period in the search
|
||
frameworks, models, methods, phase. We found out, in our preliminary searches, that the quality
|
||
and techniques
|
||
Comparison Not applied Not in the scope of this SLR.
|
||
assessment for safety-related software has been reported from 1990
|
||
Outcome Quality assessment Expected results include quality onwards. The first study concerned with safety in control functions is
|
||
characteristics, performance assessment characteristics, dated 1993, and out of 28 studies found before 1993, none focused on
|
||
indicators, and validation performance indicators, and software quality assessments. Thus, we applied the exclusion criteria “E2
|
||
impacts impacts on validation.
|
||
- Older than 1990s” during later screening. A complete list of our in
|
||
Context Automotive industry The application domain is
|
||
restricted to the automotive clusion and exclusion criteria is provided in Table 5.
|
||
industry.
|
||
|
||
|
||
4
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
Table 5 Table 6
|
||
Inclusion and Exclusion Criteria. Selected studies by title and abstract.
|
||
ID Inclusion Criteria Source Candidate papers Included by Included in full
|
||
(database search screening title and text screening
|
||
I1 Primary studies;
|
||
results) abstract
|
||
I2 Papers that consider software data validation/test phases in the automotive
|
||
industry AND: ACM Digital 26 2 (7.7 %) 2 (7.7 %)
|
||
I2.1 Discuss methodologies, processes, frameworks, methods, tools, or models Library
|
||
of quality assessment/assurance for software data; OR IEEE Digital 89 52 (58.4 %) 23 (25.8 %)
|
||
I2.2 Papers that address the impacts of quality assessment on software data Library
|
||
ID Exclusion Criteria: Science@Direct 69 5 (7.2 %) 1 (1.4 %)
|
||
E1 Duplicated studies; Scopus 77 35 (45.5 %) 17 (22 %)
|
||
E2 Older than 1990′s; Springer Link 602 63 (10.5 %) 30 (5 %)
|
||
E3 Papers that consider software development but not in automotive industry; Total 863 157 (18.2 %) 73 (8.4 %)
|
||
E4 Papers not related to software validation/test phases;
|
||
E6 Papers that are not articles from journals, conferences or magazines
|
||
E7 Papers that are not related to vehicle-embedded software • RLC circuits testing, verification, and validation (E7). Papers strictly
|
||
E8 Conference Proceedings
|
||
focused on hardware and without qualitative contributions for the
|
||
E9 Papers not written in the English language
|
||
E10 Not accessible in specific databases
|
||
scope of SLR.
|
||
• Hardware requirement approaches (E7 and E4). The requirements
|
||
for hardware, presented in some studies, did not have a relevant
|
||
3.2. Conducting impact on software validation approaches. We opted for selecting
|
||
software-specific studies when they cite the requirements for soft
|
||
To assess the understanding of inclusion and exclusion criteria ware in vehicle-embedded systems and present impacts on software
|
||
among the researchers, we performed the Pilot Selection (Fig. 1, step 5) - quality assessments (I2.2).
|
||
two rounds of individual selection and agreement check with a sample of • Description of software tooling quality evaluation (E7). The Software
|
||
20 and 24 random candidate papers, respectively. We held meetings Assessment (SA) approaches for test tooling are out of this SLR scope.
|
||
with all researchers aiming to identify the divergences and harmonise • Data science solutions for IT software (E7 and E4). The data science
|
||
the criteria understanding. We performed two additional discussion application is not in scope unless the focus is placed on automotive-
|
||
rounds to clarify divergences and doubts regarding study selection. In embedded software evaluations during validation phases.
|
||
the first one, we found cases within the automotive domain involving • Software quality processes applied for production plants (E7 and E4).
|
||
software validation, but not specific in-vehicle embedded software, Several production software solutions are important for the software
|
||
which is the target of our research. For example, some papers addressed quality lifecycle, and they are out of SLR scope because of their
|
||
the QA of external sound equipment installed in the vehicle, which is out contributions after software validation phases and release for series
|
||
of scope. To solve this issue, we added the exclusion criteria “E7- Papers production.
|
||
that are not related to vehicle embedded software” and “E8-Conference
|
||
Proceedings”. In the second round, we excluded extra papers based on As proposed by Kitchenham and Charters [22], we performed the
|
||
the criteria “E9- Papers not written in English language” due to the low Quality Assessment (Fig. 1, Step 7), aiming for an improved interpre
|
||
relevance of only two non-English papers identified. tation of selected studies. The quality assessment questions presented in
|
||
To assess the agreement level among the researchers and ensure a Table 7 are based on SLR objectives and aimed to identify relevance for
|
||
rigorous selection, we computed the inter-rater agreement proposed by selected research questions (RQ’s). The rationale behind those QA
|
||
Landis and Koch [28]. The results were: questions is described in Appendix A. The possible answers are 1 point
|
||
(or “Yes”), 0.5 points (or “Partially”), and zero points (or “No”).
|
||
• Pilot selection 1 (n = 20): k-factor = 0.342 (Fair). A quality assessment of the selected studies was carried out after the
|
||
• Pilot selection 2 (n = 24): k-factor = 0.750 (Substantial). full-text screening. To reach proper representativeness and quality re
|
||
sults, we adopted a cutoff score greater or equal to 6.0. Thirteen out of
|
||
After the first round, referred to as “pilot selection 1″, we reviewed 73 candidate papers below this threshold were further excluded, many
|
||
and discussed the disagreements. Out of 20 randomly sampled studies, of which do not fulfil our quality assessment questions Q6, Q7, and Q8.
|
||
five disagreements occurred due to unclear inclusion/exclusion criteria. Fig. 2 shows the distribution of studies by achieved score.
|
||
Specifically, the validation or test phases for automotive embedded
|
||
software were not always clearly distinguished, and other out-of-scope
|
||
cases, e.g., software applied in a vehicle assembly line or software for Table 7
|
||
supporting vehicle development processes. After the researcher’s dis Quality Assessment Questions.
|
||
cussion and refinement of exclusion criteria “E4” and “E7”, we per ID Question
|
||
formed the “pilot selection 2”, resulting in a substantial improvement in Q1 Is the study related to the proposed SLR objectives?
|
||
agreement. Q2 Does the study bring relevant information for answering research questions?
|
||
We then started the Studies Selection phase (Fig. 1, step 6). We Q3 Is the quality application field clearly described and applicable to vehicle
|
||
embedded software? (RQ1)
|
||
screened the candidate papers by reading the title and abstract, applying
|
||
Q4 Are the methodology, processes, models or framework clearly presented?
|
||
the inclusion and exclusion criteria listed in Table 5. In case of uncer (RQ1.1)
|
||
tainty, a full text screening was conducted. Out of 863 candidate papers, Q5 Is there any technique comparison included? If yes, is that clearly presented
|
||
we included 157 (18,2 %) based on title and abstract. Table 6 describes and described?
|
||
the distribution of included and excluded studies by electronic database. Q6 Does the study present any real use-case investigation or is applicable for?
|
||
(RQ2)
|
||
We further excluded 84 papers in full text screening that met the
|
||
Q7 Does the study bring consistent results regarding the presented technique(s)
|
||
exclusion criteria E7 - “Papers that are not related to vehicle embedded approach? (RQ3)
|
||
software” and/or E4 - “Papers not related to software validation/test Q8 Does the study present success, unsuccess examples or issues during the
|
||
phases”. Table 6 also details the distribution of final selected studies software validation phase?
|
||
Q9 Is the related study applicable to the commercial vehicles segment?
|
||
after the full text screening in the second selection round. We presented
|
||
Q10 Does the study bring any coverage of quality issues related to software
|
||
below some examples to illustrate the researchers’ rationale: validation lacks?
|
||
|
||
|
||
5
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
|
||
|
||
Fig. 2. Distribution of papers based on the achieved score.
|
||
|
||
|
||
Fig. 3 shows the quality of included and excluded papers by publi • Proposed solutions (P1): The proposed solutions for software data
|
||
cation year. Papers published until 2005 accounted for <7 % (3 papers) validation were categorised in the following groups: case study,
|
||
of our dataset, whereas a significant concentration of studies emerged in framework, methodology/method, model, process, technique, and
|
||
the last decade. We observed an increase in high-quality studies in tool.
|
||
recent years, possibly linked to stricter regulatory requirements and/or • Software Assessment type (P2): We analysed the focus of studies in
|
||
the growing maturity of research fields such as safety-critical and terms of quality, safety, security, performance, or any other moti
|
||
cybersecurity domains. The growth trend identified cannot be fully vation for a proposal development solution.
|
||
confirmed for the last period (2021–2024), as the period is not yet • Software Assessment phase (P3): The objective here is to discern
|
||
complete, and the collection for the studies concluded in the first quarter the software lifecycle phase, in accordance with Sommerville [29],
|
||
of 2024. within which the software assessment is applied: concept, design
|
||
The application of the protocol is summarized in the flowchart in (including implementation), validation, development phases (except
|
||
Fig. 4. The specific data for each database is described in Appendix A. maintenance), or complete lifecycle coverage (including
|
||
We then carried out the Data Extraction phase (Fig. 1, step 8), maintenance).
|
||
considering a set of data characteristics (presented in Table 8) linked to • Involved software data source (P4): To understand the possible
|
||
the research questions, here called properties (from P1 to P8). An paths and origins of software data, we extracted existent sources
|
||
additional property, P9 was added later due to the relevance of new from which the software data is validated. Sources found dataset,
|
||
technology trends for software data assessment. dynamic tests, static tests, modelling/simulation, source code, and
|
||
specification.
|
||
|
||
|
||
|
||
|
||
Fig. 3. Quality of studies by publication year.
|
||
|
||
6
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
• Application of Artificial Intelligence (AI), Neural Network (NN)
|
||
or Machine Learning (ML) techniques (P9): We intend to check
|
||
how much the application of such techniques is cited to improve the
|
||
software quality and provide support to the developers to mitigate
|
||
the failures in the field.
|
||
|
||
The categorized groups and their definitions were as stated by the
|
||
authors of the primary studies. Our classification was based on findings,
|
||
and we did not intend to re-classify them into different definition
|
||
groups, rather, we retained those stated in the primary studies. Appen
|
||
dix A lists the extracted terms for property groups used in the data
|
||
extraction phase.
|
||
For the Data Synthesis (Fig. 1, step 9), we applied qualitative analysis
|
||
and inductive coding to analyse the findings of the studies [30]. The
|
||
inductive coding process allows the creation of codes directly from the
|
||
collected data, instead of relying on a predetermined coding framework
|
||
[31]. Each code represents similar textual contents in the studies and
|
||
groups them according to the assigned category. When necessary, a
|
||
second round of coding was conducted to merge similar codes into a
|
||
common final categorization. Fig. 5 shows an example of the coding
|
||
process.
|
||
|
||
3.3. Reporting
|
||
|
||
The Reporting phase (Fig. 1, steps 10 to 13) are presented in sections
|
||
4 (Results and Analysis), 5 (Discussion, Challenges, and Trends), and 6
|
||
(Conclusion).
|
||
Fig. 4. Summarized protocol application.
|
||
3.4. Threats to validity and limitations
|
||
|
||
Table 8 The term “quality” is largely applied to several areas of knowledge.
|
||
Extraction list for properties. In the software data validation and quality assessment process domains,
|
||
ID Property Research
|
||
we noted a large variation in how this term is used. The complexity and
|
||
Question potential restrictions of the defined string may also have impacted the
|
||
identification of relevant papers. We attempted to mitigate these risks by
|
||
P1 Proposed approaches/solutions RQ1
|
||
P2 Software Assessment (SA) type RQ1 increasing the rigour during the studies selection phase (step 6), per
|
||
P3 Software Assessment (SA) phase RQ1.1 forming meetings and rounds of discussion among the researchers to
|
||
P4 Involved software data source RQ2 reach a more assertive result. In the same way, the decision to state a
|
||
P5 Other industry areas, segments, or product types than RQ1.1 publication date limitation after the title and abstract readings
|
||
automotive also cited
|
||
P6 Main software characteristics considered RQ2
|
||
contributed to accessing all available papers and checking their real
|
||
P7 Main assessment characteristics source RQ2 relevance before any exclusion step.
|
||
P8 Main impacts of quality assessment RQ3 During the first selection phase, we noted several cases of quality
|
||
P9 Application of Artificial Intelligence (AI), Neural New technology assessment analysis focused on testing tools, which is out of our research
|
||
Network (NN) or Machine Learning (ML) techniques trends
|
||
scope. To exclude these papers, we added the exclusion-related criteria
|
||
for non-embedded software-related papers (E7 in Table 5). This solution
|
||
• Cited industry areas, segments, or product types others than mitigates but does not eliminate completely the chance of finding
|
||
automotive (P5): The Software Assessment (SA) in the automotive studies mentioning testing tools in the context of embedded software
|
||
domain can bring relevant approaches valid for many other domains. quality assessment. In such cases, we opted to keep the papers that
|
||
This fact moved us to check which are the most cited in the selected contributed to answering the research questions.
|
||
studies. The possible categories are aerospace, avionics, chemical, In Section 4 - Results and Analysis, we used classification terms as
|
||
automation, rail systems, nuclear, and software. stated by the primary studies. That means our classification was based
|
||
• Main software characteristics considered (P6): The target is to on direct findings, and we did not reclassify the studies based on our
|
||
identify the software data characteristics described in the studies judgment. We recognise that overlaps or similarities among the different
|
||
during the SA phase. We define such characteristic groups by terms may introduce biases originating in the primary studies, which
|
||
applying the coding after the data extraction, without predefined could reflect in our results.
|
||
categories. We conduct this analysis to support us in answering RQ2
|
||
properly. 4. Results and analysis
|
||
• Main Quality Assessment (QA) characteristics source (P7):
|
||
Similar to P6, but with the focus on identifying the origins and ref A total of 60 studies were included and considered for data
|
||
erences of the studies to perform a software data assessment extraction.
|
||
approach.
|
||
• Main impacts of Quality Assessment (QA) (P8): The assessment 4.1. Software quality approaches based on proposed studies (RQ1)
|
||
performed with a target to validate a software, in general, results in
|
||
positive impacts such as cost, safety, security, error mitigation, per The main software quality approaches stated in the included studies
|
||
formance, and user experience. are Techniques (25 %), Methods (21.7 %), Frameworks (18.3 %),
|
||
Models (16.7 %), Processes (6.7 %), and Case studies (5.0 %). Other
|
||
|
||
7
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
|
||
|
||
Fig. 5. Coding process examples for P2.
|
||
|
||
|
||
categories (Methodology, Standard, Systematic Mapping Study, and methodologies of sensor networks and stream processing to optimise the
|
||
Topology) were mentioned just once (1.7 % each). amount of data to create a method for selectively preprocessing and
|
||
The predominance of Techniques and Methods (together over 45 %) recording measured data, which is the main intended target of their
|
||
suggests a practitioner-driven focus on implementation-ready tools. study. Along the same line, Touw [12] performed a case study merging
|
||
Frameworks and Models are academically grounded, but they might three assessment models (CMMi, TMMi and A-SPICE) to establish a new
|
||
demand higher integration or customization effort. This indicates a model taking account of the quality aspects of complex systems.
|
||
prioritization of practical solutions in safety-critical automotive envi
|
||
ronments. The complete number of studies per software quality 4.1.1. How the software quality approach is applied and where the
|
||
approach is illustrated in Fig. 6. Models. Models, as described by Tak application focus is placed (RQ1.1)
|
||
rouni et al. [32], are based on other already existing models to extend The purpose of this research question is to understand how the
|
||
the applicability and cover the complexity of modern vehicles. This is software quality approach was applied in terms of the SA type domains.
|
||
possible through the addition of modules, focused on desired coverage We found six different target groups, as illustrated in Fig. 7, many of
|
||
areas, such as, reliability, safety, cyber security, cost optimization and them focused on a single quality characteristic. Several studies consid
|
||
others. ered a combination of quality characteristics coming from multiple
|
||
Techniques. Mateen et al. [33] propose a technique for an analysis domains that bring a better compromise among performance, time, cost,
|
||
work to provide a better way to evaluate the software quality level, use of technology limits, improved user experience, and others. For
|
||
having the comparison between manual testing vs automated testing example, Sadio et al. [39] defined quality based on proper detection of
|
||
aiming the minimum error rate within the time and lower target costs. Wi-Fi power thresholds, which improve customer experience and
|
||
Techniques based on software prediction are also cited in [34–37]. satisfaction.
|
||
Ranasinghe et al. [35] emphasise that the quality of intelligent auton Fig. 8 points out Safety and Performance as the most relevant char
|
||
omous decision-making can be reached by applying techniques of the acteristic for software quality evaluation, independent of the adopted
|
||
fusion of sensor data and historical health-state information. approach. In the automotive domain, those attributes are often inter
|
||
A mix of software quality approaches could be also found in [38], related. For example, the capacity of a front radar for autonomous sys
|
||
comprising failure detection via two diagnostic protocols, vehicle data tems to maintain the coverage angle against powder, snow, exhaust
|
||
measurement, fault monitoring, and unexpected behaviour analysis. The gases, water, and fog is directly related to the safety-critical system
|
||
authors proposed an application method based on existing operation. At the same time, a safety assessment for safety-critical
|
||
|
||
|
||
|
||
|
||
Fig. 6. Distribution of studies by software quality approach.
|
||
|
||
8
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
|
||
|
||
Fig. 7. Distribution of studies by SA type.
|
||
|
||
|
||
|
||
|
||
Fig. 8. Distribution of Assessment Focus (P2) by Software Quality Approaches (P1).
|
||
|
||
|
||
systems requires that the correct sensor data, processed by the software, did not describe clearly the source of their SA data. Some reasons for that
|
||
is correct and available. are their focus on (1) the effects of the adopted approach or presented
|
||
Taking account of the classification by SA phase (P3), we identified solution [40], (2) the comparison of approaches [33,36] or other similar
|
||
the software lifecycle phases in which software QA is placed (shown in targets. Other studies [41–45] reported more than one source of data.
|
||
Fig. 9). Not surprisingly, most of the studies are focused on the Valida The most relevant software quality approaches (techniques,
|
||
tion phase (75 %) but others also mentioned Concept (11.7 %), Full methods, frameworks, and models) represent 81.7 % of the total selected
|
||
lifecycle (5.0 %), Software development (3.3 %), among others. The studies. These approaches can often be implemented in combination, as
|
||
predominance of Validation phase suggests that QA is often placed be proposed by [11,32,38]. The main reported motivation for adopting
|
||
tween Design and Deployment phases. This may reflect industry prac combined approaches is the need to fit them to a specific new demand
|
||
tices where late-stage validation is still a primary gatekeeper for such as increasing systems complexity [13,44,46,47] or cost [36,48,49].
|
||
compliance and cost, despite calls for shift-left approaches. Oka et al. [50] describe another solution that integrates tools and test
|
||
We also examined data sources for software QA (P4) to identify the cases. The proposed process uses information from Application Lifecycle
|
||
origins of validation data, as illustrated in Table 9. Ten studies (16.7 %) Management (ALM) tools to configure the testing tools and, finally,
|
||
|
||
9
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
|
||
|
||
Fig. 9. Distribution of studies by software QA application phase.
|
||
|
||
|
||
assessment in terms of product type segment, as illustrated in Fig. 10. In
|
||
Table 9
|
||
second place, autonomous systems (16.7 %) are increasing in relevance
|
||
Software data sources.
|
||
due to higher efforts concentrated over the last years, as pointed out by
|
||
Software data source Occurrences [7,13,14,35,37,52–56]. The electronic control system (13.3 %), and
|
||
Specification 21 (35 %) Powertrain (6.7 %) completed the most relevant product type segments.
|
||
Dynamic test 13 (21.7 %) The dominance of embedded systems suggests that a deterministic
|
||
Not clearly described 10 (16.7 %) behaviour could be more suitable for QA. The growth of sophisticated
|
||
Dynamic and Static tests 3 (5 %)
|
||
Source code 3 (5 %)
|
||
software products to attend very complex computer systems requires
|
||
Data validation set 2 (3.3 %) that QA starts as early as possible, thus reducing the correction costs [17,
|
||
Modelling 2 (3.3 %) 36,42,57,58].
|
||
Static test 2 (3.3 %) In Fig. 11, the Validation and Concept phases lead to the software
|
||
Dynamic test and Specification 1 (1.7 %)
|
||
Development phases, but their relevance is not the same from a product
|
||
Historic fault data 1 (1.7 %)
|
||
Simulation 1 (1.7 %) type perspective. The presence of the Concept phase for the Embedded
|
||
Static test and Specification 1 (1.7 %) Software, Electronic Control, and Autonomous Systems indicates that
|
||
the assessment focus is also centered on base software or code, instead of
|
||
the Powertrain. Assessment of Powertrain primarily involves evalua
|
||
collect the test results back to the ALM tools. tions of acceptance characteristics, like gear shifting software charac
|
||
Similarities with other sectors (P5), such as aerospace, avionics, rail teristics (comfort, shifting speed, gear scratches), more traditional
|
||
systems, chemical, and automation were cited by four (6.6 %) studies mechanics evaluation approaches, and driver’s feeling. Assessment of
|
||
[11,35,40,51]. The reasons for this are the increasing relevance of safety Electronic Control products focuses on microcontroller management,
|
||
in automotive software applications (e.g. autonomous driving), cyber memory, drivers, standards, and proprietary communication. In the
|
||
security requirements, high-voltage applications for battery-electric Autonomous Systems, the balance between Validation and Concept is
|
||
vehicles, and new standards linked to connectivity and over-the-air expected to increase in the next years due to the importance of safety-
|
||
applications. critical systems issues being addressed early. Rework and additional
|
||
Furthermore, embedded software (48.3 %) leads the software data costs, due to late identification of defective software, can be a no-go for
|
||
|
||
|
||
|
||
|
||
Fig. 10. Product type segments reported by the studies.
|
||
|
||
10
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
|
||
|
||
Fig. 11. Concept and Validation Phases by Product Type.
|
||
|
||
|
||
the start of production releases. autonomous driving, e-mobility applications, and telematics. Further
|
||
more, other cited characteristics (code quality, misfunction, perfor
|
||
4.2. Main aspects or characteristics considered in each study (RQ2) mance, communication quality, data accuracy, data interpretation, and
|
||
software health) will also be affected by those standards and regulations.
|
||
The main assessment characteristics considered in QA for software In terms of data sources, we identified the following origins: speci
|
||
data validation are the scope of this research. Furthermore, we also fication with 22 occurrences (36.7 %); requirements with 14 (23.3 %);
|
||
intended to identify the origins of such characteristics and understand measured data with seven (11.7 %); fault data with three (5.0 %);
|
||
their relationship (P4, P6, and P7). simulated with two (3.3 %); CAN databases, emulated data, source code
|
||
Out of 60 included studies, 17 (28.3 %) mentioned code quality. The and stored project data with one (1.7 %) occurrence each one. Eight
|
||
main cited characteristics are Reliability, Maturity, Maintainability, studies (13.3 %) did not report a specific data source or did not simply
|
||
Fault Tolerance, Availability, Safety, Cybersecurity, and Security. emphasise that. This could be because some internal documents are
|
||
Despite the automotive domain’s particularities, they are mainly placed confidential or protected by non-disclosure agreements.
|
||
on the requirements specification, not in the software. This means that a Lastly, but not less important, we tracked software data sources.
|
||
QA for software data validation can be established by generic software Fig. 12 shows the distribution of the occurrences for each software data
|
||
quality standards such as ISO/IEC25010, where Functional Suitability source. The majority of studies concentrates in specification (21 occur
|
||
can also cover the Functional Safety (FuSa) automotive-specific re rences, 35 %) and dynamic and/or static tests (20 occurrences, 33.3 %).
|
||
quirements [46]. Ten (16.7 %) of selected studies did not clearly mention the applied
|
||
From the characteristics illustrated in Fig. 7, we can highlight that source for QA. The causes for this may be the same as described above
|
||
Safety and Cybersecurity have been increasingly cited in recent years. A for reported data sources.
|
||
potential explanation for this is the spread of regional regulations
|
||
implementation, such as UN-ECE R155 for Cybersecurity and UN-ECE 4.3. The impacts of QA approaches on the software lifecycle (RQ3)
|
||
R156 for software update management, and new standards, such as
|
||
ISO26262 – Functional Safety, and ISO21434 – Cybersecurity, to It may be obvious that the main impact of a well-performed software
|
||
establish a baseline for new complex upcoming technologies such as QA process is improved quality itself. As expected, the word “quality”
|
||
|
||
|
||
|
||
|
||
Fig. 12. Reported sources for software data.
|
||
|
||
11
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
was the most cited impact, it was reported by 22 studies (36.7 %). phase, AI, ML, and NN can be widely applied. Some examples of their
|
||
We aimed in RQ3 to identify other potential impacts and understand use are as follows:
|
||
how they are perceived. Thus, we analysed additional cited terms Artificial intelligence (AI). In addition to conventional vehicle
|
||
beyond “quality”. The complete overview of the reported impact dis tests, AI can be used for automated test generation and creation and
|
||
tribution is presented in Fig. 13. failure-revealing scenarios for safety-systems evaluation, as reported by
|
||
The reported techniques, frameworks, and methods are mostly [37].
|
||
applied to safety-critical systems. Autonomous systems are predominant Machine learning (ML). System-healthy assessment can be opti
|
||
(e.g., brake assist, adaptive cruise controller) as well as safety-related mised by ML models, as pointed out by Ranasinghe et al. [35]. For
|
||
embedded software products (ISO26262 compliance). These reported example, ML has been used for state of charge (SoC) battery estimation
|
||
characteristics reinforce the concern of automotive software developers in blind modelling tools [41]. Another example is the use of data-driven
|
||
in recent years. The complete distribution by approaches is presented in ML for dual-clutch transmission software, where input data - including
|
||
Fig. 14. actuator constraints and performance/comfort characteristics – are used
|
||
Table 10 presents the comparison between the SA type (P2) and the to evaluate optimal gear-shifting settings [34].
|
||
main perceived impacts on Quality Assessment (P8). As illustrated, there Neural network (NN). A variant called Recurrent Neural Network
|
||
is a clear link between the main perceived impacts (P8) and the software (RNN), described by Kollmeyer et al. [41], is a type of supervised ML
|
||
quality assessment type (P2). Ranking first in both is quality, followed method that uses short-term memory to process battery state of charge
|
||
by safety, and performance in third. The key messages here: 1) Efforts (SoC) quality metrics. It can be used for time-dependent applications
|
||
can be applied directly to collect the desired outcomes (quality, safety, with sequential datasets, where the goal is to apply the best-trained
|
||
performance, and cybersecurity); 2) Side effects are not restricted to the network in the next processing step. Another study [3] trained NNs
|
||
directly aimed SA type, the benefits extend further; and 3) QA can also with quality parameter metrics to evaluate the software quality and
|
||
have positive impacts on other aspects, such as user experience, effi produce performance ratings for Powertrain applications.
|
||
ciency, and cost. The distribution of quality assessment characteristics by these tech
|
||
The growing emphasis on cybersecurity aligns with regulatory shifts niques is as follows: Safety level with three occurrences (33 %), code
|
||
such as UNECE-R155 and UNECE-R156. This points out concerns with quality and performance with two each (22 %), data accuracy and state
|
||
public trust and liability management. Despite their lower impact of health one each (11,1 %).
|
||
ranking on QA (7th position), cybersecurity topics are now more cited These techniques are suitable for high-complexity systems during
|
||
than UX as SA types, revealing an increasing priority gap between se Verification & Validation (V&V) activities. However, their application
|
||
curity and user-centred validation. present challenges for software developers, such as higher costs due to
|
||
In terms of cost and efficiency, even though they are not directly additional required efforts, the need for advanced infrastructure and
|
||
cited in the SA phase as primary targets, they remain an integral part of expertise. Their focus on autonomous and other safety-critical systems
|
||
the entire software lifecycle and listed by studies as perceived impacts in makes sense, as these areas demand superior product quality and reli
|
||
the 5th and 6th positions, respectively. ability that traditional validation processes cannot achieve [36]. Addi
|
||
tionally, their scalability can mitigate undesired events during tests (e.
|
||
4.4. Application of new techniques and their benefits for software quality g., accidents) and minimize the need for physical tests, reserving pro
|
||
assessment totype vehicle trials in controlled or public road conditions only at
|
||
higher software maturity levels.
|
||
Out of 60 included studies, nine (15 %) mentioned at least one of the In terms of applicability, these techniques can support several soft
|
||
following techniques: artificial intelligence (AI), machine learning (ML), ware and data quality assessment tasks. For software code, they aid in
|
||
and neural networks (NN). Those upcoming trends are also relevant to specification from requirements, code creation through reuse, identifi
|
||
mention. cation of code deviations from fault storage data, test monitoring, and
|
||
In terms of product type segment, four out of nine (44 %) focused on result analysis. For software functions, especially the safety-related
|
||
autonomous systems. These studies emphasized the validation ones, benefits include driver behaviour recognition (e.g., detecting fa
|
||
complexity and the impact on fulfilling functional safety requirements tigue, misuse of cell phones while driving, etc.). They can also extend
|
||
[7,14,35,37]. Two studies (22 %) addressed Powertrain systems, high into post-production, enabling quality experts to monitor the software
|
||
lighting relevant challenges in the integration among electronic, elec quality performance in end-user applications and apply proper correc
|
||
trical, and mechanical components (e.g., automated manual tions when needed.
|
||
transmissions). Electric vehicles were cited in one study (11 %), as was However, there are constraints. AI- and ML- techniques face chal
|
||
general embedded software (11 %). lenges in meeting safety-critical requirements in real-world conditions
|
||
Regardless of the intended characteristic being evaluated in an SA [7]. This limitation prevents the exclusive application of those tech
|
||
niques without additional efforts and funding for complementary vali
|
||
dation in the “real-life” scenarios. Furthermore, variations in product
|
||
characteristics can affect system responses, requiring a worst-case
|
||
evaluation to identify critical variants for testing and ensure a proper
|
||
functional performance while mitigating safety and security risks.
|
||
|
||
5. Discussion, challenges & trends
|
||
|
||
Many studies focus on quality-specific metrics but fail to compre
|
||
hensively address all the characteristics of inherent and system-
|
||
dependent software quality assessment as defined by ISO/IEC25012.
|
||
While data accuracy, data interpretation (often referred to as “under
|
||
standability”), and efficiency are frequently cited, terms like “code
|
||
quality” can also align with this international standard as part of a more
|
||
comprehensive software quality assessment approach. This is further
|
||
supported by studies that identify safety (ISO 26,262) and cybersecurity
|
||
Fig. 13. Software QA reported impacts. (ISO21434) as key assessment characteristics, with “compliance” being
|
||
|
||
12
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
|
||
|
||
Fig. 14. Perceived QA Impacts (P8) by Software Quality Approaches (P1).
|
||
|
||
|
||
standards are further regulated by regional implementations, such as
|
||
Table 10
|
||
UN-ECE R155 for Cybersecurity and UN-ECE R156 for software update
|
||
Comparison between SA type (P2) and Main perceived impacts of QA (P8).
|
||
management. They apply particularly, though not exclusively, to safety-
|
||
P2 - SA Type P8 - Main Impacts of QA critical features such as autonomous systems, electric propulsion, and
|
||
Quality 1◦ 1◦ Advanced Driver Assistance Systems (ADAS). Therefore, to ensure
|
||
Safety 2◦ 2◦ proper code quality, all the aforementioned aspects must be considered
|
||
Performance 3◦ 3◦ from the software concept phase. If these characteristics are neglected,
|
||
User Experience (UX) - 4◦
|
||
Efficiency - 5◦
|
||
they can negatively impact not only software quality but also safety,
|
||
Cost - 6◦ security, and compliance with regional regulations.
|
||
Cybersecurity 4◦ 7◦ Impacts on the software lifecycle. Quality was identified as the
|
||
Traceability 5◦ - most significant impact throughout the software lifecycle. Ensuring
|
||
Reliability 6◦ -
|
||
software quality over its lifetime involves focusing on traceability,
|
||
applying quality tools (such as FMEA and FTA), and implementing
|
||
a particularly notable focus. Another group of studies emphasises safety approaches. The second most significant impact is Safety, which
|
||
portability, reliability, security, user experience (often referred to as arises from software quality assessments that prioritise safety and per
|
||
“usability”), and performance, all based on the principles of the product formance. This is reinforced by functional safety requirements outlined
|
||
quality model established by ISO/IEC25010. This model serves as the in ISO26262. Additionally, User Experience (UX) was reported as an
|
||
foundation for software requirements specification and quality important factor, encompassing all user perceptions of the final product,
|
||
evaluation. which can greatly influence customer satisfaction. Attention to UX has
|
||
Best practices in software development processes significantly increased in recent years with the incorporation of interactive solutions,
|
||
enhance the efficiency of quality assessment, especially given that such as digital and touch screens, connectivity features, and voice
|
||
approximately 85 % of vehicle functionalities are software-based today. interaction.
|
||
We examined quality characteristics and the impacts of these The findings of RQ1 brought an overview of applied approaches, the
|
||
assessment approaches on the software lifecycle, as detailed below: most relevant targets, and motivations for an automotive software
|
||
How each approach is applied and where the application focus quality evaluation. The contributions of RQ1.1 provided the automotive
|
||
is placed. The distribution of software quality assessment approaches software segment groups and identified the phases of software devel
|
||
was concentrated in Techniques, Methods, Frameworks, and Models. opment where the approaches are concentrated. The RQ2 comprises the
|
||
Notably, these categories are not dependent on the specific application characteristics considered, the software source data (acquisition type),
|
||
focus. The three most relevant aspects identified are: Quality, Safety, and the provided sources for those characteristics (requirements, stan
|
||
and Performance. Regarding the application phase, Validation was cited dards, and/or specifications). Finally, the RQ3 improves the under
|
||
in 75 % of the 60 studies included in this review. Other phases, such as standing of perceived impacts and results of a well-implemented
|
||
Conceptualization, Development, and Field Application, are expected to automotive software quality assessment.
|
||
increase in relevance, particularly due to the complexity of cyberse
|
||
curity and safety-critical systems that necessitate monitoring during the 5.1. Challenges & trends
|
||
process to enhance failure prediction during vehicle operation.
|
||
Quality characteristics. Code quality was the most frequently cited The studies also revealed several challenges and issues, such as:
|
||
characteristic, as the structure of the software code is critical to various
|
||
aspects, including complexity, efficiency, execution time, and reliability. • Ensuring safety-critical requirements: Addressing these chal
|
||
Additionally, code quality is intrinsically linked to other characteristics lenges in real-world conditions, particularly with the introduction of
|
||
such as malfunction, performance, communication quality, data accu new AI and ML techniques, requires identifying and clarifying safety
|
||
racy, data interpretation, and software health. and fail-operational requirements as the foundation for an effective
|
||
The other two significant topics identified were safety level and testing strategy [7]. The requirements specification should ensure all
|
||
cybersecurity level, both of which are guided by standards like intended application environments and cover the possible software
|
||
ISO26262 (Functional Safety) and ISO21434 (Cybersecurity). These variances during the entire lifecycle.
|
||
|
||
13
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
• Adapting to rapidly changing software requirements and of tools for software and systems management, such as the Engi
|
||
environments: neering Lifecycle Management (ELM) that provides support for re
|
||
• Frequent changes hinder alignment with standards and processes, quirements, tests, and design specifications. This integrated
|
||
leading to increased costs and efforts in correcting software defects. documentation environment enables the software development
|
||
To facilitate early defect detection, a risk-based approach that teams to deal with complexity, software compliance, and auditable
|
||
identifies high-potential failures is essential. This should be sup documentation.
|
||
ported by exploratory testing based on ISO 29,119 [59]. The soft
|
||
ware documentation, including requirements, code specification, While these findings are centred on the automotive domain, they can
|
||
tests, and related reports, must be maintained. Lifecycle modes, like also benefit other industries, such as aviation, electronic control sys
|
||
agile, support the software development teams in this challenging tems, automation, railway transport, and autonomous systems, by
|
||
task. This does not, by itself, prevent software issues but contributes enhancing their software quality processes through similar or adapted
|
||
to a structured basis for improved software traceability throughout approaches.
|
||
the whole development phase.
|
||
• Bridging the gap between simulated and physical systems: 6. Conclusion
|
||
Simulations are often constrained by what is known as the “reality
|
||
gap.” This issue can be mitigated by incorporating real measure Higher software quality positively influences customer acceptance of
|
||
ments into the simulation and validating the relevant components of a product. Over the past 20 years, the term ’software quality assessment’
|
||
the physics engine using modelling software [52]. Additionally, the has evolved to include new assessment criteria to support the automo
|
||
use of digital twins - virtual representations of physical systems tive software developers in managing growing systems’ complexity. The
|
||
updated with real-time data - can further narrow this gap. It allows primary reason for this evolution in the automotive industry is the rapid
|
||
continuous validation of existing physical tests using real-world expansion of complex technologies, including Advanced Driver Assis
|
||
measurements in prototype vehicles or customer fleets (under an tance Systems (ADAS), connectivity systems (such as over-the-air fea
|
||
official customer’s agreement) via upcoming over-the-air technolo tures), electric propulsion vehicles (e-mobility), battery electric vehicles
|
||
gies for data acquisition. The existing infrastructure may be insuffi (BEV), and hybrid electric vehicles (HEV). To effectively address the
|
||
cient to fulfil the requirements for the validation task, and challenges posed by this complexity, the quality assessment must align
|
||
complementary funding shall be considered only for “in-vehicle” with new requirements and standards. Additionally, a more compre
|
||
equipment, not for the acquisition of infrastructure from scratch. hensive approach can enhance understanding and better equip software
|
||
• Improving quality assessment testing procedures and metrics: developers and testers to tackle the challenges.
|
||
The absence of clear software quality assessment and testing metrics This SLR identified 60 primary studies on automotive software
|
||
in the automotive industry results in vulnerability issues. To address quality assessment published over the past 30 years. It synthesized the
|
||
this, a weight-based approach targeting the most vulnerable software findings about four main types of approaches (techniques, methods,
|
||
components is proposed [53]. The application of robust approaches frameworks, and models) and their focus across the software lifecycle,
|
||
based on quality, safety, and security standards, internal docu with Validation being the most commonly addressed phase (75 %). The
|
||
mented lessons learned, and state-of-the-art practices certainly con review also mapped QA characteristics, with the most cited being code
|
||
tributes to improved software quality. Nevertheless, the accumulated quality (28.3 %) and safety (20 %). Safety and cybersecurity (3.33 %)
|
||
knowledge, expertise, and adequate resources for continuous pro figure in alignment with standards such as ISO 26,262 and ISO 21,434. It
|
||
cesses optimization are critical success factors for the long-term for also revealed that AI and ML techniques are emerging in the field,
|
||
software quality. though currently applied in only 15 % of the studies, mostly in auton
|
||
• Reducing cost and effort in safety-critical embedded software omous and powertrain systems. Finally, our SLR offers actionable
|
||
quality assessment: A set of software defect prediction approaches guidelines (Section 5) for practitioners and researchers to improve
|
||
to mitigate these impacts is presented in [36]. Reducing the quality future software quality processes by applying standards like ISO/IEC
|
||
assessment does not mean, in principle, eliminate spending, but 25,010 and 25,012, and regional regulations such as UN-ECE R155 and
|
||
allocating it properly. As aforementioned, the implementation of a R156.
|
||
robust software validation process reduces the rework costs. In We explored the software quality approaches used in the automotive
|
||
addition to that, flexibility for corrections, modularity for the addi industry, clarified their applications, and identified their specific areas
|
||
tion of new software functions blocks, and applying the defect of focus. Our results indicate that new software developers should adopt
|
||
detection techniques as early as possible, can contribute to miti a more comprehensive quality approach throughout the entire software
|
||
gating the final software validation costs. development process, beginning with the concept phase, to enable
|
||
• Addressing problems in safety-critical machine learning sys earlier detection of non-quality issues, reduce correction costs, and
|
||
tems: Kuwajima et al. [14] emphasised the need for improved avoid showstoppers on vehicle assembly lines. The quality characteris
|
||
data-driven requirements specification and design. Both test data tics and impacts outlined in this SLR ensured through the use of quality
|
||
and training data should accurately reflect real application scenarios. standards (ISO25000 series), proper specifications and requirements,
|
||
Additionally, they propose future research to address gaps in current encompassing key pillars such as safety, cybersecurity, user experience,
|
||
quality standards, such as ISO/IEC 25,000. As pointed out for the gap and performance. Additionally, regional regulations and their impacts
|
||
between simulated and physical systems, the use of real-world data on technical specifications must be considered during the conceptuali
|
||
can stimulate and teaching machine learning systems. The combi zation phase. Quality tools remain essential for the entire software
|
||
nation of measured data and simulated environments can be pro development lifecycle. Furthermore, it is crucial to involve customers as
|
||
vided, for example, a more comprehensive scan of radars and much as possible in the conceptualization phase and to gather their
|
||
cameras from an active brake assist system. The integration of such feedback during the initial pre-series (pilot production) evaluations to
|
||
benefits can also provide a reduced development cost by shared enhance user experience (e.g., gearshift strategy for automated trans
|
||
resources. mission software). The relevance of AI, and ML in managing complex
|
||
• Overcoming poor documentation and scalability challenges: technologies is evidenced, but their integration depends on traditional
|
||
Software testing strategy decisions are often based on experience, real-world validation, which introduces safety risks. The availability of
|
||
and unclear textual specifications can negatively impact software expert workforce, infrastructure costs, and day-by-day improvements
|
||
quality. Therefore, improved documentation practices are essential will likely define which organizations will stay and which ones will
|
||
for enhancing software quality management. An example is the use perish in this domain.
|
||
|
||
14
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
The SLR outcomes can be used as a guideline for software developers through a comparative analysis of emerging studies and traditional
|
||
to implement and/or improve the automotive SA approaches by using approaches.
|
||
documentation-supported tools (e.g.: ELM), focusing on the most rele
|
||
vant characteristics highlighted in state-of-the-art practices, and aligned CRediT authorship contribution statement
|
||
with recommended quality, safety, and security standards.
|
||
We encourage the research directions focused on how the more Gilmar Pagoto: Writing – review & editing, Writing – original draft,
|
||
comprehensive real-world data analysis can contribute to mitigate the Methodology, Investigation, Conceptualization. Luiz Eduardo Galvão
|
||
existing test modelling and simulation gaps, especially for safety-critical Martins: Writing – review & editing, Validation, Supervision, Method
|
||
automotive software (e.g., autonomous systems). Second specific ology, Formal analysis, Conceptualization. Jefferson Seide Molléri:
|
||
research is a deeper investigation into the potential contributions of the Writing – review & editing, Validation, Methodology, Formal analysis,
|
||
over-the-air automotive data acquisition techniques for AI-based sys Conceptualization.
|
||
tems training activities, focused on software quality assessments.
|
||
Furthermore, the research in the aforementioned techniques – AI and
|
||
ML– to gain deeper understanding of future trends in supporting safety- Declaration of competing interest
|
||
critical quality assessments. Comparison with existing systematic re
|
||
views can support this goal. Future studies should aim for a more The authors declare that they have no known competing financial
|
||
comprehensive quality coverage that includes characteristics not fully interests or personal relationships that could have appeared to influence
|
||
addressed by traditional standards, such as the ISO/IEC 25,000 series, the work reported in this paper.
|
||
|
||
|
||
|
||
Appendix A
|
||
|
||
1. Terms and definitions
|
||
|
||
To provide a better understanding of the application context in our SLR to the readers, we described the several key terms used in this work.
|
||
The terms are alphabetically ordered:
|
||
ADAS or Advanced Driver-Assistance System. These systems are focused on improving safety driving, offering features that can warn the driver
|
||
in case of misdriving (lane departure), distracted attention, or assume vehicle control (steering, braking, accelerating).
|
||
Approach. Based on the context of this SLR, we refer to the methodologies, processes, frameworks, models, methods, and techniques focused on
|
||
software data assessment phases.
|
||
Automotive SPICE (A-SPICE). It is a reference model for automotive process assessment [12].
|
||
Capability maturity model integration (CMMi). Is a bundle of practices to evaluate product development or services [12].
|
||
Characteristics impacts. The reported effects or influences on the software quality, such as safety, performance, user experience (UX), cost,
|
||
efficiency, and cyber-security.
|
||
Critical embedded systems (CES). The systems, in case of failures, can hardly impact the safety, security, and performance causing potential
|
||
damage to the environment or to human lives [15].
|
||
FuSa or Functional Safety. Is the “absence of unreasonable risks due to hazards caused by malfunctioning behaviour of Electric/Electronic
|
||
systems” [10]. ISO26262 is a standard that offers a framework to provide guidelines for automotive safety-related Electric/Electronic systems.
|
||
Property. In this work, the extracted data that contributes to research questions answering and/or aggregates value for the results analysis step.
|
||
Quality assessment characteristics. The main assessed attributes in the software (e.g. code quality, safety level, data accuracy). They were
|
||
extracted from selected studies.
|
||
Safety. Is a system attribute with the focus on software as the critical component [16].
|
||
Software data. In this work, “software data” means any software-related information that can be used for checking, measuring, comparing, or
|
||
evaluating via static (e.g., documentation, software code) or dynamic (e.g., test benches, vehicle, simulation) tests. The term software applied in this
|
||
work is based on the IEEE 1012–2016, which includes firmware and code. Software includes documentation.
|
||
Test maturity model integration (TMMi). Is a reference model for test process assessment and improvements [12].
|
||
User experience (UX). In the context of this SLR, is the impact created by a software quality improvement on the product in terms of facility, day-
|
||
by-day use, and efficiency perceived by the final customer [3,17,34,41].
|
||
Validation. This term is based on IEEE 1012–2016, which defines “validation” as the evaluation process of the software (as part of a system or
|
||
component) during or after the development process concerning the specified requirements. Validation processes include analysis, review, inspection,
|
||
assessment, and testing.
|
||
|
||
2. Categories description
|
||
|
||
The definitions were self-stated by the authors of the primary studies. Our classification was based on findings, and we did not intend to re-classify
|
||
them into other definition groups, rather the one stated in the primary studies. We acknowledge that overlaps or similarities exist among the different
|
||
terms and now discuss them in our Section 3.4-Threats to Validity and Limitations.
|
||
Approaches
|
||
We considered the self-stated approaches integrally.
|
||
|
||
• Techniques
|
||
• Methods
|
||
• Frameworks
|
||
• Models
|
||
• Processes
|
||
|
||
|
||
15
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
• Case study
|
||
• Methodology
|
||
• Standard
|
||
• Systematic Mapping Study
|
||
• Topology
|
||
|
||
Application Focus / Target Groups
|
||
|
||
• Quality. When the studies present this general term, we understand that a bundle that was not described or split by the studies is behind. The
|
||
content of this bundle is detailed in standards, such as ISO 25,012, which support the critical characteristics for software quality evaluation.
|
||
• Safety. The term is covered by ISO 26,262 or only self-stated by studies.
|
||
• Performance. The combination of characteristics is covered by standards (ISO25012) and others, such as comfort.
|
||
• Cybersecurity. The term is covered by ISO 21,434, or only self-stated by studies.
|
||
• Traceability. The term is covered by ISO 25,012, or only self-stated by studies.
|
||
• Reliability. Term self-stated by studies.
|
||
|
||
Quality Assessment Application
|
||
|
||
• Validation. In same studies, the Validation is cited as Verification and Validation (V&V).
|
||
• Concept. Term self-stated by studies.
|
||
• Life Cycle. This classification defines the software phases from concept to decommissioning.
|
||
• Development Phases. The automotive V-Model establishes that “development phases” are Requirements & System Specification, Architecture &
|
||
Component Design, Coding or Implementation, and Tests (Component, Integration, System, and Acceptance).
|
||
• Field Applications. The end-user environment.
|
||
|
||
Product Type
|
||
|
||
• Embedded Software. The embedded software is an automotive software under evaluation and the focus of the studies’ assessment.
|
||
• Autonomous Systems. The Advanced Driver Assistance Systems are in scope. Examples: Active Brake Assist and Lane Keep Assist.
|
||
• Electronic Control Systems. The combination of embedded software and hardware.
|
||
• Powertrain. This term includes the engine, gearbox, and axles.
|
||
• Commercial Vehicles. Focused on vehicle-centered approaches.
|
||
• Connectivity. Cloud-based systems and others that use over-the-air solutions.
|
||
• Electric Vehicles. All categories of vehicles equipped with electric propulsion.
|
||
• Automation Systems. The automated gearbox is an example.
|
||
|
||
Software Data sources
|
||
|
||
• Specification. When the assessment data is based on specifications.
|
||
• Static/Dynamic Tests. When the Static/Dynamic tests provide reference data for other approaches.
|
||
• Source Code.
|
||
• Data Validation Set. When the storage data from tests is applied.
|
||
• Modelling. The result of modelling tasks is in focus.
|
||
• Historic Fault Data. When the storage fault data is applied.
|
||
• Simulation. The result of modelling tasks is in focus.
|
||
|
||
Reported Assessment Impacts
|
||
|
||
• Quality. When the studies present this general term, we understand that a bundle that was not described or split by the studies is behind. The
|
||
content of this bundle is detailed in standards, such as ISO 25,012, which support the critical characteristics for software quality evaluation.
|
||
• Safety. The term is covered by ISO 26,262 or only self-stated by studies.
|
||
• Performance. The combination of characteristics covered by standards (ISO25012) and others, such as comfort.
|
||
• User Experience (UX)
|
||
• Cost
|
||
• Efficiency. The term is covered by ISO 25,012 or is only self-stated by studies.
|
||
• Cybersecurity. The term is covered by ISO 21,434 or is only self-stated by studies.
|
||
|
||
3. Instructions for quality assessment criteria
|
||
|
||
We have created the instructions below to provide a better understanding of the quality assessment defined criteria.
|
||
Q1 - Is the study related to the proposed SLR objectives?
|
||
The target is to identify the presence of research questions topics in the studies.
|
||
|
||
• 1 point (Yes): The study contributes to all the research questions.
|
||
• 0,5 point (Partially): The study contributes at least one or more research question topic.
|
||
• 0 point (“No”): The study does not contribute to the research questions.
|
||
|
||
16
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
Q2 - Does the study bring relevant information for answering research questions?
|
||
The target is to identify the detailing level of research question topics in the studies.
|
||
|
||
• 1 point (Yes): The study contributes to all the research questions.
|
||
• 0,5 point (Partially): The study contributes at least one or more research question topic.
|
||
• 0 point (“No”): The study does not contribute to the research questions.
|
||
|
||
Q3 - Is the quality application field clearly described and applicable to vehicle embedded software? (RQ1)
|
||
The target is to identify the description level of the quality approach field.
|
||
|
||
• 1 point (Yes): The study offers a dedicated section for application description.
|
||
• 0,5 point (Partially): The application description is present and can be identified in the text.
|
||
• 0 point (“No”): The study does not contribute to the research questions.
|
||
|
||
Q4 - Are the methodology, processes, models or framework clearly presented? (RQ1.1)
|
||
The target is to identify the description level of the adopted quality approaches.
|
||
|
||
• 1 point (Yes): The study offers a dedicated section for description.
|
||
• 0,5 point (Partially): The description is present and can be identified in the text.
|
||
• 0 point (“No”): The study does not contribute to the research questions.
|
||
|
||
Q5 - Is there any technique comparison included? If yes, is that clearly presented and described?
|
||
The target is to identify the presence and description level of the technique’s comparison.
|
||
|
||
• 1 point (Yes): The study offers a dedicated section for description.
|
||
• 0,5 point (Partially): The description is present and can be identified in the text.
|
||
• 0 point (“No”): The study does not contribute to the research questions.
|
||
|
||
Q6 - Does the study present any real use-case investigation, or is it applicable for? (RQ2)
|
||
The target is to identify the presence and description level of real use-cases.
|
||
|
||
• 1 point (Yes): The study offers a real use-case investigation and description.
|
||
• 0,5 point (Partially): The study presents example(s) of use-cases without detailing.
|
||
• 0 point (“No”): The study does not present real use-cases.
|
||
|
||
Q7 - Does the study bring consistent results regarding the presented technique(s) approach? (RQ3)
|
||
The target is to identify the presence and description level of the presented techniques.
|
||
|
||
• 1 point (Yes): The study offers results and description.
|
||
• 0,5 point (Partially): The study presents results without detailing.
|
||
• 0 point (“No”): The study does not present results.
|
||
|
||
Q8 - Does the study present success, unsuccess examples or issues during the software validation phase?
|
||
The target is to identify the presence and description level of critical results evaluation and challenges faced.
|
||
|
||
• 1 point (Yes): The study presents success, unsuccess, and challenges.
|
||
• 0,5 point (Partially): The study presents at least one of them.
|
||
• 0 point (“No”): The study does not present any results.
|
||
|
||
Q9 - Is the related study applicable to the commercial vehicles segment?
|
||
The target is to identify the applicability level for the commercial vehicles segment.
|
||
|
||
• 1 point (Yes): The study states the applicability for commercial vehicles.
|
||
• 0,5 point (Partially): The study presents general applicability coverage.
|
||
• 0 point (“No”): The study does not specify the applicability coverage.
|
||
|
||
Q10 - Does the study bring any coverage of quality issues related to software validation lacks?
|
||
The target is to identify the presence of examples of software validation lacks and their consequences.
|
||
|
||
• 1 point (Yes): The study presents a root cause relationship.
|
||
• 0,5 point (Partially): The study presents a general lack and/or consequences.
|
||
• 0 point (“No”): The study does not mention them.
|
||
|
||
|
||
|
||
|
||
17
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
|
||
4-. Protocol application: candidate studies identification, screening process, and inclusion/exclusion results by database
|
||
|
||
|
||
|
||
|
||
Data availability International Conference on Range Technology (ICORT). IEEE, 2021, https://doi.
|
||
org/10.1109/ICORT52730.2021.9581624.
|
||
[12] E. Touw, Multi-faceted reliability assessment techniques: an industrial case study,
|
||
No data was used for the research described in the article. in: 2017 IEEE International Conference on Software Architecture Workshops
|
||
(ICSAW). IEEE, 2017, https://doi.org/10.1109/ICSAW.2017.66.
|
||
[13] T. Genevois, J.B. Horel, A. Renzaglia, et al., Augmented reality on lidar data: going
|
||
References
|
||
beyond vehicle-in-the-loop for automotive software validation, in: 2022 IEEE
|
||
Intelligent Vehicles Symposium (IV). IEEE, 2022, https://doi.org/10.1109/
|
||
[1] IEEE Standards Association, IEEE 1012-2016: IEEE standard for system, Software, IV51971.2022.9827351.
|
||
and hardware verification and validation (2017). [14] H. Kuwajima, H. Yasuoka, T. Nakae, Engineering problems in machine learning
|
||
[2] P.L. Goddard, Validating the safety of embedded real-time control systems using systems, Mach. Learn 109 (5) (2020) 1103–1126, https://doi.org/10.1007/
|
||
FMEA, in: Annual Reliability and Maintainability Symposium 1993 Proceedings. s10994-020-05872-w.
|
||
IEEE, 1993, https://doi.org/10.1109/RAMS.1993.296851. [15] D. Feitosa, A. Ampatzoglou, P. Avgeriou, et al., Design approaches for critical
|
||
[3] J. Zhang, Y. Lei, X. Hua, et al., Proposed shift quality metrics and experimentation embedded systems: a systematic mapping study, in: Evaluation of Novel
|
||
on AMT shift quality evaluation, in: Third International Conference on Natural Approaches to Software Engineering: 12th International Conference, ENASE 2017,
|
||
Computation (ICNC 2007) 2, 2007, https://doi.org/10.1109/ICNC.2007.582. Porto, Portugal, April 28–29, 2017.
|
||
IEEE. [16] P.V. Bhansali, Universal software safety standard, ACM SIGSOFT Softw. Eng. Notes
|
||
[4] L. Mäurer, T. Hebecker, T. Stolte, et al., On bringing object-oriented software 30 (5) (2005) 1–4, https://doi.org/10.1145/1095430.1095440.
|
||
metrics into the model-based world–Verifying ISO 26262 compliance in Simulink. [17] R. Rana, M. Staron, J. Hansson, et al., Defect prediction over software life cycle in
|
||
System analysis and modeling: models and reusability: 8th international automotive domain state of the art and road map for future, in: 2014 9th
|
||
conference, SAM 2014, Valencia, Spain, September 29-30, 2014. Proceedings 8, International Conference on Software Engineering and Applications (ICSOFT-EA).
|
||
Springer International Publishing, 2014, https://doi.org/10.1007/978-3-319- IEEE, 2014.
|
||
11743-0_15. [18] L.S. Myllyaho, et al., Systematic literature review of validation methods for AI
|
||
[5] M. Nyberg, D. Gurov, C. Lidström, et al., Formal verification in automotive systems, J. Syst. Softw. 181 (Nov. 2021) 111050, https://doi.org/10.1016/j.
|
||
industry: enablers and obstacles." leveraging applications of Formal methods, jss.2021.111050.
|
||
verification and validation. Industrial practice: 8th international symposium, ISoLA [19] B. Gezici, A.K. Tarhan, Systematic literature review on software quality for AI-
|
||
2018, Limassol, Cyprus, November 5-9, 2018, Proceedings, Part IV 8, Springer based software, Empir. Softw. Eng. 27 (3) (Mar. 2022), https://doi.org/10.1007/
|
||
International Publishing, 2018, https://doi.org/10.1007/978-3-030-03427-6_14. s10664-021-10105-2. Art. 66.
|
||
[6] I. Stürmer, E. Salecker, H. Pohlheim, Reviewing software models in compliance [20] M.A. Ali, et al., A systematic mapping of quality models for AI systems, software
|
||
with ISO 26262, in: Computer safety, reliability, and security: 31st international and components, Appl. Sci. 12 (17) (Aug. 2022), https://doi.org/10.3390/
|
||
conference, SAFECOMP 2012, Magdeburg, Germany, September 25-28, 2012. app12178700. Art. 8700.
|
||
Proceedings 31, Springer Berlin Heidelberg, 2012, https://doi.org/10.1007/978-3- [21] M. Krichen, Formal methods and validation techniques for ensuring automotive
|
||
642-33678-2_22. systems security, Information 14 (12) (Dec. 2023) 666, https://doi.org/10.3390/
|
||
[7] F. Wotawa, B. Peischl, F. Klück, et al., Quality assurance methodologies for info14120666.
|
||
automated driving, Elektrotech. Inf. 135 (4–5) (2018) 322–327, https://doi.org/ [22] B. Kitchenham, and S. Charters. "Guidelines for performing systematic literature
|
||
10.1007/s00502-018-0630-7. reviews in software engineering." 2007, 1051.
|
||
[8] ISO/IEC, ISO/IEC 25000: 2014 systems and software engineering-systems and [23] J. Biolchini, P.G. Mian, A.C.C. Natali, et al., Systematic review in software
|
||
software Quality Requirements and Evaluation (SQuaRE)-Guide to SQuaRE, Inf. engineering, Syst. Eng. Comput. Sci. Dep. COPPE/UFRJ Tech. Rep. ES 679.05
|
||
technol. stand. (2014). (2005) 45.
|
||
[9] International Organization for Standardization, ISO/SAE 21434: 2021: Road [24] V. Parsifal, 2021. Available online: https://parsif.al (accessed on 24.03.2024).
|
||
Vehicles: Cybersecurity Engineering, ISO, (2021). [25] D. Stefanovic, S. Havzi, D. Nikolic, et al., Analysis of the tools to support systematic
|
||
[10] International Organization for Standardization (ISO). ISO 26262-1: 2018–road literature review in software engineering. IOP conference series: materials science
|
||
vehicles—functional safety. Geneva, Switzerland. 2018. and engineering. Vol. 1163. No. 1, IOP Publishing, 2021.
|
||
[11] P. Kumar, L.K. Singh, C. Kumar, et al., A Bayesian belief network model for early [26] C. Wohlin, P. Runeson, M. Höst, et al., Experimentation in software engineering,
|
||
prediction of reliability for computer-based safety-critical systems, in: 2021 2nd Springer Science & Business Media, 2012.
|
||
|
||
|
||
18
|
||
G. Pagoto et al. Computer Standards & Interfaces 97 (2026) 104110
|
||
|
||
[27] Kitchenham, B.A., Budgen, D., & Brereton, P. “Evidence-based software [45] E. Bringmann, A. Krämer, Model-based testing of automotive systems, in: 2008 1st
|
||
engineering and systematic reviews.” CRC press., 2015. international conference on software testing, verification, and validation. IEEE,
|
||
[28] J.R. Landis, G.G. Koch, An application of hierarchical kappa-type statistics in the 2008, https://doi.org/10.1109/ICST.2008.45.
|
||
assessment of majority agreement among multiple observers, Biometrics (1977) [46] C. Kugler, S. Kowalewski, J. Richenhagen, et al., Metrics-based strategies for
|
||
363–374. quality assurance of automotive embedded software. 17. Internationales stuttgarter
|
||
[29] I. Sommerville, Software engineering, 9"‘ed, Engl.: Educ. Ltd. (2010). symposium: Automobil-und Motorentechnik, Springer, Fachmedien Wiesbaden,
|
||
[30] J. Saldaña, The coding manual for qualitative researchers, Sage (2015). 2017, https://doi.org/10.1007/978-3-658-16988-6_56.
|
||
[31] D.M. Bailey, J.M. Jackson, Qualitative data analysis: challenges and dilemmas [47] J. Jürjens, D. Reiß, D. Trachtenherz, Model-based quality assurance of automotive
|
||
related to theory and method, Am. J. Occup. Ther. 57 (1) (2003) 57–65. Publisher: software. International conference on model driven engineering languages and
|
||
American Occupational Therapy Association. systems, Springer Berlin Heidelberg, Berlin, Heidelberg, 2008, https://doi.org/
|
||
[32] M. Takrouni, M. Gdhaifi, A. Hasnaoui, et al., Design and implementation of a 10.1007/978-3-540-87875-9_59.
|
||
simulink DDS blockset and its integration to an active frame steering blockset [48] M. Ellims, R. Evans, K.M. Hobley, et al., When is software ready for production?
|
||
conformed to SAE ElectricVehicle, in: 2017 IEEE/ACS 14th International Parallels with automotive QS9000 methods, in: Aspects of safety management:
|
||
Conference on Computer Systems and Applications (AICCSA). IEEE, 2017, https:// proceedings of the ninth safety-critical systems symposium, Bristol, UK 2001,
|
||
doi.org/10.1109/AICCSA.2017.52. Springer London, London, 2000, https://doi.org/10.1007/978-1-4471-0713-2_9.
|
||
[33] A. Mateen, Q. Zhu, S. Afsar, Comparitive analysis of manual vs automotive testing [49] X. Li, W.E. Wong, R. Gao, et al., Genetic algorithm-based test generation for
|
||
for software quality, in: Proceedings of the 7th International Conference on software product line with the integration of fault localization techniques, Empir.
|
||
Software Engineering and New Technologies, 2018, https://doi.org/10.1145/ Softw. Eng. 23 (2018) 1–51, https://doi.org/10.1007/s10664-016-9494-9.
|
||
3330089.3330121. [50] D.K. Oka, T. Makila, R. Kuipers, Integrating application security testing tools into
|
||
[34] X. Lu, H. Chen, B. Gao, et al., Data-driven predictive gearshift control for dual- ALM tools in the automotive industry, in: 2019 IEEE 19th International Conference
|
||
clutch transmissions and FPGA implementation, IEEE Trans. Ind. Electron. 62.1 on Software Quality, Reliability and Security Companion (QRS-C). IEEE, 2019,
|
||
(2014) 599–610, https://doi.org/10.1109/TIE.2014.2312312. https://doi.org/10.1109/QRS-C.2019.00021.
|
||
[35] K. Ranasinghe, R. Sabatine, A. Gardi, et al., Advances in integrated system health [51] O. Kovalenko, E. Serral, M. Sabou, et al., Automating cross-disciplinary defect
|
||
management for mission-essential and safety-critical aerospace applications, Prog. detection in multi-disciplinary engineering environments, Knowledge engineering
|
||
Aerosp. Sci. 128 (2022), https://doi.org/10.1016/j.paerosci.2021.100758. and knowledge management: 19th international conference, EKAW 2014,
|
||
[36] M.K. Thota, F.H. Shajin, P. Rajesh, Survey on software defect prediction Linköping, Sweden, November 24-28, 2014. Proceedings 19 (2014), https://doi.
|
||
techniques, Int. J. Appl. Sci. Eng. 17 (4) (2020) 331–344, https://doi.org/10.6703/ org/10.1007/978-3-319-13704-9_19.
|
||
IJASE.202012_17(4).331. [52] J.R.V. Rivero, T. Gerbich, B. Buschardt, et al., The effect of spray water on an
|
||
[37] H. Ebadi, M.H. Moghadam, M. Borg, et al., Efficient and effective generation of test automotive LIDAR sensor: a real-time simulation study, IEEE Trans. Intell. Veh. 7.1
|
||
cases for pedestrian detection-search-based software testing of Baidu Apollo in (2021) 57–72, https://doi.org/10.1109/TIV.2021.3067892.
|
||
SVL, in: 2021 IEEE International Conference on Artificial Intelligence Testing [53] L.J. Moukahal, M. Zulkernine, M. Soukup, Vulnerability-oriented fuzz testing for
|
||
(AITest). IEEE, 2021, https://doi.org/10.1109/AITEST52744.2021.00030. connected autonomous vehicle systems, IEEE Trans. Reliab. 70 (4) (2021)
|
||
[38] H. Schweppe, A. Zimmermann, D. Grilly, Flexible in-vehicle stream processing 1422–1437, https://doi.org/10.1109/TR.2021.3112538.
|
||
with distributed automotive control units for engineering and diagnosis, in: 2008 [54] E.Y. Kang, D. Mu, L. Huang, et al., Verification and validation of a cyber-physical
|
||
International Symposium on Industrial Embedded Systems. IEEE, 2008, https:// system in the automotive domain, in: 2017 IEEE International Conference on
|
||
doi.org/10.1109/SIES.2008.4577683. Software Quality, Reliability and Security Companion (QRS-C). IEEE, 2017,
|
||
[39] O. Sadio, I. Ngom, C. Lishou, Design and prototyping of a software defined https://doi.org/10.1109/QRS-C.2017.62.
|
||
vehicular networking, IEEE Trans. Veh. Technol. 69 (1) (2019) 842–850, https:// [55] M. Mauritz, F. Howar, A. Rausch, Assuring the safety of advanced driver assistance
|
||
doi.org/10.1109/TVT.2019.2950426. systems through a combination of simulation and runtime monitoring. Leveraging
|
||
[40] M. Kläs, T. Bauer, A. Dereani, et al., A large-scale technology evaluation study: applications of formal methods, verification and validation: discussion,
|
||
effects of model-based analysis and testing, in: 2015 IEEE/ACM 37th IEEE dissemination, applications: 7th international symposium, ISoLA 2016, Imperial,
|
||
International Conference on Software Engineering. Vol. 2. IEEE, 2015, https://doi. Corfu, Greece, October 10-14, 2016, Proceedings, Part II 7, Springer International
|
||
org/10.1109/ICSE.2015.141. Publishing, 2016, https://doi.org/10.1007/978-3-319-47169-3_52.
|
||
[41] P.J. Kollmeyer, M. Naguib, F. Khanum, et al., A blind modeling tool for [56] P. Skruch, M. Długosz, P. Markiewicz, A formal approach for the verification of
|
||
standardized evaluation of battery state of charge estimation algorithms, in: 2022 control systems in autonomous driving applications, in: Trends in Advanced
|
||
IEEE Transportation Electrification Conference & Expo (ITEC). IEEE, 2022, https:// Intelligent Control, optimization and automation: proceedings of KKA 2017—the
|
||
doi.org/10.1109/ITEC53557.2022.9813996. 19th Polish control conference, Kraków, Poland, June 18–21, 2017, Springer
|
||
[42] N. Kovačić, V. Ilić, A. Korać, et al., Validation of process execution on automotive International Publishing, 2017, https://doi.org/10.1007/978-3-319-60699-6_18.
|
||
embedded devices, in: 2017 13th International Conference on Advanced [57] K. Liu, W. Kong, G. Hou, et al., A survey of formal techniques for hardware/
|
||
Technologies, Systems and Services in Telecommunications (TELSIKS). IEEE, 2017, software co-verification, in: 2018 7th International Congress on Advanced Applied
|
||
https://doi.org/10.1109/TELSKS.2017.8246318. Informatics (IIAI-AAI). IEEE, 2018, https://doi.org/10.1109/IIAI-AAI.2018.00033.
|
||
[43] R. Jiang, Required characteristics for software reliability growth models. 2009 WRI [58] N. Mellegård, M. Staron, F. Törner, A light-weight defect classification scheme for
|
||
world congress on software engineering. Vol. 4, IEEE, 2009, https://doi.org/ embedded automotive software and its initial evaluation, in: 2012 IEEE 23rd
|
||
10.1109/WCSE.2009.157. International Symposium on Software Reliability Engineering. IEEE, 2012, https://
|
||
[44] P. Herber, S. Glesner, Verification of embedded real-time systems, in: Formal doi.org/10.1109/ISSRE.2012.15.
|
||
Modeling and Verification of Cyber-Physical Systems: 1st International Summer [59] D.K. Kim, J.H. Kim, M.C. Lee, Improvement of HILS using advanced exploratory
|
||
School on Methods and Tools for the Design of Digital Systems, Bremen, Germany and optimization techniques for system qualification test, Int. J. Automot. Technol.
|
||
2015, September 2015, pp. 1–25, https://doi.org/10.1007/978-3-658-09994-7_1. 24 (3) (2023) 901–911, https://doi.org/10.1007/s12239-023-0074-x.
|
||
|
||
|
||
|
||
|
||
19
|
||
|