[CAIN 2025] A Metrics-Oriented Architectural Model to Characterize Complexity on ML-Enabled Systems

A Metrics-Oriented Architectural Model
to Characterize Complexity on
Machine Learning-Enabled Systems

https://renatocf.xyz/cain-2025-slides

2025

Renato Cordeiro Ferreira

Institute of Mathematics and Statistics (IME)
University of São Paulo (USP) – Brazil

Jheronimus Academy of Data Science (JADS)
Technical University of Eindhoven (TUe) – The Netherlands

Co-funder of CodeLab
Brazilian student group focused on stimulating technological innovation.

B.Sc. and M.Sc. at University of São Paulo (BR)

Theoretical and practical experience with machine learning and software engineering.

Scientific Programmer at JADS (NL)

Participating on European Projects using Machine Learning techniques.

Ph.D. candidate at USP

Research about intelligent software engineering, in particular metrics to build ML-enabled systems.

Renato Cordeiro Ferreira

https://renatocf.xyz/contacts

My goal is to
use metrics to identify
where complexity emerges
in the software architecture
of ML-enabled systems

Research Questions

What are the measurable dimensions of complexity
in the architecture of ML-enabled systems?

How can complexity metrics be operationalized
over the architecture of ML-enabled systems?

RQ1

RQ2

RQ3

How can complexity metrics be used to aid
the development, operation, and evolution
of real-world ML-enabled systems?

Research Questions

How can complexity metrics be used to choose between architecture proposals for an ML-enabled system?

How can complexity metrics be used to identify refactoring opportunities in an ML-enabled systems?

RQ3.1

RQ3.2

RQ3

How can complexity metrics be used to aid
the development, operation, and evolution
of real-world ML-enabled systems?

Research Methodology

State of the art

about metrics

regarding

ML-Enabled

Systems

Industry- and academic-based case study on complexity metrics for
ML-Enabled Systems

Mixed-method approach to assess the impact of complexity in development tasks for
ML-Enabled Systems

Threats to Validity

Construct Validity
The study can measure what it proposed to measure

Internal Validity
The study can produce the results it reported

External Validity
The study can be generalized to other contexts

Conclusion
The study can be replicated by other researchers

Data from

knowledge bases

Researcher

Ontology

Design

Choice of

Case Studies

Selection of

Metrics

Constructed
Examples

Sampling

Population of

Practitioners

Established

Publication

Databases

Guidelines

from

Literature

Inclusion

Criteria for

Case Studies

Exploratory +

Confirmatory

Case Studies

Sampling

Population of

Practitioners

Constructed
Examples

Reference Architecture
for ML-Enabled Systems

Case Studies

Data Collection App
Scientific Initiation 2021
Francisco Wernke

Streaming
Prediction Server
+ Client API / App
Capstone Project 2022
Vitor Tamae

Highly Availability
with Kubernetes
Capstone Project 2023
Vitor Guidi

Redesign Continuous Training Subsystem
Capstone Project 2023
Daniel Lawand

CI/CD/CD4ML on
Training Pipeline
Capstone Project 2024
Lucas Quaresma
+ Roberto Bolgheroni

Research Team
MSc Students

Innovation Team
PDEng Trainees

Ui Dev Team
Hired Developers

Core Dev Team
Scientific Programmers

Work Plan

A Metrics-Oriented Architectural Model
to Characterize Complexity on
Machine Learning-Enabled Systems

https://renatocf.xyz/phd-quali-live

2025

Renato Cordeiro Ferreira

Supervisor: Prof. Dr. Alfredo Goldman

Co-Supervisor: Prof. Dr. Damian Tamburri

IME-USP