Project Overview
The main goal of the DATA-TRUST project is to develop a framework for smart-sized benchmarking that improves the reliability and generalizability of AI systems.
Current AI systems are often trained on all available data. While large datasets can improve performance, they may also introduce bias, redundancy, and overfitting, reducing the ability of models to perform well on new unseen data.
DATA-TRUST addresses this challenge by investigating:
- 1. How to represent benchmark datasets and optimization problems.
- 2. How to select representative subsets of data.
- 3. How to measure generalization ability.
The framework will be validated across two domains:
- 1. Single-objective continuous optimization.
- 2. Time-series analysis.
Although the methods are validated in these two domains, the developed concepts are expected to be applicable across a broad range of AI applications.
The project is carried out at the Jožef Stefan Institute through collaboration between:
- 1. The Computer Systems Department.
- 2. The Department of Knowledge Technologies.
