On the Impact of Industrial Delays when Mitigating Distribution Drifts: An Empirical Study on Real-World Financial Systems

Thibault Simonetto*, Maxime Cordy, Salah Ghamizi, Yves Le Traon, Clément Lefebvre, Andrey Boystov, Anne Goujon

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

An increasing number of financial software system relies on Machine learning models to support human decision-makers. Although these models have shown satisfactory performance to support human decision-makers in classifying financial transactions, the maintenance of such ML systems remains a challenge. After deployment in production, the performance of the models tends to degrade over time due to concept drift. Methods have been proposed to detect concept drift and retrain new models upon detection to mitigate the drop in performance. However, little is known about the effectiveness of such methods in an industrial context. In particular, their evaluation fails to consider the delay between the detection of the drift and the deployment of a new model. This delay is inherent to the strict quality assurance and manual validation processes that financial (and other critical) institutions impose on their software systems. To circumvent this limitation, we formalize the problem of retraining ML models against distribution drift in the presence of delay and propose a novel protocol to evaluate drift detectors. We report on an empirical study conducted on the transaction system of our industrial partner, BGL BNP Paribas, and two publicly available datasets: Lending Club Loan Data and Electricity. We release our tool and benchmark on GitHub. 1 We demonstrate for the first time how ignoring the delays in the evaluation of the drift detectors overestimates their ability to mitigate performance drift, up to 39.86% for our industrial application.1 Code available at https://github.com/serval-uni-lu/drift-robustness.

Original languageEnglish
Title of host publicationDiscovering Drift Phenomena in Evolving Landscapes - 1st International Workshop, DELTA 2024, Proceedings
EditorsMarco Piangerelli, Bardh Prenkaj, Ylenia Rotalinti, Ananya Joshi, Giovanni Stilo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages57-73
Number of pages17
ISBN (Print)9783031823459
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event1st International Workshop on Discovering Drift Phenomena in Evolving Landscapes, DELTA 2024 - Barcelona, Spain
Duration: 26 Aug 202426 Aug 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15013 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st International Workshop on Discovering Drift Phenomena in Evolving Landscapes, DELTA 2024
Country/TerritorySpain
CityBarcelona
Period26/08/2426/08/24

Keywords

  • AI in finance
  • ML
  • distribution-drift
  • real-world system

Fingerprint

Dive into the research topics of 'On the Impact of Industrial Delays when Mitigating Distribution Drifts: An Empirical Study on Real-World Financial Systems'. Together they form a unique fingerprint.

Cite this