Online Reinforcement Learning for Self-adaptive Systems
A self-adaptive system can modify its structure and behavior at runtime based on its perception of the environment, itself, and its requirements. By adapting itself at runtime, the system can maintain its requirements in the presence of dynamic environment changes. Examples are elastic cloud systems, intelligent IoT systems as well as proactive process management systems. One key element of a self-adaptive system is its self-adaptation logic, which encodes when and how the system should adapt itself. When developing the adaptation logic, developers face the challenge of design time uncertainty. This means they have to anticipate potential environment states and the precise effect of adaptation in a given environment state, while the knowledge available at design time may not be sufficient to do so. A recent industrial survey determined design-time uncertainty as one of the most frequently observed difficulties in designing self-adaptation logic in practice. This talk will explore the opportunities but also challenges that modern machine learning algorithms offer in building the self-adaptation logic in the presence of design-time uncertainty. It will focus on online reinforcement learning as an emerging approach, which means that during operation the system learns from interactions with its environment, thereby effectively leveraging data only available at run time. In particular, the talk will focus on three different issues related to data quality and will introduce initial solutions for these issues: (1) data non-stationarity, (2) data sparsity, and (3) data intransparency. The talk will close with a critical discussion of limitations and an outlook on future research opportunities.
Andreas Metzger is an adjunct professor at the University of Duisburg-Essen and heads the group “Adaptive Systems” at paluno, the Ruhr Institute for Software Technology. His research interests include the use of machine learning in software engineering and business process management. He is the steering committee vice chair of the European Technology Platform NESSI (The Networked European Software and Services Initiative) and was deputy general secretary of the Big Data Value Association (BDVA) from 2015 to 2021. Among other leadership roles in EU projects, he was the technical coordinator of the Big Data Value PPP lighthouse project TransformingTransport.
Thu 17 NovDisplayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change
09:00 - 10:30 | |||
09:00 15mDay opening | Welcome, Objectives and Agenda SEA4DQ | ||
09:15 60mKeynote | Online Reinforcement Learning for Self-adaptive Systems SEA4DQ | ||
10:15 15mShort-paper | Data Quality Issues in Solar Panels Installations: A Case Study SEA4DQ Dumitru Roman SINTEF, Antoine Pultier SINTEF, Xiang Ma SINTEF, Ahmet Soylu Oslo Metropolitan University, Alexander G. Ulyashin SINTEF |