Adapting to Dynamic User Preferences in Recommendation Systems via Deep Reinforcement Learning

Pantea, Luca

Adapting to Dynamic User Preferences in Recommendation Systems via Deep Reinforcement Learning

Title

Adapting to Dynamic User Preferences in Recommendation Systems via Deep Reinforcement Learning

Author

Pantea, Luca (TU Delft Electrical Engineering, Mathematics and Computer Science)

Contributor

Oliehoek, F.A. (mentor)
Czechowski, A.T. (mentor)
Mambelli, D. (mentor)
Azizi, O. (mentor)
Tax, D.M.J. (graduation committee)

Degree granting institution

Delft University of Technology

Programme

Computer Science and Engineering

Project

CSE3000 Research Project

Date

2022-06-24

Abstract

Recommender Systems play a significant part in filtering and efficiently prioritizing relevant information to alleviate the information overload problem and maximize user engagement. Traditional recommender systems employ a static approach towards learning the user's preferences, relying on logged previous interactions with the system, disregarding the sequential nature of the recommendation task and consequently, the user preference shifts occurring across interactions. In this study, we formulate the recommendation task as a slate Markov Decision Process (slate-MDP) and leverage deep reinforcement learning (DRL) to learn recommendation policies through sequential interactions and maximize user engagement over extended horizons in non-stationary environments. We construct the simulated environment with various degrees of preferential dynamics and benchmark two DRL-based algorithms: FullSlateQ, a non-decomposed full slate Q-learning based on a DQN agent, and SlateQ, which implements DQN using slate decomposition. Our findings suggest that SlateQ outperforms by 10.57% FullSlateQ in non-stationary environments and that with a moderate discount factor, the algorithms behave myopically and fail to make an appropriate tradeoff to maximize long-term user engagement.

Subject

Recommender Systems
User Modelling
Reinforcement Learning

To reference this document use:

http://resolver.tudelft.nl/uuid:9e3e4b62-1056-4d23-b48f-4acb0a708290

Part of collection

Student theses

Document type

bachelor thesis

Rights

Files

PDF

Research_Project_Report_L ... Pantea.pdf

1.79 MB

Close viewer