• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Non-Stationary Linear Bandits With Dimensionality Reduction for Large-Scale Recommender Systems
 
  • Details
  • Full
Options
2024
Journal Article
Title

Non-Stationary Linear Bandits With Dimensionality Reduction for Large-Scale Recommender Systems

Abstract
Taking advantage of contextual information can potentially boost the performance of recommender systems. In the era of Big Data, such side information often has several dimensions. Thus, developing decision-making algorithms to cope with such a high-dimensional context in real time is essential. That is specifically challenging when the decision-maker has a variety of items to recommend. In addition, changes in items' popularity or users' preferences can hinder the performance of the deployed recommender system due to a lack of robustness to distribution shifts in the environment. In this paper, we build upon the linear contextual multi-armed bandit framework to address this problem. We develop a decision-making policy for a linear bandit problem with high-dimensional feature vectors, a large set of arms, and non-stationary reward-generating processes. Our Thompson sampling-based policy reduces the dimension of feature vectors using random projection and uses exponentially increasing weights to decrease the influence of past observations with time. Our proposed recommender system employs this policy to learn the users' item preferences online while minimizing runtime. We prove a regret bound that scales as a factor of the reduced dimension instead of the original one. To evaluate our proposed recommender system numerically, we apply it to three real-world datasets. The theoretical and numerical results demonstrate the effectiveness of our proposed algorithm in making a trade-off between computational complexity and regret performance compared to the state-of-the-art.
Author(s)
Ghoorchian, Saeed
Ruhr-Universitat Bochum
Kortukov, Evgenii
Eberhard Karls Universität Tübingen
Maghsudi, Setareh
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Journal
IEEE open journal of signal processing  
Open Access
DOI
10.1109/OJSP.2024.3386490
Additional link
Full text
Language
English
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Keyword(s)
  • Decision-making

  • multi-armed bandit

  • non-stationary environment

  • online learning

  • recommender systems

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024