Convolutional Neural Bandit for Visual-aware Recommendation

Yikun Ban; Jingrui He

視覚認識推奨のための畳み込みニューラルバンディット

オンラインレコメンデーション/広告はウェブビジネスに遍在しています。画像の表示は、顧客とやり取りするために最も一般的に使用される形式の1つと見なされています。状況に応じた多腕バンディットは、推奨手順に存在する探索-搾取のジレンマを解決するための広告の適用に成功していることを示しています。視覚認識の推奨事項に触発されて、この論文では、畳み込みニューラルネットワーク（CNN）を利用して、探索の上限信頼限界（UCB）とともに報酬関数を学習するコンテキストバンディットアルゴリズムを提案します。また、ネットワークがパラメーター化されすぎている場合に、ほぼ最適な後悔限界O（T）を証明し、畳み込みニューラルタンジェントカーネル（CNTK）との強力な接続を確立します。最後に、提案されたアルゴリズムの経験的パフォーマンスを評価し、実際の画像データセットで他の最先端のUCBベースのバンディットアルゴリズムよりも優れていることを示します。

Online recommendation/advertising is ubiquitous in web business. Image displaying is considered as one of the most commonly used formats to interact with customers. Contextual multi-armed bandit has shown success in the application of advertising to solve the exploration-exploitation dilemma existing in the recommendation procedure. Inspired by the visual-aware recommendation, in this paper, we propose a contextual bandit algorithm, where the convolutional neural network (CNN) is utilized to learn the reward function along with an upper confidence bound (UCB) for exploration. We also prove a near-optimal regret bound O(T) when the network is over-parameterized, and establish strong connections with convolutional neural tangent kernel (CNTK). Finally, we evaluate the empirical performance of the proposed algorithm and show that it outperforms other state-of-the-art UCB-based bandit algorithms on real-world image data sets.

updated: Wed Feb 09 2022 21:24:48 GMT+0000 (UTC)

published: Fri Jul 02 2021 03:02:29 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト