Style-Agnostic Reinforcement Learning

Juyong Lee; Seokjun Ahn; Jaesik Park

スタイルにとらわれない強化学習

強化学習フレームワークでスタイル転送と敵対的学習の両方を使用して、スタイルに依存しない表現を学習する新しい方法を提示します。ここでのスタイルとは、画像の背景の色など、タスクに関係のない詳細を指します。異なるスタイルの環境間で学習したポリシーを一般化することは依然として課題です。スタイルにとらわれない表現の学習に焦点を当てて、私たちの方法は、固有の敵対的スタイルの摂動ジェネレーターから生成された多様な画像スタイルでアクターをトレーニングします。このジェネレーターは、アクターとジェネレーターの間でミニマックスゲームを行います。データの拡張や追加のクラスに関する専門知識は必要ありません。敵対的訓練のラベル。 Procgen および Distracting Control Suite ベンチマークで最先端のアプローチよりも競争力のある、または優れたパフォーマンスを達成することを検証し、モデルから抽出された機能をさらに調査して、モデルが不変条件をより適切に捉え、注意散漫にならないことを示します。ずらしたスタイルで。コードは https://github.com/POSTECH-CVLab/style-agnostic-RL で入手できます。

We present a novel method of learning style-agnostic representation using both style transfer and adversarial learning in the reinforcement learning framework. The style, here, refers to task-irrelevant details such as the color of the background in the images, where generalizing the learned policy across environments with different styles is still a challenge. Focusing on learning style-agnostic representations, our method trains the actor with diverse image styles generated from an inherent adversarial style perturbation generator, which plays a min-max game between the actor and the generator, without demanding expert knowledge for data augmentation or additional class labels for adversarial training. We verify that our method achieves competitive or better performances than the state-of-the-art approaches on Procgen and Distracting Control Suite benchmarks, and further investigate the features extracted from our model, showing that the model better captures the invariants and is less distracted by the shifted style. The code is available at https://github.com/POSTECH-CVLab/style-agnostic-RL.

updated: Wed Aug 31 2022 13:45:00 GMT+0000 (UTC)

published: Wed Aug 31 2022 13:45:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト