Probabilistic Attention for Interactive Segmentation

Prasad Gabbur; Manjot Bilkhu; Javier Movellan

インタラクティブセグメンテーションの確率的注意

注意の確率的解釈を提供し、変圧器の標準的なドット積注意が最大事後（MAP）推論の特殊なケースであることを示します。提案されたアプローチは、キーおよび値モデルパラメータのオンライン適応のための期待値最大化アルゴリズムの使用を提案します。このアプローチは、アノテーターなどの外部エージェントが、一部のピクセルのセマンティックカテゴリなど、一部のトークンの正しい値に関する推論時間情報を提供し、この新しい情報を他のトークンに伝播する必要がある場合に役立ちます。原則的な方法。アノテーターとモデルがオンラインでコラボレーションしてアノテーションの効率を向上させる、インタラクティブなセマンティックセグメンテーションタスクのアプローチを説明します。標準的なベンチマークを使用して、主要な適応が低フィードバック体制でモデルのパフォーマンス（〜10％mIoU）を高め、値の伝播が高フィードバック体制でのモデルの応答性を向上させることを観察します。確率的アテンションモデルのPyTorchレイヤー実装が公開されます。

We provide a probabilistic interpretation of attention and show that the standard dot-product attention in transformers is a special case of Maximum A Posteriori (MAP) inference. The proposed approach suggests the use of Expectation Maximization algorithms for online adaptation of key and value model parameters. This approach is useful for cases in which external agents, e.g., annotators, provide inference-time information about the correct values of some tokens, e.g, the semantic category of some pixels, and we need for this new information to propagate to other tokens in a principled manner. We illustrate the approach on an interactive semantic segmentation task in which annotators and models collaborate online to improve annotation efficiency. Using standard benchmarks, we observe that key adaptation boosts model performance (∼10% mIoU) in the low feedback regime and value propagation improves model responsiveness in the high feedback regime. A PyTorch layer implementation of our probabilistic attention model will be made publicly available.

updated: Wed Jun 23 2021 00:19:43 GMT+0000 (UTC)

published: Wed Jun 23 2021 00:19:43 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト