Space Time Recurrent Memory Network

Hung Nguyen; Fuxin Li

時空リカレントメモリネットワーク

時空間領域における学習と推論の問題のための新しいビジュアルメモリネットワークアーキテクチャを提案します。一般的なトランスフォーマーとは異なり、メモリネットワークに固定セットのメモリスロットを維持し、新しい情報をメモリに入力する設計を検討し、異なるメモリスロットの情報を組み合わせて、古いメモリスロットを破棄するタイミングを決定します。最後に、このアーキテクチャは、ビデオオブジェクトのセグメンテーションとビデオ予測の問題についてベンチマークされています。実験を通じて、私たちのメモリアーキテクチャは、一定のメモリ容量を維持しながら、最先端の技術で競争力のある結果を達成できることを示しています。

We propose a novel visual memory network architecture for the learning and inference problem in the spatial-temporal domain. Different from the popular transformers, we maintain a fixed set of memory slots in our memory network and explore designs to input new information into the memory, combine the information in different memory slots and decide when to discard old memory slots. Finally, this architecture is benchmarked on the video object segmentation and video prediction problems. Through the experiments, we show that our memory architecture can achieve competitive results with state-of-the-art while maintaining constant memory capacity.

updated: Tue Sep 14 2021 06:53:51 GMT+0000 (UTC)

published: Tue Sep 14 2021 06:53:51 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト