HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation

Yuval Nirkin; Lior Wolf; Tal Hassner

HyperSeg：リアルタイムセマンティックセグメンテーションのためのパッチワイズハイパーネットワーク

エンコーダーがデコーダーのパラメーター（重み）をエンコードおよび生成する、新しいリアルタイムのセマンティックセグメンテーションネットワークを紹介します。さらに、最大限の適応性を可能にするために、各デコーダーブロックの重みは空間的に変化します。この目的のために、より高いレベルのコンテキスト機能を描画するためのネストされたU-Net、消費される直前にデコーダー内の各ブロックの重みを生成するマルチヘッド重み生成モジュールで構成される新しいタイプのハイパーネットワークを設計します。効率的なメモリ使用率、および新しい動的なパッチごとの畳み込みで構成されるプライマリネットワーク。あまり一般的ではないブロックを使用しているにもかかわらず、私たちのアーキテクチャはリアルタイムのパフォーマンスを実現します。実行時間と精度のトレードオフに関しては、人気のあるセマンティックセグメンテーションベンチマークであるPASCAL VOC 2012（値セット）とCityscapesのリアルタイムセマンティックセグメンテーション、およびCamVidで最先端（SotA）の結果を上回っています。コードはhttps://nirkin.com/hypersegで入手できます。

We present a novel, real-time, semantic segmentation network in which the encoder both encodes and generates the parameters (weights) of the decoder. Furthermore, to allow maximal adaptivity, the weights at each decoder block vary spatially. For this purpose, we design a new type of hypernetwork, composed of a nested U-Net for drawing higher level context features, a multi-headed weight generating module which generates the weights of each block in the decoder immediately before they are consumed, for efficient memory utilization, and a primary network that is composed of novel dynamic patch-wise convolutions. Despite the usage of less-conventional blocks, our architecture obtains real-time performance. In terms of the runtime vs. accuracy trade-off, we surpass state of the art (SotA) results on popular semantic segmentation benchmarks: PASCAL VOC 2012 (val. set) and real-time semantic segmentation on Cityscapes, and CamVid. The code is available: https://nirkin.com/hyperseg.

updated: Thu Apr 08 2021 10:40:36 GMT+0000 (UTC)

published: Mon Dec 21 2020 18:58:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト