LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes

Aditya Kusupati; Matthew Wallingford; Vivek Ramanujan; Raghav Somani; Jae Sung Park; Krishna Pillutla; Prateek Jain; Sham Kakade; Ali Farhadi

LLC: 正確で多目的の学習済み低次元バイナリコード

インスタンスとクラスのバイナリ表現を学習することは、いくつかの潜在的なアプリケーションに伴う古典的な問題です。現代の設定では、高次元の神経表現を低次元のバイナリコードに圧縮することは困難な作業であり、多くの場合、正確であるために大きなビットコードが必要になります。この作業では、インスタンスとクラスの低次元バイナリコード (LLC) を学習するための新しい方法を提案します。私たちの方法は、注釈付き属性やラベルメタデータなどのサイド情報を必要とせず、非常に低次元のバイナリコード (ImageNet-1K では ~20 ビット) を学習します。学習したコードは、ImageNet-1K 上の ResNet50 に対してほぼ最適な分類精度を確保しながら、非常に効率的です。クラス全体の直感的な分類法を発見することにより、学習したコードがデータ内の本質的に重要な機能をキャプチャすることを示します。さらに、効率的な画像検索や配信外 (OOD) 検出の問題に適用することで、コードの品質を定量的に測定します。 ImageNet-100 検索問題の場合、学習したバイナリコードは、10 ビットのみを使用して 16 ビットの HashNet よりも優れており、10 次元の実表現と同じくらい正確です。最後に、学習したバイナリコードは、すぐに使用できる OOD 検出を、しきい値を調整するために ~3000 サンプルを必要とするベースラインと同じくらい正確に実行できますが、必要ありません。コードと事前トレーニング済みモデルは、https://github.com/RAIVNLab/LLC で入手できます。

Learning binary representations of instances and classes is a classical problem with several high potential applications. In modern settings, the compression of high-dimensional neural representations to low-dimensional binary codes is a challenging task and often require large bit-codes to be accurate. In this work, we propose a novel method for Learning Low-dimensional binary Codes (LLC) for instances as well as classes. Our method does not require any side-information, like annotated attributes or label meta-data, and learns extremely low-dimensional binary codes (~20 bits for ImageNet-1K). The learnt codes are super-efficient while still ensuring nearly optimal classification accuracy for ResNet50 on ImageNet-1K. We demonstrate that the learnt codes capture intrinsically important features in the data, by discovering an intuitive taxonomy over classes. We further quantitatively measure the quality of our codes by applying it to the efficient image retrieval as well as out-of-distribution (OOD) detection problems. For ImageNet-100 retrieval problem, our learnt binary codes outperform 16 bit HashNet using only 10 bits and also are as accurate as 10 dimensional real representations. Finally, our learnt binary codes can perform OOD detection, out-of-the-box, as accurately as a baseline that needs ~3000 samples to tune its threshold, while we require none. Code and pre-trained models are available at https://github.com/RAIVNLab/LLC.

updated: Wed Jun 02 2021 21:57:52 GMT+0000 (UTC)

published: Wed Jun 02 2021 21:57:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト