LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes

Aditya Kusupati; Matthew Wallingford; Vivek Ramanujan; Raghav Somani; Jae Sung Park; Krishna Pillutla; Prateek Jain; Sham Kakade; Ali Farhadi

LLC：正確で多目的に学習された低次元バイナリコード

インスタンスとクラスのバイナリ表現を学習することは、いくつかの潜在的なアプリケーションの古典的な問題です。現代の設定では、高次元の神経表現を低次元のバイナリコードに圧縮することは困難な作業であり、正確であるために大きなビットコードを必要とすることがよくあります。この作業では、インスタンスとクラスの低次元バイナリコード（LLC）を学習するための新しい方法を提案します。私たちの方法は、注釈付きの属性やラベルのメタデータなどの副次的な情報を必要とせず、非常に低次元のバイナリコード（ImageNet-1Kの場合は約20ビット）を学習します。学習したコードは非常に効率的ですが、ImageNet-1K上のResNet50のほぼ最適な分類精度を保証します。クラスの直感的な分類法を発見することにより、学習したコードがデータの本質的に重要な機能をキャプチャすることを示します。コードを効率的な画像検索や分布外（OOD）検出の問題に適用することで、コードの品質をさらに定量的に測定します。 ImageNet-100の取得問題の場合、学習したバイナリコードは10ビットのみを使用して16ビットHashNetよりもパフォーマンスが高く、10次元の実表現と同じくらい正確です。最後に、学習したバイナリコードは、しきい値を調整するために最大3000サンプルを必要とするベースラインと同じくらい正確に、すぐに使用できるOOD検出を実行できますが、必要ありません。コードはhttps://github.com/RAIVNLab/LLCでオープンソース化されています。

Learning binary representations of instances and classes is a classical problem with several high potential applications. In modern settings, the compression of high-dimensional neural representations to low-dimensional binary codes is a challenging task and often require large bit-codes to be accurate. In this work, we propose a novel method for Learning Low-dimensional binary Codes (LLC) for instances as well as classes. Our method does not require any side-information, like annotated attributes or label meta-data, and learns extremely low-dimensional binary codes (~20 bits for ImageNet-1K). The learnt codes are super-efficient while still ensuring nearly optimal classification accuracy for ResNet50 on ImageNet-1K. We demonstrate that the learnt codes capture intrinsically important features in the data, by discovering an intuitive taxonomy over classes. We further quantitatively measure the quality of our codes by applying it to the efficient image retrieval as well as out-of-distribution (OOD) detection problems. For ImageNet-100 retrieval problem, our learnt binary codes outperform 16 bit HashNet using only 10 bits and also are as accurate as 10 dimensional real representations. Finally, our learnt binary codes can perform OOD detection, out-of-the-box, as accurately as a baseline that needs ~3000 samples to tune its threshold, while we require none. Code is open-sourced at https://github.com/RAIVNLab/LLC.

updated: Thu Oct 07 2021 02:17:11 GMT+0000 (UTC)

published: Wed Jun 02 2021 21:57:52 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト