Patch DCT vs LeNet

David Sinclair

パッチ DCT と LeNet

Patch DCT vs LeNet

この論文では、画像パッチの DCT (離散コサイン変換) の出力を取得する NN のパフォーマンスと、MNIST 手書き数字を分類するための leNet のパフォーマンスを比較します。 DCT の基礎となる基底関数は、Visual Transformer の学習された基底関数の一部に似ていますが、適用が桁違いに高速です。

This paper compares the performance of a NN taking the output of a DCT (Discrete Cosine Transform) of an image patch with leNet for classifying MNIST hand written digits. The basis functions underlying the DCT bear a passing resemblance to some of the learned basis function of the Visual Transformer but are an order of magnitude faster to apply.

updated: Fri Nov 04 2022 11:56:00 GMT+0000 (UTC)

published: Fri Nov 04 2022 11:56:00 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト