Pseudo Supervised Metrics: Evaluating Unsupervised Image to Image Translation Models In Unsupervised Cross-Domain Classification Frameworks

Firas Al-Hindawi; Md Mahfuzur Rahman Siddiquee; Teresa Wu; Han Hu; Ying Sun

疑似教師ありメトリック: 教師なしクロスドメイン分類フレームワークにおける教師なし画像から画像への変換モデルの評価

画像を正確かつ効率的に分類できるかどうかは、ラベル付けされた大規模なデータセットにアクセスできることと、モデルがトレーニングされたのと同じドメインからのデータをテストできるかどうかにかかっています。ラベル付けされた大規模なデータセットを収集し、新しい分類子を最初からトレーニングすることは、時間と費用がかかり、時には実行不可能または不可能な場合があります。クロスドメイン分類フレームワークは、教師なし画像から画像 (UI2I) 変換モデルを利用して入力画像をラベルなしドメインからラベル付きドメインに変換することにより、このデータドメインシフトの問題を処理するために開発されました。これらの教師なしモデルの問題は、教師なしの性質にあります。注釈がないため、従来の教師付きメトリックを使用してこれらの変換モデルを評価し、最適に保存されたチェックポイントモデルを選択することはできません。このホワイトペーパーでは、生成された画像の品質に関してモデルを評価するために設計された FID などの他の一般的に使用されるメトリックとは対照的に、クロスドメイン分類アプリケーションをサポートするために特別に設計された、疑似教師ありメトリックと呼ばれる新しい方法を紹介します。人間の視点から。私たちのメトリクスは、FID などの教師なしメトリクスよりも優れているだけでなく、真の教師ありメトリクスと高度に相関し、堅牢で説明可能であることを示しています。さらに、現実世界の重要な問題 (沸騰危機問題) に適用することにより、この分野の将来の研究の標準的な測定基準として使用できることを示します。

The ability to classify images accurately and efficiently is dependent on having access to large labeled datasets and testing on data from the same domain that the model is trained on. Classification becomes more challenging when dealing with new data from a different domain, where collecting a large labeled dataset and training a new classifier from scratch is time-consuming, expensive, and sometimes infeasible or impossible. Cross-domain classification frameworks were developed to handle this data domain shift problem by utilizing unsupervised image-to-image (UI2I) translation models to translate an input image from the unlabeled domain to the labeled domain. The problem with these unsupervised models lies in their unsupervised nature. For lack of annotations, it is not possible to use the traditional supervised metrics to evaluate these translation models to pick the best-saved checkpoint model. In this paper, we introduce a new method called Pseudo Supervised Metrics that was designed specifically to support cross-domain classification applications contrary to other typically used metrics such as the FID which was designed to evaluate the model in terms of the quality of the generated image from a human-eye perspective. We show that our metric not only outperforms unsupervised metrics such as the FID, but is also highly correlated with the true supervised metrics, robust, and explainable. Furthermore, we demonstrate that it can be used as a standard metric for future research in this field by applying it to a critical real-world problem (the boiling crisis problem).

updated: Tue Aug 15 2023 18:03:07 GMT+0000 (UTC)

published: Sat Mar 18 2023 02:42:18 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト