Countering Inconsistent Labelling by Google's Vision API for Rotated Images

Aman Apte; Aritra Bandyopadhyay; K Akhilesh Shenoy; Jason Peter Andrews; Aditya Rathod; Manish Agnihotri; Aditya Jajodia

回転画像用のGoogleのVision APIによる一貫性のないラベル付けに対抗する

GoogleのVision APIは画像を分析し、さまざまな出力予測を提供します。そのようなタイプの1つは、コンテキストベースのラベル付けです。この論文では、誤ったラベルの予測となりすましを引き起こす敵対的な例を、画像を回転させることで生成できることが示されています。 APIのブラックボックス化された性質により、Res-Net50モデルで構成されるモジュール型のコンテキストベースの前処理パイプラインが提案されます。これは、向きを修正するために画像を回転させる角度を予測します。パイプラインは、画像の解像度を維持しながら修正を正常に実行し、元の正しい向きの画像と同様のラベルを生成するAPIにフィードし、パーセントエラーメトリックを使用して、回転したカウンターパーツと比較した修正画像のパフォーマンスが検出されます著しく高くなります。これらの観察結果は、APIがこのような前処理パイプラインの恩恵を受けて、回転摂動に対する堅牢性を向上できることを意味しています。

Google's Vision API analyses images and provides a variety of output predictions, one such type is context-based labelling. In this paper, it is shown that adversarial examples that cause incorrect label prediction and spoofing can be generated by rotating the images. Due to the black-boxed nature of the API, a modular context-based pre-processing pipeline is proposed consisting of a Res-Net50 model, that predicts the angle by which the image must be rotated to correct its orientation. The pipeline successfully performs the correction whilst maintaining the image's resolution and feeds it to the API which generates labels similar to the original correctly oriented image and using a Percentage Error metric, the performance of the corrected images as compared to its rotated counter-parts is found to be significantly higher. These observations imply that the API can benefit from such a pre-processing pipeline to increase robustness to rotational perturbances.

updated: Sun Nov 17 2019 09:49:58 GMT+0000 (UTC)

published: Sun Nov 17 2019 09:49:58 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト