Novel transfer learning schemes based on Siamese networks and synthetic data

Dominik Stallmann; Philip Kenneweg; Barbara Hammer

シャムネットワークと合成データに基づく新しい転移学習スキーム

巨大な画像コーパスでトレーニングされたディープネットワークに基づく転移学習スキームは、コンピュータービジョンの最先端技術を提供します。ここで、教師ありおよび半教師ありのアプローチは、比較的小さなデータセットでうまく機能する効率的なテクノロジーを構成します。しかし、そのようなアプリケーションは現在、適切なディープネットワークモデルがすぐに利用できるアプリケーションドメインに限定されています。この寄稿では、データ特性が既存のドメインと非常に異なり、トレーニングされた深いネットワークを簡単に適応させることができない、マイクロ流体単一細胞培養におけるCHO-K1懸濁液成長の自動分析である、バイオテクノロジーのドメインにおける重要なアプリケーション領域に対処します。古典的な転移学習。最近導入された、現実的な合成データでトレーニングされる Twin-VAE アーキテクチャを拡張する新しい転移学習スキームを提案し、その特殊なトレーニング手順を転移学習ドメインに変更します。特定のドメインでは、多くの場合、ラベルがほとんどまたはまったく存在せず、注釈にコストがかかります。異なる顕微鏡技術からの目に見えないデータを処理することを学習しながら、不変の共有表現と適切なターゲット変数を使用して、自然データと合成データの同時再トレーニングを組み込む、新しい転移学習戦略を調査します。画像処理における最先端の転移学習方法論および従来の画像処理技術に対する Twin-VAE アーキテクチャのバリエーションの優位性を示します。これは、トレーニング時間が大幅に短縮されても持続し、満足のいく結果につながります。このドメインで。ソースコードは https://github.com/dstallmann/transfer_learning_twinvae で入手でき、クロスプラットフォームで動作し、オープンソースで無料 (MIT ライセンス) のソフトウェアです。データセットは https://pub.uni-bielefeld.de/record/2960030 で入手できます。

Transfer learning schemes based on deep networks which have been trained on huge image corpora offer state-of-the-art technologies in computer vision. Here, supervised and semi-supervised approaches constitute efficient technologies which work well with comparably small data sets. Yet, such applications are currently restricted to application domains where suitable deepnetwork models are readily available. In this contribution, we address an important application area in the domain of biotechnology, the automatic analysis of CHO-K1 suspension growth in microfluidic single-cell cultivation, where data characteristics are very dissimilar to existing domains and trained deep networks cannot easily be adapted by classical transfer learning. We propose a novel transfer learning scheme which expands a recently introduced Twin-VAE architecture, which is trained on realistic and synthetic data, and we modify its specialized training procedure to the transfer learning domain. In the specific domain, often only few to no labels exist and annotations are costly. We investigate a novel transfer learning strategy, which incorporates a simultaneous retraining on natural and synthetic data using an invariant shared representation as well as suitable target variables, while it learns to handle unseen data from a different microscopy tech nology. We show the superiority of the variation of our Twin-VAE architecture over the state-of-the-art transfer learning methodology in image processing as well as classical image processing technologies, which persists, even with strongly shortened training times and leads to satisfactory results in this domain. The source code is available at https://github.com/dstallmann/transfer_learning_twinvae, works cross-platform, is open-source and free (MIT licensed) software. We make the data sets available at https://pub.uni-bielefeld.de/record/2960030.

updated: Mon Nov 21 2022 09:48:21 GMT+0000 (UTC)

published: Mon Nov 21 2022 09:48:21 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト