Mohammad Sadegh Rasooli; Chris Callison-Burch; Derry Tanti Wijaya

クロスリンガルタスクに合わせた「ウィキリー」教師ありニューラル翻訳

"Wikily" Supervised Neural Translation Tailored to Cross-Lingual Tasks

ウィキペディアをニューラル機械翻訳に活用するためのシンプルで効果的なアプローチと、外部の並列データまたはターゲット言語の教師ありモデルからの直接の教師ありを使用せずに、画像のキャプションと依存関係の解析の言語間タスクを紹介します。リンクされたウィキペディアページの最初の文とタイトル、およびクロスリンガル画像のキャプションが、ウィキペディアからパラレルテキストをマイニングするための二か国語辞書とクロスリンガル単語埋め込みを抽出するためのシードパラレルデータの強力なシグナルであることを示します。私たちの最終モデルは、リソースの少ない言語での強力な監視対象ベースラインに近いか、場合によってはそれよりも高いBLEUスコアを達成します。たとえば、英語からカザフ語へのモデルからの12.1に対して4.0の監視されたBLEU。さらに、ウィキで監視された翻訳モデルを、監視されていない画像のキャプション、および言語間の依存関係パーサー転送に合わせて調整します。画像キャプションでは、アラビア語と英語のマルチタスク機械翻訳と画像キャプションパイプラインをトレーニングします。アラビア語のトレーニングデータは、ウィキが管理する翻訳モデルを使用して、英語のキャプションデータの翻訳バージョンです。アラビア語でのキャプションの結果は、教師ありモデルの結果よりもわずかに優れています。依存関係の解析では、大量の単一言語テキストを翻訳し、それを注釈投影フレームワークの人工トレーニングデータとして使用します。私たちのモデルが、依存関係パーサーの言語間転送に関する最近の作業よりも優れていることを示します。

We present a simple but effective approach for leveraging Wikipedia for neural machine translation as well as cross-lingual tasks of image captioning and dependency parsing without using any direct supervision from external parallel data or supervised models in the target language. We show that first sentences and titles of linked Wikipedia pages, as well as cross-lingual image captions, are strong signals for a seed parallel data to extract bilingual dictionaries and cross-lingual word embeddings for mining parallel text from Wikipedia. Our final model achieves high BLEU scores that are close to or sometimes higher than strong supervised baselines in low-resource languages; e.g. supervised BLEU of 4.0 versus 12.1 from our model in English-to-Kazakh. Moreover, we tailor our wikily supervised translation models to unsupervised image captioning, and cross-lingual dependency parser transfer. In image captioning, we train a multi-tasking machine translation and image captioning pipeline for Arabic and English from which the Arabic training data is a translated version of the English captioning data, using our wikily-supervised translation models. Our captioning results on Arabic are slightly better than that of its supervised model. In dependency parsing, we translate a large amount of monolingual text, and use it as artificial training data in an annotation projection framework. We show that our model outperforms recent work on cross-lingual transfer of dependency parsers.

updated: Fri Sep 10 2021 17:10:31 GMT+0000 (UTC)

published: Fri Apr 16 2021 21:49:12 GMT+0000 (UTC)

arXiv

参考文献 (このサイトで利用可能なもの) / References (only if available on this site)

被参照文献 (このサイトで利用可能なものを新しい順に) / Citations (only if available on this site, in order of most recent)

Amazon.co.jpアソシエイト