no code implementations • 15 May 2024 • Masaaki Nagata, Makoto Morishita, Katsuki Chousa, Norihito Yasuda
Using crowdsourcing, we collected more than 10, 000 URL pairs (parallel top page pairs) of bilingual websites that contain parallel documents and created a Japanese-Chinese parallel corpus of 4. 6M sentence pairs from these websites.
no code implementations • 11 Oct 2022 • Masaaki Nishino, Kengo Nakamura, Norihito Yasuda
In practical situations, it is natural to expect the input-output pairs of a machine learning model to satisfy some requirements.