Search Results for author: Yen Yu

Found 3 papers, 2 papers with code

Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages

no code implementations11 Jan 2024 Zhuoyuan Mao, Yen Yu

This article introduces contrastive alignment instructions (AlignInstruct) to address two challenges in machine translation (MT) on large language models (LLMs).

Machine Translation Translation

Boredom-driven curious learning by Homeo-Heterostatic Value Gradients

1 code implementation5 Jun 2018 Yen Yu, Acer Y. C. Chang, Ryota Kanai

This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning.

Counterfactual Control for Free from Generative Models

2 code implementations22 Feb 2017 Nicholas Guttenberg, Yen Yu, Ryota Kanai

In this method, the problem of action selection is reduced to one of gradient descent on the latent space of the generative model, with the model itself providing the means of evaluating outcomes and finding the gradient, much like how the reward network in Deep Q-Networks (DQN) provides gradient information for the action generator.

counterfactual

Cannot find the paper you are looking for? You can Submit a new open access paper.