no code implementations • 16 Feb 2024 • Mohammad Hossein Amani, Nicolas Mario Baldwin, Amin Mansouri, Martin Josifoski, Maxime Peyrard, Robert West
Traditional language models, adept at next-token prediction in text sequences, often struggle with transduction tasks between distinct symbolic systems, particularly when parallel data is scarce.
no code implementations • 7 Feb 2024 • Kartik Ahuja, Amin Mansouri
Length generalization -- the ability to generalize to longer sequences than ones seen during training, and compositional generalization -- the ability to generalize to token combinations not seen during training, are crucial forms of out-of-distribution generalization in sequence-to-sequence models.
1 code implementation • 29 Oct 2023 • Amin Mansouri, Jason Hartford, Yan Zhang, Yoshua Bengio
Causal representation learning has showed a variety of settings in which we can disentangle latent variables with identifiability guarantees (up to some reasonable equivalence class).
no code implementations • 4 Oct 2023 • Kartik Ahuja, Amin Mansouri, Yixin Wang
Causal representation learning has emerged as the center of action in causal machine learning research.