Search Results for author: Léo Grinsztajn

Found 3 papers, 1 papers with code

CARTE: pretraining and transfer for tabular learning

no code implementations26 Feb 2024 Myung Jun Kim, Léo Grinsztajn, Gaël Varoquaux

The architecture - CARTE for Context Aware Representation of Table Entries - uses a graph representation of tabular (or relational) data to process tables with different columns, string embeddings of entries and columns names to model an open vocabulary, and a graph-attentional network to contextualize entries with column names and neighboring entries.

Data Integration

Vectorizing string entries for data processing on tables: when are larger language models better?

no code implementations15 Dec 2023 Léo Grinsztajn, Edouard Oyallon, Myung Jun Kim, Gaël Varoquaux

We study the benefits of language models in 14 analytical tasks on tables while varying the training size, as well as for a fuzzy join benchmark.

Why do tree-based models still outperform deep learning on tabular data?

1 code implementation18 Jul 2022 Léo Grinsztajn, Edouard Oyallon, Gaël Varoquaux

While deep learning has enabled tremendous progress on text and image datasets, its superiority on tabular data is not clear.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.