no code implementations • 25 Aug 2023 • Phil Ostheimer, Mayank Nagda, Marius Kloft, Sophie Fellenz
This suggests that LLMs could be a feasible alternative to human evaluation and other automated metrics in TST evaluation.
no code implementations • 1 Jun 2023 • Phil Ostheimer, Mayank Nagda, Marius Kloft, Sophie Fellenz
Text Style Transfer (TST) evaluation is, in practice, inconsistent.