no code implementations • 23 May 2024 • Yi-Shan Wu, Yijie Zhang, Badr-Eddine Chérief-Abdellatif, Yevgeny Seldin
While PAC-Bayes allows construction of data-informed priors, the final confidence intervals depend only on the number of points that were not used for the construction of the prior, whereas confidence information in the prior, which is related to the number of points used to construct the prior, is lost.
no code implementations • 5 Feb 2024 • Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir
We introduce Probabilistic Actor-Critic (PAC), a novel reinforcement learning algorithm with improved continuous control performance thanks to its ability to mitigate the exploration-exploitation trade-off.
no code implementations • 2 Oct 2023 • Yijie Zhang, Yi-Shan Wu, Luis A. Ortega, Andrés R. Masegosa
The cold posterior effect (CPE) (Wenzel et al., 2020) in Bayesian deep learning shows that, for posteriors with a temperature $T<1$, the resulting posterior predictive could have better performances than the Bayesian posterior ($T=1$).
1 code implementation • 1 Jun 2022 • Yi-Shan Wu, Yevgeny Seldin
We present a new concentration of measure inequality for sums of independent bounded random variables, which we name a split-kl inequality.
1 code implementation • NeurIPS 2021 • Yi-Shan Wu, Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin
The bound is based on a novel parametric form of the Chebyshev- Cantelli inequality (a. k. a.