Search Results for author: Jacob Bien

Found 26 papers, 10 papers with code

Predicting Rare Events by Shrinking Towards Proportional Odds

1 code implementation30 May 2023 Gregory Faletto, Jacob Bien

We present PRESTO, a relaxation of the proportional odds model for ordinal regression.

Marketing regression

Generalized Data Thinning Using Sufficient Statistics

no code implementations22 Mar 2023 Ameer Dharamshi, Anna Neufeld, Keshav Motwani, Lucy L. Gao, Daniela Witten, Jacob Bien

A recent paper showed that for some well-known natural exponential families, $X$ can be "thinned" into independent random variables $X^{(1)}, \ldots, X^{(K)}$, such that $X = \sum_{k=1}^K X^{(k)}$.

Inferring independent sets of Gaussian variables after thresholding correlations

no code implementations2 Nov 2022 Arkajyoti Saha, Daniela Witten, Jacob Bien

Our proposed test properly accounts for the fact that the set of variables is selected from the data, and thus is not overly conservative.

Prediction Sets for High-Dimensional Mixture of Experts Models

no code implementations30 Oct 2022 Adel Javanmard, Simeng Shao, Jacob Bien

Large datasets make it possible to build predictive models that can capture heterogenous relationships between the response variable and features.

valid Vocal Bursts Intensity Prediction

Cluster Stability Selection

no code implementations3 Jan 2022 Gregory Faletto, Jacob Bien

Stability selection (Meinshausen and Buhlmann, 2010) makes any feature selection method more stable by returning only those features that are consistently selected across many subsamples.

feature selection

Controlling the False Split Rate in Tree-Based Aggregation

no code implementations11 Aug 2021 Simeng Shao, Jacob Bien, Adel Javanmard

In many domains, data measurements can naturally be associated with the leaves of a tree, expressing the relationships among these measurements.

Tree-based Node Aggregation in Sparse Graphical Models

no code implementations29 Jan 2021 Ines Wilms, Jacob Bien

High-dimensional graphical models are often estimated using regularization that is aimed at reducing the number of edges in a network.

TAG

Selective Inference for Hierarchical Clustering

2 code implementations5 Dec 2020 Lucy L. Gao, Jacob Bien, Daniela Witten

Classical tests for a difference in means control the type I error rate when the groups are defined a priori.

Clustering

Modeling Cell Populations Measured By Flow Cytometry With Covariates Using Sparse Mixture of Regressions

2 code implementations25 Aug 2020 Sangwon Hyun, Mattias Rolf Cape, Francois Ribalet, Jacob Bien

The ocean is filled with microscopic microalgae called phytoplankton, which together are responsible for as much photosynthesis as all plants on land combined.

Testing for Association in Multi-View Network Data

1 code implementation25 Sep 2019 Lucy L. Gao, Daniela Witten, Jacob Bien

To answer this question, we extend the stochastic block model for a single network view to the two-view setting, and develop a new hypothesis test for the null hypothesis that the latent community memberships in the two data views are independent.

Stochastic Block Model

Are Clusterings of Multiple Data Views Independent?

2 code implementations12 Jan 2019 Lucy L. Gao, Jacob Bien, Daniela Witten

However, clustering the participants based on multiple data views implicitly assumes that a single underlying clustering of the participants is shared across all data views.

Clustering

Rare Feature Selection in High Dimensions

1 code implementation18 Mar 2018 Xiaohan Yan, Jacob Bien

It is common in modern prediction problems for many predictor variables to be counts of rarely occurring events.

feature selection Vocal Bursts Intensity Prediction

Estimating the error variance in a high-dimensional linear model

no code implementations6 Dec 2017 Guo Yu, Jacob Bien

In this paper, we propose the natural lasso estimator for the error variance, which maximizes a penalized likelihood objective.

Vocal Bursts Intensity Prediction

Valid Inference Corrected for Outlier Removal

1 code implementation29 Nov 2017 Shuxiao Chen, Jacob Bien

Ordinary least square (OLS) estimation of a linear regression model is well-known to be highly sensitive to outliers.

Outlier Detection valid

Interpretable Vector AutoRegressions with Exogenous Time Series

no code implementations9 Nov 2017 Ines Wilms, Sumanta Basu, Jacob Bien, David S. Matteson

The Vector AutoRegressive (VAR) model is fundamental to the study of multivariate time series.

Management Marketing +2

BigVAR: Tools for Modeling Sparse High-Dimensional Multivariate Time Series

1 code implementation23 Feb 2017 William Nicholson, David Matteson, Jacob Bien

The R package BigVAR allows for the simultaneous estimation of high-dimensional time series by applying structured penalties to the conventional vector autoregression (VAR) and vector autoregression with exogenous variables (VARX) frameworks.

Computation

The Simulator: An Engine to Streamline Simulations

no code implementations30 Jun 2016 Jacob Bien

The syntax of the simulator leads to simulation code that is easily human-readable.

Graph-Guided Banding of the Covariance Matrix

no code implementations1 Jun 2016 Jacob Bien

A highly-related, yet complementary, literature studies the specific setting in which the measured variables have a known ordering, in which case a banded population matrix is often assumed.

Learning Local Dependence In Ordered Data

no code implementations25 Apr 2016 Guo Yu, Jacob Bien

Penalized maximum likelihood estimation of this matrix yields a simple regression interpretation for local dependence in which variables are predicted by their neighbors.

regression

Non-convex Global Minimization and False Discovery Rate Control for the TREX

1 code implementation22 Apr 2016 Jacob Bien, Irina Gaynanova, Johannes Lederer, Christian Müller

The TREX is a recently introduced method for performing sparse high-dimensional regression.

Hierarchical Sparse Modeling: A Choice of Two Group Lasso Formulations

no code implementations5 Dec 2015 Xiaohan Yan, Jacob Bien

The purpose of this paper is to provide a side-by-side comparison of these two frameworks for HSM in terms of their statistical properties and computational efficiency.

Computational Efficiency Time Series Analysis +1

Sparse Partially Linear Additive Models

1 code implementation17 Jul 2014 Yin Lou, Jacob Bien, Rich Caruana, Johannes Gehrke

Thus, to make a GPLAM a viable approach in situations in which little is known $a~priori$ about the features, one must overcome two primary model selection challenges: deciding which features to include in the model and determining which of these features to treat nonlinearly.

Additive models Model Selection

Convex Banding of the Covariance Matrix

no code implementations23 May 2014 Jacob Bien, Florentina Bunea, Luo Xiao

Empirical studies demonstrate its practical effectiveness and illustrate that our exactly-banded estimator works well even when the true covariance matrix is only close to a banded matrix, confirming our theoretical results.

A lasso for hierarchical interactions

no code implementations22 May 2012 Jacob Bien, Jonathan Taylor, Robert Tibshirani

We add a set of convex constraints to the lasso to produce sparse interaction models that honor the hierarchy restriction that an interaction only be included in a model if one or both variables are marginally important.

CUR from a Sparse Optimization Viewpoint

no code implementations NeurIPS 2010 Jacob Bien, Ya Xu, Michael W. Mahoney

The CUR decomposition provides an approximation of a matrix X that has low reconstruction error and that is sparse in the sense that the resulting approximation lies in the span of only a few columns of X.

Cannot find the paper you are looking for? You can Submit a new open access paper.