Search Results for author: Zheng Xu

Found 54 papers, 17 papers with code

Analyzing the effect of neural network architecture on training performance

no code implementations • ICML 2020 • Karthik Abinav Sankararaman, Soham De, Zheng Xu, W. Ronny Huang, Tom Goldstein

Through novel theoretical and experimental results, we show how the neural net architecture affects gradient confusion, and thus the efficiency of training.

Paper
Add Code

Improved Communication-Privacy Trade-offs in $L_2$ Mean Estimation under Streaming Differential Privacy

no code implementations • 2 May 2024 • Wei-Ning Chen, Berivan Isik, Peter Kairouz, Albert No, Sewoong Oh, Zheng Xu

We study $L_2$ mean estimation under central differential privacy and communication constraints, and address two key challenges: firstly, existing mean estimation schemes that simultaneously handle both constraints are usually optimized for $L_\infty$ geometry and rely on random rotation or Kashin's representation to adapt to $L_2$ geometry, resulting in suboptimal leading constants in mean square errors (MSEs); secondly, schemes achieving order-optimal communication-privacy trade-offs do not extend seamlessly to streaming differential privacy (DP) settings (e. g., tree aggregation or matrix factorization), rendering them incompatible with DP-FTRL type optimizers.

Paper
Add Code

Prompt Public Large Language Models to Synthesize Data for Private On-device Applications

no code implementations • 5 Apr 2024 • Shanshan Wu, Zheng Xu, Yanxiang Zhang, Yuanbo Zhang, Daniel Ramage

Pre-training on public data is an effective method to improve the performance for federated learning (FL) with differential privacy (DP).

Federated Learning

Paper
Add Code

Efficient Language Model Architectures for Differentially Private Federated Learning

no code implementations • 12 Mar 2024 • Jae Hun Ro, Srinadh Bhojanapalli, Zheng Xu, Yanxiang Zhang, Ananda Theertha Suresh

Cross-device federated learning (FL) is a technique that trains a model on data distributed across typically millions of edge devices without data leaving the devices.

Computational Efficiency Federated Learning +1

Paper
Add Code

Enhancing Kubernetes Automated Scheduling with Deep Learning and Reinforcement Techniques for Large-Scale Cloud Computing Optimization

no code implementations • 26 Feb 2024 • Zheng Xu, Yulu Gong, Yanlin Zhou, Qiaozhi Bao, Wenpin Qian

With the continuous expansion of the scale of cloud computing applications, artificial intelligence technologies such as Deep Learning and Reinforcement Learning have gradually become the key tools to solve the automated task scheduling of large-scale cloud computing systems.

Cloud Computing reinforcement-learning +1

Paper
Add Code

Privacy-Preserving Instructions for Aligning Large Language Models

no code implementations • 21 Feb 2024 • Da Yu, Peter Kairouz, Sewoong Oh, Zheng Xu

Service providers of large language model (LLM) applications collect user instructions in the wild and use them in further aligning LLMs with users' intentions.

Language Modelling Large Language Model +1

Paper
Add Code

PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter

no code implementations • 16 Feb 2024 • Junfei Xiao, Zheng Xu, Alan Yuille, Shen Yan, Boyu Wang

Our research undertakes a thorough exploration of the state-of-the-art perceiver resampler architecture and builds a strong baseline.

Language Modelling Question Answering +1

Paper
Add Code

Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

no code implementations • 12 Jan 2024 • Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi

Foundation models (FMs) adapt well to specific domains or tasks with fine-tuning, and federated learning (FL) enables the potential for privacy-preserving fine-tuning of the FMs with on-device local data.

Federated Learning Privacy Preserving

Paper
Add Code

InstructPipe: Building Visual Programming Pipelines with Human Instructions

no code implementations • 15 Dec 2023 • Zhongyi Zhou, Jing Jin, Vrushank Phadnis, Xiuxiu Yuan, Jun Jiang, Xun Qian, Jingtao Zhou, Yiyi Huang, Zheng Xu, yinda zhang, Kristen Wright, Jason Mayes, Mark Sherwood, Johnny Lee, Alex Olwal, David Kim, Ram Iyengar, Na Li, Ruofei Du

Our user study (N=16) showed that InstructPipe empowers novice users to streamline their workflow in creating desired ML pipelines, reduce their learning curve, and spark innovative ideas with open-ended commands.

Paper
Add Code

Analyze Drivers' Intervention Behavior During Autonomous Driving -- A VR-incorporated Approach

no code implementations • 4 Dec 2023 • Zheng Xu

Given the rapid advance in ITS technologies, future mobility is pointing to vehicular autonomy.

Autonomous Driving

Paper
Add Code

User Inference Attacks on Large Language Models

no code implementations • 13 Oct 2023 • Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo, Zheng Xu

Fine-tuning is a common and effective method for tailoring large language models (LLMs) to specialized tasks and applications.

Paper
Add Code

Federated Learning of Gboard Language Models with Differential Privacy

no code implementations • 29 May 2023 • Zheng Xu, Yanxiang Zhang, Galen Andrew, Christopher A. Choquette-Choo, Peter Kairouz, H. Brendan McMahan, Jesse Rosenstock, Yuanbo Zhang

We train language models (LMs) with federated learning (FL) and differential privacy (DP) in the Google Keyboard (Gboard).

Federated Learning

Paper
Add Code

Can Public Large Language Models Help Private Cross-device Federated Learning?

no code implementations • 20 May 2023 • Boxin Wang, Yibo Jacky Zhang, Yuan Cao, Bo Li, H. Brendan McMahan, Sewoong Oh, Zheng Xu, Manzil Zaheer

We study (differentially) private federated learning (FL) of language models.

Federated Learning

Paper
Add Code

An Empirical Evaluation of Federated Contextual Bandit Algorithms

1 code implementation • 17 Mar 2023 • Alekh Agarwal, H. Brendan McMahan, Zheng Xu

As the adoption of federated learning increases for learning from sensitive data local to user devices, it is natural to ask if the learning can be done using implicit signals generated as users interact with the applications of interest, rather than requiring access to explicit labels which can be difficult to acquire in many tasks.

Federated Learning Multi-Armed Bandits

654

Paper
Code

How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy

1 code implementation • 1 Mar 2023 • Natalia Ponomareva, Hussein Hazimeh, Alex Kurakin, Zheng Xu, Carson Denison, H. Brendan McMahan, Sergei Vassilvitskii, Steve Chien, Abhradeep Thakurta

However, while some adoption of DP has happened in industry, attempts to apply DP to real world complex ML models are still few and far between.

654

Paper
Code

On the Convergence of Federated Averaging with Cyclic Client Participation

no code implementations • 6 Feb 2023 • Yae Jee Cho, Pranay Sharma, Gauri Joshi, Zheng Xu, Satyen Kale, Tong Zhang

Federated Averaging (FedAvg) and its variants are the most popular optimization algorithms in federated learning (FL).

Federated Learning

Paper
Add Code

Decentralized Energy Market Integrating Carbon Allowance Trade and Uncertainty Balance in Energy Communities

no code implementations • 28 Jan 2023 • Yuanxi Wu, Zhi Wu, Wei Gu, Zheng Xu, Shu Zheng, Qirun Sun

With the sustained attention on carbon neutrality, the personal carbon trading (PCT) scheme has been embraced as an auspicious paradigm for scaling down carbon emissions.

energy trading

Paper
Add Code

Learning to Generate Image Embeddings with User-level Differential Privacy

1 code implementation • CVPR 2023 • Zheng Xu, Maxwell Collins, Yuxiao Wang, Liviu Panait, Sewoong Oh, Sean Augenstein, Ting Liu, Florian Schroff, H. Brendan McMahan

Small on-device models have been successfully trained with user-level differential privacy (DP) for next word prediction and image classification tasks in the past.

Federated Learning Image Classification

654

Paper
Code

Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models

no code implementations • 15 Jul 2022 • Rui Qian, Yeqing Li, Zheng Xu, Ming-Hsuan Yang, Serge Belongie, Yin Cui

Utilizing vision and language models (VLMs) pre-trained on large-scale image-text pairs is becoming a promising paradigm for open-vocabulary visual recognition.

Ranked #1 on Zero-Shot Action Recognition on HMDB51

Optical Flow Estimation Video Classification +1

Paper
Add Code

Beyond Uniform Lipschitz Condition in Differentially Private Optimization

no code implementations • 21 Jun 2022 • Rudrajit Das, Satyen Kale, Zheng Xu, Tong Zhang, Sujay Sanghavi

Most prior results on differentially private stochastic gradient descent (DP-SGD) are derived under the simplistic assumption of uniform Lipschitzness, i. e., the per-sample gradients are uniformly bounded.

Benchmarking regression

Paper
Add Code

Motley: Benchmarking Heterogeneity and Personalization in Federated Learning

2 code implementations • 18 Jun 2022 • Shanshan Wu, Tian Li, Zachary Charles, Yu Xiao, Ziyu Liu, Zheng Xu, Virginia Smith

To better answer these questions, we propose Motley, a benchmark for personalized federated learning.

Benchmarking Fairness +3

654

Paper
Code

On the Unreasonable Effectiveness of Federated Averaging with Heterogeneous Data

no code implementations • 9 Jun 2022 • Jianyu Wang, Rudrajit Das, Gauri Joshi, Satyen Kale, Zheng Xu, Tong Zhang

Motivated by this observation, we propose a new quantity, average drift at optimum, to measure the effects of data heterogeneity, and explicitly use it to present a new theoretical analysis of FedAvg.

Federated Learning

Paper
Add Code

Digging into Primary Financial Market: Challenges and Opportunities of Adopting Blockchain

no code implementations • 20 Apr 2022 • Ji Liu, Zheng Xu, Yanmei Zhang, Wei Dai, Hao Wu, Shiping Chen

Since the emergence of blockchain technology, its application in the financial market has always been an area of focus and exploration by all parties.

Paper
Add Code

Efficient and Private Federated Learning with Partially Trainable Networks

no code implementations • 6 Oct 2021 • Hakim Sidahmed, Zheng Xu, Ankush Garg, Yuan Cao, Mingqing Chen

Through extensive experiments, we empirically show that Federated learning of Partially Trainable neural networks (FedPT) can result in superior communication-accuracy trade-offs, with up to $46\times$ reduction in communication cost, at a small accuracy cost.

Federated Learning

Paper
Add Code

Diurnal or Nocturnal? Federated Learning of Multi-branch Networks from Periodically Shifting Distributions

no code implementations • ICLR 2022 • Chen Zhu, Zheng Xu, Mingqing Chen, Jakub Konečný, Andrew Hard, Tom Goldstein

Federated learning has been deployed to train machine learning models from decentralized client data on mobile devices in practice.

Federated Learning Image Classification

Paper
Add Code

A Field Guide to Federated Optimization

2 code implementations • 14 Jul 2021 • Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz, Satyen Kale, Sai Praneeth Karimireddy, Jakub Konecny, Sanmi Koyejo, Tian Li, Luyang Liu, Mehryar Mohri, Hang Qi, Sashank J. Reddi, Peter Richtarik, Karan Singhal, Virginia Smith, Mahdi Soltanolkotabi, Weikang Song, Ananda Theertha Suresh, Sebastian U. Stich, Ameet Talwalkar, Hongyi Wang, Blake Woodworth, Shanshan Wu, Felix X. Yu, Honglin Yuan, Manzil Zaheer, Mi Zhang, Tong Zhang, Chunxiang Zheng, Chen Zhu, Wennan Zhu

Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection.

Federated Learning

654

Paper
Code

Local-Global Knowledge Distillation in Heterogeneous Federated Learning with Non-IID Data

no code implementations • 30 Jun 2021 • Dezhong Yao, Wanning Pan, Yutong Dai, Yao Wan, Xiaofeng Ding, Hai Jin, Zheng Xu, Lichao Sun

Federated learning enables multiple clients to collaboratively learn a global model by periodically aggregating the clients' models without transferring the local data.

Federated Learning Knowledge Distillation

Paper
Add Code

Local Adaptivity in Federated Learning: Convergence and Consistency

no code implementations • 4 Jun 2021 • Jianyu Wang, Zheng Xu, Zachary Garrett, Zachary Charles, Luyang Liu, Gauri Joshi

Popular optimization algorithms of FL use vanilla (stochastic) gradient descent for both local updates at clients and global updates at the aggregating server.

Federated Learning

Paper
Add Code

Practical and Private (Deep) Learning without Sampling or Shuffling

2 code implementations • 26 Feb 2021 • Peter Kairouz, Brendan Mcmahan, Shuang Song, Om Thakkar, Abhradeep Thakurta, Zheng Xu

We consider training models with differential privacy (DP) using mini-batch gradients.

Federated Learning

654

Paper
Code

GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training

2 code implementations • NeurIPS 2021 • Chen Zhu, Renkun Ni, Zheng Xu, Kezhi Kong, W. Ronny Huang, Tom Goldstein

Innovations in neural architectures have fostered significant breakthroughs in language modeling and computer vision.

Ranked #137 on Image Classification on CIFAR-10

Image Classification Language Modelling +2

133

Paper
Code

Towards Accurate Quantization and Pruning via Data-free Knowledge Transfer

no code implementations • 14 Oct 2020 • Chen Zhu, Zheng Xu, Ali Shafahi, Manli Shu, Amin Ghiasi, Tom Goldstein

Further, we demonstrate that the compact structure and corresponding initialization from the Lottery Ticket Hypothesis can also help in data-free training.

Data Free Quantization Transfer Learning

Paper
Add Code

Exploring Model Robustness with Adaptive Networks and Improved Adversarial Training

no code implementations • 30 May 2020 • Zheng Xu, Ali Shafahi, Tom Goldstein

Our adaptive networks also outperform larger widened non-adaptive architectures that have 1. 5 times more parameters.

Paper
Add Code

Advances and Open Problems in Federated Learning

8 code implementations • 10 Dec 2019 • Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D'Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson, Justin Hsu, Martin Jaggi, Tara Javidi, Gauri Joshi, Mikhail Khodak, Jakub Konečný, Aleksandra Korolova, Farinaz Koushanfar, Sanmi Koyejo, Tancrède Lepoint, Yang Liu, Prateek Mittal, Mehryar Mohri, Richard Nock, Ayfer Özgür, Rasmus Pagh, Mariana Raykova, Hang Qi, Daniel Ramage, Ramesh Raskar, Dawn Song, Weikang Song, Sebastian U. Stich, Ziteng Sun, Ananda Theertha Suresh, Florian Tramèr, Praneeth Vepakomma, Jianyu Wang, Li Xiong, Zheng Xu, Qiang Yang, Felix X. Yu, Han Yu, Sen Zhao

FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs resulting from traditional, centralized machine learning and data science approaches.

BIG-bench Machine Learning Federated Learning

4,068

Paper
Code

The Effect of Neural Net Architecture on Gradient Confusion & Training Performance

no code implementations • 25 Sep 2019 • Karthik A. Sankararaman, Soham De, Zheng Xu, W. Ronny Huang, Tom Goldstein

Through novel theoretical and experimental results, we show how the neural net architecture affects gradient confusion, and thus the efficiency of training.

Paper
Add Code

Adversarial Training for Free!

6 code implementations • NeurIPS 2019 • Ali Shafahi, Mahyar Najibi, Amin Ghiasi, Zheng Xu, John Dickerson, Christoph Studer, Larry S. Davis, Gavin Taylor, Tom Goldstein

Adversarial training, in which a network is trained on adversarial examples, is one of the few defenses against adversarial attacks that withstands strong attacks.

Ranked #3 on Adversarial Defense on ImageNet (non-targeted PGD, max perturbation=4)

Adversarial Attack Adversarial Defense +2

412

Paper
Code

The Impact of Neural Network Overparameterization on Gradient Confusion and Stochastic Gradient Descent

no code implementations • 15 Apr 2019 • Karthik A. Sankararaman, Soham De, Zheng Xu, W. Ronny Huang, Tom Goldstein

Our results show that, for popular initialization techniques, increasing the width of neural networks leads to lower gradient confusion, and thus faster model training.

Paper
Add Code

Universal Adversarial Training

no code implementations • 27 Nov 2018 • Ali Shafahi, Mahyar Najibi, Zheng Xu, John Dickerson, Larry S. Davis, Tom Goldstein

Standard adversarial attacks change the predicted class label of a selected image by adding specially tailored small perturbations to its pixels.

Paper
Add Code

Weakly Supervised Deep Learning for Thoracic Disease Classification and Localization on Chest X-rays

no code implementations • 16 Jul 2018 • Chaochao Yan, Jiawen Yao, Ruoyu Li, Zheng Xu, Junzhou Huang

Chest X-rays is one of the most commonly available and affordable radiological examinations in clinical practice.

General Classification Thoracic Disease Classification

Paper
Add Code

Accurate and efficient video de-fencing using convolutional neural networks and temporal information

1 code implementation • 28 Jun 2018 • Chen Du, Byeongkeun Kang, Zheng Xu, Ji Dai, Truong Nguyen

To overcome these problems, we propose a novel method consisting of segmentation using convolutional neural networks and a fast/robust recovery algorithm.

object-detection Object Detection +2

Paper
Code

Learning from Multi-domain Artistic Images for Arbitrary Style Transfer

1 code implementation • 25 May 2018 • Zheng Xu, Michael Wilber, Chen Fang, Aaron Hertzmann, Hailin Jin

We propose a fast feed-forward network for arbitrary style transfer, which can generate stylized image for previously unseen content and style image pairs.

Style Transfer

Paper
Code

The Effectiveness of Instance Normalization: a Strong Baseline for Single Image Dehazing

no code implementations • 8 May 2018 • Zheng Xu, Xitong Yang, Xue Li, Xiaoshuai Sun

We propose a novel deep neural network architecture for the challenging problem of single image dehazing, which aims to recover the clear image from a degraded hazy image.

Decoder Image Dehazing +1

Paper
Add Code

Learning to Cluster for Proposal-Free Instance Segmentation

1 code implementation • 17 Mar 2018 • Yen-Chang Hsu, Zheng Xu, Zsolt Kira, Jiawei Huang

We utilize the most fundamental property of instance labeling -- the pairwise relationship between pixels -- as the supervision to formulate the learning objective, then apply it to train a fully convolutional network (FCN) for learning to perform pixel-wise clustering.

Ranked #13 on Lane Detection on TuSimple

Autonomous Driving Clustering +6

313

Paper
Code

Visualizing the Loss Landscape of Neural Nets

11 code implementations • ICLR 2018 • Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer, Tom Goldstein

Neural network training relies on our ability to find "good" minimizers of highly non-convex loss functions.

2,671

Paper
Code

Training Shallow and Thin Networks for Acceleration via Knowledge Distillation with Conditional Adversarial Networks

no code implementations • 2 Sep 2017 • Zheng Xu, Yen-Chang Hsu, Jiawei Huang

There is an increasing interest on accelerating neural networks for real-time applications.

General Classification Knowledge Distillation

Paper
Add Code

Group-driven Reinforcement Learning for Personalized mHealth Intervention

1 code implementation • 14 Aug 2017 • Feiyun Zhu, Jun Guo, Zheng Xu, Peng Liao, Junzhou Huang

Due to the popularity of smartphones and wearable devices nowadays, mobile health (mHealth) technologies are promising to bring positive and wide impacts on people's health.

Clustering Decision Making +2

Paper
Code

Adaptive Consensus ADMM for Distributed Optimization

no code implementations • ICML 2017 • Zheng Xu, Gavin Taylor, Hao Li, Mario Figueiredo, Xiaoming Yuan, Tom Goldstein

The alternating direction method of multipliers (ADMM) is commonly used for distributed model fitting problems, but its performance and reliability depend strongly on user-defined penalty parameters.

Distributed Optimization

Paper
Add Code

Training Quantized Nets: A Deeper Understanding

no code implementations • NeurIPS 2017 • Hao Li, Soham De, Zheng Xu, Christoph Studer, Hanan Samet, Tom Goldstein

Currently, deep neural networks are deployed on low-power portable devices by first training a full-precision model using powerful hardware, and then deriving a corresponding low-precision model for efficient inference on such systems.

Paper
Add Code

Stabilizing Adversarial Nets With Prediction Methods

1 code implementation • ICLR 2018 • Abhay Yadav, Sohil Shah, Zheng Xu, David Jacobs, Tom Goldstein

Adversarial neural networks solve many important problems in data science, but are notoriously difficult to train.

Paper
Code

Adaptive Relaxed ADMM: Convergence Theory and Practical Implementation

no code implementations • CVPR 2017 • Zheng Xu, Mario A. T. Figueiredo, Xiaoming Yuan, Christoph Studer, Tom Goldstein

Relaxed ADMM is a generalization of ADMM that often achieves better performance, but its efficiency depends strongly on algorithm parameters that must be chosen by an expert user.

Paper
Add Code

Non-negative Factorization of the Occurrence Tensor from Financial Contracts

1 code implementation • 10 Dec 2016 • Zheng Xu, Furong Huang, Louiqa Raschid, Tom Goldstein

We propose an algorithm for the non-negative factorization of an occurrence tensor built from heterogeneous networks.

Paper
Code

An Empirical Study of ADMM for Nonconvex Problems

no code implementations • 10 Dec 2016 • Zheng Xu, Soham De, Mario Figueiredo, Christoph Studer, Tom Goldstein

The alternating direction method of multipliers (ADMM) is a common optimization tool for solving constrained and non-differentiable problems.

Image Denoising regression +1

Paper
Add Code

Adaptive ADMM with Spectral Penalty Parameter Selection

no code implementations • 24 May 2016 • Zheng Xu, Mario A. T. Figueiredo, Tom Goldstein

The alternating direction method of multipliers (ADMM) is a versatile tool for solving a wide range of constrained optimization problems, with differentiable or non-differentiable objective functions.

Paper
Add Code

Training Neural Networks Without Gradients: A Scalable ADMM Approach

2 code implementations • 6 May 2016 • Gavin Taylor, Ryan Burmeister, Zheng Xu, Bharat Singh, Ankit Patel, Tom Goldstein

With the growing importance of large network models and enormous training datasets, GPUs have become increasingly necessary to train neural networks.

Paper
Code

Exploiting Lists of Names for Named Entity Identification of Financial Institutions from Unstructured Documents

no code implementations • 14 Feb 2016 • Zheng Xu, Douglas Burdick, Louiqa Raschid

To our knowledge, our proposed solutions, Dict-based NER and Rank-based ER, and the root and suffix dictionaries, are the first attempt to exploit specialized knowledge, i. e., lists of FI names, for rule-based NER and

Entity Resolution named-entity-recognition +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.