Search Results for author: Kevin Lu

Found 13 papers, 9 papers with code

Empowering Federated Learning for Massive Models with NVIDIA FLARE

no code implementations • 12 Feb 2024 • Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng

In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge.

Federated Learning

Paper
Add Code

NVIDIA FLARE: Federated Learning from Simulation to Real-World

1 code implementation • 24 Oct 2022 • Holger R. Roth, Yan Cheng, Yuhong Wen, Isaac Yang, Ziyue Xu, Yuan-Ting Hsieh, Kristopher Kersten, Ahmed Harouni, Can Zhao, Kevin Lu, Zhihong Zhang, Wenqi Li, Andriy Myronenko, Dong Yang, Sean Yang, Nicola Rieke, Abood Quraini, Chester Chen, Daguang Xu, Nic Ma, Prerna Dogra, Mona Flores, Andrew Feng

Federated learning (FL) enables building robust and generalizable AI models by leveraging diverse datasets from multiple collaborators without centralizing the data.

Federated Learning Privacy Preserving

540

Paper
Code

PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation

1 code implementation • COLING 2022 • Sedrick Scott Keh, Kevin Lu, Varun Gangal, Steven Y. Feng, Harsh Jhamtani, Malihe Alikhani, Eduard Hovy

To this end, we propose PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation.

Sentence

Paper
Code

URLB: Unsupervised Reinforcement Learning Benchmark

1 code implementation • 28 Oct 2021 • Michael Laskin, Denis Yarats, Hao liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel

Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range of complex yet specific control tasks.

Continuous Control reinforcement-learning +2

321

Paper
Code

Pretraining for Language Conditioned Imitation with Transformers

no code implementations • 29 Sep 2021 • Aaron L Putterman, Kevin Lu, Igor Mordatch, Pieter Abbeel

We study reinforcement learning (RL) agents which can utilize language inputs.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation Models

1 code implementation • AKBC Workshop CSKB 2021 • Steven Y. Feng, Kevin Lu, Zhuofu Tao, Malihe Alikhani, Teruko Mitamura, Eduard Hovy, Varun Gangal

We investigate the use of multimodal information contained in images as an effective method for enhancing the commonsense of Transformer models for text generation.

Concept-To-Text Generation Specificity +1

Paper
Code

Decision Transformer: Reinforcement Learning via Sequence Modeling

16 code implementations • NeurIPS 2021 • Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch

In particular, we present Decision Transformer, an architecture that casts the problem of RL as conditional sequence modeling.

Ranked #3 on Offline RL on D4RL

Atari Games D4RL +5

2,551

Paper
Code

Pretrained Transformers as Universal Computation Engines

4 code implementations • 9 Mar 2021 • Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch

We investigate the capability of a transformer pretrained on natural language to generalize to other modalities with minimal finetuning -- in particular, without finetuning of the self-attention and feedforward layers of the residual blocks.

239

Paper
Code

Reset-Free Lifelong Learning with Skill-Space Planning

1 code implementation • ICLR 2021 • Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch

We propose Lifelong Skill Planning (LiSP), an algorithmic framework for non-episodic lifelong RL based on planning in an abstract space of higher-order skills.

Reinforcement Learning (RL)

Paper
Code

Weakly supervised one-stage vision and language disease detection using large scale pneumonia and pneumothorax studies

1 code implementation • 31 Jul 2020 • Leo K. Tam, Xiaosong Wang, Evrim Turkbey, Kevin Lu, Yuhong Wen, Daguang Xu

The architectural modifications address three obstacles -- implementing a supervised vision and language detection method in a weakly supervised fashion, incorporating clinical referring expression natural language information, and generating high fidelity detections with map probabilities.

Head Detection Referring Expression

Paper
Code

Efficient Empowerment Estimation for Unsupervised Stabilization

no code implementations • ICLR 2021 • Ruihan Zhao, Kevin Lu, Pieter Abbeel, Stas Tiomkin

We demonstrate our solution for sample-based unsupervised stabilization on different dynamical control systems and show the advantages of our method by comparing it to the existing VLB approaches.

Paper
Add Code

Adaptive Online Planning for Continual Lifelong Learning

1 code implementation • 3 Dec 2019 • Kevin Lu, Igor Mordatch, Pieter Abbeel

We study learning control in an online reset-free lifelong learning scenario, where mistakes can compound catastrophically into the future and the underlying dynamics of the environment may change.

Paper
Code

Character-Based Models for Adversarial Phone Extraction: Preventing Human Sex Trafficking

no code implementations • WS 2019 • Nathanael Chambers, Timothy Forman, Catherine Griswold, Kevin Lu, Yogaish Khastgir, Stephen Steckler

Illicit activity on the Web often uses noisy text to obscure information between client and seller, such as the seller{'}s phone number.

Data Augmentation Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.