Search Results for author: Pranjal Aggarwal

Found 6 papers, 4 papers with code

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

no code implementations • 12 Apr 2024 • Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hallucinations.

Language Modelling reinforcement-learning

Paper
Add Code

GEO: Generative Engine Optimization

no code implementations • 16 Nov 2023 • Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik R Narasimhan, Ameet Deshpande

We facilitate systematic evaluation in this new paradigm by introducing GEO-bench, a benchmark of diverse user queries across multiple domains, coupled with sources required to answer these queries.

Paper
Add Code

AutoMix: Automatically Mixing Language Models

1 code implementation • 19 Oct 2023 • Aman Madaan, Pranjal Aggarwal, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Mausam, Manaal Faruqui

Large language models (LLMs) are now available from cloud API providers in various sizes and configurations.

Paper
Code

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs

1 code implementation • 19 May 2023 • Pranjal Aggarwal, Aman Madaan, Yiming Yang, Mausam

A popular approach for improving the correctness of output from large language models (LLMs) is Self-Consistency - poll the LLM multiple times and output the most frequent solution.

Code Generation

Paper
Code

SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

1 code implementation • 26 Jan 2023 • Pranjal Aggarwal, Ameet Deshpande, Karthik Narasimhan

In this paper, we develop SemSup-XC, a model that achieves state-of-the-art zero-shot and few-shot performance on three XC datasets derived from legal, e-commerce, and Wikipedia data.

Contrastive Learning

Paper
Code

Hope Speech Detection on Social Media Platforms

1 code implementation • 14 Nov 2022 • Pranjal Aggarwal, Pasupuleti Chandana, Jagrut Nemade, Shubham Sharma, Sunil Saumya, Shankar Biradar

Since personal computers became widely available in the consumer market, the amount of harmful content on the internet has significantly expanded.

Hope Speech Detection Sentence

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.