Search Results for author: Shangdi Yu

Found 7 papers, 4 papers with code

Approximate Nearest Neighbor Search with Window Filters

1 code implementation1 Feb 2024 Joshua Engels, Benjamin Landrum, Shangdi Yu, Laxman Dhulipala, Julian Shun

We define and investigate the problem of $\textit{c-approximate window search}$: approximate nearest neighbor search where each point in the dataset has a numeric label, and the goal is to find nearest neighbors to queries within arbitrary label ranges.

Image Retrieval

PECANN: Parallel Efficient Clustering with Graph-Based Approximate Nearest Neighbor Search

no code implementations6 Dec 2023 Shangdi Yu, Joshua Engels, Yihao Huang, Julian Shun

In particular, we study variants of density peaks clustering, a popular type of algorithm that has been shown to work well in practice.

Clustering

AuthentiGPT: Detecting Machine-Generated Text via Black-Box Language Models Denoising

no code implementations13 Nov 2023 Zhen Guo, Shangdi Yu

Under the assumption that human-written text resides outside the distribution of machine-generated text, AuthentiGPT leverages a black-box LLM to denoise input text with artificially added noise, and then semantically compares the denoised text with the original to determine if the content is machine-generated.

Denoising Language Modelling

Improving Small Language Models on PubMedQA via Generative Data Augmentation

no code implementations12 May 2023 Zhen Guo, Peiqi Wang, Yanwei Wang, Shangdi Yu

Large Language Models (LLMs) have made remarkable advancements in the field of natural language processing.

Data Augmentation Question Answering

ParChain: A Framework for Parallel Hierarchical Agglomerative Clustering using Nearest-Neighbor Chain

2 code implementations8 Jun 2021 Shangdi Yu, Yiqiu Wang, Yan Gu, Laxman Dhulipala, Julian Shun

This paper studies the hierarchical clustering problem, where the goal is to produce a dendrogram that represents clusters at varying scales of a data set.

Clustering

Fast Parallel Algorithms for Euclidean Minimum Spanning Tree and Hierarchical Spatial Clustering

1 code implementation2 Apr 2021 Yiqiu Wang, Shangdi Yu, Yan Gu, Julian Shun

Our approach is based on generating a well-separated pair decomposition followed by using Kruskal's minimum spanning tree algorithm and bichromatic closest pair computations.

Clustering

Modeling and Analysis of Tagging Networks in Stack Exchange Communities

1 code implementation6 Feb 2019 Xiang Fu, Shangdi Yu, Austin R. Benson

Large Question-and-Answer (Q&A) platforms support diverse knowledge curation on the Web.

TAG

Cannot find the paper you are looking for? You can Submit a new open access paper.