1 code implementation • 1 Feb 2024 • Joshua Engels, Benjamin Landrum, Shangdi Yu, Laxman Dhulipala, Julian Shun
We define and investigate the problem of $\textit{c-approximate window search}$: approximate nearest neighbor search where each point in the dataset has a numeric label, and the goal is to find nearest neighbors to queries within arbitrary label ranges.
no code implementations • 6 Dec 2023 • Shangdi Yu, Joshua Engels, Yihao Huang, Julian Shun
In particular, we study variants of density peaks clustering, a popular type of algorithm that has been shown to work well in practice.
no code implementations • 13 Nov 2023 • Zhen Guo, Shangdi Yu
Under the assumption that human-written text resides outside the distribution of machine-generated text, AuthentiGPT leverages a black-box LLM to denoise input text with artificially added noise, and then semantically compares the denoised text with the original to determine if the content is machine-generated.
no code implementations • 12 May 2023 • Zhen Guo, Peiqi Wang, Yanwei Wang, Shangdi Yu
Large Language Models (LLMs) have made remarkable advancements in the field of natural language processing.
2 code implementations • 8 Jun 2021 • Shangdi Yu, Yiqiu Wang, Yan Gu, Laxman Dhulipala, Julian Shun
This paper studies the hierarchical clustering problem, where the goal is to produce a dendrogram that represents clusters at varying scales of a data set.
1 code implementation • 2 Apr 2021 • Yiqiu Wang, Shangdi Yu, Yan Gu, Julian Shun
Our approach is based on generating a well-separated pair decomposition followed by using Kruskal's minimum spanning tree algorithm and bichromatic closest pair computations.
1 code implementation • 6 Feb 2019 • Xiang Fu, Shangdi Yu, Austin R. Benson
Large Question-and-Answer (Q&A) platforms support diverse knowledge curation on the Web.