1 code implementation • 2 Feb 2024 • Zach Nussbaum, John X. Morris, Brandon Duderstadt, Andriy Mulyar
This technical report describes the training of nomic-embed-text-v1, the first fully reproducible, open-source, open-weights, open-data, 8192 context length English text embedding model that outperforms both OpenAI Ada-002 and OpenAI text-embedding-3-small on short and long-context tasks.
1 code implementation • 6 Nov 2023 • Yuvanesh Anand, Zach Nussbaum, Adam Treat, Aaron Miller, Richard Guo, Ben Schmidt, GPT4All Community, Brandon Duderstadt, Andriy Mulyar
It is our hope that this paper acts as both a technical overview of the original GPT4All models as well as a case study on the subsequent growth of the GPT4All open source ecosystem.
no code implementations • ACL 2020 • Elliot Schumacher, Andriy Mulyar, Mark Dredze
We propose an approach to concept linking that leverages recent work in contextualized neural models, such as ELMo (Peters et al. 2018), which create a token representation that integrates the surrounding context of the mention and concept name.
2 code implementations • 21 Apr 2020 • Andriy Mulyar, Bridget T. McInnes
Clinical notes contain an abundance of important but not-readily accessible information about patients.
2 code implementations • 30 Oct 2019 • Andriy Mulyar, Elliot Schumacher, Masoud Rouhizadeh, Mark Dredze
Clinical notes contain an extensive record of a patient's health status, such as smoking status or the presence of heart conditions.
Ranked #1 on Clinical Note Phenotyping on I2B2 2006: Smoking