no code implementations • Preprint 2024 • Anthropic
We introduce Claude 3, a new family of large multimodal models – Claude 3 Opus, our most capable offering, Claude 3 Sonnet, which provides a combination of skills and speed, and Claude 3 Haiku, our fastest and least expensive model.
Ranked #3 on Multi-task Language Understanding on MMLU
no code implementations • Technical Report 2023 • Anthropic
Our work using human evaluations to test model safety is most thoroughly documented in our paper “Red-Teaming Language Models to Reduce Harms” [4], while our recent work on automated safety evaluation is “Discovering Language Model Behaviors with Model-Written Evaluations” [7].
Ranked #1 on Question Answering on QuALITY