The benchmark assesses the real-time interactive consultation capabilities of LLMs across three critical dimensions. We collect Chinese medical records across diverse departments online.
Firstly, it examines their ability to identify patient symptoms, highlighting the importance of actively seeking relevant information. Secondly, we assess the comprehensiveness of their medical examinations, specifically their adeptness in selecting and administering a suitable range of these tests. Thirdly, we measure the accuracy and professionalism of their diagnosis, checking whether they meet standards of medical practice.
Paper | Code | Results | Date | Stars |
---|