HRS-Bench is a concrete evaluation benchmark for T2I models that is Holistic, Reliable, and Scalable. It measures 13 skills that can be categorized into five major categories: accuracy, robustness, generalization, fairness, and bias. In addition, HRS-Bench covers 50 scenarios, including fashion, animals, transportation, food, and clothes.
Source: HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image ModelsPaper | Code | Results | Date | Stars |
---|