CVTG-2K

CVTG-2K (Chinese Visual Text Generation 2K) is a benchmark for evaluating text-to-image models on their ability to accurately render text within generated images. It measures Word Accuracy, Normalized Edit Distance (NED), and CLIPScore across 2,000 prompts.

GLM-Image from Zhipu AI currently leads the CVTG-2K leaderboard with a score of 0.912 across 1 evaluated AI models.