Social IQa
The first large-scale benchmark for commonsense reasoning about social situations. Contains 38,000 multiple choice questions probing emotional and social intelligence in everyday situations, testing commonsense understanding of social interactions and theory of mind reasoning about the implied emotions and behavior of others.
Phi-3.5-MoE-instruct from Microsoft currently leads the Social IQa leaderboard with a score of 0.780 across 9 evaluated AI models.