Skip to content

Evaluation Task and Evaluation Data Introduction

Vision Domain Evaluation (CV)

Vision domain (CV) mainly evaluates the three major capabilities of models in downstream tasks: 1) perceptual capability, including local perception and temporal perception; 2) analytical capability, including global analysis, local analysis, and temporal analysis; 3) understanding capability, including analogy, induction, and reasoning.

Currently, it includes the following evaluation tasks:

Click the task name to view the task details. The task details include task introduction, evaluation data introduction, and evaluation metrics introduction.