Breadcrumb

Introduction

The Artificial Intelligence Evaluation Center (AIEC) has been established to promote localized AI evaluation and third-party certification in Taiwan, thereby strengthening the development of trusted AI within the industry. The Center will periodically publish benchmark evaluation results for language models. In addition to adopting indicators based on the Chinese Language and Social Studies sections of the national high school entrance examination, AIEC also incorporates evaluation criteria reflecting Taiwanese values, aligning with global trends in AI sovereignty. These benchmarks serve as key references for developing locally adapted models or fine-tuning international models.

✪依開發單位區域排序,淺橘為歐洲模型;淺藍為美國模型;淺綠為本土模型;淺紫為中國模型 

✪百分比數字說明:高於50%以綠色標註 ; 低於50%以粉紅色標註

  • 語言模型基準(benchmark) / 小模型(13B以下)

 
 

  • 語言模型基準(benchmark) / 大模型(13B以上)