Skip to Main Content

[標題]最新消息

Test Results of the October 2025 Open Source Models

Sorting by Developer Region: Light orange represents European models; light blue represents U.S. models; light green represents domestic models; and light purple represents Chinese models.

Percentage Labeling: Values above 50% are highlighted in green, while values below 50% are highlighted in pink.

  • Language Model Benchmarks / Small Models (13B and below)
Language Model Benchmark / Small Models (13B and below), please refer to the “Small” worksheet in the files below, “Test Results of the October 2025 OpenSource Models(Large and Small Models).ods” or “Test Results of the October 2025 OpenSource Models(Large and Small Models).xlsx
  • Language Model Benchmarks / Large Models (above 13B)
Language Model Benchmark / Large Models (above 13B), please refer to the “Large” worksheet in the files below, “Test Results of the October 2025 OpenSource Models(Large and Small Models).ods” or “Test Results of the October 2025 OpenSource Models(Large and Small Models).xlsx 
 
 
Downloads:
Test Results of the October 2025 OpenSource Models(Large and Small Models).ods
Test Results of the October 2025 OpenSource Models(Large and Small Models).xlsx