Test Results of the October 2025 Open Source Models

[標題]最新消息

Test Results of the October 2025 Open Source Models

✪ Sorting by Developer Region: Light orange represents European models; light blue represents U.S. models; light green represents domestic models; and light purple represents Chinese models.

✪ Percentage Labeling: Values above 50% are highlighted in green, while values below 50% are highlighted in pink.

Language Model Benchmarks / Small Models (13B and below)

Language Model Benchmark / Small Models (13B and below), please refer to the “Small” worksheet in the files below, “Test Results of the October 2025 OpenSource Models(Large and Small Models).ods” or “Test Results of the October 2025 OpenSource Models(Large and Small Models).xlsx

Language Model Benchmarks / Large Models (above 13B)

Language Model Benchmark / Large Models (above 13B), please refer to the “Large” worksheet in the files below, “Test Results of the October 2025 OpenSource Models(Large and Small Models).ods” or “Test Results of the October 2025 OpenSource Models(Large and Small Models).xlsx

Downloads:

Test Results of the October 2025 OpenSource Models(Large and Small Models).ods

Test Results of the October 2025 OpenSource Models(Large and Small Models).xlsx

Latest News

Test Results of the October 2025 Open Source Models