Test Results of the February 2026 Open Source Models

[標題]最新消息

Test Results of the February 2026 Open Source Models

✪Sorted by the region of the developing organization: light orange represents European models, light blue represents U.S. models, light green represents local models, and light purple represents Chinese models.

✪Explanation of percentage values: figures above 50% are marked in black; figures below 50% are marked in red.

✪This test cycle includes newly added 6 small models.

Language Model Benchmark / Small Models (13B and below)

Language Model Benchmark / Small Models (13B and below), please refer to the “Small” worksheet in the files below, “Test Results of the February 2026 OpenSource Models(Large and Small Models)v0.2_1150608.ods” or “Test Results of the February 2026 OpenSource Models(Large and Small Models)v0.2_1150608.xlsx

Language Model Benchmark / Large Models (above 13B)

Language Model Benchmark / Large Models (above 13B), please refer to the “Large” worksheet in the files below, “Test Results of the February 2026 OpenSource Models(Large and Small Models)v0.2_1150608.ods” or “Test Results of the February 2026 OpenSource Models(Large and Small Models)v0.2_1150608.xlsx

Downloads:

Test Results of the February 2026 OpenSource Models(Large and Small Models)v0.2_1150608.ods

Test Results of the February 2026 OpenSource Models(Large and Small Models)v0.2_1150608.xlsx

Latest News

Test Results of the February 2026 Open Source Models