Sample data: Phase-1 sample snapshot. Official crawling and weekly benchmark jobs are not connected yet. All price, latency and score values validate the product structure only and must be replaced by traceable production data before launch.

Model comparison

The `models=a,b,c` URL parameter already drives the comparison page; selectors and saved comparisons come next.

ModelInputOutputTTFTContextValueUpdated
R1DeepSeek R1DeepSeek · closed$0.55/1M$2.19/1M224ms128K832026-06-09Compare
DSDeepSeek V3DeepSeek · closed$0.14/1M$0.28/1M124ms128K962026-06-09Compare
R1

DeepSeek R1

DeepSeek · Reasoning task bucket sample leader for cost-sensitive usage.

Quality
90
Chinese
91
DS

DeepSeek V3

DeepSeek · Strong value baseline for coding and Chinese tasks in the sample set.

Quality
86
Chinese
93