OSS-first docs

These docs teach the open system first: contracts, generated surfaces, runtimes, governance, and incremental adoption. Studio shows up as the operating layer on top, not as the source of truth.

Run custom benchmark

Launch a custom benchmark evaluation against a model.

field.key.label
Run custom benchmark
field.version.label
field.type.label
field.title.label
Run custom benchmark
field.description.label

Launch a custom benchmark evaluation against a model.

field.tags.label
ai,ranking,custom,eval
field.owners.label
field.stability.label
public

Evaluates model performance using internal eval suites with configurable parameters.