OSS-first docs

These docs teach the open system first: contracts, generated surfaces, runtimes, governance, and incremental adoption. Studio shows up as the operating layer on top, not as the source of truth.

AI index

provider-ranking.benchmark.run-custom

Launch a custom benchmark evaluation against a specific model.

Type: operation (command)

Version: 1.0.0

Tags: custom, eval

File: packages/libs/contracts-spec/src/provider-ranking/commands/benchmarkRunCustom.command.ts

field.key.label: provider-ranking.benchmark.run-custom
field.version.label: 1.0.0
field.type.label: operation (command)
field.title.label: provider-ranking.benchmark.run-custom
field.description.label: Launch a custom benchmark evaluation against a specific model.
Type: operation (command)
Version: 1.0.0
Tags: custom, eval
File: packages/libs/contracts-spec/src/provider-ranking/commands/benchmarkRunCustom.command.ts
field.tags.label: custom,eval
field.owners.label
field.stability.label

Launch a custom benchmark evaluation against a specific model.

Goal

Evaluate model performance using internal eval suites.

Context

Used by operators to run proprietary benchmarks and compare models.

Source Definition

1export const BenchmarkRunCustomCommand = defineCommand({
2  meta: {
3    key: 'provider-ranking.benchmark.run-custom',
4    title: 'Run Custom Benchmark',
5    version: '1.0.0',
6    description:
7      'Launch a custom benchmark evaluation against a specific model.',
8    goal: 'Evaluate model performance using internal eval suites.',
9    context:
10      'Used by operators to run proprietary benchmarks and compare models.',
11    domain: PROVIDER_RANKING_DOMAIN,
12    owners: PROVIDER_RANKING_OWNERS,
13    tags: [...PROVIDER_RANKING_TAGS, 'custom', 'eval'],
14    stability: PROVIDER_RANKING_STABILITY,
15    docId: [docId('docs.tech.provider-ranking.benchmark.run-custom')],
16  },
17  capability: {
18    key: 'provider-ranking.system',
19    version: '1.0.0',
20  },
21  io: {
22    input: BenchmarkRunCustomInput,
23    output: BenchmarkRunCustomOutput,
24  },
25  policy: {
26    auth: 'user',
27    pii: [],
28  },
29  sideEffects: {
30    emits: [
31      {
32        ref: BenchmarkCustomCompletedEvent.meta,
33        when: 'Custom benchmark evaluation finishes execution.',
34      },
35    ],
36  },
37});

Open system docs

provider-ranking.benchmark.run-custom

Goal

Context

Source Definition