OSS-first docs

These docs teach the open system first: contracts, generated surfaces, runtimes, governance, and incremental adoption. Studio shows up as the operating layer on top, not as the source of truth.

AI index

provider-ranking.benchmark.run-custom

Launch a custom benchmark evaluation against a specific model.

Type: operation (command)

Version: 1.0.0

Tags: custom, eval

File: packages/libs/contracts-spec/src/provider-ranking/commands/benchmarkRunCustom.command.ts

field.key.label: provider-ranking.benchmark.run-custom
field.version.label: 1.0.0
field.type.label: operation (command)
field.title.label: provider-ranking.benchmark.run-custom
field.description.label: Launch a custom benchmark evaluation against a specific model.
Type: operation (command)
Version: 1.0.0
Tags: custom, eval
File: packages/libs/contracts-spec/src/provider-ranking/commands/benchmarkRunCustom.command.ts
field.tags.label: custom,eval
field.owners.label
field.stability.label

Launch a custom benchmark evaluation against a specific model.

Goal

Evaluate model performance using internal eval suites.

Context

Used by operators to run proprietary benchmarks and compare models.

Source Definition

1import { ScalarTypeEnum, SchemaModel } from '@lssm-tech/lib.schema';
2import { docId } from '../../docs/registry';
3import type { DocBlock } from '../../docs/types';
4import { defineCommand } from '../../operations';
5import {
6	PROVIDER_RANKING_DOMAIN,
7	PROVIDER_RANKING_OWNERS,
8	PROVIDER_RANKING_STABILITY,
9	PROVIDER_RANKING_TAGS,
10} from '../constants';
11import { BenchmarkCustomCompletedEvent } from '../events/benchmarkCustomCompleted.event';
12
13export const BenchmarkRunCustomCommand = defineCommand({
14	meta: {
15		key: 'provider-ranking.benchmark.run-custom',
16		title: 'Run Custom Benchmark',
17		version: '1.0.0',
18		description:
19			'Launch a custom benchmark evaluation against a specific model.',
20		goal: 'Evaluate model performance using internal eval suites.',
21		context:
22			'Used by operators to run proprietary benchmarks and compare models.',
23		domain: PROVIDER_RANKING_DOMAIN,
24		owners: PROVIDER_RANKING_OWNERS,
25		tags: [...PROVIDER_RANKING_TAGS, 'custom', 'eval'],
26		stability: PROVIDER_RANKING_STABILITY,
27		docId: [docId('docs.tech.provider-ranking.benchmark.run-custom')],
28	},
29	capability: {
30		key: 'provider-ranking.system',
31		version: '1.0.0',
32	},
33	io: {
34		input: BenchmarkRunCustomInput,
35		output: BenchmarkRunCustomOutput,
36	},
37	policy: {
38		auth: 'user',
39		pii: [],
40	},
41	sideEffects: {
42		emits: [
43			{
44				ref: BenchmarkCustomCompletedEvent.meta,
45				when: 'Custom benchmark evaluation finishes execution.',
46			},
47		],
48	},
49});

Open system docs

provider-ranking.benchmark.run-custom

Goal

Context

Source Definition