Configure online evaluation, human review routing, and dataset automation rules.

Automated Rules

Access via Settings → Automated Rules in your project.

Automated Rules let you define what happens automatically as traces arrive — scoring them with evaluators, routing them to human review queues, or extracting them into datasets.

Rule Types

The Automated Rules page has three tabs:

Tab	Description
Online Evaluations	Score incoming traces with evaluators as they arrive
Human Review	Route traces to annotation queues for human scoring
Datasets	Automatically extract traces matching criteria into a dataset

Online Evaluation Rules

Online evaluation rules attach evaluators to incoming traces. When a matching trace is ingested, the rule runs the configured evaluator and stores a score.

Creating an Online Evaluation Rule

Go to Settings → Automated Rules → Online Evaluations
Click Create rule
Configure:
- Rule name — identifier shown in the rules list
- Description — optional description
- Evaluator — which evaluator to run (LLM-as-judge, RAGAS metric, code-based)
- Filters — optional trace filters (e.g., only traces tagged production, or with a specific score below a threshold)
- Sampling rate — percentage of matching traces to evaluate (0–100%)
- Concurrency — maximum parallel evaluations
Click Save and activate

Rule Status

Rules can be in three states:

Active — running, evaluating incoming traces
Paused — rule exists but is not processing traces
Inactive — rule has been disabled

Toggle status with the play/pause button in the rules list.

Sampling Rate

Set the sampling rate to evaluate a fraction of traffic. This is useful when:

Evaluation cost is high (e.g., GPT-4o-based judge)
You want statistical coverage rather than full coverage
Traffic volume is very high

A 10% sampling rate means 1 in 10 matching traces will be evaluated.

A token exhaustion health bar shows how much of your evaluation token budget has been consumed. If the budget is exhausted, online evaluations pause automatically until the next billing period or you increase the budget.

Human Review Rules

Human review rules route traces to annotation queues for manual scoring.

Creating a Human Review Rule

Go to Settings → Automated Rules → Human Review
Click Create rule
Configure:
- Rule name and description
- Queue — which annotation queue to route traces into
- Filters — trace filters (score thresholds, tags, metadata)
- Sampling rate — percentage of matching traces to route
Click Save and activate

Human review rules are useful for:

Routing low-scoring traces (flagged by online evaluators) to human review
Randomly sampling a percentage of production traffic for quality audits
Building high-quality labeled datasets from production traffic

Dataset Automation Rules

Dataset rules automatically extract traces that match specified criteria into a named dataset.

Creating a Dataset Rule

Go to Settings → Automated Rules → Datasets
Click Create rule
Configure:
- Rule name and description
- Target dataset — which dataset to add matching traces to (created automatically if it doesn't exist)
- Filters — trace filters (score ranges, tags, user ID, session ID, metadata fields)
- Sampling rate — percentage of matching traces to include
Click Save and activate

Dataset rules are useful for:

Continuously building a dataset of production failures (low-score traces)
Sampling a representative slice of production traffic for regression testing
Capturing traces for specific users or sessions

Permissions

Action	Required Role
View rules	MEMBER and above
Create / edit / delete rules	MEMBER and above
Pause / resume rules	MEMBER and above

Automated Rules

Automated Rules

Rule Types

Online Evaluation Rules

Creating an Online Evaluation Rule

Rule Status

Sampling Rate

Token Exhaustion

Human Review Rules

Creating a Human Review Rule

Dataset Automation Rules

Creating a Dataset Rule

Permissions

On this page