eval-report-workflow

github.com/UKGovernmentBEIS/inspect_evals

Verdict: Generally safe

0 critical1 high1 medium

SCORE 75 / 100

$skillox install eval-report-workflowSoon

Why grade B?

score · 75 / 100

The current grade reflects 1 high-severity finding (any HIGH → B).

0 CRIT1 HIGH1 MED0 LOW

To reach a higher grade

A
Reach Atarget score 95
Resolve all 1 HIGH.

Thresholds are documented at /docs/grading. Source-of-truth is the grade() function in @skillox/scanner.

Latest scan findings

Scan crawl-ad611g27iu7umexa3guv2uqq · Thu, 28 May 2026 15:25:59 GMT · 1ms

high

Dangerous shell pattern: eval backtick

The skill contains a shell command pattern (`eval backtick`) commonly used in destructive or supply-chain attacks.

rule: dangerous-shellline: 8CWE-78

▾

6# Make an Evaluation Report

8This workflow drives [`tools/evaluation_report.py`](../../../tools/evaluation_report.py), which reads a per-eval `report_config.yaml` and produces a full reproducible `report.md` (results table, reference comparison, per-category breakdowns, token totals, approximate cost) plus header-only JSON copies of the input logs under `results/`. The `report_config.yaml`, regenerated `report.md`, and `results/` folder are committed alongside the eval's `eval.yaml`.← eval backtick — common in destructive or supply-chain attacks

10## Report Formatting

med

No capability manifest declared

The skill ships without a `manifest.yaml` or `capabilities` block in its frontmatter. Without a manifest, the runtime cannot enforce what this skill is permitted to do.

rule: no-manifest

▾

View latest scan →

skillox.io/c/eval-report-workflow

Skill facts

Latest grade: B · 75
Total scans: 1
Latest version: —
Last scanned: 2026-05-28
Source: UKGovernmentBEIS/inspect_evals

AIBOM

What this skill accesses

No manifest

Declared (from capabilities)

No capabilities block in frontmatter — the skill doesn't state what it should access.

Observed (from scan findings)

Subprocess invocations

This workflow drives [`tools/evaluation_report.py`](../../../tools/evaluation_re

AIBOM (Application Skills Bill of Materials) — see /docs/concepts/aibom for the format.

Version history

Only this scan

No other scans on record for this skill name. New scans appear here as the catalog re-crawls or creators request a re-scan.

Embed badge

Show this grade in your README.

Tracks the latest grade automatically. Updates within five minutes of every re-scan.

Markdown

[![SkillOx grade](https://api.skillox.io/badge/eval-report-workflow.svg)](https://skillox.io/c/eval-report-workflow)

Markdown · score

[![SkillOx score](https://api.skillox.io/badge/score/eval-report-workflow.svg)](https://skillox.io/c/eval-report-workflow)

HTML

<a href="https://skillox.io/c/eval-report-workflow"><img src="https://api.skillox.io/badge/eval-report-workflow.svg" alt="SkillOx grade"/></a>

reST

.. image:: https://api.skillox.io/badge/eval-report-workflow.svg
   :target: https://skillox.io/c/eval-report-workflow
   :alt: SkillOx grade

Coming soon

Creator verification badge, expert-review status, and Cosign-signed release attestation appear here as those layers ship. AIBOM + version history are live; the others follow once the creator portal completes its identity-proof + signing surface.