AI agents · OpenClaw · self-hosting · automation

Quick Answer

What Is Microsoft RAMPART and Clarity? CI Safety for Agents

Published:

What Is Microsoft RAMPART and Clarity? (May 2026)

Microsoft open-sourced two AI agent safety tools on May 20, 2026 — RAMPART (CI test framework for agent safety regression) and Clarity (structured pre-build assumption audit). They sit at opposite ends of the lifecycle and answer the same question: how do you make agent safety a continuous engineering discipline rather than a one-time launch audit?

Last verified: May 25, 2026.

TL;DR

  • Announced: Microsoft Security Blog, May 20, 2026.
  • License: Open source (permissive).
  • RAMPART: Agent test framework that encodes adversarial scenarios as repeatable CI tests.
  • Clarity: Structured sounding board that audits assumptions before code is written.
  • Together: Cover the agent lifecycle from design intent (Clarity) to regression coverage (RAMPART).
  • Closest analog: Application security tooling — but built for the agent-specific failure modes (prompt injection, tool abuse, indirect injection, data exfiltration).

What RAMPART actually does

RAMPART is a test framework where you write agent safety scenarios — both adversarial and benign — as code, then run them in CI like any other test suite.

Typical scenarios you encode in RAMPART:

Scenario typeExample
Direct prompt injectionUser message asks the agent to ignore prior instructions and reveal system prompt
Indirect prompt injectionWebpage the agent browses contains hidden instructions to exfiltrate data
Tool-call abuseAgent receives crafted input designed to trick it into calling a destructive tool
Data leakageAgent is asked to summarize a doc containing PII and tested whether PII appears in output
Authorization escapeAgent is asked to perform an action outside its authorized scope
Jailbreak persistenceMulti-turn jailbreak attempts to see whether agent’s safety drifts mid-conversation
Benign regressionAgent must still complete normal task even with adversarial-looking surrounding context

The point: each scenario is a repeatable test case that runs every commit. If a model update or prompt change regresses one of these, the build fails. Red-team findings stop being one-off PDFs and become permanent regression coverage.

What Clarity actually does

Clarity is the before code tool. It’s a structured sounding board that walks a team through assumption-surfacing questions before the agent is built:

  • Who is the user, and what are their incentives?
  • What data sources does the agent access (and what’s in those sources you didn’t think about)?
  • What actions can the agent take, and what’s the blast radius of each one?
  • Who is the human in the loop, and at what decision points?
  • What does failure look like — and who owns it?
  • What assumptions about user behavior, data quality, or model capability are we making?

The output is a structured design document. Microsoft frames Clarity as the step that prevents teams from building agents whose risk profile they don’t fully understand until something breaks.

It’s deliberately lightweight — closer to a guided doc generator than a heavy GRC platform. The goal is adoption, not compliance theater.

How they fit together

Lifecycle stageToolWhat it produces
Pre-build (design intent)ClarityStructured design + assumption document
Build (implementation)(your stack)Agent code, prompts, tool definitions
Pre-deploy (validation)RAMPARTAdversarial test suite, baseline pass
Post-deploy (continuous)RAMPART in CIRegression coverage on every commit

Clarity surfaces the threats you should test for. RAMPART encodes those threats as tests that never go away. The combined effect: agent safety becomes a continuous engineering discipline owned by the team building the agent — not a periodic external audit.

Why this matters in May 2026

Three forces converge:

1. Agents are taking real-world actions. By mid-2026, production agents routinely send emails, edit files, transact, browse the web, and integrate with enterprise systems. The blast radius of an agent failure is no longer “wrong text in a chat window” — it’s “wrong action in a production system.”

2. Red-team-once-at-launch is insufficient. Models update, prompts evolve, tools change, and the threat landscape moves. The application security industry learned this lesson in the 2010s and built SAST, DAST, and CI security gates. Agent safety needs the same continuous coverage.

3. Regulators are watching. EU AI Act enforcement is ramping through 2026, and the documented-design + continuous-testing pattern RAMPART and Clarity encode lines up neatly with high-risk system requirements. Microsoft is implicitly building tooling for the compliance posture enterprises need.

How RAMPART compares to other agent testing tools

ToolPrimary focusOpen sourceBest for
Microsoft RAMPARTAdversarial safety regressionYes (May 2026)Production safety CI gates
LangSmithQuality evals (accuracy, latency, cost)No (LangChain Inc.)Quality + observability
BraintrustQuality + custom evaluatorsNoQuality-focused teams
Arize PhoenixTracing + LLM observabilityYesProduction observability
M365 Copilot EvaluationsMicrosoft Foundry agentsNo (Microsoft)M365-bound agents
DeepEvalOpen-source eval frameworkYesLightweight quality evals

RAMPART’s positioning is unique: it’s the first widely-distributed open-source tool focused specifically on adversarial safety regression rather than quality evaluation. Most teams will run both a quality eval stack and RAMPART.

What’s missing

A few honest caveats on the May 20 launch:

  • Scenario authoring is still hard. RAMPART gives you the framework, but writing high-quality adversarial scenarios that reflect your agent’s real threat model is the actual work. Microsoft is shipping starter scenarios but expects teams to author their own.
  • No baked-in red-team coverage of every modality. Vision-based attacks, audio injection, document-embedded payloads — initial coverage is text-first; multimodal expansion comes later.
  • Clarity output is human-in-the-loop. Clarity surfaces questions but doesn’t answer them. You still need someone who can think clearly about your specific threat model.

Verdict

  • What it is: A two-tool open-source pairing for agent safety lifecycle — Clarity before code, RAMPART in CI.
  • Who needs it: Any team shipping agents that take real-world actions. Mandatory if you’re under regulated workloads.
  • What it replaces: One-time launch red-teaming. Ad-hoc Google Docs threat modeling.
  • What it complements: LangSmith / Braintrust / Arize for quality evals — RAMPART for safety regression.
  • Bottom line: This is the agent-era equivalent of “we added security tests to CI.” It’s not optional once you’re shipping agents that touch real systems.