应用简介
X射线分析任何AI模型的行为模式——拒绝边界、幻觉倾向、推理风格、格式默认值。无需API密钥。
--- name: bdistill-behavioral-xray description: "X-ray any AI model's behavioral patterns — refusal boundaries, hallucination tendencies, reasoning style, formatting defaults. No API key needed." category: ai-testing risk: safe source: community date_added: "2026-03-20" author: FrancyJGLisboa tags: [ai, testing, behavioral-analysis, model-evaluation, red-team, compliance, mcp] tools: [claude, cursor, codex, copilot] --- # Behavioral X-Ray Systematically probe an AI model's behavioral patterns and generate a visual report. The AI agent probes *itself* — no API key or external setup needed. ## Overview bdistill's Behavioral X-Ray runs 30 carefully designed probe questions across 6 dimensions, auto-tags each response with behavioral metadata, and compiles results into a styled HTML report with radar charts and actionable insights. Use it to understand your model before building with it, compare models for task selection, or track behavioral drift over time. ## When to Use This Skill - Use when you want to understand how your AI model actually behaves (not how it claims to) - Use when choosing between models for a specific task - Use when debugging unexpected refusals, hallucinations, or formatting issues - Use for compliance auditing — documenting model behavior at deployment boundaries - Use for red team assessments — systematic boundary mapping across safety dimensions ## How It Works ### Step 1: Install ```bash pip install bdistill claude mcp add bdistill -- bdistill-mcp # Claude Code ``` For other tools, add bdistill-mcp as an MCP server in your project config. ### Step 2: Run the probe In Claude Code: ``` /xray # Full behavioral probe (30 questions) /xray --dimensions refusal # Probe just one dimension /xray-report # Generate report from completed probe ``` In any tool with MCP: ``` "X-ray your behavioral patterns" "Test your refusal boundaries" "Generate a behavioral report" ``` ## Probe Dimensions | Dimension | What it measures | |-----------|-----------------| | **tool_use** | When does it call tools vs. answer from knowledge? | | **refusal** | Where does it draw safety boundaries? Does it over-refuse? | | **formatting** | Lists vs. prose? Code blocks? Length calibration? | | **reasoning** | Does it show chain-of-thought? Handle trick questions? | | **persona** | Identity, tone matching, composure under hostility | | **grounding** | Hallucination resistance, fabrication traps, knowledge limits | ## Output A styled HTML report showing: - Refusal rate, hedge rate, chain-of-thought usage - Per-dimension breakdown with bar charts - Notable response examples with behavioral tags - Actionable insights (e.g., "you already show CoT 85% of the time, no need to prompt for it") ## Best Practices - Answer probe questions honestly — the value is in authentic behavioral data - Run probes on the same model periodically to track behavioral drift - Compare reports across models to make informed selection decisions - Use adversarial knowledge extraction (`/distill --adversarial`) alongside behavioral probes for complete model profiling ## Related Skills - `@bdistill-knowledge-extraction` - Extract structured domain knowledge from any AI model ## Limitations - Use this skill only when the task clearly matches the scope described above. - Do not treat the output as a substitute for environment-specific validation, testing, or expert review. - Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
发布日期
5/16/2026
提供方
SkillOPIC
来源类型
导入
sickn33
coding
数据安全
使用 Skill 时,您的对话内容将被发送至 AI 模型进行处理。我们会严格保护您的隐私数据,不会将您的对话内容用于模型训练或分享给第三方。 以下为此 Skill 的数据处理说明。
此 Skill 将处理您的对话输入
您的消息将作为 Prompt 上下文发送至 AI 模型
所有通信均通过加密通道传输
对话记录仅保存在本地
您可以随时清除本地对话历史,清除后数据不可恢复
评分和评价
已验证评分
Skill 信息
了解此 Skill 的详细信息和功能特性
编程开发
后端开发
文件结构
SKILL.md3.5 KB
版本历史
- 公开
- 来源于用户导入
如需详细了解相关要求,请访问帮助中心,或给我们提交反馈信息