orq.ai

用于在 orq.ai 平台上构建、部署、评估与监控 LLM 流水线的 agent skill。

orqllmagentspromptsdeploymentsevaluationmonitoringragexperimentstraces

6 条规则

<skills>

You have additional SKILLs documented in directories containing a "SKILL.md" file.

These skills are:
 - analyze-trace-failures -> "skills/analyze-trace-failures/SKILL.md"
 - build-agent -> "skills/build-agent/SKILL.md"
 - build-evaluator -> "skills/build-evaluator/SKILL.md"
 - compare-agents -> "skills/compare-agents/SKILL.md"
 - generate-synthetic-dataset -> "skills/generate-synthetic-dataset/SKILL.md"
 - invoke-deployment -> "skills/invoke-deployment/SKILL.md"
 - optimize-prompt -> "skills/optimize-prompt/SKILL.md"
 - run-experiment -> "skills/run-experiment/SKILL.md"
 - setup-observability -> "skills/setup-observability/SKILL.md"

IMPORTANT: You MUST read the SKILL.md file whenever the description of the skills matches the user intent, or may help accomplish their task.

<available_skills>

build-agent: `Design, create, and configure an orq.ai Agent with tools, instructions, knowledge bases, and memory — includes model selection, KB management, and memory store setup`

build-evaluator: `Create validated LLM-as-a-Judge evaluators following best practices — binary Pass/Fail judges with TPR/TNR validation for measuring specific failure modes`

analyze-trace-failures: `Read production traces, identify what's failing, build failure taxonomies, and categorize issues using open coding and axial coding methodology`

invoke-deployment: `Invoke orq.ai deployments, agents, and models via the Python SDK or HTTP API — pass prompt variables, stream responses, handle multi-turn agent conversations, and generate integration code`

run-experiment: `Create and run orq.ai experiments — compare configurations against datasets using evaluators, analyze results, with specialized methodology for agent, conversation, and RAG evaluation`

generate-synthetic-dataset: `Generate and curate evaluation datasets — structured generation, quick from description, expansion from existing data, plus dataset maintenance and quality improvement`

optimize-prompt: `Analyze and optimize system prompts using a structured prompting guidelines framework — AI-powered analysis and rewriting`

compare-agents: `Run cross-framework agent comparisons using evaluatorq — compares any combination of agents (orq.ai, LangGraph, CrewAI, OpenAI Agents SDK, Vercel AI SDK) head-to-head on the same dataset with LLM-as-a-judge scoring. Do NOT use when comparing only orq.ai configurations with no external agents (use run-experiment instead).`

setup-observability: `Set up orq.ai observability for LLM applications — AI Router proxy, OpenTelemetry, tracing setup, and trace enrichment. Use when setting up tracing, adding the AI Router proxy, integrating OpenTelemetry, auditing existing instrumentation, or enriching traces with metadata. Do NOT use when traces already exist and you need to debug failures (use analyze-trace-failures).`

</available_skills>

Paths referenced within SKILL folders are relative to that SKILL. For example the build-evaluator `resources/judge-prompt-template.md` would be referenced as `skills/build-evaluator/resources/judge-prompt-template.md`.

</skills>

<commands>

You have slash commands available in the `commands/` directory:

 - quickstart -> "commands/quickstart.md" — Interactive onboarding guide for new users
 - workspace -> "commands/workspace.md" — Show workspace overview (agents, deployments, prompts, datasets, experiments)
 - traces -> "commands/traces.md" — Query and summarize traces with filters (debugging entry point)
 - models -> "commands/models.md" — List available AI models and capabilities
 - analytics -> "commands/analytics.md" — Show workspace analytics (requests, cost, tokens, errors, trends)

Commands are quick-action tools. Skills are multi-step workflows. Use `/orq:quickstart` for onboarding, `/orq:workspace`, `/orq:traces`, `/orq:models`, or `/orq:analytics` for fast operations. Use skills when the user needs guided, multi-step work.

</commands>

Future AGI↓ 1

AI 评估、可观测性与优化平台，可通过自然语言运行评估、管理数据集、检索 trace、优化 prompt 并模拟 AI agent。

Oi↓ 6

在 Cursor 中使用 Oi 发现并复用团队的上下文与工作流，为每项任务匹配合适的上下文，并在后续工作中保持其有效。

Deep Learning↓ 5.1k

Deep Learning 在 Cursor 中的规则与最佳实践。

prompt-optimizer↓ 133

Prompt Optimizer：六维评分迭代优化 Prompt 与 Rules，自动识别模式并匹配评分标准。

Gitlab MCP Server↓ 11

将 1000 个 GitLab REST API 与 GraphQL 操作封装为 MCP 工具，含基础版 32 个、企业版 47 个元工具，以及 24 个资源、38 个 prompt 等多项能力，供 AI 助手使用。

MCP Bundles↓ 9

托管式 MCP server，提供对 500+ SaaS API 的认证访问，支持 MCP Apps、代码执行与审计日志。

paygent↓ 7

面向 Paygent 的 Cursor 插件，提供 AI agent 的用量计费与成本追踪；捆绑与官方 Python SDK 对齐的规则、skill、agent 预设与命令，帮助开发者一次性正确接入。

god-of-debugger↓ 7

假设驱动的并行调试，生成相互竞争的假设，在并行子 agent 中运行实验，仅当恰好剩一个假设成立时才提交修复。

相关插件