- Teams testing prompts, agents, and RAG systems
- Developers adding AI evaluations to CI/CD
- Builders doing red-team or vulnerability checks on AI workflows
promptfoo
Open-source tool for testing prompts, agents, RAG systems, and AI security behavior.
npx promptfoo@latest initWhat is promptfoo?
promptfoo is an MIT-licensed testing and red-teaming tool for prompts, agents, RAG pipelines, and AI application behavior, with declarative configs and CI/CD-friendly workflows.
Automation
promptfoo surfaces automation as a core capability in its published project metadata and source links.
This gives readers a starting point for evaluating whether the project fits their workflow before visiting the source repository or docs.Workflow
promptfoo surfaces workflow as a core capability in its published project metadata and source links.
This gives readers a starting point for evaluating whether the project fits their workflow before visiting the source repository or docs.Rag
promptfoo surfaces rag as a core capability in its published project metadata and source links.
This gives readers a starting point for evaluating whether the project fits their workflow before visiting the source repository or docs.One command to start
npx promptfoo@latest init What teams use it for
Tags & capabilities
How it stacks up
When to choose promptfoo
Compare it with nearby tools by looking at hosting model, integration surface, license, and whether the official docs show the workflow you need.
Questions
What should I check before using promptfoo?
Add one regression case to a real prompt or RAG workflow, then verify the result can run again in CI or review.
Is promptfoo open source?
promptfoo is listed on OpenAgent.bot with MIT based on the current resource metadata. Re-check the official repository, docs, and license before production use.
Should you use promptfoo?
- Teams that need only production tracing
- Users who want a benchmark score without writing test cases
- Verified 2026-06-02
- License: MIT
- Repo: promptfoo/promptfoo
- Open-source signal
self hosted, cloud
memory
No extra signals recorded
Structured decision data for promptfoo
This packet is the compact machine-readable view agents should use before following source links or taking action.
automation, workflow, rag
open source
self hosted, cloud
memory
Evaluation and observability, Reusable skill workflow
What promptfoo does
What it is
promptfoo is listed on OpenAgent.bot as a tools resource for open AI builders.
Why it matters
Agent teams need repeatable tests before shipping changes. promptfoo gives builders a practical way to compare prompts, models, providers, and safety behavior without relying only on manual review.
How to evaluate it
Start from the official source links, then validate the project against your deployment needs, license requirements, and maintenance expectations.
Known metadata and operating surface
These fields are separated from editorial interpretation so agents can reason over facts and missing checks.
Where promptfoo fits in an agent stack
Evaluation and observability
promptfoo has multiple signals for evaluation and observability, including matching tags, capabilities, category, or positioning.
- Add one repeatable test case and confirm results can run again in review or CI.
- Confirm official docs, current maintenance, license, and runtime constraints before production use.
Reusable skill workflow
promptfoo has multiple signals for reusable skill workflow, including matching tags, capabilities, category, or positioning.
- Run one skill end to end and check whether it produces evidence or structured output.
- Confirm official docs, current maintenance, license, and runtime constraints before production use.
Browser automation
promptfoo has at least one signal for browser automation, but should be checked against a real task before adoption.
- Run one non-sensitive website task and inspect clicks, waits, retries, and changed URLs.
- Confirm official docs, current maintenance, license, and runtime constraints before production use.
Coding agent workflow
promptfoo has at least one signal for coding agent workflow, but should be checked against a real task before adoption.
- Run a small repository change and inspect the diff, tests, and rollback path.
- Confirm official docs, current maintenance, license, and runtime constraints before production use.
Local or private AI stack
promptfoo has at least one signal for local or private ai stack, but should be checked against a real task before adoption.
- Verify hardware requirements, data path, storage, and whether all calls stay in your environment.
- Confirm official docs, current maintenance, license, and runtime constraints before production use.
Memory or RAG workflow
promptfoo has at least one signal for memory or rag workflow, but should be checked against a real task before adoption.
- Create, update, retrieve, correct, and delete memory or retrieval objects with real data.
- Confirm official docs, current maintenance, license, and runtime constraints before production use.
What an agent should inspect
Likely inputs
- Repositories, files, issues, terminal output, and test results
- Documents, user facts, entities, context, or retrieval queries
- Official setup instructions and a small real workflow
Likely outputs
- Diffs, commits, explanations, test results, or review notes
- Retrieved context, memory updates, graph relations, or citations
- Scores, traces, regression results, dashboards, or failure cases
- A decision on whether this resource fits the target workflow
Sources, claims, and missing checks
Claims are marked separately from source links so future crawlers and reviewers can update them without rewriting the page.
Repository source for code, license, issues, releases, and implementation details.
Homepage homepageOfficial or project-controlled source for this resource profile.
promptfoo is listed as open source.
License metadata: MITpromptfoo has a recorded GitHub repository: promptfoo/promptfoo.
Resource facts and GitHub source link.promptfoo supports these recorded deployment modes: self hosted, cloud.
OpenAgent decision signal metadata.promptfoo is tagged with automation, workflow, rag capabilities.
OpenAgent capability taxonomy.- Dedicated docs link is missing.
- Repository freshness has not been recorded.
How to start evaluating promptfoo
Inspect repository
Check license, recent activity, issues, examples, and security-sensitive code paths.
Open sourceOpen Homepage
Start from the official source before adopting third-party instructions.
Open sourceInstall or run
Run only after checking the official source and local environment assumptions.
npx promptfoo@latest init Alternatives and nearby resources
Use related resources to compare category fit, license, deployment model, and first-workflow behavior.
Common questions about promptfoo
What is promptfoo used for?
promptfoo is used as a tool for tools workflows. The most relevant recorded capabilities are automation, workflow, rag.
Is promptfoo open source?
promptfoo is listed as open source with MIT license metadata. Re-check the official repository or source link before production use.
Can agents use promptfoo directly?
promptfoo has recorded interfaces such as repo, docs. Agents should prefer the JSON or Markdown profile first, then follow official docs for real execution.
What should I check before production use?
Check source confidence (high), risk level (low), license, maintenance freshness, permission surface, required credentials, and whether the first workflow succeeds in a sandbox.