PromptFoo
Eliminate risk with AI red-teaming and evals used by 75,000+ developers. Find and fix vulnerabilities, maximize output quality, catch regressions.
Promptfoo
Docs
- Api Reference
- Tags
- Changelog
- Custom
- Features
- Multi Turn
- Red Team
- Releases
- Strategies
- Updates
- Configuration
- Integrations
- Usage
- Code Scanning
- Cli
- Github Action
- Vscode Extension
- Caching
- Chat
- Datasets
- Expected Outputs
- Classifier
- Deterministic
- Guardrails
- Javascript
- Model Graded
- Agent Rubric
- Answer Relevance
- Context Faithfulness
- Context Recall
- Context Relevance
- Conversation Relevance
- Factuality
- G Eval
- Llm Rubric
- Max Score
- Model Graded Closedqa
- Pi
- Search Rubric
- Select Best
- Moderation
- Python
- Ruby
- Similar
- Guide
- Huggingface Datasets
- Modular Configs
- Outputs
- Parameters
- Prompts
- Rate Limits
- Reference
- Scenarios
- Telemetry
- Test Cases
- Testing Llm Chains
- Tools
- Contributing
- Enterprise
- Audit Logging
- Authentication
- Findings
- Guardrails
- Red Teams
- Remediation Reports
- Service Accounts
- Teams
- Webhooks
- Faq
- Getting Started
- Guides
- Azure Vs Openai
- Censored Vs Uncensored Ollama
- Chatbase Redteam
- Choosing Best Gpt Model
- Compare Open Source Models
- Deepseek Benchmark
- Evaling With Harmbench
- Evaluate Coding Agents
- Evaluate Crewai
- Evaluate Elevenlabs
- Evaluate Google Adk
- Evaluate Json
- Evaluate Langgraph
- Evaluate Llm Temperature
- Evaluate Openai Agents Python
- Evaluate Openai Assistants
- Evaluate Osworld With Inspect
- Evaluate Rag
- Factuality Eval
- Google Cloud Model Armor
- Gpt Mmlu Comparison
- Gpt Vs Claude Vs Gemini
- Gpt Vs Reasoning Model
- Hle Benchmark
- Langchain Prompttemplate
- Llm As A Judge
- Llm Redteaming
- Mixtral Vs Gpt
- Multimodal Red Team
- Prevent Llm Hallucinations
- Qwen Benchmark
- Sandboxed Code Evals
- Test Agent Skills
- Testing Guardrails
- Text To Sql Evaluation
- Installation
- Agent Skill
- Aws Codecommit
- Azure Pipelines
- Bitbucket Pipelines
- Burp
- Ci Cd
- Circle Ci
- Github Action
- Gitlab Ci
- Google Sheets
- Helicone
- Jenkins
- Jest
- Langfuse
- Looper
- Mcp Server
- Mcp
- Mocha Chai
- N8n
- Portkey
- Python
- Sharepoint
- Sonarqube
- Splunk
- Travis Ci
- Intro
- Model Audit
- Ci Cd
- Scanners
- Usage
- Providers
- A2a
- Abliteration
- Ai21
- Aimlapi
- Alibaba
- Anthropic
- Atlascloud
- Aws Bedrock
- Azure
- Bedrock Agents
- Browser
- Cerebras
- Claude Agent Sdk
- Cloudera
- Cloudflare Ai
- Cloudflare Gateway
- Cohere
- Cometapi
- Custom Api
- Custom Script
- Databricks
- Deepseek
- Docker
- Echo
- Elevenlabs
- Envoy
- F5
- Fal
- Fireworks
- Github
- Go
- Groq
- Helicone
- Http
- Huggingface
- Hyperbolic
- Ibm Bam
- Jfrog
- Litellm
- Llama.cpp
- LlamaApi
- Llamafile
- Localai
- Manual Input
- Mcp
- Minimax
- Mistral
- Mlflow Gateway
- Modelslab
- N8n
- Novita
- Nscale
- Nvidia
- Ollama
- Openai Agents
- Openai Chatkit
- Openai Codex App Server
- Openai Codex Sdk
- Openai
- Openclaw
- Opencode Sdk
- Openllm
- Openrouter
- Orcarouter
- Perplexity
- Python
- Quiverai
- Replicate
- Ruby
- Sagemaker
- Sequence
- Simulated User
- Slack
- Snowflake
- Text Generation Webui
- Togetherai
- Transformers
- Truefoundry
- Vercel
- Vertex
- Vllm
- Voyage
- Watsonx
- Webhook
- Websocket
- Xai
- Red Team
- Agents
- Architecture
- Coding Agents
- Configuration
- Discovery
- Dod Ai Ethics
- Eu Ai Act
- Foundation Models
- Gdpr
- Guides
- Iso 42001
- Llm Supply Chain
- Llm Vulnerability Types
- Mcp Security Testing
- Mitre Atlas
- Model Drift
- Multi Input
- Nist Ai Rmf
- Owasp Agentic Ai
- Owasp Api Top 10
- Owasp Llm Top 10
- Plugins
- Aegis
- Age Bias
- Ascii Smuggling
- Beavertails
- Bfla
- Bias
- Bola
- Coding Agent
- Competitors
- Context Compliance Attack
- Contracts
- Coppa
- Cross Session Leak
- Custom
- Cyberseceval
- Data Exfil
- Debug Access
- Disability Bias
- Divergent Repetition
- Donotanswer
- Ecommerce
- Excessive Agency
- Ferpa
- Financial
- Gender Bias
- Goal Misalignment
- Hallucination
- Harmbench
- Harmful
- Hijacking
- Imitation
- Indirect Prompt Injection
- Insurance
- Intent
- Malicious Code
- Mcp
- Medical
- Memory Poisoning
- Model Identification
- Off Topic
- Overreliance
- Pharmacy
- Pii
- Pliny
- Policy
- Politics
- Prompt Extraction
- Race Bias
- Rag Document Exfiltration
- Rag Poisoning
- Rag Source Attribution
- Rbac
- Realestate
- Reasoning Dos
- Religion
- Shell Injection
- Special Token Injection
- Sql Injection
- Ssrf
- System Prompt Override
- Teen Safety
- Telecom
- Tool Discovery
- Toxic Chat
- Unsafebench
- Unverifiable Claims
- Vlguard
- Vlsu
- Wordplay
- Xstest
- Quickstart
- Rag
- Risk Scoring
- Strategies
- Audio
- Authoritative Markup Injection
- Base64
- Basic
- Best Of N
- Citation
- Composite Jailbreaks
- Custom Strategy
- Custom
- Gcg
- Goat
- Hex
- Homoglyph
- Hydra
- Image
- Indirect Web Pwn
- Iterative
- Jailbreak Templates
- Layer
- Leetspeak
- Likert
- Math Prompt
- Meta
- Mischievous User
- Multi Turn
- Other Encodings
- Prompt Injection
- Retry
- Rot13
- Tree
- Video
- Attack Generation
- Best Practices
- Connecting To Targets
- Data Handling
- False Positives
- Grading Results
- Inference Limit
- Linking Targets
- Multi Turn Sessions
- Multiple Response Types
- Overview
- Remote Generation
- Releases
- Tracing
- Command Line
- Node Api Examples
- Node Api Quick Reference
- Node Api Reference
- Node Package
- Prompt Optimization
- Self Hosting
- Sharing
- Troubleshooting
- Web Ui
- Write For Promptfoo
Meet the modern standard for public facing documentation. Beautiful out of the box, easy to maintain, and optimized for user engagement.
Search through billions of items for similar matches to any object, in milliseconds. It’s the next generation of search, an API call away.
Build and deploy reliable background jobs with no timeouts and no infrastructure to manage.
Get the simple developer experience of SQLite in production, and scale your multi-tenant backend with unlimited databases.
Upstash is a serverless data platform providing low latency and high scalability for real-time applications.
One-click deployments built for teams, tuned for Laravel, loaded with tools and goodies you're going to love.