The official WordPress AI benchmark. Evaluate how well language models understand WordPress development—from coreCore Core is the set of software required to run WordPress. The Core Development Team builds WordPress. APIs and coding standards to pluginPlugin A plugin is a piece of software containing a group of functions that can be added to a WordPress website. They can extend functionality or add new features to your WordPress websites. WordPress plugins are written in the PHP programming language and integrate seamlessly with WordPress. These can be free in the WordPress.org Plugin Directory https://wordpress.org/plugins/ or can be cost-based plugin from a third-party. architecture and security best practices.
WP-Bench measures AI model capabilities across two dimensions:
- Knowledge — Multiple-choice questions testing WordPress concepts, APIs, and best practices
- Execution — Code generation tasks graded by a real WordPress runtime for correctness and quality
The benchmark uses WordPress itself as the grader, running generated code in a sandboxed environment with static analysis and runtime assertions.