AI code reviews grounded in 12 classic engineering books — decay risk diagnostics with book citations, severity labels, and 6 analysis modes including full-sweep auto-fix
-
Updated
Jun 13, 2026 - JavaScript
AI code reviews grounded in 12 classic engineering books — decay risk diagnostics with book citations, severity labels, and 6 analysis modes including full-sweep auto-fix
🧪 Intelligent test analysis tool that finds duplicate and redundant tests using coverage analysis. Improve test suite quality with detailed reports and pytest integration.
AI-powered GitHub Action that analyzes test quality, provides scores, identifies issues, and suggests improvements through OpenAI's GPT models. Posts detailed reports directly to your pull requests.
Automated chaos testing for Python — fault injection, property assertions, and stateful exploration
Static analysis tool that scores TypeScript tests across 6 quality dimensions. Works with Jest, Vitest, Playwright, Cypress, and Mocha.
AI agent testing toolkit for Playwright and Cypress: generate E2E tests from scratch, review existing specs against 24 anti-patterns (P0/P1/P2 silent-always-pass smells), and debug flaky failures from playwright-report/ or cypress/reports/. Agent Skills for Claude Code and Codex.
Side Project for Quality Assurance.
Verification system that catches coding agents falsely claiming task completion. Runs 4 parallel checks (file integrity, test quality, scope narrowing, optional LLM judge) over task+claim+diff and returns a weighted 0-100 confidence score with evidence.
A smart test quality gate CLI. Lint your test suite for anti-patterns, flaky selectors, hard waits, and bad practices before they merge. Like ESLint, but for your tests.
ESLint plugin to flag weak tests: snapshot-only coverage, mock-call-only tests, unawaited assertions, no negative path, and empty async tests.
A Claude Code skill that challenges tests to actually catch bugs. Detects test theater, weak assertions, and missing edge cases.
Hermes Agent plugin that runs weekly static analysis on Python test suites, flagging 7 types of test debt: skipped tests, commented asserts, TODOs, missing assertions, fixed sleeps, abusive retries, and duplicates. Produces prioritised Markdown reports and notifies the QA channel. Stdlib only.
Catch theater, weak, and missing unit tests in VS Code. Local-first detective with line-anchored evidence and an optional anti-hallucination AI second opinion (OpenAI / Claude / Gemini).
Open-source plugins for Claude Code — AI-powered developer tools
Detects test files that don't actually test real implementations
Stop AI agents from writing useless tests. Decision trees for mocking, test layering, and requirement-driven testing that catches real bugs.
Pragma is a Claude Code plugin (and a small CLI) that watches every test file your AI assistant writes and blocks the edit when the test is gamed — assertions that pass without actually verifying anything.
AI agent that detects flaky tests from CI history and proposes fixes via Codex
Hermes Agent plugin that runs mutation testing on Python code by wrapping mutmut. Exposes the mutation_test tool and returns a structured JSON with mutation score, breakdown by category (killed/survived/timeout/...) and a bounded sample of surviving mutants.
Mutation testing for Go source code
Add a description, image, and links to the test-quality topic page so that developers can more easily learn about it.
To associate your repository with the test-quality topic, visit your repo's landing page and select "manage topics."