marketerswiki
Home
Resources
marketerswiki

The open playbook for performance marketers who build with AI.

Resources

  • SimpleCRM
  • Ads OS
  • Ledgeros
  • Tag Manager Engine
  • Skills
  • Playbooks
  • Vibecoding

Company

  • About
  • Privacy Policy
  • Terms of Service

© 2026 marketers wiki. All rights reserved.

Built withClaude Code
Two passes. All 49 skills at 100%.

Autoresearch Results

Two passes. 49 skills. All at 100%.

Every skill in the Vibecoding pipeline was put through two rounds of systematic testing and improvement: a functional autoresearch pass (Karpathy method) and a progressive disclosure audit. Both ended at 100%.

49
Skills tested
100%
Functional score
49/49 at 4/4
Disclosure score
120
Total experiments
84.2%
Avg baseline
40–70%
Token reduction

Two Passes, One Standard: 100%

Karpathy autoresearch method

Pass 1 — Functional Autoresearch

5 real-world test inputs × 5 binary evals = 25 points per skill. One mutation per experiment — the change that fixes the most failures. Loop until 100% for 3 consecutive stability runs.

Tests whether the skill produces correct, complete output across real scenarios: edge cases, missing files, wrong stack, direct activation, handoffs.

4-criteria binary audit per skill

Pass 2 — Progressive Disclosure Audit

Each skill scored on 4 binary checks. Any skill below 4/4 received a targeted fix — either a jump directive or structural rewrite.

Tests token efficiency: does Claude load only the relevant section, or does it read the whole skill every time?

  • Entry router at the top routes by context before any content loads
  • Sections are clearly labeled so Claude can jump directly
  • Heavy content (code blocks, configs) gated behind named sections
  • No wall of code appears before context is established

Pass 1 — Functional Autoresearch

All 49 skills at 100%. 120 total experiments. Average baseline 84.2%.

Top Failure Patterns Fixed

×12 skills
Missing preview + approval prompt
Added mandatory approval step before every file save in all doc generator skills
×11 skills
JS/TypeScript-only — Python stacks broken
Per-skill Python adapters: pip, venv, Alembic, gunicorn, pytest, Makefile
×8 skills
No fallback for missing prerequisite files
Explicit fallback chains when PRD.md, DESIGN_SYSTEM.md, or progress.txt absent
×5 skills
No direct-activation handler
Added "read progress.txt first; if missing, ask user" to skills designed for BUILD/SHIP only
×3 skills
No severity levels in audit output
MUST FIX / SHOULD FIX and CRITICAL / HIGH / MEDIUM / LOW added to design-guard, code-review, security-review

All 49 Skills — Functional Results

SkillBaselineFinalExperimentsKey Fix
vibe-coding-orchestrator80%100%3Bug intercept before routing; routing announcements; step status on resume
vibe-coding-state80%100%3Concrete WRITE STATE format; VALIDATE error messages; RESUME missing-file handler
vibe-coding-ideate68%100%4Mid-session fast-track switch; progress.txt on activation; handoff announcement
vibe-coding-document84%100%4Resume skip-approved-docs; revision handler; PRD import gap-filling
vibe-coding-doc-prd76%100%4Thin data handler; existing PRD check; preview + approval
vibe-coding-doc-appflow72%100%3Preview + approval; missing PRD fallback; SPA handler
vibe-coding-doc-techstack72%100%2Preview + approval; stack-adaptive setup commands
vibe-coding-doc-design76%100%3Preview + approval; UI_UX_SELECTIONS lookup; validation checklist
vibe-coding-doc-backend72%100%3Preview + approval; ORM adapter; API style adapter
vibe-coding-doc-frontend72%100%2Preview + approval; Vite + Next.js Pages Router structure variants
vibe-coding-doc-implplan72%100%3Preview + approval; no-auth handler; Python stack variants
vibe-coding-doc-claudemd56%100%2Specific gotchas derivation; Python stack adapter for state/testing
vibe-coding-doc-review84%100%3globals.css path discovery; requirements.txt fallback; missing-file handler
vibe-coding-build96%100%1Explicit error routing to vibe-coding-build-fix on any compile failure
vibe-coding-css-setup96%100%1Non-Tailwind handler; Tailwind v4 setup with @import and @theme block
vibe-coding-design-templates88%100%3Type detection scoring; Vite variant; read APP_FLOW.md screen inventory
vibe-coding-design-guard64%100%3MUST FIX vs SHOULD FIX severity; Svelte + Vue check sections added
vibe-coding-ui-ux88%100%3Style-audience mismatch warning; skip path defaults; custom color expansion
vibe-coding-ui-review92%100%2DESIGN_SYSTEM.md missing handler; framework-aware fix suggestions
vibe-coding-code-review96%100%1Language adaptations for Python, Go, Ruby, PHP added per check
vibe-coding-debug96%100%1Vague bug handler — clarifying questions before reproducing
vibe-coding-impact-analysis92%100%2Python/Go/Ruby/PHP import pattern detection; deletion always HIGH risk
vibe-coding-reverse-engineer92%100%1Minimum codebase check — warn if fewer than 5 code files
vibe-coding-re-scan80%100%1Save to progress.txt before displaying scan output
vibe-coding-re-analyze100%100%—Already perfect at baseline
vibe-coding-re-generate96%100%1Wave 1 partial failure handler
vibe-coding-ship88%100%2Platform deploy commands; Python pre-flight; Railway/Fly.io/Render support
vibe-coding-explore100%100%—Already perfect at baseline
vibe-coding-recall100%100%—Already perfect at baseline
vibe-coding-security-review100%100%—Already perfect at baseline
vibe-coding-tdd92%100%1RSpec + Rust #[test] templates added; direct-activation handler
vibe-coding-build-fix76%100%1Go + Rust error sections added; Python module resolution section
vibe-coding-api-connect80%100%1Python httpx + requests client; Python OAuth2; Flask webhook handler
vibe-coding-cli-runner68%100%1Python CLI: venv, Alembic, Django, uvicorn/gunicorn, pytest, Makefile
vibe-coding-mcp-setup84%100%1JSON merge guidance; settings.json create-if-missing step; security checklist
vibe-coding-local-runner100%100%—Go, Rails, Rust, PHP sections added in rewrite pass
vibe-coding-deploy-vercel80%100%1Python on Vercel limitation note + serverless function config
vibe-coding-deploy-netlify80%100%1Python persistent server limitation — redirect to Railway/Render/Fly.io
vibe-coding-deploy-digitalocean100%100%—Already perfect at baseline
vibe-coding-review-react100%100%—React 19 patterns added in rewrite pass
vibe-coding-react-native100%100%—Expo New Architecture check added in rewrite pass
vibe-coding-web-design-guidelines100%100%—Already perfect at baseline
vibe-coding-self-improve40%100%1Fixed handoff claims; progress.txt format; orchestrator registration; results file created
vibe-coding-db80%100%1Non-canonical PHASE: DB_SETUP replaced with BUILD phase append format
vibe-coding-db-sqlite80%100%1Progress.txt update section added with BUILD phase append format
vibe-coding-db-bettersqlite80%100%1Progress.txt update section added with BUILD phase append format
vibe-coding-db-postgres80%100%1Progress.txt update section added with connection_pooling flag
vibe-coding-db-duckdb80%100%1Progress.txt update section added with primary_db field
vibe-coding-db-convex80%100%1Progress.txt update section added with auth_configured and file_storage_configured flags

49 skills. Both passes. All at 100%.

The full pipeline is free and open source on GitHub.

Get the Skills on GitHub