Yotam Jacob
Portrait of Yotam Jacob

  AI Quality & Automation Engineer

I build the verification layer for AI-assisted software.

  • Ship AI-written code with confidence. Eval gates catch the bad changes before they merge.
  • Bug triage on autopilot. Agentic pipelines classify, fix, and open PRs for production errors.
  • Test coverage that grows itself. Every bug we close becomes a new regression test.
  • Senior from day one. 7 yrs, zero-to-CI suites, FDA-grade rigor, fully remote.

Selected builds

Exactius · 2024–

Self-triaging incident pipeline

Production errors classify themselves, draft their own fixes, and open confidence-gated pull requests. There's no human in the first loop. The gate decides what's safe to put up for review.

Impact: many hours saved on every issue triage raised by Sentry

Sentry Claude triage ClickUp GitHub PR
Exactius · 2024–

Coverage that grows itself

Every closed production bug becomes a Playwright regression test via Claude Code, driven by a tag-based state machine on a weekly schedule. The suite widens on its own, so yesterday's incident can't reappear unnoticed.

Impact: several hours saved for each new bug

Claude Code Playwright Scheduled CI
Exactius · 2024–

Parallel E2E on shared state

Parallelized Playwright across a single shared-state environment using tag-based serial lanes (@readonly / @stateful) and per-worker storage state. That cleared the race conditions on a non-isolated backend, and it did it without a fresh fixture for every test.

Impact: 50% reduction in run times on parallelized tests

Playwright TypeScript
Exactius · 2024–

Quality, built from zero

Stood up the frontend unit-testing practice from scratch with Jest + RTL + MSW, enforced in CI so a failing test blocks the merge instead of slipping into a release. Wrote the testing standards the team builds against today, as the first QA voice on the product.

Impact: increased confidence in code quality

Jest RTL MSW GitHub Actions
Exactius · 2024–

QA that writes itself, on demand

Authored custom Claude skills and slash commands that generate bug-fix test coverage across frontend and backend. The repetitive QA chores are now a one-command workflow anyone on the team can run.

Impact: hours saved on each new test suite written

Claude Code Custom Skills FE + BE
Nanox Vision · 2019–24

FDA-grade V&V

Owned formal validation, verification, and traceability for FDA-submitted SaaS algorithms: STD / STR / RTM documentation backed by automated GitLab CI/CD tests. Designed the Java and Selenium framework that a distributed team of eight built on.

Impact: maximized confidence in product quality, aligned with customer and regulatory needs

Selenium GitLab CI Java

Toolbox

AI & PipelinesAgentic triage/fix pipelineseval-style confidence gatingClaude · Codex · Copilot integrationcustom Claude skillsprompt engineeringPipedream
Test AutomationPlaywrightSeleniumWebDriverIOCypressJestReact Testing LibraryMSWpytestTestNG / JUnitCucumber BDD
Languages & WebTypeScriptJavaScriptPythonJavaBashReactNext.js
Backend & APIFastAPIREST / RESTfulPostmanSQLPostgreSQLMongoDB
DevOps & CloudGitHub ActionsGitLab CI/CDGoogle Cloud RunAWS (EC2, S3)Linux / macOS / Windows
Monitoring & PMSentryClickUpJiraXrayZephyr

Bonus

Side project · on Google Play

Egg Smash Adventures

A mindless-fun mobile game where you play a monkey, smash eggs, win rewards, and climb the stages. I built it solo with Claude and VS Code, with Supabase handling auth, user accounts, and cloud save. And yes, GitHub Actions runs daily sanity tests on it, because apparently I can't ship anything without a pipeline.

Claude Supabase Daily CI