agents-orchestrator/agent-orchestrator - Hibryda Git

agents-orchestrator/agent-orchestrator

Author	SHA1	Message	Date
Hibryda	f339c5918d	test: increase WebDriverIO timeout for LLM-judged E2E tests Increase global mocha timeout from 60s to 180s in wdio.conf.js to accommodate longer-running LLM judge tests that evaluate agent responses and code generation. Add explicit per-test overrides for Phase B scenarios B4 and B5 to ensure adequate time for agent startup, execution, and LLM verification. - wdio.conf.js: global timeout 60_000 → 180_000ms - phase-b.test.ts: explicit 180_000ms timeout for B4 and B5 scenarios	2026-03-12 11:10:50 +01:00
Hibryda	070ef3bf48	test: update WebDriverIO configuration with improved fixture setup and logging	2026-03-12 11:10:50 +01:00
Hibryda	9a90c2499a	test: add Phase C E2E tests and fix pre-existing test failures - Add phase-c.test.ts: 27 new E2E tests across 11 scenarios covering hardening sprint features (command palette, search overlay, notification center, keyboard navigation, settings panel, project health, metrics tab, context tab, files tab, LLM-judged settings/status bar) - Fix 3 pre-existing failures in bterminal.test.ts: update stale CSS selectors (.group-name → .cmd-label, .palette-item.active → .selected) - Register phase-c.test.ts in wdio.conf.js specs array - Update test counts: 444 vitest + 151 cargo + 109 E2E = 704 total	2026-03-12 11:10:50 +01:00
Hibryda	90c997d3e9	feat(e2e): add Phase B scenarios with LLM-judged assertions and multi-project tests Adds 6 new E2E scenarios in phase-b.test.ts covering multi-project grid rendering, independent tab switching, status bar fleet state, and LLM-judged agent response quality evaluation via Claude API. Includes llm-judge.ts helper (raw Anthropic API fetch, haiku-4-5, structured verdicts with confidence thresholds).	2026-03-12 11:10:50 +01:00
Hibryda	8bc8a1a33d	feat(e2e): add Phase A scenarios, fixtures, and results store 7 human-authored test scenarios (22 tests) using data-testid selectors. Test fixture generator for isolated environments. JSON results store (no native deps). WebDriverIO config updated with TCP readiness probe and multi-spec support.	2026-03-12 11:10:50 +01:00
Hibryda	2eb323fba8	test(e2e): expand coverage from 25 to 48 tests across 8 describe blocks	2026-03-08 22:27:51 +01:00
Hibryda	d12cbffda7	fix(e2e): consolidate specs into single file and fix WebDriver click issues Tauri creates one app session per spec file; multiple files caused invalid session id on subsequent specs. WebDriver clicks on Svelte 5 components inside scrollable panels dont trigger onclick handlers via WebKit2GTK/tauri-driver - use browser.execute() JS clicks. Also removed tauri-plugin-log (redundant with telemetry::init()).	2026-03-08 21:58:23 +01:00
Hibryda	bfbdb2cc18	fix(e2e): resolve wdio v9 BiDi + tauri-driver compatibility issues	2026-03-08 21:32:16 +01:00
Hibryda	3c3a8ab54e	test(e2e): scaffold WebdriverIO + tauri-driver E2E testing infrastructure	2026-03-08 21:13:38 +01:00