fix(computer): make every action invocable (schema field gaps) + element_at/error-mapping fixes by 1jehuang · Pull Request #351 · 1jehuang/jcode

1jehuang · 2026-06-09T04:47:46Z

Summary

Exhaustively exercised every computer tool action against the live running server and fixed every failure so the tool is production-ready. The handlers were already implemented; the bugs were in the function-calling contract and a few AppleScript/error-mapping/perf details. All fixes are validated live after build + reload.

Fixes #350.

Bugs fixed

1. Schema omitted half the input fields → 8+ actions were impossible to call

ComputerInput deserializes dx, dy, to_x, to_y, w, h, menu_path, window_id, ax_action, contains, timeout_ms, region, level and dispatch() requires them, but parameters_schema() only declared a subset. A model can only emit parameters present in the tool schema, so those fields could never be sent and each dependent action dead-ended in its own requires X guard:
scroll, drag, resize_window, select_menu, window_screenshot, wait_for, set_brightness, perform_action, plus the ocr region variant.

Fix: declare all input fields (action specs stay in discover; fields must be declared). Trimmed descriptions so the always-on schema stays ~880 tokens. Added schema_declares_every_input_field regression test.

2. `element_at` used the AppleScript reserved word `result`

result holds the last command's value, so element_at always failed with The variable result is not defined. Renamed to bestHit.

3. `element_at` walked the entire AX tree → 20s timeout on big apps

Now prunes to only descend subtrees whose frame contains the point (a child's frame ⊆ its parent's), with a depth bound + explicit 12s timeout. Full-tree walk → path-length walk. Live: System Settings used to time out (20s); a comparable large tree (Messages) now returns in ~1.6s.

4. `-1719` misclassified as "Accessibility permission required"

-1719 (errAEIllegalIndex) / -1728 (errAENoSuchObject) mean the target reference didn't resolve (app not running / no front window / AX path out of range), not a permission problem. Extracted a pure, unit-tested classify_osa_error: permission text first, then map index errors to an accurate "target not found" message. Use -25211 (errAXAPIDisabled) for the real permission code.

5. `wait_for` ignored its own `timeout_ms`

Each poll dumped the AX tree (depth 10) under the 20s default scripting timeout, so a single poll could take ~20s and a timeout_ms=2000 call wouldn't return until ~20s. Now each poll is bounded by the time remaining (capped 3s) and the dump depth is reduced. Live: a 2000ms wait_for now returns in ~2.2s.

Validation

cargo test -p jcode-app-core --lib tool::computer: 26 passed, 0 failed (10 new tests across the fixes).
Built selfdev, installed, reloaded the running server through 4 iterations, and re-ran the full action matrix live. Every previously-broken action now works:
scroll ✓ drag ✓ resize_window ✓ select_menu ✓ (View > Actual Size in Preview) window_screenshot ✓ (captured an occluded window) wait_for ✓ (honors timeout) set_brightness ✓ (clean message) perform_action ✓ (AXPress) ocr region ✓ element_at ✓ (fast + accurate not-found errors).

Notes / known limitations

wait_for/ui/element_at operate on front window of frontApp; apps on another Space or with non-AX content (Catalyst/Electron/rich-text) expose limited text. This is an inherent AX limitation, not a tool bug. Follow-up tracked in computer tool: truthful observability animation for background AX actions (highlight when visible, notice when occluded) #348.

Test plan for reviewers

cargo test -p jcode-app-core --lib tool::computer
# live (mutates desktop; needs GUI + Accessibility/Screen Recording):
cargo test -p jcode-app-core tool::computer::coverage -- --ignored --nocapture --test-threads=1

…served word The computer tool implements every action and ComputerInput field, but parameters_schema() (progressive disclosure) only declared a subset of properties. Models emit only parameters present in the tool's JSON schema, so the undeclared fields could never be sent, making 8 actions impossible to invoke despite being fully implemented: - scroll (needs dx/dy) - drag (needs to_x/to_y) - resize_window (needs w/h) - select_menu (needs menu_path) - window_screenshot (needs window_id) - wait_for (needs contains, timeout_ms) - set_brightness (needs level) - perform_action (needs ax_action) - ocr region variant(needs region) Declare all input fields in the schema. Action *specs* stay in `discover` (progressive disclosure), but the *fields* must be declared or the action is dead on arrival. Trim field descriptions so the always-on schema stays small (~880 tokens) and bump the schema_is_compact bound accordingly. Also fix ax.rs::element_at: the hit-test AppleScript assigned to a local named `result`, which is reserved in AppleScript (holds the last command's value), so element_at always failed with "The variable result is not defined." Rename to `bestHit`. Add a schema_declares_every_input_field regression test so this class of bug (handler requires a field the schema never exposes) can't recur. Fixes #350.

…cation Two production-quality issues found during exhaustive live testing: 1) element_at walked the ENTIRE AX tree, so on large apps (System Settings) it blew past the scripting timeout (~20s) and failed. A child's AX frame is contained in its parent's, so a parent that doesn't contain the point can't have a matching descendant: prune to only descend into subtrees whose frame contains the point, add a depth bound, and give it an explicit 12s timeout. This turns a full-tree walk into a path-length walk. 2) osascript errors -1719 (errAEIllegalIndex / "Invalid index") were misclassified as "Accessibility permission required", sending users to grant a permission they already had when the real cause was "app not running / no front window / AX path out of range". Extract the mapping into a pure `classify_osa_error`, check permission text first, then map -1719/-1728/"invalid index" to an accurate "target not found" message. Use -25211 (errAXAPIDisabled) for the genuine permission code. Add unit tests covering each branch. Refs #350.

Each wait_for poll dumped the AX tree (depth 10) under run_applescript's 20s default timeout. On a large app a single poll could take ~20s, so a wait_for with timeout_ms=4000 didn't return until ~20s — it ignored its own deadline. Bound each poll's scripting timeout to the time remaining (capped at 3s) and shrink the dump depth to 6 so polls are cheap and the call honors timeout_ms. Refs #350.

… matching

Add a keymap discovery layer in jcode-setup-hints that normalizes both macOS system shortcuts (com.apple.symbolichotkeys) and terminal emulator bindings (Ghostty +list-keybinds) into a shared KeyChord and persists them to ~/.jcode/keymap-snapshot.json. This is the data layer for detecting when a terminal/OS shortcut intercepts a key jcode wants to use. - chord: normalized (cmd/ctrl/alt/shift + key) with cross-source key spelling - macos_hotkeys: decode [ascii, keycode, modmask] triples (pure + tested) - terminal: parse Ghostty effective keybinds (pure + tested) - snapshot: collect/save/load JSON; tolerant of string/int plist params

Build the conflict-detection and reporting layer on top of the keymap snapshot, and surface it through a new /keys command. - conflicts: enumerate jcode's KeybindingsConfig as chords and diff against the machine snapshot, reporting each overlap tied to its config field (deduplicated per field/source) - report: human-readable diagnostics + a compact status line - mod: snapshot_cached_or_refresh() reuses a <1 day old snapshot, else rescans - /keys (alias /keybindings): show diagnostics; /keys refresh forces a rescan - registered in command list, help overlay, and both dispatch chains

Surface keybinding conflicts proactively at launch, not only via /keys. - conflict_signature(): stable, order-independent signature of a conflict set - SetupHintsState.keymap_conflict_signature: debounce so users are warned once per distinct conflict set and never nagged on every launch - maybe_show_keymap_conflict_hint(&KeybindingsConfig): config-aware startup notice, gated on a real TTY and on the signature changing - wired into the client launch path, deferring to existing setup hints so it never clobbers an early-launch alignment/welcome/terminal tip

Split the warn-once-per-change policy out of maybe_show_keymap_conflict_hint into a pure conflict_hint_decision() so it is unit-testable without I/O. Covers: unchanged, newly-conflicting, resolved-silently, and changed-set.

Extract keymap_conflict_hint_for() so the decision + state-update path is testable without TTY/disk I/O, and add an integration test exercising first-warn, debounce, and resolved-silently transitions.

JadenSWang · 2026-06-24T20:39:02Z

use openrelay

…sion rows Surfaces the credential sources jcode probed during first-run detection but did not find, beneath the import/login decision rows, with PgUp/PgDn scrolling when the list overflows the welcome card. - jcode-app-core: add external_auth_search_report()/external_auth_search_targets() keyed on presence (not consent) so already-trusted logins never read as 'not found' - onboarding flow carries login_not_found; welcome kinds gain not_found rows + scroll - ui_onboarding renders a scrollable 'Searched, not found' panel - PgUp/PgDn scroll, clamped, disjoint from the Yes/No h/l/j/k keys; reset on row advance - tests: core family coverage, welcome-kind rows, scroll clamp, render assertions

…anel

Adds a one-time 'Set up ScrollWM?' onboarding step (macOS, local sessions) shown before the suggestion cards when ScrollWM isn't already installed and the user hasn't answered. On Yes it runs the ScrollWM web installer in the background and surfaces progress; the Accessibility grant is ScrollWM-owned. Default highlight is No and the timeout never installs. - OnboardingPhase::ScrollWmOptIn + ScrollWmInstallProgress render kind - gate: macOS + local + not-installed + unanswered (pure, unit-tested) - async install via web-install.sh (download-to-temp then bash, 180s cap) - result delivered via Bus::ScrollWmInstallCompleted -> advances to suggestions - persisted in setup_hints.json (scrollwm_optin_answered / install_started) - Yes/No keys mirror the existing decision rows; tick auto-skips to No - tests: gate logic, enter/answer/skip, Yes->Running, install-completed, render

The Rust counterpart to ScrollWM's Swift ControlClient: a small, blocking, best-effort client for ScrollWM's Unix control socket. This is the code-level link between the two repos, used to detect ScrollWM and drive it (arrange, focus columns, focus-by-title, read strip status). - ScrollWm client: discover()/with_socket(), is_running (ping), hello (version handshake with status fallback), status (serde over controlStatusJSON shape), arrange, focus_index/next/prev, focus_title - socket path resolution honors SCROLLWM_CONTROL_SOCK (sandbox/tests) and defaults to ~/Library/Application Support/ScrollWM/control.sock - ENOENT/ECONNREFUSED map to NotRunning; error: replies -> Protocol; non-unix stub - pure column_for_title helper for title-keyed focus - tests: path resolution, JSON round-trips, NotRunning, and loopback tests that exercise the real connect/write/read protocol via a temp UnixListener

When agents.scrollwm.enabled and ScrollWM is running, focus each freshly spawned headed agent's strip column by matching its unique session name in the window title. Best-effort and fully detached: spawn never blocks on socket or window I/O, and it is a quiet no-op when ScrollWM is absent/not managing. - AgentsConfig.scrollwm: { enabled=false, focus_active=true, arrange_on_spawn=false } - env overrides JCODE_SCROLLWM / _FOCUS_ACTIVE / _ARRANGE_ON_SPAWN + known-vars - maybe_reconcile_scrollwm_after_spawn() hooked after register_visible_spawned_member, polls status up to ~3s for the agent column, never calls arrange unless opted in - documented in the default config; config env-override test added

…st flake - docs/scrollwm-integration/CONTRACT.md points at the canonical scrollwm/docs/INTEGRATION.md and records jcode's client/config/onboarding surface - PLAN.md marks all five workstreams shipped - jcode-scrollwm loopback tests: use short /tmp paths + atomic counter so parallel binds never collide (macOS sun_path 104-byte limit) — 9/9 stable

…ded swarms

jeremy added 4 commits June 8, 2026 21:27

fix(computer): wait_for depth 8 + include description text for better…

c84ad17

… matching

1jehuang mentioned this pull request Jun 9, 2026

computer tool: schema omits half its input fields (scroll/drag/resize/select_menu/window_screenshot/wait_for/set_brightness/perform_action unusable) + element_at AppleScript reserved-word bug #350

Closed

jeremy added 6 commits June 8, 2026 22:43

test(keymap): cover full conflict-hint path (warn/debounce/resolve)

043aeae

Extract keymap_conflict_hint_for() so the decision + state-update path is testable without TTY/disk I/O, and add an integration test exercising first-warn, debounce, and resolved-silently transitions.

docs: document keybinding conflict detection and /keys

0fde2dc

jcode added 8 commits June 25, 2026 19:44

docs(scrollwm): integration exploration + synthesis plan

69a7202

test(onboarding): inject welcome kind in render tests for not-found p…

d1f61b3

…anel

docs(readme): document ScrollWM window-management integration for hea…

1677d56

…ded swarms

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(computer): make every action invocable (schema field gaps) + element_at/error-mapping fixes#351

fix(computer): make every action invocable (schema field gaps) + element_at/error-mapping fixes#351
1jehuang wants to merge 18 commits into
feat/macos-computer-toolfrom
fix/computer-tool-schema-and-element-at

1jehuang commented Jun 9, 2026 •

edited

Loading

Uh oh!

JadenSWang commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

1jehuang commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Bugs fixed

1. Schema omitted half the input fields → 8+ actions were impossible to call

2. element_at used the AppleScript reserved word result

3. element_at walked the entire AX tree → 20s timeout on big apps

4. -1719 misclassified as "Accessibility permission required"

5. wait_for ignored its own timeout_ms

Validation

Notes / known limitations

Test plan for reviewers

Uh oh!

JadenSWang commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1jehuang commented Jun 9, 2026 •

edited

Loading

2. `element_at` used the AppleScript reserved word `result`

3. `element_at` walked the entire AX tree → 20s timeout on big apps

4. `-1719` misclassified as "Accessibility permission required"

5. `wait_for` ignored its own `timeout_ms`