[FEATURE] Use AppleScripts for MacOS Computer Use when possible

## Summary

Screenshots are expensive.

I need to rethink the approach of using Vision models for Computer Use, the LLMs can also interact with the Accessibility Tree instead of purely by vision. This will make the system more compatible and token efficient.

Not all OS supports this type of structured data, so the task is to explore the opportunity to make this more efficient.

Perhaps even providing a tool to read the tree and select an element with a fallback that when it fails, it will consume a screenshot (GetLatestScreenshot).

### Acceptance Criteria

- [ ] A pre-step tool call for reading and using the Accessibility tree exists to make the operations more efficient
- [ ] Less tokens ingestion is needed
- [ ] It's documented
- [ ] It's tested


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Use AppleScripts for MacOS Computer Use when possible #371

Summary

Acceptance Criteria

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[FEATURE] Use AppleScripts for MacOS Computer Use when possible #371

Description

Summary

Acceptance Criteria

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions