Repository Details
Shared by


HelloGitHub Rating
10.0
1 ratings
Intelligent Agent Framework for Operating Computers
Past 7 days Received 285 stars ✨
Free•Apache-2.0
Claim
Discuss
Collect
Share
4.9k
Stars
No
Chinese
Python
Language
Yes
Active
13
Contributors
11
Issues
Yes
Organization
0.2.3
Latest
476
Forks
Apache-2.0
License
More

This is an AI Agent framework that enables AI to operate computers (such as macOS, Windows, Linux, Android) like humans and automatically complete complex GUI operation tasks. It adopts a 'general body + expert body' combined architecture and supports active hierarchical planning. By integrating large models (LLM) and visual multimodal models, it can understand inputs such as screenshots and interface structures and generate operation instructions to achieve automatic clicking, inputting, window switching, searching and other operations.
Comments
Rating:
No comments yet