Repository Details
Shared by
HelloGitHub Rating
10.0
1 ratings
Intelligent Agent Framework for Operating Computers
Past 6 days Received 217 stars ✨
Free•Apache-2.0
Claim
Discuss
Collect
Share
7.9k
Stars
No
Chinese
Python
Language
Yes
Active
19
Contributors
12
Issues
Yes
Organization
0.3.1
Latest
854
Forks
Apache-2.0
License
More

This is an AI Agent framework that enables AI to operate computers (such as macOS, Windows, Linux, Android) like humans and automatically complete complex GUI operation tasks. It adopts a 'general body + expert body' combined architecture and supports active hierarchical planning. By integrating large models (LLM) and visual multimodal models, it can understand inputs such as screenshots and interface structures and generate operation instructions to achieve automatic clicking, inputting, window switching, searching and other operations.
Comments
Rating:
No comments yet