diff --git a/documents/docs/index.md b/documents/docs/index.md index 2ca81f32..bac79968 100644 --- a/documents/docs/index.md +++ b/documents/docs/index.md @@ -24,7 +24,7 @@ - AppAgent 👾, responsible for iteratively executing actions on the selected applications until the task is successfully concluded within a specific application. -- Control Interaction 🎮, is tasked with translating actions from HostAgent and AppAgent into interactions with the application and its UI controls. It's essential that the targeted controls are compatible with the Windows **UI Automation** or **Win32** API. +- Application Automator 🎮, is tasked with translating actions from HostAgent and AppAgent into interactions with the application and through UI controls, native APIs or AI tools. Check out more details [here](./automator/overview.md). Both agents leverage the multi-modal capabilities of Visual Language Model (VLM) to comprehend the application UI and fulfill the user's request. For more details, please consult our [technical report](https://arxiv.org/abs/2402.07939).