The AI assistant is moving from "just talking" to "doing things for you".
Recently,
Test Experience: Fully "Driverless", But Requires Some Patience
In a test case disclosed by
Background Operations: The AI will automatically open the app, identify interface elements, fill out forms, select options, and confirm the order.
Asynchronous Execution: During execution, the bottom of the screen will scroll in real-time with messages like "Selecting destination". The coolest part is that you can switch to watching videos or replying to emails, while the AI continues running in the background until the task is completed.
Speed Bottleneck: The current drawback is "slowness". Since the AI needs to recognize the interface frame by frame and perform cloud-based reasoning, a task that takes 2 minutes manually may take up to 9 minutes for the AI.
Technical Breakthrough: Breaking the Ten-Year Ceiling of "Information Query"
Over the past decade, from Siri to
Ecological Limitations: Still in the "Concept Product" Stage
Although the prospects are promising, the current automation features still face many challenges:
Narrow Adaptation Scope: Currently, it only supports highly standardized apps such as Uber and DoorDash.
Need for Improved Error Tolerance: Interface recognition errors or security restrictions in the payment process remain major obstacles to its widespread adoption.
Major Players' Battle: The Year of "AI Agent" Begins in 2026
With the recent efforts of
Although the current
