Agent-1 can “operate software like a human”



summary
Summary

Agent-1 may soon be able to operate any software. Initially, the AI model will launch as part of a browser extension.

Matt Shumer, CEO of HyperWriteAI and OthersideAI, has announced a foundation model called “Agent-1”. It is supposed to be able to operate software like a human. A scientific paper is still missing, but he demonstrated the capabilities in a video.

Current AI models such as GPT-4, and even next-generation models that have not yet been released, are unable to reliably operate software and programming interfaces, Shumer said.

Cost and speed are also major issues with the complexity of software operations, he said. “Agent-1” aims to solve this problem. Shumer promises much: “We’re already well above previous state-of-the-art, and we’re improving massively each week.”

Ad

Chrome plugin integration

Agent-1 is expected to be integrated into HyperWrite’s Personal Assistant in the next few updates. It is a browser extension that provides a website-independent AI text generator.

Image: Screenshot/THE DECODER

At the end of June, Shumer unveiled a very early version of Personal Assistant, which can perform simple browser tasks such as sending an email or ordering food.

Agent-1, however, could take on much more complex tasks. In Shumer’s demo video, you can see Agent-1 controlling a Google Cloud dashboard.

Image: screenshot/HyperWrite

“Dynamic thinking

“Current models store lots of knowledge, leaving fewer parameters for reasoning,” he explains. “Instead, we aim to put all of the model’s horsepower to work on dynamic reasoning.”

This “dynamic reasoning” approach, he says, allows the model to handle situations for which it has not been trained. Shumer set the bar pretty high for Agent-1:

Recommendation

Gorilla, a large language model that has been trained on 1,600 programming interfaces and is capable of operating software.



Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top