· 2h · on MSN
OpenAI’s new Operator AI agent can do things on the web for you
· 2h
Operator: OpenAI’s Next Step Toward the ‘Agentic’ Future
The company says the CUA’s reasoning technique, which they call an “inner monologue,” helps the model understand intermediate steps and adapt to unexpected input. Under the hood, CUA takes screenshots of web pages and uses a virtual mouse and keyboard to navigate.
· 3h · on MSN
OpenAI announces Operator AI agent that can browse the web for you
The model underpinning Operator is a Computer-Using Agent (CUA) that combines GPT-4o's vision mode to "see" what's on the user's screen through screenshots with graphical user interfaces (GUIs) that enable Operator to interact with the screen (clicking buttons, typing, scrolling, etc.).
Some results have been hidden because they may be inaccessible to you
Show inaccessible results