OpenAI announced the launch of a new upgraded version of ChatGPT Agent. This integrated autonomous agent AI system can not only understand language and analyze information, but now it can also take initiative to act, operate web pages, process files and generate presentations, turning ideas into practical results.

ChatGPT Agent is now available

ChatGPT is an AI system that can choose its own tools and has the ability to think and act. It is not just a chatbot, but can also operate websites, fill out forms, make presentations or analyze competitors through a virtual computer, greatly simplifying tedious tasks.

It integrates three major capabilities:

  • Operator: Web operation expert

  • Further research: Multi-step reasoning and information integration tools

  • ChatGPT conversational capabilities: natural and smooth human-computer interaction

Users only need to briefly describe their needs, and ChatGPT will judge and use the best tool to complete the task. For example: "Please summarize my client briefing based on recent news" or "Analyze competitors and convert them into PowerPoint."

ChatGPT Agent serial tools to complete complex workflows

ChatGPT is equipped with a variety of network tools, including a graphical browser, a text browser, and a module that can directly connect to the API. It can switch the usage mode according to the task requirements:

  • Search data can be retrieved using API

  • To operate the website, use the browser to simulate clicks and inputs

  • Perform integrated tasks in a virtual environment with complete background information flow

It also supports real-time interaction and correction: users can adjust the direction at any time during the task, or interrupt or take over the browser operation, which is extremely flexible.

ChatGPT Agent breaks industry records in multiple benchmark tests

OpenAI ran a number of standardized tests on the ChatGPT agent, and the results were impressive:

1. Humanity’s Last Exam (Expert Questions and Answers)

  • The ChatGPT agent set a new record of 43.1% accuracy, leading other tool combination models.

2. DSBench (Data Science Task Test)

  • Data analysis accuracy: 89.9%, far better than GPT-4o (34.1%) and humans (64.1%)

  • Data modeling performance: 85.5%, leading in all aspects

3. SpreadsheetBench (Ability to operate trial balance sheets)

  • 45.5% accuracy in editing Excel spreadsheets, almost twice as fast as Copilot

4. Investment banking model building tasks

  • Outperforms deep research tools and OpenAI o3 models by a wide margin

5. WebArena and BrowseComp (web tasks and hard-to-find information)

  • ChatGPT set new records with 78.2% and 68.9% accuracy respectively, leading similar products in the industry

Whether in the enterprise, personal or educational fields, ChatGPT intelligence can be highly practical. Practical application scenarios include:

  • Automatically turn dashboard data into presentations

  • Reschedule your trip or meeting

  • Edit and update financial balance sheets

  • Travel planning and booking

  • Search and book services, restaurants and other personal matters

You can also schedule tasks to be executed regularly, for example, automatically generate a KPI report every Monday.

How to enable ChatGPT Agent?

To use the smart agent function, just select "Smart Agent Mode" in ChatGPT and describe the task. The system will start the task execution window and display the progress and narration in real time. If necessary, you can:

  • Abort Mission

  • Provide new instructions

  • Take over the operation yourself

If you are a Pro, Plus, Team, Enterprise or Education plan user, it will be gradually available. Pro users also enjoy almost unlimited task quotas.

How does ChatGPT Agent balance security?

For the first time, ChatGPT has the ability to "actually operate websites". OpenAI has designed multiple security mechanisms to ensure user control and information privacy:

  • Explicit authorization must be obtained before operation: such as shopping, making appointments, filling out forms, etc.

  • Sensitive tasks require "surveillance mode": step-by-step approval of each action

  • Actively refuse high-risk actions: such as financial transactions and legal affairs

  • Preventing prompt injection attacks and abuse

  • Browsing data is not stored, users can delete cookies and log out at any time

Although the intelligent body can currently handle briefing production and task integration, some functions (such as designing sophisticated briefings from scratch) are still in the Beta stage, and the format and aesthetics may be a bit rough.

In the future, a new generation of presentation functions will be launched to improve typesetting, content quality and template application, and further optimize data reading and presentation.

OpenAI said that this is just the first step in integrating the autonomous agent system into ChatGPT. In the future, it will continue to update and expand more tools and application capabilities to further build ChatGPT into a professional, reliable and efficient digital work partner.

This article ChatGPT Agent is officially launched! AI can operate web pages autonomously, you can do it just by thinking about it first appeared in Chain News ABMedia.