OpenAI Operator is a groundbreaking AI agent that can autonomously perform web-based tasks through a browser interface, just like a human would.
View in browser
AI Bytes

Hello, 

 

To continue reading, you don’t need to select all squares with traffic lights.😊

 

This week’s AI tip is about: OpenAI Operator  
 

Yesterday OpenAI announced their new approach to AI agents called Operator.

OpenAI Operator is a groundbreaking AI agent that can autonomously perform web-based tasks through a browser interface, just like a human would.

 

Released on January 23, 2025, it’s currently available exclusively to ChatGPT Pro subscribers at operator.chatgpt.com.

 

Core Capabilities


Operator can handle various everyday tasks including:

  • Booking restaurant reservations
  • Ordering groceries
  • Planning vacations
  • Filling out online forms
  • Purchasing tickets

Technical Foundation 


The system is powered by Computer-Using Agent (CUA), built on top of OpenAI’s GPT-4o multimodal model. CUA operates by:

 

  • Taking screenshots to analyze the screen
  • Interacting with graphical user interfaces through clicking, typing and scrolling
  • Breaking down complex tasks into manageable steps
  • Self-correcting when it encounters errors

Key Features 

 

Remote Operation

Unlike similar tools from competitors, Operator runs on OpenAI’s servers rather than the user’s local computer, enabling it to handle multiple tasks simultaneously.

 

Collaborative Approach


When Operator encounters challenges like CAPTCHAs, login requirements or payment details, it automatically pauses and requests user intervention. Users can take control of the remote browser at any point during task execution.
 

Business Integration
 

OpenAI has established partnerships with several major platforms including:

  • DoorDash
  • Instacart
  • OpenTable
  • StubHub
  • Uber

Current Limitations 

 

The system is still in research preview and has some restrictions:

  • Cannot reliably handle complex tasks like detailed slideshow creation
  • Has limitations with calendar management
  • Will not perform certain high-stakes tasks such as financial transactions or email sending

    This week’s batch of AI news 

    1. Chinese startup DeepSeek released an open AI model called DeepSeek-R1 on January 20, which matches OpenAI’s o1 in reasoning capabilities, while being significantly cheaper to use. The model is available under the MIT license, enabling researchers to study and build upon it, though training data remains private. It costs about 1/30th of what o1 does to use.

    Read more: 
    https://www.nature.com/articles/d41586-025-00229-6

     

    2. The Stargate Project, a major $500 billion AI infrastructure initiative was announced, led by SoftBank and OpenAI, with partners including Oracle, NVIDIA and Microsoft. The project aims to strengthen US AI capabilities and create hundreds of thousands of jobs. 

    Read more:

    https://openai.com/index/announcing-the-stargate-project/ 

     

     

    Chatbot soon, 

    Damian Mazurek 

    Chief Innovation Officer 

    DM

    Interested in learning about our AI experience and capabilities? Get in touch with us and learn how our generative AI development services and machine learning expertise can help your organization.  

    SM podstawowy v21 JPG

    About Software Mind 

    Software Mind engineers software that reimagines tomorrow, by providing companies with autonomous development teams who manage software life cycles from ideation to release and beyond. For over 25 years we’ve been enriching organizations with the talent they need to boost scalability, drive dynamic growth and bring disruptive ideas to life. Our top-notch engineering teams combine ownership with leading technologies, including cloud, AI and data science to accelerate digital transformations and boost software delivery.

    Software Mind, Jana Pawła II 43b Avenue, Kraków, Lesser Poland 31-864, Poland

    Unsubscribe Manage preferences