According to Cointelegraph, OpenAI is scaling back its contracts with Scale AI, a data labeling startup recently acquired by Meta. This decision comes shortly after Meta announced a $14.8 billion deal for a 49% ownership stake in Scale AI, marking Meta's second-largest acquisition. As part of the deal, Scale CEO Alexandr Wang will join Meta’s experimental AI project, with the companies having announced the agreement on June 12.
Scale AI, founded in 2016 and backed by over 100 investors, provides labeled data crucial for training and enhancing artificial intelligence models. The startup has been a supplier to prominent AI companies such as Anthropic, Cohere, and Adept. In 2019, Scale AI raised $100 million in a Series C funding round, according to PitchBook. However, OpenAI is now phasing out its reliance on Scale AI's data, seeking more specialized data sources for its AI models. An OpenAI spokesperson revealed that the company began reducing its contracts with Scale over the past year, noting that Scale accounted for only a small portion of OpenAI’s data requirements.
Google is reportedly another company moving away from contracts with Scale AI, driven by concerns that Meta's acquisition could provide insights into competitors’ AI advancements. Reuters reported that this strategic shift is motivated by the potential competitive implications of Meta's involvement with Scale AI. Despite these changes, Scale interim CEO Jason Droege emphasized that the startup remains an independent entity, asserting that their commitment to protecting customer data remains unchanged.
OpenAI is now exploring alternative data suppliers, including emerging companies like Mercor, to support its operations. Bloomberg highlighted that Scale AI initially employed a large number of contractors to label images and text for early AI systems. Over time, the company transitioned to hiring more educated contractors to contribute to the development of advanced AI models. This evolution reflects the growing complexity and sophistication required in AI data labeling processes.