Key Points
Last week, after briefly deposed CEO Sam Altman was reinstalled at OpenAI, two reports claimed that a top-secret project at the company had rattled some researchers there with its potential to solve intractable problems in a powerful new way...
Combining a close read of the initial reports with consideration of the hottest problems in AI right now suggests it may be related to a project that OpenAI announced in May, claiming powerful new results from a technique called process supervision...
The project showed how this could help LLMs, which often make simple errors on elementary math questions, tackle such problems more effectively.. Andrew Ng, a Stanford University professor who led AI labs at both Google and Baidu and who introduced many people to machine learning through his classes on Coursera, says that improving large language models is the next logical step in making them more useful..
Subbarao Kambhampati, a professor at Arizona State University who is researching the reasoning limitations of LLMs, thinks that Q* may involve using huge amounts of synthetic data, combined with reinforcement learning, to train LLMs to specific tasks such as simple arithmetic..
The TLDR version is that Q* could be an effort to use reinforcement learning and a few other techniques to improve a large language models ability to solve tasks by reasoning through steps along the way..
You might be interested in
Now That ChatGPT Is Plugged In, Things Could Get Weird
28, Mar, 23Letting the chatbot interact with the live internet will make it more useful—and more problematic, too.
OpenAI used YouTube data to train some of its models: Report
15, Jun, 23The outlet also reported that Google, which owns YouTube, has been using the video sharing platform’s data to train its own model Gemini. Read more on The Hindu
Sam Altman once said Indians can’t build OpenAI and compete with it, but can he do it now?
20, Nov, 23During his visit to India, the fired OpenAI CEO Sam Altman said that he did not think that Indians would be able to compete with a company like OpenAI because they would not be able to create something like ChatGPT. Now that he has been fired the question is: Can he compete with OpenAI?
OpenAI Offers a Peek Inside the Guts of ChatGPT
09, Jun, 24Days after former employees said the company was being too reckless with its technology, OpenAI released a research paper on a method for reverse engineering the workings of AI models.
ChatGPT owner OpenAI to open first foreign office in UK
28, Jun, 23The Microsoft-backed company says the London office will allow it to 'attract world-class talent'.
OpenAI says ChatGPT users can now turn off chat histories
25, Apr, 23The company has allowed users to withhold their ChatGPT conversations from being used in training the artificial intelligence models
OpenAI, Google, Meta or Anthropic? A Guide to the Best AI for Your Business
09, Mar, 24Different flavors work best for different business needs
Greg Brockman, OpenAI co-founder, quits hours after CEO Sam Altman sacked
18, Nov, 23Barely a few hours after OpenAI, the company that created ChatGPT sacked CEO Sam Altman, co-founder and president Greg Brockman announced that he had quit.
OpenAI to enable more customizations for enterprise and individual users
10, Mar, 23The Microsoft-backed company is working with enterprise clients to train its models in particular domains.
Behind EU lawmakers’ challenge to rein in ChatGPT and generative AI
29, Apr, 23EU lawmakers are working quickly to regulate technology like ChatGPT and generative AI.