Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism

Kevin Scott, CTO and EVP of AI at Microsoft speaks onstage during Vox Media's 2023 Code Conference at The Ritz-Carlton, Laguna Niguel on September 27, 2023 in Dana Point, California. — Enlarge / Kevin Scott, CTO and EVP of AI at Microsoft speaks onstage during Vox Mass-media’s 2023 Code Conference at The Ritz-Carlton, Laguna Niguel acceso September 27, 2023 quanto a Dana Point, California.

During an interview with Sequoia Capital’s Tirocinio Giorno podcast published last Tuesday, Microsoft CTO Kevin Scott doubled acceso his belief that so-called large language model (LLM) “scaling laws” will continue to drive AI progress, despite some skepticism quanto a the field that progress has leveled out. Scott played a key role quanto a forging a $13 billion technology-sharing deal between Microsoft and OpenAI.

“Despite what other people think, we’signore not at diminishing marginal returns acceso scale-up,” Scott said. “And I try to help people understand there is an exponential here, and the unfortunate thing is you only get to sample it every couple of years because it just takes a while to build supercomputers and then train models acceso sommità of them.”

LLM scaling laws refer to patterns explored by OpenAI researchers quanto a 2020 showing that the of language models tends to improve predictably as the models get larger (more parameters), are trained acceso more patronato, and have access to more computational power (compute). The laws suggest that simply scaling up model size and pratica patronato can lead to significant improvements quanto a AI capabilities without necessarily requiring fundamental algorithmic breakthroughs.

Since then, other researchers have challenged the of persisting scaling laws over time, but the concept is still a cornerstone of OpenAI’s AI development philosophy.

You can see Scott’s comments quanto a the below beginning around 46:05:

Microsoft CTO Kevin Scott acceso how far scaling laws will extend

Scott’s optimism contrasts with a narrative among some critics quanto a the AI community that progress quanto a LLMs has plateaued around GPT-4 class models. The perception has been fueled by largely informal observations—and some benchmark results—about recent models like Google’s Gemini 1.5 Utile, Anthropic’s Claude Opus, and even OpenAI’s GPT-4o, which some argue haven’t shown the dramatic leaps quanto a capability seen quanto a earlier generations, and that LLM development may be approaching diminishing returns.

“We all know that GPT-3 was vastly better than GPT-2. And we all know that GPT-4 (released thirteen months ) was vastly better than GPT-3,” wrote AI critic Gary Marcus quanto a April. “But what has happened since?”

The perception of altopiano

Scott’s stance suggests that tech giants like Microsoft still feel justified quanto a investing heavily quanto a larger AI models, betting acceso continued breakthroughs rather than hitting a capability altopiano. Given Microsoft’s investment quanto a OpenAI and strong marketing of its own Microsoft Copilot AI features, the company has a strong interest quanto a maintaining the perception of continued progress, even if the tech stalls.

Frequent AI critic Ed Zitron recently wrote quanto a a post acceso his blog that one defense of continued investment into generative AI is that “OpenAI has something we don’t know about. A personaggio, procace, secret technology that will eternally the bones of every hater,” he wrote. “Yet, I have a counterpoint: mai it doesn’t.”

Some perceptions of slowing progress quanto a LLM capabilities and benchmarking may be to the rapid onset of AI quanto a the public eye when, quanto a fact, LLMs have been developing for years prior. OpenAI continued to develop LLMs during a roughly three-year between the release of GPT-3 quanto a 2020 and GPT-4 quanto a 2023. Many people likely perceived a rapid jump quanto a capability with GPT-4’s launch quanto a 2023 because they had only become recently aware of GPT-3-class models with the launch of ChatGPT quanto a late November 2022, which used GPT-3.5.

Quanto a the podcast interview, the Microsoft CTO pushed back against the that AI progress has stalled, but he acknowledged the challenge of infrequent patronato points quanto a this field, as new models often take years to develop. Despite this, Scott expressed confidence that future iterations will show improvements, particularly quanto a areas where current models struggle.

“The next sample is coming, and I can’t tell you when, and I can’t predict exactly how good it’s going to be, but it will almost certainly be better at the things that are brittle right now, where you’signore like, oh my god, this is a little too expensive, a little too frangibile, for me to use,” Scott said quanto a the interview. “All of that gets better. It’ll get cheaper, and things will become less frangibile. And then more complicated things will become possible. That is the story of each generation of these models as we’ve scaled up.”

Advertisement. Scroll to continue reading.

Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism

admin

Traveling with Friends: How to Avoid Disaster

Lascia un commento Annulla risposta

Popular News

Pakistan to conduct DNA testing on remains of suicide bomber who killed 5 Chinese nationals

Asus ROG Ally to receive a revision with more storage and a bigger battery

Rabbit R1 Review: This AI Device Can’t Replace Your Smartphone Apps Yet

Why We Get ‘the Ick,’ According to Psychologists

SEC wonât approve ether (ETH) exchange-traded fund

About Us

Category

Recent Posts

Welcome Back!

Retrieve your password

Microsoft CTO Kevin Scott thinks LLM “scaling laws” will hold despite criticism

The perception of altopiano

admin

Traveling with Friends: How to Avoid Disaster

Lascia un commento Annulla risposta

Popular News

Pakistan to conduct DNA testing on remains of suicide bomber who killed 5 Chinese nationals

Asus ROG Ally to receive a revision with more storage and a bigger battery

Rabbit R1 Review: This AI Device Can’t Replace Your Smartphone Apps Yet

Why We Get ‘the Ick,’ According to Psychologists

SEC wonât approve ether (ETH) exchange-traded fund

About Us

Category

Recent Posts

Welcome Back!

Retrieve your password

SEC wonât approve ether (ETH) exchange-traded fund