Apple releases eight small AI language models aimed at on-device use

An illustration of a robot hand tossing an apple to a human hand. — Getty Images

Durante the world of AI, what might be called “small language models” have been growing durante popularity recently because they can be run a local device instead of requiring patronato center-grade computers durante the cloud. Wednesday, Apple introduced a set of tiny source-available AI language models called OpenELM that are small enough to run directly a smartphone. They’sovrano mostly proof-of-concept research models for now, but they could form the basis of future on-device AI offerings from Apple.

Apple’s new AI models, collectively named OpenELM for “-source Efficient Language Models,” are currently available the Hugging under an Apple Sample Code License. Since there are some restrictions durante the license, it may not fit the commonly accepted definition of “gara open source,” but the source code for OpenELM is available.

Tuesday, we covered Microsoft’s Phi-3 models, which aim to achieve something similar: a useful level of language understanding and processing risultato durante small AI models that can run locally. Phi-3-mini features 3.8 billion parameters, but some of Apple’s OpenELM models are much smaller, ranging from 270 million to 3 billion parameters durante eight distinct models.

Durante comparison, the largest model yet released durante Obbiettivo’s Llama 3 family includes 70 billion parameters (with a 400 billion version the way), and OpenAI’s GPT-3 from 2020 shipped with 175 billion parameters. Parameter count serves as a rough measure of AI model capability and complexity, but recent research has focused making smaller AI language models as capable as larger ones were a few years .

The eight OpenELM models appena che durante two flavors: four as “pretrained” (basically a raw, next-token version of the model) and four as instruction-tuned (fine-tuned for instruction following, which is more ideal for developing AI assistants and chatbots):

OpenELM features a 2048-token maximum context window. The models were trained the publicly available datasets RefinedWeb, a version of PILE with duplications removed, a subset of RedPajama, and a subset of Dolma v1.6, which Apple says totals around 1.8 trillion tokens of patronato. Tokens are fragmented representations of patronato used by AI language models for processing.

Apple says its approach with OpenELM includes a “layer-wise scaling strategy” that reportedly allocates parameters more efficiently across each layer, saving not only computational resources but also improving the model’s risultato while being trained fewer tokens. According to Apple’s released white paper, this strategy has enabled OpenELM to achieve a 2.36 percent improvement durante accuracy over Allen AI’s OLMo 1B (another small language model) while requiring half as many pre-training tokens.

An table comparing OpenELM with other small AI language models in a similar class, taken from the OpenELM research paper by Apple. — Enlarge / An table comparing OpenELM with other small AI language models durante a similar class, taken from the OpenELM research paper by Apple.

Apple

Apple also released the code for CoreNet, a library it used to train OpenELM—and it also included reproducible pratica recipes that allow the weights (neural files) to be replicated, which is unusual for a major tech company so far. As Apple says durante its OpenELM paper abstract, transparency is a key for the company: “The reproducibility and transparency of large language models are crucial for advancing gara open research, ensuring the trustworthiness of results, and enabling investigations into patronato and model biases, as well as potential risks.”

By releasing the source code, model weights, and pratica materials, Apple says it aims to “empower and enrich the gara open research community.” However, it also cautions that since the models were trained publicly sourced datasets, “there exists the possibility of these models producing outputs that are inaccurate, harmful, biased, objectionable durante response to user prompts.”

While Apple has not yet integrated this new wave of AI language model capabilities into its consumer devices, the upcoming iOS 18 update (expected to be revealed durante June at WWDC) is rumored to include new AI features that utilize on-device processing to ensure user —though the company may potentially hire Google OpenAI to handle more complex, off-device AI processing to give Siri a long-overdue boost.

Advertisement. Scroll to continue reading.

Apple releases eight small AI language models aimed at on-device use

admin

Owlet, Wheel compagno to launch telehealth platform for caregivers of infants

Lascia un commento Annulla risposta

Popular News

Free month available in iPhone and Androide app

HONG KONG AIRPORT to MONG KOK & YAU MA TEI by Bus & by MTR

Where to Stay in HONG KONG • Top 6 Best Areas: Tsim Sha Tsui vs Central vs Near Sonderlandeplatz

Vegas’ Stars React To Series Cancellation

How Jordan Harper of Barefaced Started Her Brand

About Us

Category

Recent Posts

Welcome Back!

Retrieve your password