GPT, Claude, Llama? How to tell which AI model is best

Beware model-makers marking their own homework

Illustration: George Wylesol

Jul 31st 2024

When Meta, the parent company of Facebook, announced its latest open-source large language model (LLM) on July 23rd, it claimed that the most powerful version of Llama 3.1 had “state-of-the-art capabilities that rival the best closed-source models” such as GPT-4o and Claude 3.5 Sonnet. Meta’s announcement included a table, showing the scores achieved by these and other models on a series of popular benchmarks with names such as MMLU, GSM8K and GPQA.

Explore more

Science & technology August 3rd 2024

Reuse this content

GPT, Claude, Llama? How to tell which AI model is best

Beware model-makers marking their own homework

Continue with a free trial

Explore all our independent journalism for free for one month. Cancel any time

Explore more

Science & technology August 3rd 2024

More from Science & technology

How to reduce the risk of developing dementia

How America built an AI tool to predict Taliban attacks

Gene-editing drugs are moving from lab to clinic at lightning speed

How Ukraine’s new tech foils Russian aerial attacks

The deep sea is home to “dark oxygen”

Augmented reality offers a safer driving experience

GPT, Claude, Llama? How to tell which AI model is best

Beware model-makers marking their own homework

Explore more

Science & technology August 3rd 2024

Handpicked stories, in your inbox

More from Science & technology

How to reduce the risk of developing dementia

How America built an AI tool to predict Taliban attacks

Gene-editing drugs are moving from lab to clinic at lightning speed

How Ukraine’s new tech foils Russian aerial attacks

The deep sea is home to “dark oxygen”

Augmented reality offers a safer driving experience