May 15, 2024· 38 min read

Large language models are pre-trained on large datasets and use natural language processing to perform linguistic tasks such as text generation, code completion, paraphrasing, and more.

The initial release of ChatGPT sparked the rapid adoption of generative AI, which has led to large language model innovations and industry growth.

List of LLMs (Updated)

This table lists the leading large language models in 2026.

LLM Name	Developer	Release Date	Access	Parameters
Gemini 3.1 Pro	Google DeepMind	Feb 19, 2026	API	Unknown
Claude Sonnet 4.6	Anthropic	Feb 17, 2026	API	Unknown
Claude Opus 4.6	Anthropic	Feb 5, 2026	API	Unknown
Gemini 3 Flash	Google DeepMind	Dec 17, 2025	API	Unknown
Nemotron 3	Nvidia	Dec 15, 2025	Open Source	Nano 30B, Super 100B, Ultra 500B
GPT-5.2	OpenAI	Dec 11, 2025	API	Unknown
Mistral Large 3	Mistral AI	Dec 2, 2025	API, Open Source	41B active (MoE)
DeepSeek-V3.2	DeepSeek	Dec 1, 2025	API, Open Source	Unknown
Claude Opus 4.5	Anthropic	Nov 24, 2025	API	Unknown
Grok 4.1	xAI	Nov 17, 2025	API	Unknown
Gemini 3 Pro	Google DeepMind	Nov 18, 2025	API	Unknown
GPT-5.1	OpenAI	Nov 12, 2025	API	Unknown
Claude Sonnet 4.5	Anthropic	Sep 29, 2025	API	Unknown
DeepSeek-V3.1	DeepSeek	Aug 2025	API, Open Source	Unknown
GPT-5	OpenAI	August 7, 2025	API	Unknown
Claude 4.1	Anthropic	August 5, 2025	API	Unknown
Grok 4	xAI	July 9, 2025	API	Unknown
Claude Sonnet 4	Anthropic	May 22, 2025	API	Unknown
Claude Opus 4	Anthropic	May 22, 2025	API	Unknown
Qwen 3	Alibaba	April 29, 2025	API, Open Source	235B
GPT-o4-mini	OpenAI	April 16, 2025	API	Unknown
GPT-o3	OpenAI	April 16, 2025	API	Unknown
GPT-4.1	OpenAI	April 14, 2025	API	Unknown
Llama 4 Scout	Meta AI	April 5, 2025	API	17B
Llama 4 Maverick	Meta AI	April 5, 2025	Open Source	400B (17B active, MoE)
Gemini 2.5 Pro	Google DeepMind	Mar 25, 2025	API	Unknown
GPT-4.5	OpenAI	Feb 27, 2025	API	Unknown
Claude 3.7 Sonnet	Anthropic	Feb 24, 2025	API	Unknown (est. 200B+)
Grok-3	xAI	Feb 17, 2025	API	Unknown
Gemini 2.0 Flash-Lite	Google DeepMind	Feb 5, 2025	API	Unknown
Gemini 2.0 Pro	Google DeepMind	Feb 5, 2025	API	Unknown
GPT-o3-mini	OpenAI	Jan 31, 2025	API	Unknown
Qwen 2.5-Max	Alibaba	Jan 29, 2025	API	Unknown
DeepSeek R1	DeepSeek	Jan 20, 2025	API, Open Source	671B (37B active)
DeepSeek-V3	DeepSeek	Dec 26, 2024	API, Open Source	671B (37B active)
Gemini 2.0 Flash	Google DeepMind	Dec 11, 2024	API	Unknown
Nova	Amazon	Dec 3, 2024	API	Unknown
Claude 3.5 Sonnet	Anthropic	Oct 22, 2024	API	Unknown
GPT-o1	OpenAI	Sept 12, 2024	API	Unknown (o1-mini est. ~100B)
DeepSeek-V2.5	DeepSeek	Sept 5, 2024	API, Open Source	Unknown
Grok-2	xAI	Aug 13, 2024	API	Unknown
Mistral Large 2	Mistral AI	July 24, 2024	API	123B
Llama 3.1	Meta AI	July 23, 2024	Open Source	405B
GPT-4o mini	OpenAI	July 18, 2024	API	~8B (est.)
Nemotron-4	Nvidia	July 14, 2024	Open Source	340B
Claude 3.5 Sonnet	Anthropic	June 20, 2024	API	~175-200B (est.)
GPT-4o	OpenAI	May 13, 2024	API	~1.8T (est.)
DeepSeek-V2	DeepSeek	May 6, 2024	API, Open Source	Unknown
Phi-3	Microsoft	April 23, 2024	API, Open Source	Mini 3B, Small 7B, Medium 14B
Mixtral 8x22B	Mistral AI	April 10, 2024	Open Source	141B (39B active)
Jamba	AI21 Labs	Mar 29, 2024	Open Source	52B (12B active)
DBRX	Databricks' Mosaic ML	Mar 27, 2024	Open Source	132B
Command R	Cohere	Mar 11, 2024	API, Open Source	35B
Inflection-2.5	Inflection AI	Mar 7, 2024	Proprietary	Unknown (predecessor ~400B)
Gemma	Google DeepMind	Feb 21, 2024	API, Open Source	2B, 7B
Gemini 1.5	Google DeepMind	Feb 15, 2024	API	~1.5T Pro, ~8B Flash (est.)
Stable LM 2	Stability AI	Jan 19, 2024	Open Source	1.6B, 12B
Grok-1	xAI	Nov 4, 2023	API, Open Source	314 billion
Mistral 7B	Mistral AI	Sept 27, 2023	Open Source	7.3 billion
Falcon 180B	Technology Innovation Institute	Sept 6, 2023	Open Source	180 billion
XGen-7B	Salesforce	July 3, 2023	Open Source	7 billion
PaLM 2	Google	May 10, 2023	API	340 billion
Alpaca 7B	Stanford CRFM	Mar 13, 2023	Open Source	7 billion
Pythia	EleutherAI	Mar 13, 2023	Open Source	70 million to 12 billion

Context Windows and Knowledge Boundaries

LLMs with a larger context window size can handle longer inputs and outputs. The context window, therefore, determines how much information an LLM processes before its performance starts to degrade. The knowledge cutoff date determines the end date of the data used in training.

(It's worth noting that context windows are not the be-all-and-end-all of LLMs. Practicing context engineering on models with smaller windows can even produce better results.)

LLM Name	Context Window (Tokens)	Knowledge Cutoff Date	Release Date
Llama 4 Scout	10,000,000	August 2024	Apr 2025
Grok 4.1	2,000,000	November 2024	Late 2025
Gemini 3.1 Pro	1,000,000	January 2025	Feb 2026
Gemini 3 Pro	1,000,000	January 2025	Nov 2025
Gemini 3 Flash	1,000,000	January 2025	Late 2025
Llama 4 Maverick	1,000,000	August 2024	Apr 2025
Claude Sonnet 4	1,000,000 (upgraded from 200K)	March 2025	May 2025
Gemini 2.5 Flash	1,000,000	January 2025	2025
GPT-5.2 (Instant/Thinking/Pro)	400,000	August 2025	Dec 2025
GPT-5.1	400,000	September 2024	Nov 2025
GPT-5	400,000	September 2024	Aug 2025
Grok 4	256,000	November 2024	Jul 2025
Claude 4.6 Opus	200,000	August 2025 (training) / May 2025 (reliable)	Feb 2026
Claude 4.6 Sonnet	200,000	January 2026 (training) / August 2025 (reliable)
Claude 4.5 Opus	200,000	August 2025 (training) / May 2025 (reliable)	Nov 2025
Claude 4.5 Sonnet	200,000	July 2025 (training) / January 2025 (reliable)	Sep 2025
Claude 4.5 Haiku	200,000	July 2025 (training) / February 2025 (reliable)	Oct 2025
Claude 4 Opus	200,000	March 2025	May 2025
Kimi K2	1,000,000,000 (1T MoE)	~Mid-2025	Jul 2025
DeepSeek-V3-0324	128,000	~Early 2025	Mar 2025
DeepSeek R1	131,072	January 2025	Jan 2025
Qwen 3 (235B-A22B)	128,000	Unknown	Apr 2025
GPT-4.1	1,047,576	June 2024	Apr 2025
GPT-o3	200,000	June 2024	Jan 2025
GPT-o4-mini	200,000	June 2024	Apr 2025
Gemini 2.5 Flash-Lite	1,000,000	January 2025	2025
Grok 3	131,072	November 2024	Feb 2025

As adoption continues to grow, so does the LLM industry.

The global large language model market is projected to grow from $6.5 billion in 2024 to $140.8 billion by 2033
92% of Fortune 500 firms have started using generative AI in their workflows
Generative AI is disrupting the SEO industry and changing the way we find information online
LLM search will drive 75% of search revenue by 2028

Here's a deeper dive on some of the most important models over the last 3 years.

1. GPT-5.2

Introducing GPT-5 - Resource | OpenAI Academy

Developer: OpenAI

Release date: December 2025

Number of Parameters: Unknown

Context Window (Tokens): 400,000

Knowledge Cutoff Date: August 2025

What is it? GPT-5.2 is an iteration of the is the largest OpenAI model to date: GPT-5. In benchmarking, it outperforms earlier OpenAI models in most tests.

The ChatGPT website continues to be one of the world's most popular sites, receiving more than 5.5 billion visitors from organic search in February 2026.

Unlike earlier models that relied solely on unsupervised learning, GPT-5.2 incorporates advanced multimodal and reasoning capabilities, enabling more accurate and context-aware interactions.

It also has superior agentic capabilities and tool-calling than earlier versions. The GPT-5 lineup hallucinates less than older models, but some benchmarks show GPT-5.2 has a higher hallucination rate at 39% than GPT-5 (18%).

OpenAI has positioned GPT-5 as a major step forward, offering improved performance in tasks requiring logic, planning, and real-time understanding.

Pro users began gaining access to GPT-5 in mid-August 2025. Access will expand to Team, Enterprise, and Education users in early September.

ChatGPT is an important platform for marketing as well. As one of the first major LLMs, it has one of the largest base of users.

This is why companies want to boost AI visibility of their products and brand in ChatGPT to win voice of share and discovery opportunities in AI-powered search.

You can benchmark your brand performance in AI search by using this free AI visibility checker.

2. Gemini 3.1 Pro

undefined

Developer: Google DeepMind

Release date: February 19, 2026

Number of Parameters: Unknown

Context Window (Tokens): 1,000,000

Knowledge Cutoff Date: January 2025

What is it? Gemini 3.1 offer is the latest Gemini model, offering a one million-token context window on release.

This model has outstanding reasoning performance, scoring 77.1% on a test that measures the ability of an AI recognize novel patterns not fed during training. The second best model is Claude Opus 4.6 when measured on this benchmark, but it scores only 68.8%, putting Gemini well ahead of all competition right now.

With Nano Bano also rolled in for image generation and editing, the current iteration of Gemini stands as one of the most versatile AI tools with all-around strong capabilities.

Gemini also boasts stunning SVG animations that run directly in the chat interface. It's one of differentiator that's not yet available to the same level in competitor tools yet.

3. DeepSeek-V3.2

DeepSeek counting Rs in strawberry

Developer: DeepSeek

Release date: December 1, 2025

Number of Parameters: 685B total (MoE)

Context Window (Tokens): 163,840 (input) / 65,536 (output)

Knowledge Cutoff Date: Unknown (estimated mid-2025)

What is it? DeepSeek-V3.2 is a reasoning model that excels in math and coding. DeepSeek's earliest models outperformed mainstread models like OpenAI o1 at first launch.

However, models like GPT-5.2, Claude Opus 3.6, and Gemini 3.1 and beyond have surpassed DeepSeek in several intelligence benchmarks.

(We've done a thorough DeepSeek vs ChatGPT comparison, where we put the R1 model to the test.)

That said, there's one area where DeepSeek still outshines its competitors: handling massive context windows far more cheaply.

DeepSeek V3.2 costs $0.25 per million tokens for input and $0.40 per million tokens for output. It's only a fraction of what Google, OpenAI, or Anthropic charge for comparable performance .

This is critical advantage for enterprise deployment.

Another notable feature is DeepSeek's tool-use abilities which integrates with both thinking and non-thinking modes. The ability to think and reason while using external tools makes DeepSeek a formidable competitor in the AI space.

Prominently, DeepSeek also has lower hallucination rates than leading LLMs like Gemini 3.1 and Claude Opus 4.6

On its release, DeepSeek immediately hit headlines due to the low cost of training compared to most major LLMs. Traffic to the DeepSeek website exploded in early 2025.

According to Semrush, DeepSeek gets over 262 million visits per month from more than 42.9 million unique visitors.

4. Claude 4.6 Opus

undefined

Developer: Anthropic

Release date: February 5, 2026

Number of Parameters: Unknown (Anthropic does not disclose parameter counts)

Context Window (Tokens): 200,000 (1,000,000 in beta) / 128,000 max output

Knowledge Cutoff Date: May 2025 (reliable) / August 2025 (training data)

What is it? Claude Opus 4.6 is Anthropic's most intelligent model. Anthropic released 4.6 only a few months after Opus 4.5, seeking to expand features of the 4.5 model.

On the web, it receives 219.9+ visits from organic search each month.

Claude remains a popular model for creative tasks and coding.

Since Claude was the first LLM to introduce MCP, it became a popular choice for developers, designers, and marketers.

There's a good number of tools that support MCPs now, so marketers can have Claude perform keyword research, backlink analysis, and other SEO tasks within the chat interface.

The latest upgrade in the flagship Opus line of models is adaptive thinking.

Simply put, Claude has the ability to dynamically decide the amount of thinking effort it should put in on a task in order to maximize speed, results, and computational efficiency.

Opus 4.6 introduces agent teams, a system where multiple AI agents specializing in different tasks handle different parts of a problem in team-work style.

Each agent gets its own context window (up to 1 million tokens), and they can communicate peer-to-peer through the "Mailbox Protocol".

For complex tasks (coding, math, data analysis etc.) requiring deep context knowledge work, Claude 4.6 Opus is right up there with the best LLM tools.

5. Grok-4

undefined

Developer: xAI

Release date: July 9, 2025

Number of Parameters: Unknown (Grok-1: 314 billion)

Context Window (Tokens): 256,000

Knowledge Cutoff Date: None (uses real-time information)

What is it? Grok-4 is the newest flagship model from xAI, building on the capabilities of Grok-3 with major improvements in reasoning, speed, and real-time awareness. It’s fully integrated into X (formerly Twitter) for Premium+ subscribers.

As of launch, Grok now serves 42.7 million active users, with daily visits averaging 6.85 million since Grok-4 became available.

6. Mistral Large 2

undefined

Developer: Mistral AI

Release date: December 2, 2025

Number of Parameters: 675 billion total / 41 billion active (Sparse MoE)

Context Window (Tokens): 256,000

Knowledge Cutoff Date: Unknown

What is it? Mistral Large 3 uses a mixture-of-experts model with an impressive context window size. While it doesn't measure up to the reasoning and coding capabilities of LLMs like GPT-5.2, Claude 4.6, Gemini 3.1, and DeepSeek, it's a powerful general model with impressive multi-lingual performance.

Mistral's main utility comes from its open-source nature and its ability to be self-hosted. That, combined with its token efficiency makes Mistral a good enterprise-level LLM even if it lags behind GPT and Claude in reasoning tasks.

7. Falcon 3

undefined

Developer: Technology Innovation Institute (TII)

Release date: December 17, 2024

Number of Parameters: 10 billion (largest variant); family includes 1B, 3B, 7B, and 10B models

Context Window (Tokens): 32,768

Knowledge Cutoff Date: Unknown

What is it? Falcon 3 is a smart LLM that reflects where the open-source AI ecosystem is heading, with a focus on small, efficient, and accessible range of AI models.

It's not a match for leading models like Claude, GPT, and Gemini, since it's a fairly small model.

That said Falcon 3-10B outperforms some Llama variants in Hugging Face leaderboard.

In February 2024, the UAE-based Technology Innovation Institute (TII) committed $300 million in funding to the Falcon Foundation.

8. Llama 4

Meta's LLaMA 4: Scout, Maverick, and Behemoth — A New Era in Scalable Multimodal AI | by Don Moon | Byte-Sized AI | Medium

Developer: Meta AI

Release date: April 5, 2025

Number of Parameters: 109 billion total / 17 billion active (Scout); 400 billion total / 17 billion active (Maverick)

Context Window (Tokens): 10,000,000 (Scout); 1,000,000 (Maverick)

Knowledge Cutoff Date: August 2024

What is it? Llama 4 is another mixture-of-models LLMs consisting of Llama 4 Scout (~109B total) abd Llama 4 Maverick (~400B total). Meta has also expanded its multilingual capabilities, adding support for eight more languages. This model now stands as the largest open-source release from Meta to date.

That said, there was significant controversy where Llama 4 Maverick benchmark results were discovered to have been manipulated by Meta to exaggerate its performance.

With independent testing revealing that Llama 4 performed worse than several models that were already months old at the time of Llama 4's release, Meta delayed the release of LLama 4 Behemoth that still hasn't been made publicly available.

The substandard results of Meta Llama make it suitable for casual tasks at best and not the best tool for jobs involving technical coding, development, and analysis tasks.

9. Inflection-3.0

undefined

Developer: Inflection AI

Release date: March 7, 2024

Number of Parameters: Unknown

Context Window (Tokens): 32,768

Knowledge Cutoff Date: Mid 2023

What is it? Inflection-2.5 was developed by Inflection AI to power its conversational AI assistant, Pi. Significant upgrades have been made, as the model currently achieves over 94% of GPT-4’s average performance while only having 40% of the training FLOPs.

Pi differentiates itself by being an empathetic AI. It doesn't measure up to flagship AI tools in technicals tasks, focusing instead on being an emotional support and displaying human-like kindness and diplomacy in its responses.

However, the company has since pivoted from its user-centric AI chatbot and is now prioritizing enterprise use.

The Microsoft-backed startup reached 1+ million daily active users on Pi in 2021, Q1.

10. Jamba

undefined

Developer: AI21 Labs

Release date: March 6, 2025 (Jamba 1.6); October 8, 2025 (Jamba Reasoning 3B); January 2026 (Jamba 2 Mini)

Number of Parameters: 398B total / 94B active (Large); 52B total / 12B active (Mini); 3B (Reasoning 3B) — all MoE

Context Window (Tokens): 256,000

Knowledge Cutoff Date: Early March 2024 (Jamba 1.5 Mini confirmed); newer versions likely later 2024

What is it? AI21 Labs created Jamba, the world's first production-grade Mamba-style large language model. It integrates SSM technology with elements of a traditional transformer model to create a hybrid architecture. The model is efficient and highly scalable, with a context window of 256K and deployment support of 140K context on a single GPU.

Jamba's core competency is maintaining high speed and efficiency when processing answers with long contexts.

However, benchmarks show that Jamba is one of the least intelligent LLMs, especially for today's standards. And while it is faster than some counterparts like DeepSeek, Claude, and Grok, its low intelligence leaves it behind leading LLMs, more so considering that it's not the cheapest either.

11. Command A

undefined

Developer: Cohere

Release date: March 13, 2025

Number of Parameters: 111 billion (Command A)

Context Window (Tokens): 256,000 (Command A)

Knowledge Cutoff Date: Unknown

What is it? Command A is a series of scalable LLMs from Cohere that support ten languages and 256,000-context length. This model primarily excels at retrieval-augmented generation for enterprise use.

Cohere has moved from the Command R era to Command A and its family (Reasoning, Vision, Translate) in a single year. In doing so, it doubled the context window to 256K and achieved superior inference efficiency.

They're one of the few companies building an enterprise stack (North + Embed + Rerank + Command) rather than just shipping a model.

12. Gemma 3

undefined

Developer: Google DeepMind

Release date: August 14, 2025 (Gemma 3 270M)

Number of Parameters: 270M, 1B, 4B, 12B, and 27B (Gemma 3)

Context Window (Tokens): 128,000 (Gemma 3, 4B and above); 32,000 (Gemma 3 1B & 270M; Gemma 3n)

Knowledge Cutoff Date: Unknown

What is it? Gemma is a series of lightweight open-source language models developed and released by Google DeepMind. The Gemma models are built with similar tech to the Gemini models, but Gemma is limited to text inputs and outputs only.

It doesn't compete with frontier closed models on factual accuracy or hard reasoning, but for the open-source / local-deployment community, Gemma 3 is one of the most versatile and well-supported options available.

13. Phi-4

undefined

Release date: December 12, 2024 (Phi-4 base); February 26, 2025 (Phi-4-mini & Phi-4-multimodal); April 30, 2025 (Phi-4-reasoning family); March 4, 2026 (Phi-4-reasoning-vision)

Number of Parameters: 3.8B (Phi-4-mini), 5.6B (Phi-4-multimodal), 14B (Phi-4 base / reasoning / reasoning-plus), 15B (Phi-4-reasoning-vision)

Context Window (Tokens): 128,000

Knowledge Cutoff Date: June 2024 (Phi-4-multimodal); February 2025 (Phi-4-mini-reasoning)

What is it? Classified as a small language model (SLM), Phi-4 is Microsoft's latest release with 3.8 billion parameters. Despite the smaller size, it's been trained on 3.3 trillion tokens of data to compete with Mistral 8x7B and GPT-3.5 performance on MT-bench and MMLU benchmarks.

These models are fundamentally limited by size for certain tasks. They simply don't have the capacity to store too much factual knowledge, so users may experience factual incorrectness.

Microsoft has only just very recently released Phi-4-reasoning-vision-15B. This model combines vision understanding with structured reasoning.

In fact, it uses a mixed reasoning/non-reasoning approach, switching automatically depending on the nature of task. For perception-based problems (OCR, captioning etc.), it uses direct inference.

When a scientific or math problem is given, the models applies chain-of-thought reasoning for more thoughtful responses.

14. XGen

undefined

Developer: Salesforce AI Research

Release date: May 2, 2025 (xGen-small); April 2025 (xLAM-2 series)

Number of Parameters: 4B–9B (xGen-small); 1B–70B (xLAM-2 series)

Context Window (Tokens): 128,000 (xGen-small); 32,000–128,000 (xLAM)

Knowledge Cutoff Date: Unknown (estimated mid-2025)

What is it? XGen series include a small model as well as a Large Action Model (LAM).

LAMs are specialized, compact language models that focus on predict the next action rather than the next word like traditional LLMs do.

They're purpose-built for AI agents that can trigger workflows, call functions, and execute tasks autonomously.

While these models aren't in competition with the top LLMs, it's part of Salesforce design philosophy to keep these models powerful in narrow applications like agentic workflows for enterprises.

15. DBRX

undefined

Developer: Databricks' Mosaic ML

Release date: March 27, 2024

Number of Parameters: 132 billion

Context Window (Tokens): 32,768

Knowledge Cutoff Date: December 2023

DBRX has now been retired and no longer receives any updates.

What is it? DBRX is an open-source LLM built by Databricks and the Mosaic ML research team. The mixture-of-experts architecture has 36 billion (of 132 billion total) active parameters on an input. DBRX has 16 experts and chooses 4 of them during inference, providing 65 times more expert combinations compared to similar models like Mixtral and Grok-1.

16. Pythia

undefined

Developer: EleutherAI

Release date: February 13, 2023

Number of Parameters: 70 million to 12 billion

Context Window (Tokens): 2,048

Knowledge Cutoff Date: Mid 2022

What is it? Pythia is a series of 16 large language models developed and released by EleutherAI, a non-profit AI research lab. There are eight different model sizes: 70M, 160M, 410M, 1B, 1.4B, 2.8B, 6.9B, and 12B. Because of Pythia's open-source license, these LLMs serve as a base model for fine-tuned, instruction-following LLMs like Dolly 2.0 by Databricks.

17. Alpaca 7B

undefined

Developer: Stanford CRFM

Release date: March 27, 2024

Number of Parameters: 7 billion

Context Window (Tokens): 32,768

Knowledge Cutoff Date: Unknown

Defunct as a model. No successor model released.

What is it? Alpaca is a 7 billion-parameter language model developed by a Stanford research team and fine-tuned from Meta's LLaMA 7B model. Users will notice that, although being much smaller, Alpaca performs similarly to text-DaVinci-003 (ChatGPT 3.5). However, Alpaca 7B is available for research purposes, and no commercial licenses are available.

18. Nemotron-3

undefined

Developer: NVIDIA

Release date: December 15, 2025

Number of Parameters: 3.6B active (Nemotron 3 Nano); ~100B and ~500B (Nemotron 3 Super/Ultra, upcoming)

Context Window (Tokens): 128,000 (Llama Nemotron, Nemotron Nano 2) / 1,000,000 (Nemotron 3)

Knowledge Cutoff Date: Mid-2025

What is it? The Nemotron 3 models introduce a breakthrough hybrid latent mixture-of-experts method, featuring a native 1M-token context window.

On some benchmarks, the Nemotron 3 family is more accurate than GPT-OSS-20B and Qwen3-30B-A3B.

Nemotron goes beyond language models and includes an ecosystem capable of reasoning, vision, speech, RAG models for document retrieval, and safety models for real-time content filtering.

19. PaLM 2

undefined

Developer: Google

Release date: May 10, 2023

Number of Parameters: 340 billion

Context Window (Tokens): 8,192

Knowledge Cutoff Date: February 2023

PaLM 2 has been decommissioned was originally used to power Google's first generative AI chatbot, Bard (rebranded to Gemini in February 2024).

What is it? PaLM 2 is an advanced large language model developed by Google. As the successor to the original Pathways Language Model (PaLM), it’s trained on 3.6 trillion tokens (compared to 780 billion) and 340 billion parameters (compared to 540 billion).

Wrapping Up

New breakthroughs and innovations are emerging at an unprecedented pace.

We will keep this list regularly updated with new models. If you liked learning about these LLMs, check out our lists of generative AI startups and AI startups.