gemma 4 — US news

The wider picture

Gemma 4 is a family of state-of-the-art open models launched by Google DeepMind. These models are designed to run efficiently on various hardware, including Android devices, laptop GPUs, and developer workstations. The introduction of Gemma 4 marks a significant advancement in the capabilities of on-device AI, allowing developers to create applications that are not only powerful but also efficient and versatile.

One of the most notable features of Gemma 4 is its support for advanced reasoning, multi-step planning, and deep logic improvements in math and instruction-following benchmarks. This positions Gemma 4 as a robust tool for developers looking to enhance the functionality of their applications. The models also feature native support for function-calling, structured JSON output, and system instructions, which are essential for building autonomous agents.

In addition to these capabilities, Gemma 4 supports high-quality offline code generation, acting as a local-first AI code assistant. This is particularly beneficial for developers working in environments with limited internet connectivity. The models are optimized for NVIDIA GPUs, which enhances performance for local execution. Running open models like the Gemma 4 family on NVIDIA GPUs achieves optimal performance because NVIDIA Tensor Cores accelerate AI inference workloads, delivering higher throughput and lower latency.

Gemma 4 models are equipped with impressive context windows, with edge models offering a context window of 128K and larger models providing up to 256K. This allows for more extensive data processing and better performance in complex tasks. Furthermore, all models natively process video and images, supporting variable resolutions and tasks such as Optical Character Recognition (OCR) and chart understanding.

The models are trained on over 140 languages, facilitating the development of inclusive applications that can cater to a global audience. This multilingual capability is essential in today’s interconnected world, where applications must serve diverse user bases. The 26B and 31B models are particularly optimized for high-performance reasoning and developer workflows, making them ideal for complex application development.

Gemma 4 is available under the Apache 2.0 license, allowing developers to build on-device AI applications freely. This open-source approach encourages innovation and collaboration within the developer community. Additionally, LiteRT-LM enables Gemma 4 to run with a minimal memory footprint on constrained devices, making it accessible for a wider range of applications.

As the landscape of AI development continues to evolve, the introduction of Gemma 4 signifies a shift towards more agentic experiences on-device. Observers note that this development is poised to change how applications are built and deployed, particularly in environments where local processing is crucial. The excitement surrounding Gemma 4 reflects a growing recognition of the potential for on-device AI to enhance user experiences and streamline workflows.

In summary, Gemma 4 represents a significant leap forward in on-device AI technology, providing developers with a powerful toolkit for creating innovative applications. With its advanced features and capabilities, Gemma 4 is set to transform the way AI is integrated into everyday technology.

By