Technologies

Google Gemma Overview: New AI Models For Developers

By PlaysDev
Published: Jan 20, 2024

On February 21, Google announced Gemma 2B and 7B, open-source artificial intelligence models based on Gemini. Gemini line – neural networks used for various purposes: Gemini Nano, Gemini Pro and Gemini Ultra. Gemini 1.5 was also recently announced, being the faster version, but so far only for enterprises and developers.

Gemma
Thus, Gemini Ultra can recognize, analyze and generate texts, images, audio and video. With Gemini Pro, developers can work in their preferred environment: SDKs are available for Python, Android (Kotlin), Node.js, Swift and JavaScript.

Unlike Gemini, accessible through the API or Vertex AI, Gemma aims to attract a wider range of developers.

Let’s move on to the key points of the presentation:

  • Gemma is a family of models. There will be two models in sizes 2B and 7B.
  • Gemma will use the new Responsible Generative AI Toolkit to help prioritize the creation of AI apps.
  • The multi-framework Keras 3.0 provides compatibility with JAX, PyTorch and TensorFlow, which will allow developers to quickly switch platforms depending on their tasks.
  • Gemma is equipped with popular tools such as Hugging Face, MaxText, NVIDIA NeMo and TensorRT-LLM, and also uses Colab and Kaggle notebooks.
  • Gemma’s pre-trained and tuned models can run on your laptop, PC, or Google Cloud with easy deployment on Vertex AI and Google Kubernetes Engine (GKE).
  • Gemma boasts improved performance in a modest size, including NVIDIA GPUs and Google Cloud TPUs. Vertex AI provides a variety of MLOps tools with one-click configuration and deployment using built-in output optimizations.
  • Google says its Terms of Service will allow all organizations, regardless of size, to use Gemma. However, the tool is currently only suitable for English-speaking use.
  • The models will be free to use on the Kaggle platform, and new Google Cloud customers will be able to get a $300 discount on their deployment. For researchers, its size reaches $500 thousand.

Google emphasizes that Gemma becomes the best due to its size, the ability to run directly on a laptop or PC, and high key indicators. You can read more details about its performance in Google’s technical report.

Developers actively use neural networks when writing code, and in addition to Google tools, we found several more popular neural assistants: Copylot on the OpenAI Codex model, the well-known ChatGPT, Fig – a very useful tool for beginners who have not yet mastered all the functionality of programming languages and development patterns, Mintlify – that helps write code documentation.

The emergence of new technologies and neural tools always contributes to the evolution of development, and Gemma is no exception, offering developers new perspectives and opportunities.

You may also like

Technologies
2024-03-26
PlaysDev
MLOps as a methodology: how is it different from DevOps and DataOps?
Let's talk about the features of MLOps. What specialists use MLOps practices in their work and what are the responsibilities of ML engineers? As well as bringing up the main differences between DevOps, DataOps and MLops.
Читать
Expertise
2024-05-08
PlaysDev
8 Best Tech Podcasts of 2024: what to listen to stay up-to-date?
Look through a list of Top 8 Tech Podcast in 2024 to find something useful for yourself. Topics covered: Cloud, DevOps, Software Development, Project Management, HR, business in IT.
Читать
Expertise
2024-03-22
PlaysDev
Books for self-development – what to read for self-discipline
What to read for self-development: a list of useful books that are suitable for everyone. These books will help you develop self-discipline, expand your knowledge in the field of business and reach new heights in your professional activities, provided that you are striving for this! Suitable for employees, managers and students.
Читать
ServicesTechnologies
2023-11-21
PlaysDev
Datadog: A Brief Overview of Monitoring Platform
In this article I will look at the Datadog platform: its advantages and disadvantages, entry threshold, types and monitoring systems, and much more...
Читать
Expertise
2024-07-24
PlaysDev
DevSecOps: How is it different from DevOps?
What is DevSecOps? Huge Overview: best practices, lifecycle, main tools. Why security should be built into the development process?
Читать
Industries
2024-05-03
PlaysDev
Grades in IT – How can a DevOps engineer evaluate his grade?
How do IT specialists evaluate their experience and why is grade a very vague concept? We talk about grading using the examples of Google, Meta, Amazon.
Читать
Expertise
2024-07-31
PlaysDev
OKR vs. KPI – Which metrics should you choose for IT projects?
Guide to choosing metrics for IT projects: we talk about different approaches to managing achievements and results. Will be useful for Project Manager.
Читать
Technologies
2024-09-12
PlaysDev
Tech News 2024: Top 5 Interesting Releases
What's new in 2024: what digital solutions might you have missed? Open the article to learn about Microsoft Places, NVIDIA Superchip, and the updated AI assistant Copilot X on GitHub.
Читать
Industries
2024-03-20
PlaysDev
Mobile development trends in 2024: market overview and popular technologies
Spending on mobile apps has been growing steadily over the past 5 years, according to a report by Statista, while the number of new mobile users is also increasing. The main trends of 2024 were blockchain technology, multi-platform development, the use of biometric data, iBeacon.
Читать
Technologies
2024-06-28
PlaysDev
Mobile development: Should You Choose Native or Cross-platform?
Find out the advantages and disadvantages of each approach and how they impact the performance, user experience, and cost of mobile app development.
Читать