Technologies

Google Gemma Overview: New AI Models For Developers

By PlaysDev
Published: Jan 20, 2024

On February 21, Google announced Gemma 2B and 7B, open-source artificial intelligence models based on Gemini. Gemini line – neural networks used for various purposes: Gemini Nano, Gemini Pro and Gemini Ultra. Gemini 1.5 was also recently announced, being the faster version, but so far only for enterprises and developers.

Gemma
Thus, Gemini Ultra can recognize, analyze and generate texts, images, audio and video. With Gemini Pro, developers can work in their preferred environment: SDKs are available for Python, Android (Kotlin), Node.js, Swift and JavaScript.

Unlike Gemini, accessible through the API or Vertex AI, Gemma aims to attract a wider range of developers.

Let’s move on to the key points of the presentation:

  • Gemma is a family of models. There will be two models in sizes 2B and 7B.
  • Gemma will use the new Responsible Generative AI Toolkit to help prioritize the creation of AI apps.
  • The multi-framework Keras 3.0 provides compatibility with JAX, PyTorch and TensorFlow, which will allow developers to quickly switch platforms depending on their tasks.
  • Gemma is equipped with popular tools such as Hugging Face, MaxText, NVIDIA NeMo and TensorRT-LLM, and also uses Colab and Kaggle notebooks.
  • Gemma’s pre-trained and tuned models can run on your laptop, PC, or Google Cloud with easy deployment on Vertex AI and Google Kubernetes Engine (GKE).
  • Gemma boasts improved performance in a modest size, including NVIDIA GPUs and Google Cloud TPUs. Vertex AI provides a variety of MLOps tools with one-click configuration and deployment using built-in output optimizations.
  • Google says its Terms of Service will allow all organizations, regardless of size, to use Gemma. However, the tool is currently only suitable for English-speaking use.
  • The models will be free to use on the Kaggle platform, and new Google Cloud customers will be able to get a $300 discount on their deployment. For researchers, its size reaches $500 thousand.

Google emphasizes that Gemma becomes the best due to its size, the ability to run directly on a laptop or PC, and high key indicators. You can read more details about its performance in Google’s technical report.

Developers actively use neural networks when writing code, and in addition to Google tools, we found several more popular neural assistants: Copylot on the OpenAI Codex model, the well-known ChatGPT, Fig – a very useful tool for beginners who have not yet mastered all the functionality of programming languages and development patterns, Mintlify – that helps write code documentation.

The emergence of new technologies and neural tools always contributes to the evolution of development, and Gemma is no exception, offering developers new perspectives and opportunities.

You may also like

Industries
2024-03-12
Dmitry Ostroga
IT Conferences for business: What’s beneficial about it and where to find one
Learn where to find the most impactful IT conferences, whether through global platforms or specialized niche gatherings. Elevate your business's IT strategy and stay ahead of the curve with the insights shared in this comprehensive article.
Читать
Expertise
2024-04-12
PlaysDev
Onboarding or employee adaptation – what is it, stages and methods
What is employee onboarding and how to organize it correctly? Traditional and modern approach to employee adaptation: why it is important to increase engagement on the new colleague’s first week.
Читать
Expertise
2024-03-15
PlaysDev
Top 8 Project Manager Skills: hard and soft skills to put in your resume
A short guide to the Project Manager profession: who is he and what responsibilities does he perform, what skills should a valuable PM have and how to develop them?
Читать
Expertise
2024-03-28
PlaysDev
10 Tips on How to Succeed at an Internship at a Company
Discover useful tips on how to successfully complete an internship at IT company. Here we talk about the main reasons why young specialists need to enter an internship in 2024.
Читать
Expertise
2024-06-21
Ulyana Grechko
HR Manager Interview: Main Tasks and Original Cases
New interview with an HR manager: We talk about most interesting cases at PlaysDev, self-motivation, key values ​​and approaches to managing people in IT.
Читать
Expertise
2024-07-24
PlaysDev
DevSecOps: How is it different from DevOps?
What is DevSecOps? Huge Overview: best practices, lifecycle, main tools. Why security should be built into the development process?
Читать
Expertise
2024-01-05
PlaysDev
Everything You Need To Know About CEO, CTO, CMO
What are the responsibilities of the CEO, CMO, CTO, CIO, COO, CFO and what does the hierarchy of the management department look like? Here we cover the concepts of C-level positions and decipher its abbreviations.
Читать
Expertise
2024-02-03
PlaysDev
5 Tips on How to Learn English On Your Own
A compilation of the most useful resources for learning English. We tried to collect interesting options that will suit everyone.
Читать
Services
2024-12-02
PlaysDev
Git: 10 commands every developer should know
Git: A powerful development tool and a valuable technology for your resume. Learn how Git simplifies teamwork.
Читать
Technologies
2024-03-26
PlaysDev
MLOps as a methodology: how is it different from DevOps and DataOps?
Let's talk about the features of MLOps. What specialists use MLOps practices in their work and what are the responsibilities of ML engineers? As well as bringing up the main differences between DevOps, DataOps and MLops.
Читать