Technologies

Google Gemma Overview: New AI Models For Developers

By PlaysDev
Published: Jan 20, 2024

On February 21, Google announced Gemma 2B and 7B, open-source artificial intelligence models based on Gemini. Gemini line – neural networks used for various purposes: Gemini Nano, Gemini Pro and Gemini Ultra. Gemini 1.5 was also recently announced, being the faster version, but so far only for enterprises and developers.

Gemma
Thus, Gemini Ultra can recognize, analyze and generate texts, images, audio and video. With Gemini Pro, developers can work in their preferred environment: SDKs are available for Python, Android (Kotlin), Node.js, Swift and JavaScript.

Unlike Gemini, accessible through the API or Vertex AI, Gemma aims to attract a wider range of developers.

Let’s move on to the key points of the presentation:

  • Gemma is a family of models. There will be two models in sizes 2B and 7B.
  • Gemma will use the new Responsible Generative AI Toolkit to help prioritize the creation of AI apps.
  • The multi-framework Keras 3.0 provides compatibility with JAX, PyTorch and TensorFlow, which will allow developers to quickly switch platforms depending on their tasks.
  • Gemma is equipped with popular tools such as Hugging Face, MaxText, NVIDIA NeMo and TensorRT-LLM, and also uses Colab and Kaggle notebooks.
  • Gemma’s pre-trained and tuned models can run on your laptop, PC, or Google Cloud with easy deployment on Vertex AI and Google Kubernetes Engine (GKE).
  • Gemma boasts improved performance in a modest size, including NVIDIA GPUs and Google Cloud TPUs. Vertex AI provides a variety of MLOps tools with one-click configuration and deployment using built-in output optimizations.
  • Google says its Terms of Service will allow all organizations, regardless of size, to use Gemma. However, the tool is currently only suitable for English-speaking use.
  • The models will be free to use on the Kaggle platform, and new Google Cloud customers will be able to get a $300 discount on their deployment. For researchers, its size reaches $500 thousand.

Google emphasizes that Gemma becomes the best due to its size, the ability to run directly on a laptop or PC, and high key indicators. You can read more details about its performance in Google’s technical report.

Developers actively use neural networks when writing code, and in addition to Google tools, we found several more popular neural assistants: Copylot on the OpenAI Codex model, the well-known ChatGPT, Fig – a very useful tool for beginners who have not yet mastered all the functionality of programming languages and development patterns, Mintlify – that helps write code documentation.

The emergence of new technologies and neural tools always contributes to the evolution of development, and Gemma is no exception, offering developers new perspectives and opportunities.

You may also like

Technologies
2024-06-05
PlaysDev
AI Trends 2024: Which Industries Use Artificial Intelligence?
Why are businesses investing in AI? Discover main achievements of artificial assistants and main trends of the AI industry.
Читать
Expertise
2023-11-13
PlaysDev
Developer and Engineer: What are the main differences?
Differences between developer and engineer. What is a Software Developer? What is a DevOps Engineer? Why DevOps engineer is not a developer?
Читать
Technologies
2023-10-26
PlaysDev
What is Agile? Methodologies’ overview
Agile as a business philosophy. Read more about how to choose the right methodology.
Читать
Expertise
2024-04-19
PlaysDev
System administrator vs DevOps engineer: What is The Difference?
Why are DevOps engineers confused with system administrators? What are the key differences between these specialists and what does a system administrator do?
Читать
Expertise
2023-12-21
PlaysDev
Who is a Business analyst?
Who is a business analyst and what does he do in the company? What benefits does it bring to the company? Read about it in our article.
Читать
Industries
2024-03-12
Dmitry Ostroga
IT Conferences for business: What’s beneficial about it and where to find one
Learn where to find the most impactful IT conferences, whether through global platforms or specialized niche gatherings. Elevate your business's IT strategy and stay ahead of the curve with the insights shared in this comprehensive article.
Читать
Expertise
2024-10-16
PlaysDev
Results of the IT conference Strachka 2024: Main thoughts
PlaysDev team attended IT conference Stachka 2024 — the largest event for IT specialists in Russia, bringing together developers, managers and industry leaders to exchange experiences and discuss trends.
Читать
Expertise
2024-04-12
PlaysDev
Onboarding or employee adaptation – what is it, stages and methods
What is employee onboarding and how to organize it correctly? Traditional and modern approach to employee adaptation: why it is important to increase engagement on the new colleague’s first week.
Читать
Expertise
2023-10-20
PlaysDev
10 Practicable Resources for Android Development
10 Practicable Resources for Android Development. Learn about such useful platforms as Developer Guide, Android Weekly, Udacity, Medium and others.
Читать
Industries
2024-05-03
PlaysDev
Grades in IT – How can a DevOps engineer evaluate his grade?
How do IT specialists evaluate their experience and why is grade a very vague concept? We talk about grading using the examples of Google, Meta, Amazon.
Читать