Google Gemini: Learn about the new innovation by Google

21 Nov 2023

Google's Gemini, celebrated by CEO Sundar Pichai as a groundbreaking innovation, has been creating ripples since its announcement. Following the impactful launch of ChatGPT in November 2023, Google took decisive steps, investing substantially to keep pace with the generative AI trend. This focused effort not only resulted in the introduction of Google Bard but also the revelation of Google Gemini. Pichai expressed his enthusiasm during the APEC CEO Conference, stating that they were building their next generation of models with Gemini. He mentioned being extraordinarily excited about the innovation ahead, expecting it to be a golden age of innovation. Pichai also expressed eagerness to bring all the innovations to more people.

What is Google Gemini?

Gemini is a suite of large language models (LLMs) utilizing training methodologies similar to those employed in AlphaGo, incorporating reinforcement learning and tree search techniques. It holds the potential to challenge ChatGPT's position as the leading generative AI solution globally. It emerged shortly after Google merged its Brain and DeepMind AI labs to form a new research entity called Google DeepMind. Following closely on the heels of Bard's launch and the introduction of its advanced PaLM 2 LLM, expectations suggest a potential release of Google Gemini in the autumn of 2023, although comprehensive details about its capabilities remain elusive.

In May 2023, Sundar Pichai posted a blog article providing a comprehensive introduction to Gemini. He mentioned that Gemini was meticulously developed to be multimodal, highlighting its efficient tool and API integrations. Additionally, it was designed to pave the way for future advancements, including innovations in memory and planning. Pichai emphasized that despite being in its early stages, Gemini already exhibits remarkable multimodal capabilities not seen in earlier models. Once fine-tuned and rigorously assessed for safety, Gemini will be available in various sizes and functionalities, similar to PaLM 2. Since then, official disclosures about its release have been limited. In a Wired interview, Demis Hassabis, the CEO of Google DeepMind, alluded to the capabilities of Gemini. He highlighted its fusion of AlphaGo's strengths with the remarkable language capabilities of large models. A source linked to the product, as reported by Android Police, hinted that Gemini is designed to generate text in conjunction with contextual images, leveraging sources like YouTube video transcripts.

Challenges for the future

Google's extensive effort to catch up with OpenAI, the creators of ChatGPT, appears more challenging than initially anticipated. In the early part of 2023, Google notified certain cloud clients and business partners that they would be granted access to the company's latest conversational AI, Gemini, by November 2023. However, the company recently notified them to expect it in the first quarter of 2024, revealing a significant challenge for Google, particularly amid the slowdown in its cloud sales growth. This contrasts with the accelerated growth of its larger rival, Microsoft, with a portion of Microsoft's success attributed to selling OpenAI's technology to its customer base.