This blog post is co-authored by our great contributor Thomas Vitale. OpenAI provides specialized models for speech-to-text and text-to-speech conversion, recognized for their performance and cost-efficiency. Spring AI integrates these capabilities via Voice-to-Text and Text-to-Speech (TTS). The new Audio Generation feature (gpt-4o-audio-preview) goes further, enabling mixed input and output modalities. Audio inputs can contain richer data than text alone. Audio can convey nuanced information like tone and inflection, and together with the audio outputs it enables asynchronous speech-to-speech…
This blog post is co-authored by our great contributor Thomas Vitale. Ollama now supports all GGUF models from Hugging Face, allowing access to over 45,000 community-created models through Spring AI's Ollama integration, runnable locally. spring-ai-ollama-huggingface-gguf2 We'll explore using this new feature with Spring AI. The Spring AI Ollama integration can automatically pull unavailable models for both chat completion and embedding models. This is useful when switching models or deploying to new environments. Setting Up Spring AI with Ollama Install Ollama on your system: https://ollama…
In the rapidly evolving world of artificial intelligence, developers are constantly seeking ways to enhance their AI applications.
Spring AI, a Java framework for building AI-powered applications, has introduced a powerful feature: the Spring AI Advisors. The advisors can supercharge your AI applications, making them more modular, portable and easier to maintain. If reading the post isn't convenient, you can listen to this experimental podcast, AI-generated from blog's content: What are Spring AI Advisors? At their core, Spring AI Advisors are components that intercept and potentially modify…
Spring AI now supports NVIDIA's Large Language Model API, offering integration with a wide range of models. By leveraging NVIDIA's OpenAI-compatible API, Spring AI allows developers to use NVIDIA's LLMs through the familiar Spring AI API. SpringAI-NVIDIA-API-5 We'll explore how to configure and use the Spring AI OpenAI chat client to connect with NVIDIA LLM API. The demo application code is available in the nvidia-llm GitHub repository. The SpringAI / NVIDIA integration documentation. Prerequisite Create NVIDIA account with sufficient credits. Select your preferred LLM model from NVIDIA's…
OpenAI recently introduced a powerful feature called Structured Outputs, which ensures that AI-generated responses adhere strictly to a predefined JSON schema. This feature significantly improves the reliability and usability of AI-generated content in real-world applications. Today, we're excited to announce that Spring AI (1.0.0-SNAPSHOT) has fully integrated support for OpenAI's Structured Outputs, bringing this capability to Java developers in a seamless, Spring-native way. Following diagram shows how the new Structured Outputs feature extends the OpenAI Chat API: Restored Spring AI (…
Faster information processing not only informs - it transforms how we perceive and innovate. Spring AI, a powerful framework for integrating AI capabilities into Spring applications, now offers support for Groq - a blazingly fast AI inference engine with support for Tool/Function calling. Leveraging Groq's OpenAI-compatible API, Spring AI seamlessly integrates by adapting its existing OpenAI Chat client.
This approach enables developers to harness Groq's high-performance models through the familiar Spring AI API. spring-ai-groq-integration We'll explore how to configure and use the Spring AI…
Earlier this week, Ollama introduced an exciting new feature: tool support for Large Language Models (LLMs). Today, we're thrilled to announce that Spring AI (1.0.0-SNAPSHOT) has fully embraced this powerful feature, bringing Ollama's function calling capabilities to the Spring ecosystem. Ollama's tool support allows models to make decisions about when to call external functions and how to use the returned data. This opens up a world of possibilities, from accessing real-time information to performing complex calculations. Spring AI takes this concept and integrates it seamlessly with the…
UPDATE: (04.06.2024) Adde snippets for using structured output with the new, fluent ChatClient API . UPDATE: (17.05.2024) Generic Types support for BeanOutputConverter added. Science works with chunks and bits and pieces of things with the continuity presumed, and Art works only with the continuities of things with the chunks and bits and pieces presumed. - Robert M. Pirsig The ability of LLMs to produce structured outputs is important for downstream applications that rely on reliably parsing output values.
Developers want to quickly turn results from an AI model into data types, such as…
UPDATE 20.07.2024 : Update Message API hierarchy diagram and update the model names supporting multimodality UPDATE 02.06.2024 : Add an additional code snippet showing how to use the new ChatClient API. Humans process knowledge, simultaneously across multiple modes of data inputs. The way we learn, our experiences are all multimodal. We don't have just vision, just audio and just text. These foundational principles of learning were articulated by the father of modern education John Amos Comenius, in his work, "Orbis Sensualium Pictus", dating back to 1658. orbis-sensualium-pictus2 "All things…
UPDATE: As of March 13, 2024, Mistral AI has integrated support for parallel function calling into their large model, a feature that was absent at the time of this blog's initial publication. Mistral AI, a leading developer of open-source large language models, unveiled the addition of Function Calling support to its cutting-edge models. Function Calling is a feature that facilitates the integration of LLM with external tools and APIs.
It enables the language model to request the execution of client-side functions, allowing it to access necessary run-time information or perform tasks…