Anchor: Navigation

GenAI Issue 4.0 - February 2025 Follow the Stream Your Dedicated Community Content Scroll 

Exploring AI Innovation in 2025 with Data Streaming

Welcome to the fourth issue of Follow the Stream GenAI! This edition explores the latest trends and predictions for GenAI and data streaming in 2025, focusing on the industry’s vision and expert insights. Covering topics like data quality, strategic alignment, and AI skilling, we unpack common industry challenges and consider emerging data streaming solutions.
See Industry Updates

GenAI & Data Streaming Fundamentals

Real-Time GenAI Crash Course

Heading 1

Heading 2

Heading 3


Subtitle 1
Subtitle 2
Subtitle 3
Anchor: General-Updates

GenAI Sector Snapshot: General Updates

Available in German, French, and Spanish!

3 Data Engineering Trends Riding Apache Kafka®, Apache Flink®, and Apache Iceberg

The Apache Kafka, Flink, and Iceberg communities are continuously evolving, offering innovative ways for engineers to manage and process data at scale. From re-envisioning microservices with Flink streaming applications to enabling real-time AI model applications, these tools are reshaping data integration. With strong community contributions, especially to Iceberg, data governance and real-time analytics are set to accelerate, revolutionizing how businesses manage their data infrastructure.
Read Article

Highlights from Confluent AI Day 2024

Confluent AI Day 2024, held in San Francisco and virtually, brought together over 200 AI developers, tech leaders, and innovators to explore how data streaming enhances GenAI applications. Key highlights included a panel discussion on visionary AI, workshops from AWS and MongoDB, and a hackathon spotlighting AI applications for 3D customer service agents, churn prevention, and developer productivity tools.
Read Blog

Confluent Introduces Enterprise Data Streaming to MongoDB’s AI Applications Program

Confluent has joined MongoDB’s AI Applications Program (MAAP), enabling enterprises to leverage its real-time data streaming platform to enhance GenAI applications. Through seamless integration with MongoDB and tools like Apache Kafka® and Flink®, businesses can build a robust, up-to-date data foundation that powers sophisticated AI solutions, such as real-time chatbots, and rapidly deploy them using Confluent’s quickstart guide for AWS.
Raed Blog

The Future for Software in 2025

The integration of AI and data streaming will continue to shape software development in 2025, as organizations seek practical, real-world applications for AI while navigating its challenges as both enabler and disruptor. This fusion emphasizes the growing importance of roles like data streaming engineers and marks the beginning of "Software 2.0," where applications will adapt and learn from usage, transforming workflows and productivity without constant recoding.
Read Article
Anchor: Executives-Brief

Executive's Brief: AI Vision & Strategy

Predictive Analytics: How GenAI & Data Streaming Work Together to Forecast the Future

Predictive analytics, powered by generative AI and data streaming, is revolutionizing decision-making across various industries, including AI. By leveraging historical data and machine learning models, predictive analytics helps businesses forecast future outcomes with greater accuracy. Generative AI enhances these predictions by generating diverse scenarios, filling data gaps, and adapting to real-time data for dynamic forecasting.
Read Blog

Generative AI and Kafka: Confluent’s Vision for Data Streaming

"Data streaming is a must-have technology, and if you're not using it for generative AI, you're doing it wrong," says Will LaForest, Field CTO at Confluent. While discussing the challenges and innovations in cloud-native Apache Kafka® services, LaForest and Rohit Vyas, Regional Vice President at Confluent, emphasize the importance of real-time data processing for powering real-time AI and transforming industries with seamless data flows that drive smarter decisions and operational efficiency.
Read Article

Don't Let Sensitive Data Become an Achilles’ Heel

As AI continues to shape the future of business, sensitive data can become a major vulnerability, jeopardizing both security and reputation. Confluent's Andrew Foo stresses the need for strong data governance frameworks to protect customer privacy and maintain regulatory compliance, ensuring that businesses can harness the power of real-time data and AI while safeguarding against potential risks.
Read Article

Why Great AI Products Are All About the Data

Great AI products rely on the quality and relevance of data to drive accurate and impactful insights. As Shaun Clowes discusses in Lenny’s Podcast, it’s crucial to view data not as the definitive answer but as a tool to disprove hypotheses and refine decision-making. The key lies in having the right data at the right time, which can significantly enhance AI-driven innovation for businesses.
Listen to Podcast

Unlocking the Full Potential of Real-Time GenAI with Confluent and Google Cloud

Although Google is at the forefront of the global AI transformation, the full potential of GenAI can only be harnessed through real-time data integration. Confluent's cloud-native data streaming platform addresses this challenge by seamlessly integrating with Google's GenAI to enhance accuracy, scalability, and reliability. This integration drives real-world applications like personalized experiences, fraud detection, and operational efficiency.
Watch Online Talk
Anchor: Customer-Experience

GenAI Ethics & Impact

Align AI by Design: Harnessing AI with Data Governance to Minimize Risks and Maximize Impact

The issue of “AI misalignment," although new to many business leaders, is predicted to make headlines in the upcoming future due to diverse business risks it presents. Misalignment occurs when AI is trained on data that doesn’t accurately reflect real-world objectives, leading to unintended consequences. The "align by design" approach addresses these issues by emphasizing the importance of proactive AI design, transparency, and data governance.
Listen to Podcast

Building Trust in AI Means Building Trust in Data 

The Executive Order on Safe, Secure, and Trustworthy AI in 2023 emphasizes building trust in AI through better data management, risk frameworks, and data stewardship. Discussions at the Public Sector Summit emphasized the importance of data streaming in maintaining real-time risk assessment and preventing data bias, ultimately helping organizations create trustworthy AI solutions.
Read Blog
Anchor: Architects-Blueprint

Architect's Blueprint: AI-Driven Design

Integrating OpenAI with BigQuery

Integrating OpenAI with BigQuery unlocks powerful possibilities for AI-powered data analysis and advanced insight generation. OpenAI BigQuery integration enables businesses to transform their data warehouses into intelligent analytics powerhouses. By connecting OpenAI to BigQuery, organizations can seamlessly incorporate cutting-edge AI capabilities into their existing data infrastructure, allowing them to uncover patterns, trends, and predictions that were previously difficult or impossible to identify.
Read Blog
Anchor: must-read

Data Streaming in Action at Lufthansa | IT Modernization, Data Sharing, AI

Lufthansa, one of the largest airlines in Europe, leverages data streaming for diverse use cases in real-time analytics, AI, and machine learning applications. Their Kafka Unified Streaming Cloud Operations (KUSCO) project, powered by stream processing, ensures that real-time, high-quality data is available across the enterprise. This enables real-time analytics and AI to enhance customer experiences, “from predictive maintenance to personalized travel experiences,” and supports ML models for efficient fleet management. 
Read the full story and many more in the new book by Kai Waehner: The Ultimate Data Streaming Guide. Download is available for free.
Download Ebook
Available in German, French, and Spanish!

Supercharging Ad Campaign Performance with Generative AI + Confluent

Success in advertising depends on making quick, data-driven decisions. As advertising platforms generate massive amounts of real-time data, organizations need sophisticated tools to analyze and act on this information effectively. GenAI and data streaming offer a powerful solution for automating campaign tasks, creating personalized content, and adjusting in real time based on performance data.
Read Blog

Real-Time Model Inference with Apache Kafka and Flink for Predictive AI and GenAI

Artificial intelligence and machine learning are transforming business operations by enabling systems to learn from data and make intelligent decisions for predictive and generative AI use cases. Confluent’s Field CTO Kai Waehner covers the basics of model inference, comparing different approaches like remote and embedded inference, and explores how data streaming with Apache Kafka® and Flink® enhances the performance and reliability of these predictions.
Read Blog

Stop Treating Your LLM Like a Database

LLMs, unlike databases, are extroverts that thrive on engaging, synthesizing, and proactively contributing based on real-time data. Batch processing limits their potential by holding information back until asked, whereas real-time data empowers LLMs to continuously adapt and deliver dynamic, context-rich insights.
Read Article
Anchor: Developers-Desk

Developer's Desk: AI Toolkit

Building AI and Real-Time Applications Using Confluent and SingleStore Cybersecurity Use Case Demo 

Real-time threat detection analyzes network logs using a machine learning model trained to classify intrusions as benign or malicious. A database with a million pre-labeled records in SingleStore is compared to streaming Apache Kafka® data via embedding vectors and a Euclidean distance metric, enabling real-time classification of threats.
Watch on YouTube

GenAI Demo with Kafka, Flink, LangChain, and OpenAI

GenAI is transforming industries by automating tasks and driving innovation. Kai Waehner, Confluent’s Global Field CTO, explores a simple but powerful architecture that combines Python, LangChain with OpenAI's LLM, Apache Kafka®, and Apache Flink®. “The use case shows how data streaming and GenAI help to correlate data from Salesforce CRM, searching for lead information in public datasets like Google and LinkedIn, and recommending ice-breaker conversations for sales reps.”

Generative AI Meets Data Streaming (Part I) – Data as the Engine: Building the AI Fundamentals

While GenAI focuses on generating new content from vast, unstructured data, data streaming platforms enable the real-time flow of data and enhance data governance, ensuring AI models receive the most current and relevant information. This integration addresses key AI challenges, such as data fragmentation and inconsistent quality, and unlocks new potential for both predictive and generative AI.
Read Blog

Event-Driven AI: Building a Research Assistant with Kafka and Flink

Although the rise of agentic AI has sparked interest in agents that autonomously execute complex workflows, real-world implementations face challenges like scalability and maintenance. The solution, demonstrated by PodPrep AI research assistant, lies in adopting an event-driven architecture (EDA), which decouples system components and allows them to interact more flexibly, promoting scalability and responsiveness.
Read Article

Three AI Trends Developers Need to Know in 2025

Interest in AI has surged since 2020, with 81% of IT leaders prioritizing AI and machine learning in their 2024 budgets. However, the widespread success of AI depends on whether businesses can equip engineers with the right skills, tools, and trustworthy data. One notable trend of 2025 is agentic AI, which is becoming more capable of independent decision-making and thereby presenting new challenges for developers.
Read Blog

Learn About Data Streaming With Apache Kafka® and Apache Flink®

High-throughput low latency distributed event streaming platform. Available locally or fully-managed via Apache Kafka on Confluent Cloud.

High-performance stream processing at any scale. Available via Confluent Cloud for Apache Flink.

Explore Developer Hub

Request Flink Workshop or Tech Talk

Anchor: Innovation-Research

Innovation & Research: AI Revolution

AI and Data Sovereignty Set to Lead Business Innovations in 2025

The AI landscape is moving toward new innovation to address challenges of building LLMs. A promising development in 2025 is agentic AI, which goes beyond generative AI by offering proactive, automated solutions, such as autonomously stabilizing IT systems. Data sovereignty is another key focus, leading to the adoption of sovereign clouds and private data centers to ensure local data compliance and security.
Read Article

Join the Community

Sign up for updates below and check out previous issues!

Share content suggestions and new uses cases in the Comments Section