News Froggy
newsfroggy
HomeTechReviewProgrammingGamesHow ToAboutContacts
newsfroggy

Your daily source for the latest technology news, startup insights, and innovation trends.

More

  • About Us
  • Contact
  • Privacy Policy
  • Terms of Service

Categories

  • Tech
  • Review
  • Programming
  • Games
  • How To

© 2026 News Froggy. All rights reserved.

TwitterFacebook
Tech

Definity Embeds Agents in Spark Pipelines to Prevent AI System

Definity, a Chicago-based startup, secured $12M in Series A funding to advance its unique data pipeline reliability solution. By embedding agents directly within Spark pipelines, Definity proactively identifies and prevents failures, bad data, and inefficiencies during execution, crucial for the integrity of agentic AI systems.

PublishedApril 30, 2026
Reading Time5 min
Definity Embeds Agents in Spark Pipelines to Prevent AI System

Definity, a Chicago-based data pipeline operations startup, announced on Wednesday, April 29, 2026, it has secured $12 million in Series A funding. The investment, led by GreatPoint Ventures with participation from Dynatrace, StageOne Ventures, and Hyde Park Venture Partners, will fuel Definity's mission to revolutionize data pipeline reliability. The company's innovative approach embeds intelligent agents directly within Spark and DBT pipelines, proactively catching and preventing failures, bad data, and inefficiencies during execution—a critical advancement for ensuring the integrity of data feeding increasingly vital agentic AI systems.

Why Existing Pipeline Monitoring Falls Short

Traditional data pipeline monitoring tools typically operate from outside the execution layer, gathering metrics only after a job has completed. Solutions from companies like Datadog (which acquired Metaplane), Databricks system tables, Unravel Data, and Acceldata provide valuable insights, but often after the damage is done. According to Roy Daniel, CEO and co-founder of Definity, this "after-the-fact" approach means that by the time a problem is identified, the pipeline has already run, potentially propagating bad data downstream, wasting compute resources, and ultimately breaking AI systems reliant on timely, clean input. This reactive posture is no longer sufficient for the demands of modern, AI-driven enterprises where data quality and availability are paramount.

Definity's In-Execution Intelligence

Definity differentiates itself by integrating its proprietary agents directly into the pipeline's execution layer. This is achieved through inline instrumentation, where a JVM agent is installed with a single line of code, operating below the platform layer to pull real-time execution data directly from Spark.

These agents capture a comprehensive range of critical metrics as the pipeline runs, including query execution behavior, memory pressure, data skew, shuffle patterns, and infrastructure utilization. Crucially, the system dynamically infers data lineage between pipelines and tables without requiring a predefined data catalog, providing a full-stack, real-time, and production-aware context.

Beyond mere observation, Definity's agents can actively intervene during a pipeline run. This includes modifying resource allocation dynamically, stopping a job before corrupt data can propagate further, or preempting a pipeline based on detected upstream data conditions. Daniel cited an instance where an agent prevented a downstream pipeline from starting because an upstream job had been preempted, leading to stale input data. While detection and prevention occur in real-time, comprehensive root cause analysis and optimization recommendations are generated on-demand when an engineer queries the assistant, utilizing the already-assembled execution context. The agent's overhead is minimal, adding approximately one second of compute to an hour-long run, and supports full on-premises deployment for sensitive environments by only transmitting metadata externally.

Real-World Impact at Nexxen

Nexxen, an ad tech platform that manages large-scale, on-premises Spark pipelines for mission-critical advertising workloads, has already experienced the tangible benefits of Definity's platform. Dennis Meyer, Director of Data Engineering at Nexxen, explained that their primary challenge wasn't frequent pipeline failures, but rather the cumulative cost of inefficiencies within a non-elastic, on-premises environment where waste directly impacts costs.

Existing monitoring tools provided fragmented visibility, making systematic optimization difficult. Upon deploying Definity without requiring any pipeline code changes, Nexxen quickly gained full-stack visibility. Meyer reported that his team identified 33% of its optimization opportunities within the first week, leading to a remarkable 70% reduction in engineering effort spent on troubleshooting and optimization. This operational efficiency freed up infrastructure capacity, enabling Nexxen to support increasing workload demands without additional hardware investments. Meyer underscored the shift: "The key shift was moving from reactive troubleshooting to proactive, continuous optimization. At scale, the biggest gap often isn't tooling — it's actionable visibility."

Implications for Enterprise Data Teams

Definity's approach signifies a crucial evolution for enterprise data teams, particularly those operating production Spark environments. As data pipelines increasingly underpin agentic AI workloads with direct business dependencies, the consequences of failures escalate from mere inconvenience to blocking critical AI delivery. This transformation elevates pipeline operations into a fundamental AI infrastructure challenge.

The proven ability to significantly reduce troubleshooting and optimization effort, as demonstrated by Nexxen's 70% reduction, highlights a substantial recoverable cost. For lean data engineering teams, reclaiming this time to focus on strategic roadmap initiatives presents a compelling immediate case for evaluating in-execution intelligence solutions like Definity. This paradigm shift from reactive post-mortem analysis to proactive, in-run intervention is set to redefine data reliability and operational efficiency in the era of pervasive AI.

FAQ

Q: How does Definity's approach differ from traditional data pipeline monitoring tools?

A: Traditional tools typically monitor pipelines externally and report issues after a job has completed. Definity embeds intelligent agents inside the pipeline's execution layer (via a JVM agent), allowing for real-time capture of execution data and proactive intervention, such as stopping a job or modifying resources, before failures or bad data propagate downstream.

Q: What specific benefits have early Definity users, like Nexxen, reported?

A: Nexxen, an ad tech platform, identified 33% of its optimization opportunities within the first week of deployment. They also saw a 70% reduction in engineering effort dedicated to troubleshooting and optimization, significantly freed up infrastructure capacity, and could support workload growth without additional hardware investment. Definity also claims customers resolve complex Spark issues up to 10x faster.

Q: Why is Definity's solution particularly important for agentic AI systems?

A: Agentic AI systems critically depend on a continuous supply of clean, accurate, and timely data. A data pipeline that delivers stale or faulty data, or fails silently, directly impairs or breaks the AI system relying on it. Definity's ability to prevent these issues in real-time ensures the foundational data integrity required for reliable and effective AI operations.

#Orchestration#Infrastructure#Data#Security#AI#Spark

Related articles

Proton CEO on AI Privacy: Possible, But Agents Keep Him Up
Review
ZDNetApr 30

Proton CEO on AI Privacy: Possible, But Agents Keep Him Up

Quick Verdict In an era where Artificial Intelligence (AI) and Big Tech are increasingly eroding personal privacy, Proton CEO Andy Yen presents a nuanced yet optimistic view: privacy in the AI era is indeed possible.

Sniffies Secures $100M Match Group Investment for Sex-Positive Tech
Tech
GeekWireApr 29

Sniffies Secures $100M Match Group Investment for Sex-Positive Tech

Seattle’s Sniffies lands $100M investment from Match Group in major bet on sex-positive tech Seattle-based Sniffies, a prominent meetup platform for gay, bisexual, and sexually curious men, has secured a substantial

Ubuntu Linux to Integrate AI Features Through 2026
Tech
The VergeApr 28

Ubuntu Linux to Integrate AI Features Through 2026

Canonical has revealed its strategy to integrate AI features into Ubuntu Linux throughout 2026. The plan includes enhancing existing OS functions with background AI models and introducing new AI-native tools, such as advanced accessibility features and agentic AI. Canonical emphasizes model transparency and local inference, aiming to make Linux more accessible without transforming Ubuntu into an "AI product."

DeepMind’s David Silver Just Raised $1.1B for AI That Learns Without
Tech
TechCrunch AIApr 28

DeepMind’s David Silver Just Raised $1.1B for AI That Learns Without

DeepMind veteran David Silver has secured an unprecedented $1.1 billion in funding for his new British AI lab, Ineffable Intelligence, at a $5.1 billion valuation. The company aims to build a "superlearner" AI that acquires knowledge and skills purely through reinforcement learning, without relying on human data, a radical departure from current large language models.

Philips Hue Sync Box 8K Slashed by 30% in 'Bright Days' Sale
Tech
The VergeApr 27

Philips Hue Sync Box 8K Slashed by 30% in 'Bright Days' Sale

Smart home enthusiasts and gamers can rejoice as the Philips Hue Play HDMI Sync Box 8K is now available at a significant 30 percent discount, bringing its price down to $269.49. This substantial offer, part of Philips

Google Expands Gradient Icon Redesign to More Key Apps
Tech
The VergeApr 26

Google Expands Gradient Icon Redesign to More Key Apps

Google is rolling out its new gradient icon design to more apps like Sheets, Slides, and Keep. This update, which started in late 2025 with apps like Gemini, features softer gradients, rounder corners, and a more vibrant, varied aesthetic. It marks a shift from flat designs and uniform circles, with the new look also reportedly signaling the presence of AI-powered features.

Back to Newsroom

Stay ahead of the curve

Get the latest technology insights delivered to your inbox every morning.