Toward Real-Time Medallion Architecture

One of the biggest advantages of open table formats like Delta Lake and Apache Iceberg is that you can query the same data with multiple engines. No vendor lock-in, just pick the best tool for each job.

Case in point: We’re working with a customer who built their Medallion architecture using Apache Spark and Delta Lake. But as their business evolved, so did user expectations—especially around latency. Spark’s batch processing couldn’t keep up with the demand for fresh insights.

The solution? We replaced periodic Spark jobs with the Feldera Incremental View Maintenance (IVM) engine. Feldera picks up changes to lower-tier tables and updates higher-tier tables in real-time, reducing the end-to-end latency of the pipeline from hours to minutes.

Prior to open table formats, in the world of legacy databases, such a migration would require completely re-architecting the entire data stack in order to use a new query engine. But thanks to the use of Delta Lake, the transition was completely seamless to the rest of the company:

✅ The data hasn't moved and can be read using Spark, Trino, etc., just like before the migration.
✅ Same ingestion pipeline from Kafka to the Bronze layer.
✅ Existing BI dashboards reading the Gold layer stayed untouched—only now they show near-real-time data!

Want to see how it works?
Watch Gerd Zellweger's webinar on replacing batch analytics with Feldera.

Toward Real-Time Medallion Architecture

Other articles you may like

Database computations on Z-sets

Implementing Batch Processes with Feldera

Feldera: three tools for the price of one