Upserting Data using Spark and Iceberg

25.05.2023

•

Jonathan Merlevede

Use Spark and Iceberg’s MERGE INTO syntax to efficiently store daily, incremental snapshots of a mutable source table.

Latest

Stop loading bad quality data

Ingesting all data without quality checks leads to recurring issues. Prioritize data quality upfront to prevent downstream problems.

A 5-step approach to improve data platform experience

Boost data platform UX with a 5-step process:gather feedback, map user journeys, reduce friction, and continuously improve through iteration

From Good AI to Good Data Engineering. Or how Responsible AI interplays with High Data Quality

Responsible AI depends on high-quality data engineering to ensure ethical, fair, and transparent AI systems.

What we do

Resources

Cases

About us

Belgium

Vismarkt 17, 3000 Leuven - HQ
Borsbeeksebrug 34, 2600 Antwerpen

Vat. BE.0667.976.246

Germany

Spaces Kennedydamm,
Kaiserswerther Strasse 135, 40474 Düsseldorf, Germany

What we do

Resources

Cases

About us

Belgium

Vismarkt 17, 3000 Leuven - HQ
Borsbeeksebrug 34, 2600 Antwerpen

Vat. BE.0667.976.246

Germany

Spaces Kennedydamm, Kaiserswerther Strasse 135, 40474 Düsseldorf, Germany

What we do

Resources

Cases

About us

Belgium

Vismarkt 17, 3000 Leuven - HQ
Borsbeeksebrug 34, 2600 Antwerpen

Vat. BE.0667.976.246

Germany

Spaces Kennedydamm, Kaiserswerther Strasse 135, 40474 Düsseldorf, Germany

What we do

Resources

Cases

About us

Select Language