Data Engineering Hub
GitHub Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode

Idempotence

Idempotence in the context of data engineering means that if you execute a data pipeline multiple times with the same input, the output will stay the same.

Idempotence Advantages

  • Keeps data duplicate-free
  • Can remove stale data
  • Saves on storage and cost