Why You Should Prefer MERGE INTO Over INSERT OVERWRITE in Apache Iceberg
The article argues for preferring MERGE INTO with Merge-on-Read in Apache Iceberg over INSERT OVERWRITE for many update workloads. It explains COW vs MOR, shows how MOR reduces I/O and compute costs (with an EMR+S3 example), warns about INSERT OVERWRITE pitfalls when partitioning evolves, and outlines necessary compaction and maintenance best practices. It also highlights Iceberg v3 deletion vectors as an improvement for high-churn update workloads.