The original pandas script worked great in the notebook. In production it now takes 6 hours, OOMs on the larger partitions, and the engineer who wrote it left. The team is staring at Polars, DuckDB, Spark, and Dask — wondering which to migrate to and whether the rewrite is worth the cost compared to throwing bigger machines at the problem.