Reduce peak memory usage during release builds to fix OOM on manylinux runners by kevinjqliu · Pull Request #1445 · apache/datafusion-python

kevinjqliu · 2026-03-27T01:18:39Z

Which issue does this PR close?

Follow up to #1443
Closes #1429

Rationale

As the dependency tree has grown (DataFusion + Substrait + Arrow + object_store with aws/gcp/azure/http features), the release build's peak memory during LTO linking has exceeded what the GitHub runner can provide.

Fixes OOM (Killed process ... (rustc) total-vm:25086084kB, anon-rss:15361808kB) during manylinux x86_64 release builds, where rustc consumed ~15 GB and exhausted the runner's memory.

What changes are included in this PR?

Cargo profile (Cargo.toml):

Switch from fat LTO (lto = true) to thin LTO (lto = "thin") -- this is the biggest win, reducing peak memory by ~50-70% since LLVM no longer needs to merge all bitcode into a single module
Increase codegen-units from 1 to 2 -- splits LLVM's workload, further reducing peak RSS

CI workflow (.github/workflows/build.yml):

Add 8 GB swap to the build-manylinux-x86_64 job as a safety net (matching the existing pattern in the aarch64 job)
Reduce build-manylinux-aarch64 swap from 16 GB to 8 GB for consistency

Tradeoffs

Thin LTO + codegen-units=2 may produce binaries that are ~1-4% slower in micro-benchmarks vs fat LTO + codegen-units=1. In practice, this is unlikely to be measurable for a Python extension where the Python-Rust FFI boundary and PyArrow serialization dominate execution time.

kevinjqliu · 2026-03-27T01:20:15Z

.github/workflows/build.yml

          sudo swapoff -a || true
          sudo rm -f /swapfile
-          sudo fallocate -l 16G /swapfile || sudo dd if=/dev/zero of=/swapfile bs=1M count=16384
+          sudo fallocate -l 8G /swapfile || sudo dd if=/dev/zero of=/swapfile bs=1M count=8192


we dont need all 16GB, take less disk space

kevinjqliu added 2 commits March 26, 2026 18:14

adjust swap to 8gb

83138ef

modify profile.release

634e32d

kevinjqliu commented Mar 27, 2026

View reviewed changes

kevinjqliu mentioned this pull request Mar 27, 2026

main branch has errors #1429

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce peak memory usage during release builds to fix OOM on manylinux runners#1445

Reduce peak memory usage during release builds to fix OOM on manylinux runners#1445
kevinjqliu wants to merge 2 commits intoapache:mainfrom
kevinjqliu:kevinjqliu/more-ci-optimizations

kevinjqliu commented Mar 27, 2026 •

edited

Loading

Uh oh!

kevinjqliu Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kevinjqliu commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale

What changes are included in this PR?

Tradeoffs

Uh oh!

kevinjqliu Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kevinjqliu commented Mar 27, 2026 •

edited

Loading