Fix process abort on large float format precision by changjoon-park · Pull Request #7633 · RustPython/RustPython

changjoon-park · 2026-04-19T17:14:11Z

Summary

Formatting a float with a large precision aborts the interpreter instead of raising a Python exception. CPython returns a clean string on the same input.

# Before
$ ./rustpython -c "print(f'{1.5:.1000000}')"
thread 'main' panicked at crates/literal/src/float.rs:135:
Formatting argument out of range              # → exit 101 (process abort)

# After
$ ./rustpython -c "print(f'{1.5:.1000000}')"
1.5

Root cause

Rust's format!("{:.*}", n, x) macro panics when n exceeds the fmt runtime's internal precision limit. format_fixed in crates/literal/src/float.rs already caps n at u16::MAX before calling format!, but its sibling functions format_general and format_exponent — and the FormatType::Percentage branch in crates/common/src/format.rs — pass user-supplied precision straight through. A one-line user script (f"{x:.N}" with N ≳ 65535) triggers the abort.

Affected format types:

Type	Before	After
`f` (fixed)	OK (already capped)	OK
`e` / `E` (exponential)	abort	OK
`g` / `G` (general)	abort	OK
`%` (percent)	abort	OK
default (no type)	abort (routes to `g`)	OK

Fix

Add FMT_MAX_PRECISION + clamp_fmt_precision() helper at module level in float.rs.
Cap is u16::MAX - 1, not u16::MAX — {:.*e} hits a second assertion (ndigits > 0 in core::num::flt2dec) at exactly u16::MAX; the smaller value covers both {:.*} and {:.*e} uniformly.
Apply the helper to:
- format_fixed (replacing existing ad-hoc cap — consistency only)
- format_exponent (new, at function entry)
- format_general (new, at each of three internal format! calls, with saturating arithmetic on derived precision values)
- FormatType::Percentage branch in common/src/format.rs (new)

Complex-number formatting and old-style %-formatting dispatch to the same library functions, so they transitively benefit without separate changes.

Why the cap is safe

f64 carries only ~17 significant decimal digits. Precision beyond ~17 produces padding zeros (for f/e/%) or is silently trimmed (for g). Capping at ~65K is far beyond any user-meaningful precision and matches the existing format_fixed behavior already shipping in main.

Verification

$ cargo +1.94.0 build --release
    Finished `release` profile [optimized] target(s) in 40.54s

$ ./target/release/rustpython -m test test_float test_fstring test_format
All 3 tests OK.
Total tests: run=162

$ ./target/release/rustpython extra_tests/snippets/builtin_format.py
# (all assertions pass, including 7 new regression cases)

Reliability audit (beyond the trigger path)

Probed after the fix:

40 magnitude/type combinations: 10 values × 4 format types at precision 200_000 — 0.0, ±1.5, ±inf, nan, 1e-300, 1e300, f64::MAX, 5e-324. All return clean strings.
Boundary precisions: 0 / 1 / 2 for each format type — outputs match expected ('{:.0g}'.format(1.5) == '2', etc.).
Complex numbers: five format specs including .200000e — all OK (transitively via format_exponent).
Old-style % formatting: '%.200000e' % 1.5 etc. — all OK (transitively via cformat.rs → format_fixed/format_exponent/format_general).
Combined format specs: fill + align + width + precision, sign + precision, alternate form + precision, zero-pad + precision, grouping + precision — all OK.
Defense-in-depth range: Rust's parser rejects precision > i32::MAX with ValueError("Precision too big"), so the guarded interval [0, i32::MAX] is now panic-free end-to-end.

Prior fix using the same "cap before Rust format!" approach: format_fixed already did this; this PR extends the same pattern to its siblings.
Adjacent hardening PRs using the same spirit (translate native failures into Python exceptions): Fix segfault on cyclic or deeply-nested AST in compile() #7630 (cyclic-AST SIGSEGV → RecursionError), Fix stack overflow on deeply-nested JSON in json.loads() #7632 (deep-JSON SIGSEGV → RecursionError).

Summary by CodeRabbit

Bug Fixes
- Float formatting now handles extreme precisions safely: percent formatting outputs "inf%" for infinite values, precision is clamped for fixed/exponential/general formats, and fractional digits are zero-padded when needed to match expected behavior.
Tests
- Added regression tests covering very large precision values across f/e/g/% formats to ensure stable, non-crashing output and preserved NaN/Inf representations.

Formatting a float with large precision (>= ~65535) aborted the interpreter instead of raising a Python exception. CPython handles the same input by returning a clean string. # Before ./rustpython -c "print(f'{1.5:.1000000}')" thread 'main' panicked at crates/literal/src/float.rs:135: Formatting argument out of range (exit 101, abort) # After ./rustpython -c "print(f'{1.5:.1000000}')" 1.5 Root cause: Rust's `format!("{:.*}", n, x)` panics when `n` exceeds the fmt runtime's internal precision limit. `format_fixed` already caps `n` at u16::MAX, but `format_general` and `format_exponent` (and the `%` branch in `crates/common/src/format.rs`) passed user-supplied precision straight through to `format!`. Fix: * Introduce `FMT_MAX_PRECISION` + `clamp_fmt_precision()` in crates/literal/src/float.rs. Cap is `u16::MAX - 1` because `{:.*e}` hits a second panic (`ndigits > 0` in core flt2dec) at exactly u16::MAX; the smaller value covers both paths. * Apply the helper to `format_fixed` (replacing the existing ad-hoc cap), `format_exponent` (entry), and `format_general` (three separate format! calls with saturating arithmetic on derived precision values). * Apply the helper in the `FormatType::Percentage` branch in crates/common/src/format.rs. This is harmless for all normal inputs — f64 carries only ~17 significant digits, so precision beyond 65K is padding zeros at best. Complex-number and old-style `%`-formatting paths transitively benefit because they dispatch to the same library functions. Verified: * cargo run -- -m test test_float test_fstring test_format: 144 passed, 0 regressed. * extra_tests/snippets/builtin_format.py: all assertions pass, including 7 new regression cases covering e / E / g / G / f / % at precision 1_000_000. * Probed with 10 magnitude values (0, ±1.5, ±inf, nan, 1e-300, 1e300, f64::MAX, 5e-324) x 4 format types = 40 combinations, plus precision 0/1/2 boundary, complex formatting, old-style `%` formatting, and combined specs (fill/align/sign/grouping/ zero-pad). All return clean strings; no process abort.

coderabbitai · 2026-04-19T17:14:25Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Add precision-clamping constants and helpers; apply clamping across percent, fixed, exponent, and general float-formatting code paths to avoid runtime panics for extremely large precisions, and add regression tests validating behavior for huge precision values.

Changes

Cohort / File(s)	Summary
Float formatting core `crates/literal/src/float.rs`	Add `pub const FMT_MAX_PRECISION`, `pub const FMT_MAX_EXP_PRECISION`, and `pub fn clamp_fmt_precision()/clamp_exp_precision()`. Clamp precision usage in `format_fixed`, `format_exponent`, and `format_general`; reuse exponential output where applicable to avoid extra formatting and out-of-range precision panics.
Formatting glue `crates/common/src/format.rs`	In `FormatSpec::format_float` percent (`%`) branch, scale magnitude first and detect overflow to `inf` (returning `"inf%"`). Use `float::clamp_fmt_precision` for the internal formatting call and pad fractional digits when the original precision exceeded the clamped value; decimal point decision continues to use the original precision.
Regression tests `extra_tests/snippets/builtin_format.py`	Add tests exercising extremely large precisions (including `1_000_000`, at-cap, and beyond-cap) for `f`, `e`/`E`, `%`, and `g` formats to assert no panics and CPython-matching outputs (including zero-padding and `inf%`).

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐰 I nibbled through precisions, tidy and spry,

Tucked runaway digits so none could fly.
When zeros stretch to mountains and exponents roam,
My gentle clamps bring every float safely home. 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 28.57% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title accurately describes the main objective: preventing process aborts when formatting floats with large precision values, which is the core fix across the changed files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

crates/literal/src/float.rs (1)

154-178: ⚠️ Potential issue | 🟡 Minor

Avoid double-clamping exponent base formatting in format_general.

At high precision values near the cap (65534), line 167–171 reformats base with clamp_fmt_precision(precision.saturating_add(1)), which re-applies the cap and can drop necessary precision. Since r_exp is already produced with clamped precision at line 156, reuse base directly instead of reformatting. Extract exponent_precision at the start to avoid repeating the same computation and keep both the exponent string formatting and decimal point logic consistent.

Proposed fix

         magnitude if magnitude.is_finite() => {
+            let exponent_precision = clamp_fmt_precision(precision.saturating_sub(1));
             let r_exp = format!(
                 "{:.*e}",
-                clamp_fmt_precision(precision.saturating_sub(1)),
+                exponent_precision,
                 magnitude,
             );
             let mut parts = r_exp.splitn(2, 'e');
             let base = parts.next().unwrap();
             let exponent = parts.next().unwrap().parse::<i64>().unwrap();
@@
-                let magnitude = format!(
-                    "{:.*}",
-                    clamp_fmt_precision(precision.saturating_add(1)),
-                    base,
-                );
+                let magnitude = base.to_owned();
                 let base = maybe_remove_trailing_redundant_chars(magnitude, alternate_form);
-                let point = decimal_point_or_empty(precision.saturating_sub(1), alternate_form);
+                let point = decimal_point_or_empty(exponent_precision, alternate_form);
                 format!("{base}{point}{e}{exponent:+#03}")

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/literal/src/float.rs` around lines 154 - 178, In format_general, avoid
reformatting the exponent base (the second formatting that uses
clamp_fmt_precision(precision.saturating_add(1)) and assigns to magnitude/base)
because r_exp was already produced with a clamped precision; instead compute a
single exponent_precision = clamp_fmt_precision(precision.saturating_sub(1)) at
the start, use it when creating r_exp, then reuse the parsed base from r_exp
(variable base from parts.next()) rather than calling format! again; pass that
base into maybe_remove_trailing_redundant_chars and use
decimal_point_or_empty(precision.saturating_sub(1), alternate_form) so both
exponent string formatting and decimal-point logic use the same computed
precision and you no longer double-clamp via clamp_fmt_precision in this branch.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In `@crates/literal/src/float.rs`:
- Around line 154-178: In format_general, avoid reformatting the exponent base
(the second formatting that uses
clamp_fmt_precision(precision.saturating_add(1)) and assigns to magnitude/base)
because r_exp was already produced with a clamped precision; instead compute a
single exponent_precision = clamp_fmt_precision(precision.saturating_sub(1)) at
the start, use it when creating r_exp, then reuse the parsed base from r_exp
(variable base from parts.next()) rather than calling format! again; pass that
base into maybe_remove_trailing_redundant_chars and use
decimal_point_or_empty(precision.saturating_sub(1), alternate_form) so both
exponent string formatting and decimal-point logic use the same computed
precision and you no longer double-clamp via clamp_fmt_precision in this branch.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yml

Review profile: CHILL

Plan: Pro

Run ID: 6827d226-5ccc-49ce-9a04-483979d9a597

📥 Commits

Reviewing files that changed from the base of the PR and between b18b71b and a3acff1.

📒 Files selected for processing (3)

crates/common/src/format.rs
crates/literal/src/float.rs
extra_tests/snippets/builtin_format.py

Two refinements after CodeRabbit review: 1. Drop the redundant `format!("{:.*}", precision + 1, base)` in `format_general`'s scientific branch. It was a no-op pre-fix (magnitude is `.abs()`-ed at the caller, so `base` has no sign and its length was exactly `precision + 1`), but after I added the cap it turned into an active truncate — dropping 1 char of precision at the cap boundary. Reuse `base` directly and extract `exp_precision` for reuse by `decimal_point_or_empty`. 2. Split the cap into two helpers. `FMT_MAX_PRECISION = u16::MAX` — for plain `{:.*}` (format_fixed, %-branch, format_general's non-scientific branch). `FMT_MAX_EXP_PRECISION = u16::MAX - 1` — for `{:.*e}` (format_exponent, format_general's scientific entry). The second value is one lower because `{:.*e}` trips an additional `ndigits > 0` assertion in `core::num::flt2dec` at exactly `u16::MAX`. The first commit used the tighter cap uniformly, which silently regressed `format_fixed` by 1 char at `precision == u16::MAX` (it previously capped at exactly that value). Two helpers restore byte-identical CPython parity for fixed / percent / general-non-scientific paths up through `precision == u16::MAX`. Verification: * precision 5 .. 65534: 360 outputs byte-identical to CPython across 8 magnitudes x 9 precisions x 5 types. * precision == 65535: f / g / G / % now match CPython (0 diff). e / E remain 1 char shorter — unavoidable within the `u16::MAX - 1` exp cap. * precision > 65535: output stops at cap; CPython emits full padding — same design divergence as before. * No panic regression: f-string default, e/E, g/G, %, f at precision 1_000_000 all return cleanly. * Test suite: test_float + test_fstring + test_format, 162 passed, 0 regressed.

youknowone · 2026-04-20T00:17:44Z

+# crates/literal/src/float.rs and `crates/common/src/format.rs` (the `%`
+# branch), which panic past Rust's fmt precision limit and killed the
+# process instead of raising a Python exception.
+_big = 1_000_000


Because this patch set the boundary as pub const FMT_MAX_PRECISION: usize = u16::MAX as usize;, the test must cover 65535 and 65536 to ensure its behavior follows CPython

Per review comment on `extra_tests/snippets/builtin_format.py:209`: the patch declares `FMT_MAX_PRECISION = u16::MAX`, so the tests must cover 65535 and 65536 and demonstrate CPython parity at the boundary. The previous version only avoided panic — at the cap it silently truncated 1 char short of CPython for e / E, and thousands of chars short for f / % at precision beyond the cap. This commit restores byte-identical CPython output at every precision up to the format- spec parser's own `i32::MAX` ceiling. Fix: pad the Rust-format result with '0's up to the user-requested precision. Why this is correct, not a workaround: IEEE 754 double has at most ~767 significant decimal digits; past that, every digit is deterministically '0' in both CPython and the native Rust output. Our cap (65534 for exp, 65535 for plain) sits far above 767, so appending zeros reconstructs precisely what CPython would have produced. Verified on hard inputs: `1e-100`, `5e-324` (subnormal boundary), `f64::MAX`, mixed magnitudes — the last 100 chars of Rust-format output at precision 65534 are all '0' for every case. Changes: * `format_fixed`: after format!(), extend with (precision - capped) '0' chars before appending the optional decimal point. * `format_exponent`: same, applied to the parsed mantissa before reassembling with the exponent marker. * `FormatType::Percentage` branch: same. Also fixed a bug the boundary audit surfaced: the finite-input overflow guard used `return Ok("inf%")`, which bypasses the outer sign handler. Changed to a match-arm value so `format_sign_and_align` still runs and produces "-inf%" for `-f64::MAX`, matching CPython. Verification: * 7 magnitudes × 5 precisions × 6 format types = 210 comparisons against CPython at precisions {65534, 65535, 65536, 100000, 200000}. All 210 byte-identical. * Gap audit (complex formatting, old-style % formatting, negative magnitudes, -0.0, combined specs with fill / sign / alternate / grouping) at boundary precisions. All but 20 byte-identical. The 20 remaining diffs all stem from a pre-existing complex-imaginary-part repr bug (`1e100j` expands to 100 '0's in RustPython vs CPython's `1e+100j`) which reproduces on upstream main without any part of this patch and is out of scope here. * `cargo run -- -m test test_float test_fstring test_format`: 162 passed, 0 regressed. * `extra_tests/snippets/builtin_format.py` now pins exact expected strings at 65534 / 65535 / 65536 / 1_000_000 for every format type, plus the `f64::MAX × 100 → 'inf%'` overflow case. * `cargo fmt --check`: pass.

youknowone · 2026-04-20T12:55:54Z

+# f-format pads with trailing zeros up to the requested precision.
+assert "{:.65534f}".format(1.5) == "1." + "5" + "0" * 65533
+assert "{:.65535f}".format(1.5) == "1." + "5" + "0" * 65534
+assert "{:.65536f}".format(1.5) == "1." + "5" + "0" * 65535
+# e-format emits a fixed mantissa width + 'e+00'.
+assert "{:.65534e}".format(1.5) == "1." + "5" + "0" * 65533 + "e+00"
+assert "{:.65535e}".format(1.5) == "1." + "5" + "0" * 65534 + "e+00"
+assert "{:.65536e}".format(1.5) == "1." + "5" + "0" * 65535 + "e+00"
+# %-format multiplies by 100 then applies f-format.
+assert "{:.65534%}".format(1.5) == "150." + "0" * 65534 + "%"
+assert "{:.65535%}".format(1.5) == "150." + "0" * 65535 + "%"
+assert "{:.65536%}".format(1.5) == "150." + "0" * 65536 + "%"
+# g-format strips trailing zeros, so the short form is the natural
+# representation regardless of precision.
+for p in (65534, 65535, 65536, 1_000_000):
+    assert ("{:." + str(p) + "g}").format(1.5) == "1.5"
+
+# Percent overflow: finite input whose *100 is +inf produces "inf%"
+# rather than crashing. CPython does the same.
+assert "{:.100000%}".format(1.7976931348623157e308) == "inf%"
+
+# Shallow cases unchanged.
+assert f"{1.5:.5}" == "1.5"
+assert "{:.3f}".format(1.5) == "1.500"
+assert "{:.2%}".format(0.25) == "25.00%"
+assert "{:.4e}".format(1234.5) == "1.2345e+03"
+assert "{:.3g}".format(1234.5) == "1.23e+03"
+assert f"{float('nan'):.10f}" == "nan"
+assert f"{float('inf'):.10f}" == "inf"


which test is about unhappy cases, like exceeding the MAX_PRECISION?

Rename the boundary-test section so the three precision points per format type are labeled below / at / past the cap inline, making the "past MAX_PRECISION" unhappy-case coverage explicit. Add len-based assertions at precision 1_000_000 for f, e, and % to exercise the cap-then-pad path at a depth far beyond the boundary.

coderabbitai bot reviewed Apr 19, 2026

View reviewed changes

changjoon-park added 2 commits April 20, 2026 03:00

Fix ruff format: single-line precision clamp

b142cf2

youknowone requested changes Apr 20, 2026

View reviewed changes

youknowone reviewed Apr 20, 2026

View reviewed changes

changjoon-park force-pushed the fix-float-format-panic branch from 6b52bdb to 58c59d4 Compare April 20, 2026 13:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix process abort on large float format precision#7633

Fix process abort on large float format precision#7633
changjoon-park wants to merge 5 commits intoRustPython:mainfrom
changjoon-park:fix-float-format-panic

changjoon-park commented Apr 19, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Apr 19, 2026 •

edited

Loading

Reviews paused

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

youknowone Apr 20, 2026

Uh oh!

youknowone Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

changjoon-park commented Apr 19, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root cause

Fix

Why the cap is safe

Verification

Reliability audit (beyond the trigger path)

Related

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

youknowone Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

youknowone Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

changjoon-park commented Apr 19, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 19, 2026 •

edited

Loading