gh-143214: Add the wrapcol parameter in binascii.b2a_base64() and base64.b64encode() #143216

serhiy-storchaka · 2025-12-27T11:55:39Z

Issue: Support multiline output in binascii.b2a_base64() and base64.b64encode() #143214

📚 Documentation preview 📚: https://cpython-previews--143216.org.readthedocs.build/

…nd base64.b64encode()

Lib/ssl.py

Modules/binascii.c

serhiy-storchaka · 2025-12-27T13:14:42Z

As a side effect -- the calculation of the size for the output buffer is now more accurate. Encoding more than 1GiB is now potentially supported on 32-bit systems (up to 1.5GiB if there is enough memory).

picnixz

About the warning in memoryobject.c, this was my fault but it's fixed on main and backports are pending.

Doc/whatsnew/3.15.rst

cmaloney

👍 overall, didn't review the implementation in depth but went through the docs + overall API shape.

Doc/library/base64.rst

Lib/base64.py

Modules/binascii.c

gpshead · 2025-12-27T19:14:12Z

Modules/binascii.c

+        goto toolong;
+    }
+    if (wrapcol && out_len) {
+        out_len += (out_len - 1) / wrapcol;


Do we want to support unreasonable wrapcol values? Values that are not multiples of 4 really do not make sense and feel unlikely to be used for any practical use of base64. As do low values such as 1-3. (small values are inefficient with our post-processing implementation anyways)

Do we know of any practical actual need for odd-multiple wrapping?

If we restrict the allowed wrappings to start with, we could open them up wider later if anyone has a need. I doubt there is demand for such strange for base64's use case wrapping sizes.

Odd-wrapping: 4n - 1 to really have a space on the right if there is a hard wrap at 4n. Only for rendering purposes I'd say and likely not useful. But 4n - 2 may also be an option. Or odd-wrapping could be useful.

More generally, email's policy allows to specify its own max line length and we currently have: "maxlen = policy.max_line_length or sys.maxsize". So it's important to support all inputs. However, we can specialize the case sys.maxsize to be equivalent to "no wrapping".

I considered this. We can round wrapcol down to the multiplier of 4 and set some minimal non-zero limit. plistlib does this, with the minimum 16. a85encode() sets the limit to 2 if adobe=True. quopri also needs some minimum (not less than 3, plus an escaped newline).

For Base 4 the reason can be that some third-party decoders can not support groups split between lines. See also #76672. But I am not sure whether we should force this or left on the user responsibility? What should we do for values from 1 to 3? Raise an exception? Round up to 4?

For 1, 2, 3, if it's not efficient, then so be it. It's the user responsibility at this point. For 1, it's the same as having no wrap and then put \n in between each values. So they could just do their own b"\n.".join(list(encode(s))) in this case which may be faster than encode(s, wrapcol=1). For 2-3, if it works but is not efficient, just leave it as is I guess (though the case n=2 may be interesting and we could try to optimize it, but I don't think people really want to group b64 encoded content by 2; the number of lines is likely to be too large to be visually interesting).

noone will intentionally pass such pointless tiny values anyways - it is our choice if we bother to disallow them or not. if we do not I'd suggest a ValueError rather than rounding up so that anyone with an actual need (prediction: nobody) can come file a bug explaining why.

a85encode(data, adobe=True, wrapcol=1) wraps the output in lines of length 2, so <~ and ~> will not split.

Modules/binascii.c

gpshead · 2025-12-27T19:33:20Z

Modules/binascii.c

+    if (state == NULL) {
+        return NULL;
+    }
+    PyErr_SetString(state->Error, "Too much data for base64 line");


this error is raised even when no newlines are involved. i'd just leave it as "too much data" without trying to explain why. it's effectively a MemoryError in a sense as the only reason this would happen is if output would cross Py_ssize_t bounds which is unrealistic to ever happen on any system.

This is realistic on 32-bit systems.

I'd just get rid of the word "line" then as this isn't line specific.

pythongh-143214: Add the wrapcol parameter in binascii.b2a_base64() a…

b7196ba

…nd base64.b64encode()

serhiy-storchaka requested review from a team, AA-Turner, gpshead and picnixz as code owners December 27, 2025 11:55

bedevere-app bot mentioned this pull request Dec 27, 2025

Support multiline output in binascii.b2a_base64() and base64.b64encode() #143214

Open

bedevere-app bot added the awaiting core review label Dec 27, 2025

picnixz reviewed Dec 27, 2025

View reviewed changes

Lib/ssl.py Show resolved Hide resolved

picnixz reviewed Dec 27, 2025

View reviewed changes

Modules/binascii.c Show resolved Hide resolved

serhiy-storchaka mentioned this pull request Dec 27, 2025

gh-101178: Add Ascii85, base85, and Z85 support to binascii #102753

Open

serhiy-storchaka added 2 commits December 27, 2025 14:36

Fix errors.

50eb52d

Add tests.

7c7f2dc

Update the name of Hauke Dämpfling.

e429f03

serhiy-storchaka mentioned this pull request Dec 27, 2025

gh-143103: Added pad parameter to base64.z85encode #143106

Merged

picnixz reviewed Dec 27, 2025

View reviewed changes

Doc/whatsnew/3.15.rst Show resolved Hide resolved

cmaloney reviewed Dec 27, 2025

View reviewed changes

Doc/library/base64.rst Outdated Show resolved Hide resolved

Lib/base64.py Outdated Show resolved Hide resolved

gpshead reviewed Dec 27, 2025

View reviewed changes

serhiy-storchaka added 4 commits December 27, 2025 21:35

Address review comments (signature, docstrings).

46c6a25

Add more comments nd tests.

5062ae5

Mark CPython specific tests as such.

cd2e330

Add more tests for a85encode().

cb4af0e

Uh oh!

gh-143214: Add the wrapcol parameter in binascii.b2a_base64() and base64.b64encode() #143216

Are you sure you want to change the base?

gh-143214: Add the wrapcol parameter in binascii.b2a_base64() and base64.b64encode() #143216

Conversation

serhiy-storchaka commented Dec 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

serhiy-storchaka commented Dec 27, 2025

Uh oh!

picnixz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cmaloney left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

picnixz Dec 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

serhiy-storchaka commented Dec 27, 2025 •

edited by github-actions bot

Loading

picnixz Dec 27, 2025 •

edited

Loading