[CBOR] - Implement RFC compliant BigInteger bytes encoding & decoding #578

iifawzi · 2025-04-26T23:39:25Z

Hello, I discovered this issue #431 while working on the async parser for CBOR.
This PR adds feature flags to control compliant encoding & decoding behavior as described in the RFC https://datatracker.ietf.org/doc/html/rfc8949#section-3.4.3

For encoding: Changed calculation to -1 - n for negative values
For decoding:
- Applied -1 - n formula
- Updated to treat bytes as signed using new BigInteger(1, _binaryValue)

Changes are controlled by feature flags:

CBORGenerator.Feature.CORRECT_CBOR_NEGATIVE_BIGINT_ENCODING
CBORParser.Feature.CORRECT_CBOR_NEGATIVE_BIGINT_DECODING

Both flags default to false for backward compatibility.

Since changes are feature-flag controlled, I think they should be safe for 2.19.1 and 2.20 releases (I'm not sure about versioning after the release) @cowtowncoder I'll adjust base branch/comments based on your feedback.

Signed-off-by: Fawzi Essam <[email protected]>

cowtowncoder · 2025-04-27T02:09:13Z

First of all: thank you for working on this!

Second of all: Rats! With SemVer, we can't really merge that in 2.19 in a patch as that changes API.
So needs to go in 2.20 -- so 2.x branch is correct.

iifawzi · 2025-04-27T11:39:38Z

First of all: thank you for working on this!

Second of all: Rats! With SemVer, we can't really merge that in 2.19 in a patch as that changes API. So needs to go in 2.20 -- so 2.x branch is correct.

Thank you. so I understand even if it's managed by feature flags, we consider it an API change.

I will update the comments, and will also update the tests to a more realistic case, as I realized even if it's possible to have BigInteger(-1), it's not really a big integer, and might be confusing given checking any online encoder of CBOR won't use big integer tag for -1 by default. I will include another test with an actual big integer for clarity.

edit: marking it as a draft temporarily until I do more verifications and add more tests.

Signed-off-by: Fawzi Essam <[email protected]>

iifawzi · 2025-04-27T14:45:37Z

The only point to comment on is that we're not following the preferred serialization as described https://www.rfc-editor.org/rfc/rfc8949.html#name-bignums, so encoding -340282366920938463463374607431768211456 using jackson results in:

byte[] expectedBytes = {
                (byte) 0xC3,
                (byte) 0x51, // 17 bytes - leading zero, 16 for the number
                (byte) 0x00, // LEADING Zero
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF,
                (byte) 0xFF
        };

which's still considered fine encoding as per the RFC as long as we're able to decode it Decoders that understand these tags MUST be able to decode bignums that do have leading zeroes.

Tests added in the mapper to verify we're able to decode both with/without leading zeros to the same correct value, while for backward compatibility, it remains the same, it returns different incorrect values with/without leading zeros.

EDIT: we're fine with not following the preferred way anyway, it wasn't mentioned in the initial RFC (https://www.rfc-editor.org/rfc/rfc7049#section-2.4.2), only mentioned point is that decoder should be able to decode with/without leading zeros, which's what has been achieved through this PR.

Signed-off-by: Fawzi Essam <[email protected]>

cowtowncoder · 2025-04-28T01:10:25Z

Thank you. so I understand even if it's managed by feature flags, we consider it an API change.

More specifically: Addition of said Feature flags is an API change (functionality addition). Something to do in a minor release, but not in patch.

cbor/src/main/java/com/fasterxml/jackson/dataformat/cbor/CBORParser.java

cbor/src/main/java/com/fasterxml/jackson/dataformat/cbor/CBORGenerator.java

cowtowncoder · 2025-04-30T02:57:10Z

Will make some minor changes wrt naming, then merge. Thank you.

One follow-up question: should we consider changing defaults for Jackson 3.0.0? I have mixed feelings about that -- on one hand, we would ideally use correct standard encoding/decoding. But then again this probably breaks handling by users unaware of changes.
But not changing it will perpetuate incorrect handling.

cowtowncoder

LGTM!

cowtowncoder · 2025-04-30T03:51:10Z

Big thank you @iifawzi for solving this! I merged it in 2.20 & forward to 3.x.
We can consider changing defaults as a follow-up step.

iifawzi · 2025-04-30T11:20:16Z

Will make some minor changes wrt naming, then merge. Thank you.

One follow-up question: should we consider changing defaults for Jackson 3.0.0? I have mixed feelings about that -- on one hand, we would ideally use correct standard encoding/decoding. But then again this probably breaks handling by users unaware of changes. But not changing it will perpetuate incorrect handling.

I've been thinking about while working on the async parser, on whether it should have same experience or not.

In my opinion, we should change defaults for 3.0.0, major releases meant to break compatibility (if needed), and I think in that case it's needed as current encoding leads to entirely incorrect values. However, given the consistency we had for encoding/decoding, users using Jackson for roundtrip wouldn't even notice the fix (unless they have plain assertions in binary). it will break only for users with partial usage of jackson (encoding using jackson and plain binary for decoding, or vise-versa), so I think better to default to true for 3.0.0.

If you agree, I can create a separate PR targeting 3.x

cowtowncoder · 2025-04-30T18:26:46Z

@iifawzi Yes, I concur. PR for change would be welcome. I will create Issue to match PR with.

EDIT: #582

iifawzi added 2 commits April 27, 2025 01:36

[CBOR] - Implement RFC compliant binary BigInteger encoding & decoding

85a5c56

Signed-off-by: Fawzi Essam <[email protected]>

unify documentation

2d42c5a

Signed-off-by: Fawzi Essam <[email protected]>

update version comment

4cff9c4

Signed-off-by: Fawzi Essam <[email protected]>

iifawzi marked this pull request as draft April 27, 2025 12:04

iifawzi added 2 commits April 27, 2025 16:27

simplify conditions and limit it to negative big integers

76373fb

Signed-off-by: Fawzi Essam <[email protected]>

update referenced decoder to use the variant of leading zero

b07c95a

Signed-off-by: Fawzi Essam <[email protected]>

iifawzi marked this pull request as ready for review April 27, 2025 14:34

iifawzi added 2 commits April 27, 2025 16:41

testing we're able to decode with/without leading zeros

6d0767a

Signed-off-by: Fawzi Essam <[email protected]>

reference cbor.me in tests for clarity

c69c23c

Signed-off-by: Fawzi Essam <[email protected]>

rename variable

71f8cdc

Signed-off-by: Fawzi Essam <[email protected]>

cowtowncoder reviewed Apr 30, 2025

View reviewed changes

cbor/src/main/java/com/fasterxml/jackson/dataformat/cbor/CBORParser.java Outdated Show resolved Hide resolved

cowtowncoder reviewed Apr 30, 2025

View reviewed changes

cbor/src/main/java/com/fasterxml/jackson/dataformat/cbor/CBORGenerator.java Outdated Show resolved Hide resolved

Merge branch '2.x' into fix-cbor-negative-bigInteger

138c1cf

cowtowncoder added 2 commits April 29, 2025 20:09

Add release notes, minor renaming, refactoring

8220346

Make tests more 3.0-proof (wrt configuration)

4727239

cowtowncoder approved these changes Apr 30, 2025

View reviewed changes

cowtowncoder merged commit 29c171a into FasterXML:2.x Apr 30, 2025
4 checks passed

cowtowncoder mentioned this pull request Apr 30, 2025

Negative BigInteger values not encoded/decoded correctly #431

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CBOR] - Implement RFC compliant BigInteger bytes encoding & decoding #578

[CBOR] - Implement RFC compliant BigInteger bytes encoding & decoding #578

Uh oh!

iifawzi commented Apr 26, 2025 •

edited

Loading

Uh oh!

cowtowncoder commented Apr 27, 2025

Uh oh!

iifawzi commented Apr 27, 2025 •

edited

Loading

Uh oh!

iifawzi commented Apr 27, 2025 •

edited

Loading

Uh oh!

cowtowncoder commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

cowtowncoder commented Apr 30, 2025

Uh oh!

cowtowncoder left a comment

Uh oh!

Uh oh!

cowtowncoder commented Apr 30, 2025

Uh oh!

iifawzi commented Apr 30, 2025

Uh oh!

cowtowncoder commented Apr 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[CBOR] - Implement RFC compliant BigInteger bytes encoding & decoding #578

[CBOR] - Implement RFC compliant BigInteger bytes encoding & decoding #578

Uh oh!

Conversation

iifawzi commented Apr 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cowtowncoder commented Apr 27, 2025

Uh oh!

iifawzi commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

iifawzi commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cowtowncoder commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

cowtowncoder commented Apr 30, 2025

Uh oh!

cowtowncoder left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cowtowncoder commented Apr 30, 2025

Uh oh!

iifawzi commented Apr 30, 2025

Uh oh!

cowtowncoder commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

iifawzi commented Apr 26, 2025 •

edited

Loading

iifawzi commented Apr 27, 2025 •

edited

Loading

iifawzi commented Apr 27, 2025 •

edited

Loading

cowtowncoder commented Apr 30, 2025 •

edited

Loading