fix: correct type mismatch in avro zstd decompression. #2128

zhongyujiang · 2025-06-20T07:39:17Z

Rationale for this change

The return type of the decompress method in ZStandardCodec should be bytes, but it currently returns a bytearray, which causes an exception when reading Avro files compressed with zstd.

def new_decoder(b: bytes) -> BinaryDecoder:
        try:
            from pyiceberg.avro.decoder_fast import CythonBinaryDecoder

>           return CythonBinaryDecoder(b)
E           TypeError: Argument 'input_contents' has incorrect type (expected bytes, got bytearray)

Are these changes tested?

Yes, test_write_manifest

Are there any user-facing changes?

No.

Fokko

I'm able to reproduce it locally, great catch @zhongyujiang


    def new_decoder(b: bytes) -> BinaryDecoder:
        try:
            from pyiceberg.avro.decoder_fast import CythonBinaryDecoder
    
>           return CythonBinaryDecoder(b)
E           TypeError: Argument 'input_contents' has incorrect type (expected bytes, got bytearray)

../../pyiceberg/avro/decoder.py:181: TypeError

# Rationale for this change The return type of the decompress method in `ZStandardCodec` should be `bytes`, but it currently returns a `bytearray`, which causes an exception when reading Avro files compressed with zstd. ```text def new_decoder(b: bytes) -> BinaryDecoder: try: from pyiceberg.avro.decoder_fast import CythonBinaryDecoder > return CythonBinaryDecoder(b) E TypeError: Argument 'input_contents' has incorrect type (expected bytes, got bytearray) ``` # Are these changes tested? Yes, `test_write_manifest` # Are there any user-facing changes? No.

fix: correct type mismatch in avro zstd decompression.

0b1d932

zhongyujiang force-pushed the yuj/fix-zstd-decompress branch from d09539c to 0b1d932 Compare June 20, 2025 07:39

Fokko approved these changes Jun 20, 2025

View reviewed changes

Fokko merged commit c27028f into apache:main Jun 20, 2025
10 checks passed

zhongyujiang deleted the yuj/fix-zstd-decompress branch June 21, 2025 02:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: correct type mismatch in avro zstd decompression. #2128

fix: correct type mismatch in avro zstd decompression. #2128

Uh oh!

zhongyujiang commented Jun 20, 2025

Uh oh!

Fokko left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: correct type mismatch in avro zstd decompression. #2128

fix: correct type mismatch in avro zstd decompression. #2128

Uh oh!

Conversation

zhongyujiang commented Jun 20, 2025

Rationale for this change

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Fokko left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants