Skip to content

Conversation

@Alvaro-Kothe
Copy link
Member

This is just an idea of using simdjson in pandas, partially following #58278.

  • Uses simdjson with meson wraps
  • Still not using nanoarrow, and I don't know how would it be possible without knowing the json schema beforehand.
  • Deletes vendored ujson decoder.

I also explored pysimdjson, which would be viable and do the exact same thing that this PR does. Although, it only decodes, while creating an extension permits to create an encoder on top of simdjson.

From the current PR, compared to ultrajson, it shows no performance increase and memory consumption remained similar, while with pysimdjson seems to increase the memory consumption slightly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant