Skip to content

Optimizing Large-Scale Data Processing: A Deep Dive into FireDucks vs. Pandas #189

@Sirohi97vaibhav

Description

@Sirohi97vaibhav

Title of the talk

Optimizing Large-Scale Data Processing: A Deep Dive into FireDucks vs. Pandas

Description

Abstract:
As data scientists, we rely on Pandas for data preprocessing, but when dealing with large datasets, it struggles with performance. To overcome this, I explored various high-performance alternatives like DuckDB, Polars, and cuDF. While these libraries offer speed, they come with a learning curve, requiring new syntax and concepts. Then I discovered FireDucks—a library that is fully compatible with Pandas, meaning no new functions to learn, just a simple import change. FireDucks delivers impressive speed improvements over Pandas and even outperforms many other alternatives in large-scale data processing. In this session, I’ll share my experience comparing these tools and demonstrate why FireDucks is a game-changer for handling big data effortlessly.

Table of contents

Key Takeaways:
✅ Understanding Pandas' limitations with large datasets
✅ Exploring alternatives like DuckDB, Polars, and cuDF
✅ Why FireDucks stands out: seamless integration & high speed
✅ Best practices for using FireDucks efficiently in your workflow
This session is ideal for data professionals, analysts, and engineers looking to enhance their workflow efficiency with large datasets.

Duration (including Q&A)

15 miniutes

Prerequisites

No response

Speaker bio

https://www.linkedin.com/in/vaibhav-sirohi-ba65771a2?utm_source=share&utm_campaign=share_via&utm_content=profile&utm_medium=android_app

The talk/workshop speaker agrees to

Metadata

Metadata

Assignees

No one assigned

    Labels

    talk-proposalNew talk of Python Pune meetup

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions