Optimizing Large-Scale Data Processing: A Deep Dive into FireDucks vs. Pandas

### Title of the talk

Optimizing Large-Scale Data Processing: A Deep Dive into FireDucks vs. Pandas

### Description

Abstract:
As data scientists, we rely on Pandas for data preprocessing, but when dealing with large datasets, it struggles with performance. To overcome this, I explored various high-performance alternatives like DuckDB, Polars, and cuDF. While these libraries offer speed, they come with a learning curve, requiring new syntax and concepts. Then I discovered FireDucks—a library that is fully compatible with Pandas, meaning no new functions to learn, just a simple import change. FireDucks delivers impressive speed improvements over Pandas and even outperforms many other alternatives in large-scale data processing. In this session, I’ll share my experience comparing these tools and demonstrate why FireDucks is a game-changer for handling big data effortlessly.


### Table of contents

Key Takeaways:
✅ Understanding Pandas' limitations with large datasets
✅ Exploring alternatives like DuckDB, Polars, and cuDF
✅ Why FireDucks stands out: seamless integration & high speed
✅ Best practices for using FireDucks efficiently in your workflow
This session is ideal for data professionals, analysts, and engineers looking to enhance their workflow efficiency with large datasets.


### Duration (including Q&A)

15 miniutes

### Prerequisites

_No response_

### Speaker bio

[https://www.linkedin.com/in/vaibhav-sirohi-ba65771a2?utm_source=share&utm_campaign=share_via&utm_content=profile&utm_medium=android_app](https://apc01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linkedin.com%2Fin%2Fvaibhav-sirohi-ba65771a2%3Futm_source%3Dshare%26utm_campaign%3Dshare_via%26utm_content%3Dprofile%26utm_medium%3Dandroid_app&data=05%7C02%7Cvaibhav.sirohi%40india.nec.com%7C92ff5bdb9457453a07bc08dd4cbd5467%7Ccc4532b813fc4545981297eef82e110c%7C0%7C0%7C638751098256232220%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=CJNuY5lufk3ih7vq7IoH9RpH%2FB%2BTSgvbrSjmZA4iWg4%3D&reserved=0)

### The talk/workshop speaker agrees to

- [x] Share the slides, code snippets and other material used during the talk
- [x] If the talk is recorded, you grant the permission to release
the video on [PythonPune's YouTube
channel](https://www.youtube.com/channel/UCWjk7oGWV9eknuOzC20dyiQ)
under [CC-BY-4.0
license](https://creativecommons.org/licenses/by/4.0/)

- [x] Not do any hiring pitches during the talk and follow the [Code
of
Conduct](https://github.com/pythonpune/meetup-talks#code-of-conduct)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimizing Large-Scale Data Processing: A Deep Dive into FireDucks vs. Pandas #189

Title of the talk

Description

Table of contents

Duration (including Q&A)

Prerequisites

Speaker bio

The talk/workshop speaker agrees to

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimizing Large-Scale Data Processing: A Deep Dive into FireDucks vs. Pandas #189

Description

Title of the talk

Description

Table of contents

Duration (including Q&A)

Prerequisites

Speaker bio

The talk/workshop speaker agrees to

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions