From 18bc6fa770c0bf8f920cebf8c3899b5997ca6dff Mon Sep 17 00:00:00 2001 From: Aditya Date: Sat, 17 Jan 2026 15:25:06 +0530 Subject: [PATCH 1/3] Add docling to Data Analysis --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index df3ed0048..a2d7baa0c 100644 --- a/README.md +++ b/README.md @@ -330,6 +330,8 @@ _Libraries for data analysis._ - [pandas](http://pandas.pydata.org/) - A library providing high-performance, easy-to-use data structures and data analysis tools. - [pathway](https://github.com/pathwaycom/pathway) - Real-time data processing framework for Python with reactive dataflows. - [polars](https://github.com/pola-rs/polars) - A fast DataFrame library implemented in Rust with a Python API. +- [docling](https://github.com/docling-project/docling) - Library for converting documents into structured data. + ## Data Validation From b9811358febeb07a4880e0a6228e475147c58804 Mon Sep 17 00:00:00 2001 From: Jinyang Wang Date: Sat, 17 Jan 2026 18:57:17 +0400 Subject: [PATCH 2/3] Fix entry order --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index a2d7baa0c..b6ac4f305 100644 --- a/README.md +++ b/README.md @@ -326,11 +326,11 @@ _Libraries for data analysis._ - [aws-sdk-pandas](https://github.com/aws/aws-sdk-pandas) - Pandas on AWS. - [datasette](https://github.com/simonw/datasette) - An open source multi-tool for exploring and publishing data. - [desbordante](https://github.com/desbordante/desbordante-core/) - An open source data profiler for complex pattern discovery. +- [docling](https://github.com/docling-project/docling) - Library for converting documents into structured data. - [optimus](https://github.com/hi-primus/optimus) - Agile Data Science Workflows made easy with PySpark. - [pandas](http://pandas.pydata.org/) - A library providing high-performance, easy-to-use data structures and data analysis tools. - [pathway](https://github.com/pathwaycom/pathway) - Real-time data processing framework for Python with reactive dataflows. - [polars](https://github.com/pola-rs/polars) - A fast DataFrame library implemented in Rust with a Python API. -- [docling](https://github.com/docling-project/docling) - Library for converting documents into structured data. ## Data Validation From 017cdec13e7290f73eaf30c673a10545f8afb12c Mon Sep 17 00:00:00 2001 From: Jinyang Wang Date: Sat, 17 Jan 2026 19:58:54 +0400 Subject: [PATCH 3/3] format --- README.md | 1 - 1 file changed, 1 deletion(-) diff --git a/README.md b/README.md index b6ac4f305..55fe951fc 100644 --- a/README.md +++ b/README.md @@ -332,7 +332,6 @@ _Libraries for data analysis._ - [pathway](https://github.com/pathwaycom/pathway) - Real-time data processing framework for Python with reactive dataflows. - [polars](https://github.com/pola-rs/polars) - A fast DataFrame library implemented in Rust with a Python API. - ## Data Validation _Libraries for validating data. Used for forms in many cases._