Fix o2m id field handling rebased #140

bosd · 2025-09-28T15:25:22Z

No description provided.

gemini-code-assist · 2025-09-28T15:25:56Z

Summary of Changes

Hello @bosd, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the data import capabilities by fortifying its resilience against common data quality issues, especially type mismatches and encoding problems. It introduces intelligent pre-processing and error recovery mechanisms that automatically detect and correct problematic data types before they reach Odoo, and provides more informative feedback when issues arise. Furthermore, it refines the handling of relational fields by supporting diverse ID formats and improving the efficiency and logging of relational updates.

Highlights

Enhanced Type Conversion for Odoo Imports: Introduced a new _safe_convert_field_value function to pre-process data, converting float strings (e.g., "1.0") to integers for Odoo's integer fields and handling empty numeric values, preventing common type-related errors during record creation.
Robust Error Handling and Fallback: Implemented specific error detection for 'tuple index out of range' and 'does not seem to be an integer' errors during both bulk load and individual create operations. If bulk load fails due to these type errors, the system now gracefully falls back to processing records individually.
Improved CSV Reading with Encoding Fallback: The _read_data_file function now includes an automatic fallback mechanism to try multiple common encodings (utf-8, latin-1, cp1252, iso-8859-1) if a UnicodeDecodeError occurs, making CSV parsing more resilient.
Pre-flight Type Correction using Polars: Added a new _auto_correct_field_types function that uses Polars to proactively identify and correct float-like string values in integer columns within the source CSV, saving a corrected file for import.
Flexible External ID Resolution: The _resolve_related_ids function now supports resolving both numeric database IDs (passed directly) and traditional XML IDs (e.g., module.name) for relational fields, improving flexibility and reducing resolution failures.
Refined Relational Import Logic: The run_write_tuple_import function has been refactored for better ID normalization, explicit handling of /id suffixes in column names, and delegation of Odoo write operations to a new _execute_write_tuple_updates helper, enhancing clarity and error reporting.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces significant improvements to data type handling and error resilience, particularly for relational fields and numeric conversions. The changes correctly handle external IDs suffixed with /id by treating them as strings during the import process, which is a crucial fix. The introduction of _safe_convert_field_value and pre-flight type correction using Polars are excellent additions for preventing common type-related errors. However, the implementation has introduced considerable code duplication, especially in import_threaded.py for retrieving model field metadata and performing type conversions. Refactoring these duplicated blocks into centralized helper functions would greatly improve the code's maintainability and readability. Additionally, there are a few areas where logic could be simplified or made more robust, such as in CSV reading and redundant connection fetching.

src/odoo_data_flow/import_threaded.py

src/odoo_data_flow/importer.py

src/odoo_data_flow/import_threaded.py

src/odoo_data_flow/lib/preflight.py

src/odoo_data_flow/lib/relational_import.py

…ed branch onto latest master\n- Resolved merge conflicts in import_threaded.py, importer.py, and relational_import.py\n- Maintained enhanced debug logging and field handling functionality\n- Preserved fixes for direct database ID handling and connection management

…ith problematic bytes like 0x9d

…o int)

…empty strings to prevent IndexError

…lues to prevent bulk import failures

…rors that were being masked by generic IndexError handler

…ve UX

…ors by safely converting field values before sending to Odoo

…ex out of range errors occur in bulk load operations

…ers for integer fields to prevent tuple index out of range errors

…hen _fields is a function/method instead of a dictionary

…n _fields is a method vs dictionary property, and add safer IndexError catching in both batch and individual record processing

…console to avoid flooding main console logs

…ex out of range errors caused by float strings in integer fields

… errors: broaden detection to catch 'does not seem to be an integer' errors alongside 'tuple index out of range'

…atch and add proper IndexError handling with fallback to individual record processing

…oat strings in integer fields before import using Polars casting

…s.console.print calls

…y checking model._fields before accessing it

…s in load batch processing and fall back to individual record creation

…(float to int)" This reverts commit cee856c.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

…ed branch onto latest master\n- Resolved merge conflicts in import_threaded.py, importer.py, and relational_import.py\n- Maintained enhanced debug logging and field handling functionality\n- Preserved fixes for direct database ID handling and connection management

…ers for integer fields to prevent tuple index out of range errors

…hen _fields is a function/method instead of a dictionary

…n _fields is a method vs dictionary property, and add safer IndexError catching in both batch and individual record processing

…console to avoid flooding main console logs

…ex out of range errors caused by float strings in integer fields

… errors: broaden detection to catch 'does not seem to be an integer' errors alongside 'tuple index out of range'

…atch and add proper IndexError handling with fallback to individual record processing

…oat strings in integer fields before import using Polars casting

…s.console.print calls

…y checking model._fields before accessing it

…s in load batch processing and fall back to individual record creation

…(float to int)" This reverts commit cee856c.

…taFlow/odoo-data-flow into fix-o2m-id-field-handling-rebased

bosd · 2025-09-28T16:13:41Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces significant improvements to the data import process, focusing on robustness and error handling, particularly for data type mismatches that cause "tuple index out of range" errors. Key changes include refactoring CSV reading to handle encoding issues, adding safe type conversion logic before sending data to Odoo, and enhancing relational ID resolution to support both database and XML IDs.

My review focuses on several areas of code duplication that have been introduced, potential performance bottlenecks in file parsing and database updates, and the use of fixed delays (time.sleep) which could be replaced with more dynamic solutions. Addressing these points will further improve the maintainability and performance of this already much-improved import engine.

src/odoo_data_flow/lib/relational_import.py

src/odoo_data_flow/import_threaded.py

gemini-code-assist · 2025-09-28T16:16:10Z

src/odoo_data_flow/import_threaded.py

+    elif field_type == "float":
+        try:
+            # Convert numeric strings to float
+            if str_value.replace(".", "").replace("-", "").isdigit():
+                return float(str_value)
+            else:
+                # Non-numeric string - leave as-is
+                return str_value
+        except (ValueError, TypeError):
+            # Conversion failed - leave as original string
+            return field_value


The logic for converting a string to a float is a bit complex and can be simplified. The if str_value.replace(".", "").replace("-", "").isdigit(): check is not fully robust (e.g., it would fail for scientific notation or incorrectly handle multiple decimal points) and is somewhat redundant given the surrounding try-except block.

A more direct and robust approach would be to attempt the conversion and let the except block handle failures.

elif field_type == "float": try: # Attempt to convert directly to float. return float(str_value) except (ValueError, TypeError): # Conversion failed - leave as original string for Odoo to handle. return field_value

src/odoo_data_flow/import_threaded.py

gemini-code-assist · 2025-09-28T16:16:10Z

src/odoo_data_flow/import_threaded.py

+            # PRE-PROCESSING: Clean up field values to prevent type errors
+            # This prevents "tuple index out of range" errors in Odoo server processing
+            model_fields = _get_model_fields(model)
+            if model_fields:
+                processed_load_lines = []
+                for row in load_lines:
+                    processed_row = []
+                    for i, value in enumerate(row):
+                        if i < len(load_header):
+                            field_name = load_header[i]
+                            clean_field_name = field_name.split("/")[0]
+
+                            field_type = "unknown"
+                            if clean_field_name in model_fields:
+                                field_info = model_fields[clean_field_name]
+                                field_type = field_info.get("type", "unknown")
+
+                            converted_value = _safe_convert_field_value(
+                                field_name, value, field_type
+                            )
+                            processed_row.append(converted_value)
+                        else:
+                            processed_row.append(value)
+                    processed_load_lines.append(processed_row)
+                load_lines = processed_load_lines
+            else:
+                log.debug(
+                    "Model has no _fields attribute, using raw values for load method"
+                )


The pre-processing logic here to safely convert field values is a duplicate of the logic found in _create_batch_individually (lines 662-674).

To adhere to the DRY (Don't Repeat Yourself) principle and improve maintainability, this logic should be extracted into a single helper function. This function could take the header, data rows, and model fields as input and return the processed rows, and then be called from both _execute_load_batch and _create_batch_individually.

src/odoo_data_flow/importer.py

src/odoo_data_flow/lib/preflight.py

src/odoo_data_flow/lib/relational_import.py

gemini-code-assist · 2025-09-28T16:16:10Z

src/odoo_data_flow/lib/relational_import.py

+    # 4. Execute the updates
+    success = _execute_write_tuple_updates(
+        config, model, field, link_df, id_map, related_model_fk, original_filename
+    )
+
+    # Count successful updates - get from link_df
+    if link_df.height > 0:
+        successful_count = len(
+            [
+                row["external_id"]
+                for row in link_df.iter_rows(named=True)
+                if id_map.get(row["external_id"])
+            ]
+        )
+    else:
+        successful_count = 0
+    failed_count = 0 if success else "unknown"
+
+    log.info(
+        f"Finished 'Write Tuple' for '{field}': "
+        f"{successful_count} successful, {failed_count} failed."
+    )
+
+    return success


After this refactoring, the _create_relational_records function is no longer called from run_write_tuple_import and appears to be unused throughout the codebase. If it is indeed no longer needed, it should be removed to eliminate dead code and improve maintainability.

bosd · 2025-09-28T17:13:39Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces significant enhancements to the data import process, focusing on robustness and error handling. Key changes include proactive data type conversion to prevent common errors with numeric fields, more resilient logic for reading CSV files with various encodings, and a major refactoring of how relational fields (especially many-to-many) are handled. The new approach for relational imports is more idiomatic to Odoo and more efficient.

My review focuses on a few areas for improvement:

A potential performance issue with large files due to in-memory file reading.
A resource leak from a temporary file that is not being cleaned up.
Opportunities to reduce code duplication and remove dead code for better maintainability.

Overall, this is a strong set of changes that will make the import tool much more reliable.

gemini-code-assist bot reviewed Sep 28, 2025

View reviewed changes

bosd and others added 23 commits September 28, 2025 17:30

Fix KeyError: strategies issue by using safe dict access

fcc4135

Add encoding detection fallback to handle character encoding issues w…

a57ec5e

…ith problematic bytes like 0x9d

Add field type conversion to handle Odoo version differences (float t…

6022775

…o int)

Fix empty value handling: send 0 for empty numeric fields instead of …

ec7ea38

…empty strings to prevent IndexError

Make type conversion more conservative: only convert clean numeric va…

4a03730

…lues to prevent bulk import failures

Fix IndexError handling to properly catch tuple index out of range er…

6feb1d8

…rors that were being masked by generic IndexError handler

Add user-facing warnings for tuple index out of range errors to impro…

6e540b1

…ve UX

Add auto-type casting feature to prevent tuple index out of range err…

bab17d8

…ors by safely converting field values before sending to Odoo

Add immediate fallback to individual record processing when tuple ind…

6efa90d

…ex out of range errors occur in bulk load operations

Add pre-processing to convert float string values like '1.0' to integ…

be2733d

…ers for integer fields to prevent tuple index out of range errors

Fix all occurrences of model._fields direct access: properly handle w…

dafc652

…hen _fields is a function/method instead of a dictionary

Fix IndexError handling and model._fields access: properly handle whe…

deb5930

…n _fields is a method vs dictionary property, and add safer IndexError catching in both batch and individual record processing

Improve UX: Move tuple index out of range error messages to progress …

d35dc38

…console to avoid flooding main console logs

Add targeted type conversion before model.load() to prevent tuple ind…

7f3dead

…ex out of range errors caused by float strings in integer fields

Add targeted error handling for Odoo 18 integer field type conversion…

6cec0a6

… errors: broaden detection to catch 'does not seem to be an integer' errors alongside 'tuple index out of range'

Fix IndexError handling: remove problematic type conversion in load b…

e2623ee

…atch and add proper IndexError handling with fallback to individual record processing

Add preflight type validation with auto-correction: detect and fix fl…

e582e3f

…oat strings in integer fields before import using Polars casting

Fix NameError: 'progress' is not defined by properly guarding progres…

4179299

…s.console.print calls

Fix TypeError: argument of type 'function' is not iterable by properl…

8357d64

…y checking model._fields before accessing it

Add simple IndexError handler to catch tuple index out of range error…

20f99a7

…s in load batch processing and fall back to individual record creation

Pre-commit fixes

9212291

Revert "Add field type conversion to handle Odoo version differences …

0914ff5

…(float to int)" This reverts commit cee856c.

bosd force-pushed the fix-o2m-id-field-handling-rebased branch from 9abd787 to 0914ff5 Compare September 28, 2025 15:36

bosd and others added 4 commits September 28, 2025 17:36

Update src/odoo_data_flow/lib/relational_import.py

7536bc2

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update src/odoo_data_flow/importer.py

9e91830

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Fix KeyError: strategies issue by using safe dict access

eb02cce

bosd added 17 commits September 28, 2025 17:47

Add pre-processing to convert float string values like '1.0' to integ…

7cf27da

…ers for integer fields to prevent tuple index out of range errors

Fix all occurrences of model._fields direct access: properly handle w…

d50c291

…hen _fields is a function/method instead of a dictionary

Fix IndexError handling and model._fields access: properly handle whe…

dd57b87

…n _fields is a method vs dictionary property, and add safer IndexError catching in both batch and individual record processing

Improve UX: Move tuple index out of range error messages to progress …

dc31707

…console to avoid flooding main console logs

Add targeted type conversion before model.load() to prevent tuple ind…

f546758

…ex out of range errors caused by float strings in integer fields

Add targeted error handling for Odoo 18 integer field type conversion…

18f19ba

… errors: broaden detection to catch 'does not seem to be an integer' errors alongside 'tuple index out of range'

Fix IndexError handling: remove problematic type conversion in load b…

61e01c0

…atch and add proper IndexError handling with fallback to individual record processing

Add preflight type validation with auto-correction: detect and fix fl…

1c6c4de

…oat strings in integer fields before import using Polars casting

Fix NameError: 'progress' is not defined by properly guarding progres…

a3781d1

…s.console.print calls

Fix TypeError: argument of type 'function' is not iterable by properl…

783a003

…y checking model._fields before accessing it

Add simple IndexError handler to catch tuple index out of range error…

6f9e344

…s in load batch processing and fall back to individual record creation

Pre-commit fixes

513d862

Revert "Add field type conversion to handle Odoo version differences …

2f91b8a

…(float to int)" This reverts commit cee856c.

Merge branch 'fix-o2m-id-field-handling-rebased' of github.com:OdooDa…

fd9ef21

…taFlow/odoo-data-flow into fix-o2m-id-field-handling-rebased

review comments

7a5955f

review comments

13f756e

review comments

50a28f6

gemini-code-assist bot reviewed Sep 28, 2025

View reviewed changes

bosd added 7 commits September 28, 2025 18:23

review comments fix linting and mypy errors

23e8e35

remove dead code

a8d6451

optimize memory usage, read csv file with iterator

90d9b2a

optimize polars typeconversion

057ea6b

remove hard coded sleep timers

53139c3

review comments

ca878fa

Fixup review comments

efd35c9

gemini-code-assist bot reviewed Sep 28, 2025

View reviewed changes

bosd and others added 2 commits September 28, 2025 20:38

Merge branch 'master' into fix-o2m-id-field-handling-rebased

3173b8f

Fixup relational import

727919a

Fix o2m id field handling rebased #140

Are you sure you want to change the base?

Fix o2m id field handling rebased #140

Uh oh!

Conversation

bosd commented Sep 28, 2025

Uh oh!

gemini-code-assist bot commented Sep 28, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bosd commented Sep 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

bosd commented Sep 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants