feat(oss-opensearch): Add Scalar Quantization support #685

Akhil-Pathivada · 2025-12-24T09:24:51Z

Summary

Adds support for Lucene Scalar Quantization (SQ) and FAISS 16-bit Scalar Quantization (FP16) for OSS OpenSearch, enabling users to reduce memory usage for in-memory vector indexes.

Changes

Backend

Refactored quantization enum: fp32/fp16 → None/LuceneSQ/FaissSQfp16
Added new fields:
- confidence_interval (float, optional): For Lucene SQ quantile calculation
- clip (bool, optional): For FAISS FP16 out-of-range value handling
Implemented validator: Converts UI/CLI string values to enum with backward compatibility
Updated encoder logic: Engine-aware configuration for Lucene SQ and FAISS FP16

Frontend

Engine-aware UI: Separate quantization dropdowns for Lucene and FAISS engines
- Lucene: ["None", "LuceneSQ"]
- FAISS: ["None", "FaissSQfp16"]
Progressive disclosure: Optional parameters appear only when relevant
- confidence_interval shown for Lucene SQ
- clip shown for FAISS FP16
Prevents invalid combinations: UI logic ensures engine/quantization compatibility

CLI

Updated options: --quantization-type now accepts None, LuceneSQ, FaissSQfp16
New parameters: --confidence-interval and --clip for fine-tuning

Technical Details

Lucene SQ

Converts 32-bit float vectors to 7-bit integers (4x memory reduction)
Optional confidence_interval (0.0-1.0) controls quantile calculation
OpenSearch API: encoder: {name: "sq", parameters: {confidence_interval: <value>}}

FAISS FP16

Converts 32-bit float vectors to 16-bit floats (2x memory reduction)
Optional clip parameter handles out-of-range values ([-65504, 65504])
OpenSearch API: encoder: {name: "sq", parameters: {type: "fp16", clip: true}}

Testing

✅ All linter checks passed
✅ Backward compatible with existing configs
✅ Engine-aware UI prevents invalid configurations
✅ CLI and UI both functional

Screenshots

Akhil-Pathivada · 2025-12-24T11:17:37Z

/assign @alwayslove2013

sre-ci-robot · 2025-12-25T03:08:31Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: Akhil-Pathivada, alwayslove2013
To complete the pull request process, please assign xuanyang-cn after the PR has been reviewed.
You can assign the PR to them by writing /assign @xuanyang-cn in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

feat(oss-opensearch): Add Scalar Quantization support

f528fdc

Akhil-Pathivada force-pushed the feature/quantization-support branch from c81a1ef to f528fdc Compare December 24, 2025 11:11

Akhil-Pathivada marked this pull request as ready for review December 24, 2025 11:17

alwayslove2013 approved these changes Dec 25, 2025

View reviewed changes

alwayslove2013 merged commit eb79134 into zilliztech:main Dec 25, 2025
4 checks passed

Akhil-Pathivada deleted the feature/quantization-support branch December 25, 2025 06:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(oss-opensearch): Add Scalar Quantization support #685

feat(oss-opensearch): Add Scalar Quantization support #685

Uh oh!

Akhil-Pathivada commented Dec 24, 2025 •

edited

Loading

Uh oh!

Akhil-Pathivada commented Dec 24, 2025

Uh oh!

sre-ci-robot commented Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(oss-opensearch): Add Scalar Quantization support #685

feat(oss-opensearch): Add Scalar Quantization support #685

Uh oh!

Conversation

Akhil-Pathivada commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Backend

Frontend

CLI

Technical Details

Lucene SQ

FAISS FP16

Testing

Screenshots

Uh oh!

Akhil-Pathivada commented Dec 24, 2025

Uh oh!

sre-ci-robot commented Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Akhil-Pathivada commented Dec 24, 2025 •

edited

Loading