feat: add neural network optimizers module by shretadas · Pull Request #13685 · TheAlgorithms/Python

shretadas · 2025-10-22T15:07:28Z

Neural Network Optimizers Module

This PR introduces a comprehensive neural network optimizers module that implements five widely used optimization algorithms for machine learning and deep learning. The primary goal is to enhance the educational value of the repository by including well-documented, tested, and modular implementations.

Fixes #13662

What's Added

Add SGD (Stochastic Gradient Descent) optimizer
Add MomentumSGD with momentum acceleration
Add NAG (Nesterov Accelerated Gradient) optimizer
Add Adagrad with adaptive learning rates
Add Adam optimizer combining momentum and RMSprop
Include 61 comprehensive doctests (all passing)
Add abstract BaseOptimizer for a consistent interface
Include detailed mathematical documentation
Add educational examples and performance comparisons
Follow repository guidelines: type hints, error handling, and pure Python implementation

Technical Details

Algorithms Implemented:

Algorithm	Key Concept
SGD	θ = θ - α∇θ (basic gradient descent)
MomentumSGD	v = βv + (1-β)∇θ, θ = θ - αv
NAG	Uses lookahead gradients for better convergence
Adagrad	Adapts learning rates per parameter
Adam	Combines momentum and adaptive learning rates

Directory Structure:

neural_network/optimizers/
├── init.py # Package initialization
├── README.md # Comprehensive documentation
├── base_optimizer.py # Abstract base class
├── sgd.py # Stochastic Gradient Descent
├── momentum_sgd.py # SGD with Momentum
├── nag.py # Nesterov Accelerated Gradient
├── adagrad.py # Adagrad optimizer
├── adam.py # Adam optimizer
├── test_optimizers.py # Comprehensive test suite
└── IMPLEMENTATION_SUMMARY.md # Technical implementation details

Testing Coverage

61 doctests with 100% pass rate
Error handling for edge cases
Multi-dimensional parameter support
Performance comparison examples

Describe Your Change

Add an algorithm
Fix a bug or typo in an existing algorithm
Add or change doctests (note: no mixed code/test changes in a single PR)
Documentation change

Checklist

Notes for maintainers and reviewers

If the project maintainers prefer one algorithm per PR for easier review, I can split this into separate PRs (one PR per optimizer) — please indicate that preference and I will open the PRs accordingly.
To correct the checklist in this PR: open the PR description, click the edit (pencil) icon, change the relevant [ ] to [x], and save. Only [x] is accepted to mark completion.

- Add SGD (Stochastic Gradient Descent) optimizer - Add MomentumSGD with momentum acceleration - Add NAG (Nesterov Accelerated Gradient) optimizer - Add Adagrad with adaptive learning rates - Add Adam optimizer combining momentum and RMSprop - Include comprehensive doctests (61 tests, all passing) - Add abstract BaseOptimizer for consistent interface - Include detailed mathematical documentation - Add educational examples and performance comparisons - Follow repository guidelines: type hints, error handling, pure Python Implements standard optimization algorithms for neural network training with educational focus and comprehensive testing coverage.

algorithms-keeper

Click here to look at the relevant links ⬇️

🔗 Relevant Links

Repository:

Contributing guidelines

Project Euler solution guidelines

Python:

Formatted string literals (f-strings)

Type hints

doctest

unittest

pytest

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper commands and options

algorithms-keeper actions can be triggered by commenting on this PR:

@algorithms-keeper review to trigger the checks for only added pull request files

@algorithms-keeper review-all to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.

NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/adagrad.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _adagrad_update_recursive(params, grads, acc_grads):


Please provide return type hint for the function: _adagrad_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: acc_grads

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/adam.py

+        bias_correction1 = 1 - self.beta1 ** self._time_step
+        bias_correction2 = 1 - self.beta2 ** self._time_step
+
+        def _adam_update_recursive(params, grads, first_moment, second_moment):


Please provide return type hint for the function: _adam_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: first_moment

Please provide type hint for the parameter: second_moment

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/adam.py

+    x_adagrad = [-1.0, 1.0] 
+    x_adam = [-1.0, 1.0]
+
+    def rosenbrock(x, y):


Please provide return type hint for the function: rosenbrock. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

Please provide descriptive name for the parameter: y

Please provide type hint for the parameter: y

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/adam.py

+        """Rosenbrock function: f(x,y) = 100*(y-x²)² + (1-x)²"""
+        return 100 * (y - x*x)**2 + (1 - x)**2
+
+    def rosenbrock_gradient(x, y):


Please provide return type hint for the function: rosenbrock_gradient. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

Please provide descriptive name for the parameter: y

Please provide type hint for the parameter: y

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/momentum_sgd.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _check_shapes_and_get_velocity(params, grads, velocity):


Please provide return type hint for the function: _check_shapes_and_get_velocity. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: velocity

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/nag.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _nag_update_recursive(params, grads, velocity):


Please provide return type hint for the function: _nag_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: velocity

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/nag.py

+    x_momentum = [2.5]
+    x_nag = [2.5]
+
+    def gradient_f(x):


Please provide return type hint for the function: gradient_f. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

algorithms-keeper · 2025-10-22T15:07:43Z

neural_network/optimizers/nag.py

+        """Gradient of f(x) = 0.1*x^4 - 2*x^2 + x is f'(x) = 0.4*x^3 - 4*x + 1"""
+        return 0.4 * x**3 - 4 * x + 1
+
+    def f(x):


Please provide descriptive name for the function: f

Please provide return type hint for the function: f. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

algorithms-keeper · 2025-10-22T15:07:44Z

neural_network/optimizers/sgd.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _check_and_update_recursive(params, grads):


Please provide return type hint for the function: _check_and_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

for more information, see https://pre-commit.ci

algorithms-keeper

Click here to look at the relevant links ⬇️

🔗 Relevant Links

Repository:

Contributing guidelines

Project Euler solution guidelines

Python:

Formatted string literals (f-strings)

Type hints

doctest

unittest

pytest

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper commands and options

algorithms-keeper actions can be triggered by commenting on this PR:

@algorithms-keeper review to trigger the checks for only added pull request files

@algorithms-keeper review-all to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.

NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.

algorithms-keeper · 2025-10-22T15:15:47Z

neural_network/optimizers/adagrad.py

+            ValueError: If parameters and gradients have different shapes
+        """
+
+        def _adagrad_update_recursive(params, grads, acc_grads):


Please provide return type hint for the function: _adagrad_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: acc_grads

algorithms-keeper · 2025-10-22T15:15:47Z

neural_network/optimizers/adam.py

+        bias_correction1 = 1 - self.beta1**self._time_step
+        bias_correction2 = 1 - self.beta2**self._time_step
+
+        def _adam_update_recursive(params, grads, first_moment, second_moment):


Please provide return type hint for the function: _adam_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: first_moment

Please provide type hint for the parameter: second_moment

algorithms-keeper · 2025-10-22T15:15:47Z

neural_network/optimizers/adam.py

+    x_adagrad = [-1.0, 1.0]
+    x_adam = [-1.0, 1.0]
+
+    def rosenbrock(x, y):


Please provide return type hint for the function: rosenbrock. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

Please provide descriptive name for the parameter: y

Please provide type hint for the parameter: y

algorithms-keeper · 2025-10-22T15:15:47Z

neural_network/optimizers/adam.py

+        """Rosenbrock function: f(x,y) = 100*(y-x²)² + (1-x)²"""
+        return 100 * (y - x * x) ** 2 + (1 - x) ** 2
+
+    def rosenbrock_gradient(x, y):


Please provide return type hint for the function: rosenbrock_gradient. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

Please provide descriptive name for the parameter: y

Please provide type hint for the parameter: y

algorithms-keeper · 2025-10-22T15:15:48Z

neural_network/optimizers/momentum_sgd.py

+            ValueError: If parameters and gradients have different shapes
+        """
+
+        def _check_shapes_and_get_velocity(params, grads, velocity):


Please provide return type hint for the function: _check_shapes_and_get_velocity. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: velocity

algorithms-keeper · 2025-10-22T15:15:48Z

neural_network/optimizers/nag.py

+            ValueError: If parameters and gradients have different shapes
+        """
+
+        def _nag_update_recursive(params, grads, velocity):


Please provide return type hint for the function: _nag_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: velocity

algorithms-keeper · 2025-10-22T15:15:48Z

neural_network/optimizers/nag.py

+    x_momentum = [2.5]
+    x_nag = [2.5]
+
+    def gradient_f(x):


Please provide return type hint for the function: gradient_f. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

algorithms-keeper · 2025-10-22T15:15:48Z

neural_network/optimizers/nag.py

+        """Gradient of f(x) = 0.1*x^4 - 2*x^2 + x is f'(x) = 0.4*x^3 - 4*x + 1"""
+        return 0.4 * x**3 - 4 * x + 1
+
+    def f(x):


Please provide descriptive name for the function: f

Please provide return type hint for the function: f. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

algorithms-keeper · 2025-10-22T15:15:48Z

neural_network/optimizers/sgd.py

+            ValueError: If parameters and gradients have different shapes
+        """
+
+        def _check_and_update_recursive(params, grads):


Please provide return type hint for the function: _check_and_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

algorithms-keeper bot added documentation This PR modified documentation files require descriptive names This PR needs descriptive function and/or variable names require type hints https://docs.python.org/3/library/typing.html labels Oct 22, 2025

algorithms-keeper bot reviewed Oct 22, 2025

View reviewed changes

algorithms-keeper bot added the awaiting reviews This PR is ready to be reviewed label Oct 22, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

af03ccb

for more information, see https://pre-commit.ci

shretadas closed this Oct 22, 2025

shretadas reopened this Oct 22, 2025

algorithms-keeper bot reviewed Oct 22, 2025

View reviewed changes

shretadas closed this Oct 22, 2025

Uh oh!

Comments

Conversation

shretadas commented Oct 22, 2025

Neural Network Optimizers Module

What's Added

Technical Details

Testing Coverage

Describe Your Change

Checklist

Notes for maintainers and reviewers

Uh oh!

algorithms-keeper bot left a comment

Choose a reason for hiding this comment

🔗 Relevant Links

Repository:

Python:

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper actions can be triggered by commenting on this PR:

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot left a comment

Choose a reason for hiding this comment

🔗 Relevant Links

Repository:

Python:

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper actions can be triggered by commenting on this PR:

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

algorithms-keeper bot Oct 22, 2025