Neural network optimizers module by shretadas · Pull Request #13689 · TheAlgorithms/Python

shretadas · 2025-10-22T15:48:49Z

Neural Network Optimizers Module

This PR adds a comprehensive neural network optimizers module implementing 5 standard optimization algorithms used in machine learning and deep learning.
Fixes #13662

What's Added:

Add SGD (Stochastic Gradient Descent) optimizer
Add MomentumSGD with momentum acceleration
Add NAG (Nesterov Accelerated Gradient) optimizer
Add Adagrad with adaptive learning rates
Add Adam optimizer combining momentum and RMSprop
Include comprehensive doctests (61 tests, all passing)
Add abstract BaseOptimizer for consistent interface
Include detailed mathematical documentation
Add educational examples and performance comparisons
Follow repository guidelines: type hints, error handling, pure Python

Implements standard optimization algorithms for neural network training with educational focus and comprehensive testing coverage.

Technical Details:

Algorithms Implemented:

SGD: θ = θ - α∇θ (basic gradient descent)
MomentumSGD: v = βv + (1-β)∇θ, θ = θ - αv
NAG: Uses lookahead gradients for better convergence
Adagrad: Adaptive learning rates per parameter
Adam: Combines momentum + adaptive learning rates

Files Added:

neural_network/optimizers/
├── init.py # Package initialization
├── README.md # Comprehensive documentation
├── base_optimizer.py # Abstract base class
├── sgd.py # Stochastic Gradient Descent
├── momentum_sgd.py # SGD with Momentum
├── nag.py # Nesterov Accelerated Gradient
├── adagrad.py # Adagrad optimizer
├── adam.py # Adam optimizer
├── test_optimizers.py # Comprehensive test suite
└── IMPLEMENTATION_SUMMARY.md # Technical implementation details

Testing Coverage:

61 comprehensive doctests (100% pass rate)
Error handling for all edge cases
Multi-dimensional parameter support
Performance comparison examples

Describe your change:

Add an algorithm?
Fix a bug or typo in an existing algorithm?
Add or change doctests? -- Note: Please avoid changing both code and tests in a single pull request.
Documentation change?

Checklist:

- Add SGD (Stochastic Gradient Descent) optimizer - Add MomentumSGD with momentum acceleration - Add NAG (Nesterov Accelerated Gradient) optimizer - Add Adagrad with adaptive learning rates - Add Adam optimizer combining momentum and RMSprop - Include comprehensive doctests (61 tests, all passing) - Add abstract BaseOptimizer for consistent interface - Include detailed mathematical documentation - Add educational examples and performance comparisons - Follow repository guidelines: type hints, error handling, pure Python Implements standard optimization algorithms for neural network training with educational focus and comprehensive testing coverage.

algorithms-keeper

Click here to look at the relevant links ⬇️

🔗 Relevant Links

Repository:

Contributing guidelines

Project Euler solution guidelines

Python:

Formatted string literals (f-strings)

Type hints

doctest

unittest

pytest

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper commands and options

algorithms-keeper actions can be triggered by commenting on this PR:

@algorithms-keeper review to trigger the checks for only added pull request files

@algorithms-keeper review-all to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.

NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.

algorithms-keeper · 2025-10-22T15:49:03Z

neural_network/optimizers/adagrad.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _adagrad_update_recursive(params, grads, acc_grads):


Please provide return type hint for the function: _adagrad_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: acc_grads

def _check_and_update_recursive(
params: list[float],
grads: list[float]
) -> list[float]:
...

algorithms-keeper · 2025-10-22T15:49:03Z

neural_network/optimizers/adam.py

+        bias_correction1 = 1 - self.beta1 ** self._time_step
+        bias_correction2 = 1 - self.beta2 ** self._time_step
+
+        def _adam_update_recursive(params, grads, first_moment, second_moment):


Please provide return type hint for the function: _adam_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: first_moment

Please provide type hint for the parameter: second_moment

algorithms-keeper · 2025-10-22T15:49:03Z

neural_network/optimizers/adam.py

+    x_adagrad = [-1.0, 1.0] 
+    x_adam = [-1.0, 1.0]
+
+    def rosenbrock(x, y):


Please provide return type hint for the function: rosenbrock. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

Please provide descriptive name for the parameter: y

Please provide type hint for the parameter: y

algorithms-keeper · 2025-10-22T15:49:03Z

neural_network/optimizers/adam.py

+        """Rosenbrock function: f(x,y) = 100*(y-x²)² + (1-x)²"""
+        return 100 * (y - x*x)**2 + (1 - x)**2
+
+    def rosenbrock_gradient(x, y):


Please provide return type hint for the function: rosenbrock_gradient. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

Please provide descriptive name for the parameter: y

Please provide type hint for the parameter: y

algorithms-keeper · 2025-10-22T15:49:03Z

neural_network/optimizers/momentum_sgd.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _check_shapes_and_get_velocity(params, grads, velocity):


Please provide return type hint for the function: _check_shapes_and_get_velocity. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: velocity

algorithms-keeper · 2025-10-22T15:49:03Z

neural_network/optimizers/nag.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _nag_update_recursive(params, grads, velocity):


Please provide return type hint for the function: _nag_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

Please provide type hint for the parameter: velocity

algorithms-keeper · 2025-10-22T15:49:04Z

neural_network/optimizers/nag.py

+    x_momentum = [2.5]
+    x_nag = [2.5]
+
+    def gradient_f(x):


Please provide return type hint for the function: gradient_f. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

algorithms-keeper · 2025-10-22T15:49:04Z

neural_network/optimizers/nag.py

+        """Gradient of f(x) = 0.1*x^4 - 2*x^2 + x is f'(x) = 0.4*x^3 - 4*x + 1"""
+        return 0.4 * x**3 - 4 * x + 1
+
+    def f(x):


Please provide descriptive name for the function: f

Please provide return type hint for the function: f. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide descriptive name for the parameter: x

Please provide type hint for the parameter: x

algorithms-keeper · 2025-10-22T15:49:04Z

neural_network/optimizers/sgd.py

+        Raises:
+            ValueError: If parameters and gradients have different shapes
+        """
+        def _check_and_update_recursive(params, grads):


Please provide return type hint for the function: _check_and_update_recursive. If the function does not return a value, please provide the type hint as: def function() -> None:

Please provide type hint for the parameter: params

Please provide type hint for the parameter: grads

for more information, see https://pre-commit.ci

algorithms-keeper bot added documentation This PR modified documentation files require descriptive names This PR needs descriptive function and/or variable names require type hints https://docs.python.org/3/library/typing.html labels Oct 22, 2025

algorithms-keeper bot reviewed Oct 22, 2025

View reviewed changes

algorithms-keeper bot added the awaiting reviews This PR is ready to be reviewed label Oct 22, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

05b5c45

for more information, see https://pre-commit.ci

algorithms-keeper bot added the tests are failing Do not merge until tests pass label Oct 22, 2025

shretadas closed this Oct 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

Neural network optimizers module#13689

Neural network optimizers module#13689
shretadas wants to merge 2 commits intoTheAlgorithms:masterfrom
shretadas:master

shretadas commented Oct 22, 2025

Uh oh!

algorithms-keeper bot left a comment

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

shretadas Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

algorithms-keeper bot Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Comments

Conversation

shretadas commented Oct 22, 2025