networmix
diff --git a/‎docs/reference/api-full.md‎
Lines changed: 93 additions & 11 deletions b/‎docs/reference/api-full.md‎
Lines changed: 93 additions & 11 deletions
diff --git a/‎docs/reference/cli.md‎
Lines changed: 143 additions & 0 deletions b/‎docs/reference/cli.md‎
Lines changed: 143 additions & 0 deletions
diff --git a/‎ngraph/cli.py‎
Lines changed: 48 additions & 4 deletions b/‎ngraph/cli.py‎
Lines changed: 48 additions & 4 deletions
@@ -10,9 +10,9 @@ For a curated, example-driven API guide, see **[api.md](api.md)**.
 > - **[CLI Reference](cli.md)** - Command-line interface
 > - **[DSL Reference](dsl.md)** - YAML syntax guide
 
-**Generated from source code on:** July 04, 2025 at 19:15 UTC
+**Generated from source code on:** July 05, 2025 at 15:08 UTC
 
-**Modules auto-discovered:** 48
+**Modules auto-discovered:** 50
 
 ---
 
@@ -675,6 +675,90 @@ Returns:
 
 ---
 
+## ngraph.profiling
+
+Performance profiling instrumentation for NetGraph workflow execution.
+
+CPU profiler with workflow step timing, function analysis, and bottleneck detection.
+
+### PerformanceProfiler
+
+CPU profiler for NetGraph workflow execution.
+
+Profiles workflow steps using cProfile and identifies bottlenecks.
+
+**Methods:**
+
+- `analyze_performance(self) -> 'None'`
+  - Analyze profiling results and identify bottlenecks.
+- `end_scenario(self) -> 'None'`
+  - End profiling for the entire scenario execution.
+- `get_top_functions(self, step_name: 'str', limit: 'int' = 10) -> 'List[Tuple[str, float, int]]'`
+  - Get the top CPU-consuming functions for a specific step.
+- `profile_step(self, step_name: 'str', step_type: 'str') -> 'Generator[None, None, None]'`
+  - Context manager for profiling individual workflow steps.
+- `save_detailed_profile(self, output_path: 'Path', step_name: 'Optional[str]' = None) -> 'None'`
+  - Save detailed profiling data to a file.
+- `start_scenario(self) -> 'None'`
+  - Start profiling for the entire scenario execution.
+
+### PerformanceReporter
+
+Formats and displays performance profiling results.
+
+Generates text reports with timing analysis, bottleneck identification, and optimization suggestions.
+
+**Methods:**
+
+- `generate_report(self) -> 'str'`
+  - Generate performance report.
+
+### ProfileResults
+
+Profiling results for a scenario execution.
+
+Attributes:
+    step_profiles: List of individual step performance profiles.
+    total_wall_time: Total wall-clock time for entire scenario.
+    total_cpu_time: Total CPU time across all steps.
+    total_function_calls: Total function calls across all steps.
+    bottlenecks: List of performance bottlenecks (>10% execution time).
+    analysis_summary: Performance metrics and statistics.
+
+**Attributes:**
+
+- `step_profiles` (List[StepProfile]) = []
+- `total_wall_time` (float) = 0.0
+- `total_cpu_time` (float) = 0.0
+- `total_function_calls` (int) = 0
+- `bottlenecks` (List[Dict[str, Any]]) = []
+- `analysis_summary` (Dict[str, Any]) = {}
+
+### StepProfile
+
+Performance profile data for a single workflow step.
+
+Attributes:
+    step_name: Name of the workflow step.
+    step_type: Type/class name of the workflow step.
+    wall_time: Total wall-clock time in seconds.
+    cpu_time: CPU time spent in step execution.
+    function_calls: Number of function calls during execution.
+    memory_peak: Peak memory usage during step (if available).
+    cprofile_stats: Detailed cProfile statistics object.
+
+**Attributes:**
+
+- `step_name` (str)
+- `step_type` (str)
+- `wall_time` (float)
+- `cpu_time` (float)
+- `function_calls` (int)
+- `memory_peak` (Optional[float])
+- `cprofile_stats` (Optional[pstats.Stats])
+
+---
+
 ## ngraph.results
 
 Results class for storing workflow step outputs.
@@ -2122,29 +2206,27 @@ Attributes:
 
 ## ngraph.workflow.network_stats
 
-Base statistical analysis of nodes and links.
+Workflow step for basic node and link statistics.
 
 ### NetworkStats
 
-A workflow step that gathers capacity and degree statistics for the network.
+Compute basic node and link statistics for the network.
 
-YAML Configuration:
-    ```yaml
-    workflow:
-      - step_type: NetworkStats
-        name: "stats"        # Optional custom name for this step
-    ```
+Attributes:
+    include_disabled (bool): If True, include disabled nodes and links in statistics.
+                             If False, only consider enabled entities. Defaults to False.
 
 **Attributes:**
 
 - `name` (str)
 - `seed` (Optional[int])
+- `include_disabled` (bool) = False
 
 **Methods:**
 
 - `execute(self, scenario: "'Scenario'") -> 'None'`
   - Execute the workflow step with automatic logging.
-- `run(self, scenario: "'Scenario'") -> 'None'`
+- `run(self, scenario: 'Scenario') -> 'None'`
   - Collect capacity and degree statistics.
 
 ---
 
@@ -120,6 +120,7 @@ python -m ngraph run <scenario_file> [options]
 - `--results`, `-r`: Optional path to export results as JSON. If provided without a path, defaults to "results.json"
 - `--stdout`: Print results to stdout
 - `--keys`, `-k`: Space-separated list of workflow step names to include in output
+- `--profile`: Enable performance profiling with CPU analysis and bottleneck detection
 - `--help`, `-h`: Show help message
 
 ## Examples
@@ -179,6 +180,148 @@ workflow:
 
 Then `--keys build_graph` will include only the results from the BuildGraph step, and `--keys capacity_probe` will include only the CapacityProbe results.
 
+### Performance Profiling
+
+NetGraph provides performance profiling to identify bottlenecks, analyze execution time, and optimize workflow performance. The profiling system provides CPU-level analysis with function-by-function timing and bottleneck detection.
+
+#### Performance Analysis
+
+Use `--profile` to get performance analysis:
+
+```bash
+# Run scenario with profiling
+python -m ngraph run scenario.yaml --profile
+
+# Combine profiling with results export
+python -m ngraph run scenario.yaml --profile --results
+
+# Profiling with filtered output
+python -m ngraph run scenario.yaml --profile --keys capacity_probe
+```
+
+Performance profiling provides:
+
+- **Summary**: Total execution time, CPU efficiency, function call statistics
+- **Step timing analysis**: Time spent in each workflow step with percentage breakdown
+- **Bottleneck identification**: Workflow steps consuming >10% of total execution time
+- **Function-level analysis**: Top CPU-consuming functions within each bottleneck
+- **Call statistics**: Function call counts and timing distribution
+- **CPU utilization patterns**: Detailed breakdown of computational efficiency
+- **Targeted recommendations**: Specific optimization suggestions for each bottleneck
+
+#### Profiling Output
+
+Profiling generates a performance report displayed after scenario execution:
+
+```
+================================================================================
+NETGRAPH PERFORMANCE PROFILING REPORT
+================================================================================
+
+1. SUMMARY
+----------------------------------------
+Total Execution Time: 12.456 seconds
+Total CPU Time: 11.234 seconds
+CPU Efficiency: 90.2%
+Total Workflow Steps: 3
+Average Step Time: 4.152 seconds
+Total Function Calls: 1,234,567
+Function Calls/Second: 99,123
+
+1 performance bottleneck(s) identified
+
+2. WORKFLOW STEP TIMING ANALYSIS
+----------------------------------------
+Step Name          Type               Wall Time    CPU Time     Calls      % Total
+build_graph        BuildGraph         0.123s       0.098s       1,234      1.0%
+capacity_probe     CapacityProbe      11.234s      10.987s      1,200,000  90.2%
+network_stats      NetworkStats       1.099s       0.149s       33,333     8.8%
+
+3. PERFORMANCE BOTTLENECK ANALYSIS
+----------------------------------------
+Bottleneck #1: capacity_probe (CapacityProbe)
+   Wall Time: 11.234s (90.2% of total)
+   CPU Time: 10.987s
+   Function Calls: 1,200,000
+   CPU Efficiency: 97.8% (CPU-intensive workload)
+   Recommendation: Consider algorithmic optimization or parallelization
+
+4. DETAILED FUNCTION ANALYSIS
+----------------------------------------
+Top CPU-consuming functions in 'capacity_probe':
+   ngraph/lib/algorithms/max_flow.py:42(dijkstra_shortest_path)
+      Time: 8.456s, Calls: 500,000
+   ngraph/lib/algorithms/max_flow.py:156(ford_fulkerson)
+      Time: 2.234s, Calls: 250,000
+```
+
+#### Profiling Best Practices
+
+**When to Use Profiling:**
+
+- Performance optimization during development
+- Identifying bottlenecks in complex workflows
+- Analyzing scenarios with large networks or datasets
+- Benchmarking before/after optimization changes
+
+**Development Workflow:**
+
+```bash
+# 1. Profile scenario to identify bottlenecks
+python -m ngraph run scenario.yaml --profile
+
+# 2. Combine with filtering for targeted analysis
+python -m ngraph run scenario.yaml --profile --keys slow_step
+
+# 3. Profile with results export for analysis
+python -m ngraph run scenario.yaml --profile --results analysis.json
+```
+
+**Performance Considerations:**
+
+- Profiling adds minimal overhead (~15-25%)
+- Use production-like data sizes for accurate bottleneck identification
+- Profile multiple runs to account for variability in timing measurements
+- Focus optimization efforts on steps consuming >10% of total execution time
+
+**Interpreting Results:**
+
+- **CPU Efficiency**: Ratio of CPU time to wall time (higher is better for compute-bound tasks)
+- **Function Call Rate**: Calls per second (very high rates may indicate optimization opportunities)
+- **Bottleneck Percentage**: Time percentage helps prioritize optimization efforts
+- **Efficiency Ratio**: Low ratios (<30%) suggest I/O-bound operations or external dependencies
+
+#### Advanced Profiling Scenarios
+
+**Profiling Large Networks:**
+
+```bash
+# Profile capacity analysis on large networks
+python -m ngraph run large_network.yaml --profile --keys capacity_envelope_analysis
+```
+
+**Comparative Profiling:**
+
+```bash
+# Profile before optimization
+python -m ngraph run scenario_v1.yaml --profile > profile_v1.txt
+
+# Profile after optimization
+python -m ngraph run scenario_v2.yaml --profile > profile_v2.txt
+
+# Compare results manually or with diff tools
+```
+
+**Targeted Profiling:**
+
+```bash
+# Profile only specific workflow steps
+python -m ngraph run scenario.yaml --profile --keys capacity_probe network_stats
+
+# Profile with results export for further analysis
+python -m ngraph run scenario.yaml --profile --results analysis.json
+```
+
 ## Output Format
 
 The CLI outputs results in JSON format. The structure depends on the workflow steps executed in your scenario:
 
@@ -11,6 +11,7 @@
 
 from ngraph.explorer import NetworkExplorer
 from ngraph.logging import get_logger, set_global_log_level
+from ngraph.profiling import PerformanceProfiler, PerformanceReporter
 from ngraph.scenario import Scenario
 
 logger = get_logger(__name__)
@@ -392,6 +393,7 @@ def _run_scenario(
     output: Optional[Path],
     stdout: bool,
     keys: Optional[list[str]] = None,
+    profile: bool = False,
 ) -> None:
     """Run a scenario file and optionally export results as JSON.
 
@@ -401,16 +403,47 @@ def _run_scenario(
         stdout: Whether to also print results to stdout.
         keys: Optional list of workflow step names to include. When ``None`` all steps are
             exported.
+        profile: Whether to enable performance profiling with CPU analysis.
     """
     logger.info(f"Loading scenario from: {path}")
 
     try:
         yaml_text = path.read_text()
         scenario = Scenario.from_yaml(yaml_text)
 
-        logger.info("Starting scenario execution")
-        scenario.run()
-        logger.info("Scenario execution completed successfully")
+        if profile:
+            logger.info("Performance profiling enabled")
+            # Initialize comprehensive profiler
+            profiler = PerformanceProfiler()
+
+            # Start scenario-level profiling
+            profiler.start_scenario()
+
+            logger.info("Starting scenario execution with profiling")
+
+            # Manual execution of workflow steps with profiling
+            for step in scenario.workflow:
+                step_name = step.name or step.__class__.__name__
+                step_type = step.__class__.__name__
+
+                with profiler.profile_step(step_name, step_type):
+                    step.execute(scenario)
+
+            logger.info("Scenario execution completed successfully")
+
+            # End scenario profiling and analyze results
+            profiler.end_scenario()
+            profiler.analyze_performance()
+
+            # Generate and display performance report
+            reporter = PerformanceReporter(profiler.results)
+            performance_report = reporter.generate_report()
+            print("\n" + performance_report)
+
+        else:
+            logger.info("Starting scenario execution")
+            scenario.run()
+            logger.info("Scenario execution completed successfully")
 
         # Only export JSON if output path is provided
         if output:
@@ -493,6 +526,11 @@ def main(argv: Optional[List[str]] = None) -> None:
         nargs="+",
         help="Filter output to these workflow step names",
     )
+    run_parser.add_argument(
+        "--profile",
+        action="store_true",
+        help="Enable performance profiling with CPU analysis and bottleneck detection",
+    )
 
     # Inspect command
     inspect_parser = subparsers.add_parser(
@@ -518,7 +556,13 @@ def main(argv: Optional[List[str]] = None) -> None:
         set_global_log_level(logging.INFO)
 
     if args.command == "run":
-        _run_scenario(args.scenario, args.results, args.stdout, args.keys)
+        _run_scenario(
+            args.scenario,
+            args.results,
+            args.stdout,
+            args.keys,
+            args.profile,
+        )
     elif args.command == "inspect":
         _inspect_scenario(args.scenario, args.detail)