Migrating PGO CLI kuttl tests to chainsaw #138

philrhurst · 2025-08-27T14:23:59Z

These are the first round of kuttl tests migrated to chainsaw. The structure is a single test for each kubectl-pgo command

backup
create
delete
show
version

We can use Kyverno policies to facilitate testing with different image repos.

These are helpful templates to use when testing the PGO CLI

Instructions for how to run the tests

The chainsaw-show Test should remove resources after it completes.

tjmoore4

Looks good overall. A few questions.

testing/chainsaw/README

testing/chainsaw/e2e/values.yaml

tjmoore4 · 2025-09-10T17:47:20Z

testing/chainsaw/e2e/backup/chainsaw-test.yaml

+    - name: Sleep for 10s
+      try:
+      - sleep:
+          duration: 10s


🤔 Is 10s enough for these sleeps? I wonder if the step length or something else is still coming into play here.

Can we set timeouts on different steps to give longer timeouts for the confirm steps or whatever that is taking so long.

Yes, I would still like to investigate getting away from Sleeps and use some form of Get/Wait or Get/Assert to give underlying Resources enough time to be created before checking if they have completed.

testing/chainsaw/e2e/backup/chainsaw-test.yaml

benjaminjb · 2025-09-15T17:02:50Z

testing/chainsaw/e2e/backup/chainsaw-test.yaml

+apiVersion: chainsaw.kyverno.io/v1alpha1
+kind: Test
+metadata:
+  name: check-backup-command-longer-options


This test feels like it's testing that we pass through the options from the command line to the spec. If that's true, I wonder if we could do this as a unit test in the future, and skip the actual backup.

Yes - I think several of the kuttl tests are candidates for other implementations. But I still wanted to migrate them to chainsaw for completeness and discussion.

benjaminjb · 2025-09-15T17:06:53Z

testing/chainsaw/e2e/backup/chainsaw-test.yaml

+        - description: Compare annotations
+          assert:
+            timeout: 30s
+            resource:
+              apiVersion: v1
+              kind: ConfigMap
+              metadata:
+                name: compare-annotations
+              data:
+                (key1 != key2): true
+                (key2 != 'not-found'): true


Didn't know you could do that with Chainsaw, that's interesting.

🤔 I wonder what a step would look like that just compared the two strings in bash...

yes that would be another way to do it - using ConfigMaps to pass test data around is an interesting paradigm I've seen used with Chainsaw. I wrote this test this way to learn more about it. Open to changing the format though.

Do you like it better this way?

I'm open to either -- the cm way was a bit of a shock, but happy to use it going forward. But I'd like being a little consistent if we can. I wonder if there's anything we can't do with the CM version or if the bash version looks messier

I kind of like this way because it makes the assertion more clear (it's not in a bash script...although this would be a simple one).

benjaminjb · 2025-09-15T17:07:12Z

testing/chainsaw/e2e/backup/chainsaw-test.yaml

+apiVersion: chainsaw.kyverno.io/v1alpha1
+kind: Test
+metadata:
+  name: check-backup-command-no-flags


What is this test demonstrating?

This is the conversion of 08--backup-with-just-trigger.yaml which is triggering a backup from the PGO CLI with no options and checking the annotations

benjaminjb · 2025-09-15T17:14:21Z

testing/chainsaw/e2e/create/chainsaw-test.yaml

+      use:
+        template: '../templates/confirm-created.yaml'
+
+    - name: Sleep for 30s


If we need these sleep steps, I think it would be fine to name them all Sleep or something without the time -- it's easy enough to look at the duration to see how long and it avoids drift when we change those durations.

More interesting to me would be why we chose those durations.

Most of the Sleep 30s started at Sleep 10. I bumped them up because running all the test simultaneously was causing issues. I left the descriptive "Sleep for 30s" as I was debugging the tests to know how much time I should wait before the next step.

benjaminjb · 2025-09-15T17:31:05Z

testing/chainsaw/e2e/restore/chainsaw-test.yaml

+apiVersion: chainsaw.kyverno.io/v1alpha1
+kind: Test
+metadata:
+  name: check-restore-confirm-yes


Would it be faster to create one cluster and run several tests on that -- restore without a name, restore no, restore yes...?

Yes it is, but each "test" would then be a separate "step". It becomes an issue of clarity and preference. I.e. do we have a single Restore Test that does multiple things. Or multiple Restore Tests, each doing one single thing.

One thing I like about Chainsaw is that each test is namespace-scoped. So it's easy to separate out tests into discrete units.

benjaminjb · 2025-09-15T17:36:22Z

testing/chainsaw/e2e/restore/chainsaw-test.yaml

+            (contains($stderr, 'Apply failed')): true
+            (contains($stderr, '2 conflicts')): true
+
+    - name: run 'restore' with confirm 'yes' and --force-conflicts


Could you break this up into a few steps? Maybe

set the annotation

get the annotation (maybe the same step as the one above)

issue command

check annotation differs
?

yes, I will look into this. I think I cribbed more of the original kuttl test on this particular one than the others.

benjaminjb · 2025-09-15T17:44:24Z

testing/chainsaw/e2e/start-stop/chainsaw-test.yaml

+
+    - name: 'Confirm cluster is created'
+      use:
+        template: '../templates/confirm-created.yaml'


So the idea is that we can use the same template here because the command did nothing?

yes that's correct

benjaminjb · 2025-09-15T17:45:31Z

testing/chainsaw/e2e/start-stop/chainsaw-test.yaml

+        - sleep:
+            duration: 30s
+
+    - name: run 'stop cluster' with confirm 'y' and force-conflicts


if the start test runs the stop command, I think we can collapse this into one test, yeah?

If memory serves, there are some interesting behaviors that happen when you try to stop a cluster that was created from a manifest rather than the PGO CLI. I think the --force-conflicts is really getting tested here to make sure the cluster will stop and then be started.

The original kuttl test (and this conversion) may be worth revisiting.

benjaminjb · 2025-09-15T17:51:58Z

testing/chainsaw/e2e/version/chainsaw-test.yaml

+  name: version
+spec:
+  steps:
+  - name: step-00


can we get a more descriptive step name?

benjaminjb · 2025-09-15T17:55:02Z

testing/chainsaw/policies/v1.15.1/00-rbac.yaml

+  verbs:
+  - create
+  - update
+  - delete


Why delete?

I was having some issues getting it to work. This Github comment suggested delete.

[Bug] 1.13rc3 - APICall failed to GET resource with raw url kyverno/kyverno#11467 (comment)

The way we defined the selector for backups was quite terse. This refactor simplifies the logic.

tjmoore4

Updates look good to me. Not sure if @benjaminjb is still tracking any blockers, but I think this is good to merge.

benjaminjb · 2025-09-19T19:06:30Z

testing/chainsaw/e2e/support/chainsaw-test.yaml

+    - name: postgresVersion
+      value: 16


Don't we have a central source for pg version?

Yes we do; but the bash for this particular test has already hard-coded 16 in it. I didn't want to refactor to interrupt migrating the tests.

Yeah, this is not a blocker for me; I just hope when we're testing in the far future, when 16 is a problem or dropped, we remember to change it here too (if we're not pulling from that central location)

benjaminjb · 2025-09-19T19:07:43Z

testing/chainsaw/e2e/support/chainsaw-test.yaml

+            metadata:
+              name: kuttl-support-monitoring-cluster
+            spec:
+              postgresVersion: 16


and if we have a central location for the pg version (or even a local version), better to use it

benjaminjb · 2025-09-19T19:08:39Z

testing/chainsaw/e2e/support/chainsaw-test.yaml

+            metadata:
+              name: kuttl-support-instrumentation
+            spec:
+              postgresVersion: 16


benjaminjb · 2025-09-19T19:08:54Z

testing/chainsaw/e2e/support/chainsaw-test.yaml

+                      requests:
+                        storage: 1Gi
+                  replicas: 1
+              postgresVersion: 16


benjaminjb

Really no blockers, but re-reading these tests -- many of which I probably had a hand in -- makes me want to revise these tests. Please make a ticket so we don't forget to!

(What I really want is to shift tests left, e.g., run as much as we can as unit tests) and also combine a few of these tests if we can to run them faster. I know I have created a few tickets re: separating out the cluster work from some CLI work, so it won't just be test changes I'm envisioning in the future.)

philrhurst added 10 commits August 27, 2025 13:33

Kyverno policies

b568ee5

We can use Kyverno policies to facilitate testing with different image repos.

initial set of templates

4de4798

These are helpful templates to use when testing the PGO CLI

default config and values YAML files

22554a8

Chainsaw test for PGO CLI backup

abcad58

Chainsaw test for PGO CLI backup

10e507d

Chainsaw tests for PGO CLI create

160e1e8

Chainsaw tests for PGO CLI delete

65101b3

Chainsaw test for PGO CLI show

5ce5cd9

Chainsaw tests for PGO CLI version

2c19b4f

Add Chainsaw README

3c8b26e

Instructions for how to run the tests

philrhurst marked this pull request as ready for review August 27, 2025 16:10

philrhurst requested review from benjaminjb and tjmoore4 August 27, 2025 16:12

philrhurst added 5 commits August 27, 2025 16:15

remove skipDelete

76c8592

The chainsaw-show Test should remove resources after it completes.

Chainsaw test for PGO CLI start-stop

353c2e9

Chainsaw test for PGO CLI restore

408d93c

Update chainsaw tests with helpful sleeps

f9e2832

Update template with better check for backup complete

f1e1430

tjmoore4 reviewed Sep 10, 2025

View reviewed changes

benjaminjb reviewed Sep 15, 2025

View reviewed changes

added newlines at the end

ed0124d

philrhurst added 3 commits September 18, 2025 00:10

renamed steps in version test

2eefb50

refactored backupSelector

684fa21

The way we defined the selector for backups was quite terse. This refactor simplifies the logic.

Chainsaw test for PGO CLI support export

5b6595f

tjmoore4 approved these changes Sep 19, 2025

View reviewed changes

benjaminjb reviewed Sep 19, 2025

View reviewed changes

testing/chainsaw/e2e/support/chainsaw-test.yaml

requests:

storage: 1Gi

replicas: 1

postgresVersion: 16

Copy link

Collaborator

benjaminjb Sep 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto

benjaminjb approved these changes Sep 19, 2025

View reviewed changes

philrhurst merged commit db3197c into CrunchyData:main Sep 19, 2025
12 checks passed

philrhurst deleted the dev branch September 19, 2025 19:30

Migrating PGO CLI kuttl tests to chainsaw #138

Migrating PGO CLI kuttl tests to chainsaw #138

Uh oh!

Conversation

philrhurst commented Aug 27, 2025

Uh oh!

tjmoore4 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benjaminjb Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tjmoore4 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benjaminjb left a comment

benjaminjb Sep 18, 2025 •

edited

Loading