RELATED: CQ-1005 - make FlexConnect function call deadline configurable #929

lupko · 2024-12-11T19:17:26Z

FlexConnect's implementation of GetFlightInfo uses task executor to submit tasks that will invoke the FlexConnect functions. The GetFlightInfo will then wait for the task to complete (generate flight) and return result.

The GetFlightInfo implementation had hardcoded (in a constant) timeout that was used for waiting. This timeout was set to 60 seconds. This can sometimes be too little. And what's worse, it cannot be changed.

This PR introduces new setting call_deadline_ms that is expected to appear in the [flexconnect] configuration section. It is deadline in milliseconds. The default was bumped to 180 seconds.

Part of this PR are two additional changes:

Try to cancel the task for when the deadline was exceeded. This is basic sanity. The task may be stuck in queue -> cancel will throw it out immediately. The task may be invoking function that is cancellable -> propagate cancel indicator to the function so that it can act on it.
When task that executes FlexConnect function finishes invocation (and has result), it should check whether it was cancelled & switch itself to non-cancellable state before it returns the result. This is also basic hygiene. Tasks that run non-cancellable functions may be cancelled, the function still finishes the execution and returns result which is then retained by the server for configured amount of time. However, since the task was cancelled, there is no chance a client will ever come for the results -> they will hang in the memory unnecessarily.

- this was hardcoded to 60 seconds before - added new option `call_deadline_ms` - allows to configure deadline for the FlexConnection function call; in millis - if not specified, the default of 180 seconds will be used - added extra e2e test verifying the behavior - sanitized test fixtures / test function JIRA: CQ-1005

- for now, this will mainly work in cases when the task is still in the queue - once the task is running & making the function call, there is no mechanism to tell the call to cancel JIRA: CQ-1005

JIRA: CQ-1005

- see code comments for explanation JIRA: CQ-1005

lupko added 4 commits December 11, 2024 19:55

fix: cancel function invocation when deadline exceeded

aef2266

- for now, this will mainly work in cases when the task is still in the queue - once the task is running & making the function call, there is no mechanism to tell the call to cancel JIRA: CQ-1005

docs: fix typo in code comments

84db8bf

JIRA: CQ-1005

fix: test for cancellation before task returns result

a94898b

- see code comments for explanation JIRA: CQ-1005

lupko requested review from hkad98, jaceksan and pcerny as code owners December 11, 2024 19:17

lupko enabled auto-merge December 12, 2024 07:49

no23reason approved these changes Dec 12, 2024

View reviewed changes

lupko merged commit 484fc20 into gooddata:master Dec 12, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RELATED: CQ-1005 - make FlexConnect function call deadline configurable #929

RELATED: CQ-1005 - make FlexConnect function call deadline configurable #929

Uh oh!

lupko commented Dec 11, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RELATED: CQ-1005 - make FlexConnect function call deadline configurable #929

RELATED: CQ-1005 - make FlexConnect function call deadline configurable #929

Uh oh!

Conversation

lupko commented Dec 11, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants