19 Feb 12:42

r4victor

89fee21

0.20.10-v1 Latest

Latest

Services

Prefill-Decode disaggregation

dstack now supports disaggregated Prefill–Decode inference, allowing both Prefill and Decode worker types to run within a single service.

To define and run such a service, set pd_disaggregation to true under the router property (this requires the gateway to use the sglang router, and define separate replica groups for Prefill and Decode worker types:

type: service
name: prefill-decode

env:
  - HF_TOKEN
  - MODEL_ID=zai-org/GLM-4.5-Air-FP8

image: lmsysorg/sglang:latest

replicas:
  - count: 1..4
    scaling:
      metric: rps
      target: 3
    commands:
      - |
          python -m sglang.launch_server \
            --model-path $MODEL_ID \
            --disaggregation-mode prefill \
            --disaggregation-transfer-backend mooncake \
            --host 0.0.0.0 \
            --port 8000 \
            --disaggregation-bootstrap-port 8998
    resources:
      gpu: H200

  - count: 1..8
    scaling:
      metric: rps
      target: 2
    commands:
      - |
          python -m sglang.launch_server \
            --model-path $MODEL_ID \
            --disaggregation-mode decode \
            --disaggregation-transfer-backend mooncake \
            --host 0.0.0.0 \
            --port 8000
    resources:
      gpu: H200

port: 8000
model: zai-org/GLM-4.5-Air-FP8

probes:
  - type: http
    url: /health_generate
    interval: 15s

router:
  type: sglang
  pd_disaggregation: true

Note

Note, pd_disaggregation requires both the gateway and replicas to use the same cluster. With dstack, this can now be used with the aws, gcp, kubernetes backends (as they support creating both clusters and gateways). Support for more backends (and eventually SSH fleets) is coming soon.

Currently, pd_disaggregation works only with SGLang. Support for vLLM is coming soon.

Support for additional scaling metrics, such as TTFT and ITL, is also coming soon to enable autoscaling of Prefill and Decode workers.

Model endpoint

If you configure the model property, dstack previously provided a global model endpoint at gateway.<gateway domain> (or /proxy/models/<project name>), allowing access to all models deployed in the project. This endpoint has been deprecated.

Now, any deployed model should be accessed via the service endpoint itself at <run name>.<gateway domain> (or /proxy/services/main/<service name>).

Note

If you configure the model property, dstack automatically enables CORS on the service endpoint. Future versions will allow you to disable or customize this behavior.

CLI

`dstack apply`

Previously, if you did not specify gpu, dstack treated it as 0..1 but did not display it in the run plan. Now, dstack properly displays this default. Additionally, if you do not specify image, dstack automatically defaults the vendor to nvidia.

dstack apply -f dev.dstack.yml
 Project              peterschmidt85
 User                 peterschmidt85
 Type                 dev-environment
 Resources            cpu=2.. mem=8GB.. disk=100GB.. gpu=0..
 Spot policy          on-demand
 Max price            off
 Retry policy         off
 Idle duration        5m
 Max duration         off
 Inactivity duration  off

 #  BACKEND         RESOURCES                  INSTANCE TYPE  PRICE
 1  verda (FIN-01)  cpu=4 mem=16GB disk=100GB  CPU.4V.16G     $0.0279
 2  verda (FIN-02)  cpu=4 mem=16GB disk=100GB  CPU.4V.16G     $0.0279
 3  verda (FIN-03)  cpu=4 mem=16GB disk=100GB  CPU.4V.16G     $0.0279
    ...

Submit the run dev? [y/n]:

This makes the run plan much more explicit and clear.

Full changelog: dstackai/dstack@0.20.8-v1...0.20.10-v1

Assets 2

05 Feb 11:55

jvstme

0.20.8-v1

89fee21

0.20.8-v1

CLI

`dstack event --watch`

The dstack event command now supports a --watch option for real-time event tracking.

Event coverage has also been improved, with events for run in-place update and service registration now available.

`dstack fleet`

The dstack fleet command now includes fleet-level information such as nodes, resources, spot policy, and backend details, with individual instances listed underneath.

Skills

`SKILL.md`

If you're using agents such as Claude Code, Codex, Cursor, etc., it’s now possible to install dstack skills.

npx skills add dstackai/dstack

These skills make the agent fully aware of the configuration syntax and CLI commands.

Services

Probes

UI

The UI now displays probe statuses for services, helping monitor replica readiness and health.

`until_ready`

A new until_ready option for probes allows stopping probe execution once the ready_after threshold is reached. This is useful for resource-intensive probes that only need to run during startup:

probes:
  - type: http
     url: /health
     until_ready: true
     ready_after: 2

Model probes

Services that use the model property to declare a chat model with an OpenAI-compatible interface now receive an automatically configured probe that checks model availability by requesting /v1/chat/completions.

Backends

RunPod

Community Cloud

RunPod Community Cloud is now disabled by default to ensure a more reliable experience. You can still enable Community Cloud in the backend settings. dstack Sky users can enable Community Cloud only when using their own RunPod credentials.

CUDO

Due to CUDO Compute winding down its public on-demand offering, the cudo backend is now deprecated.

What's changed

[Docs] Replica groups by @Bihan in dstackai/dstack#3511
[Docs] Added Spot policy by @peterschmidt85 in dstackai/dstack#3512
Switch UI to pagination-based projects and users API by @olgenn in dstackai/dstack#3503
[UI] Add Spot policy configuration option to the fleet wizard by @olgenn in dstackai/dstack#3519
Rename event target filters in UI by @jvstme in dstackai/dstack#3517
[Docs] Add dstack skill by @peterschmidt85 in dstackai/dstack#3525
[Docs] Remove the mention of the gateway endpoint #3514 by @peterschmidt85 in dstackai/dstack#3518
Add service and replica registration events by @jvstme in dstackai/dstack#3516
[Bug]: Refresh button does not work on list pages by @olgenn in dstackai/dstack#3520
[Feature]: Show probe statuses in the UI by @olgenn in dstackai/dstack#3521
[Runpod] Make Community Cloud an "opt-in" (disable by default) by @peterschmidt85 in dstackai/dstack#3534
[Services] Add default probes if model is set by @peterschmidt85 in dstackai/dstack#3524
[Docs] Update SKILL.md by @peterschmidt85 in dstackai/dstack#3536
[CLI]: dstack event --watch by @jvstme in dstackai/dstack#3533
Add /api/project/{project_name}/instances/get by @jvstme in dstackai/dstack#3535
Add run in-place update event by @jvstme in dstackai/dstack#3540
Add job in-place update event by @jvstme in dstackai/dstack#3541
Add probe until_ready configuration option by @jvstme in dstackai/dstack#3530
CLI crashes with 'Operation not permitted' when log file is not writable by @peterschmidt85 in dstackai/dstack#3538
[Docs] Removed cudo backend by @peterschmidt85 in dstackai/dstack#3539
[UX] Improve dstack fleet output layout by @peterschmidt85 in dstackai/dstack#3529
[UX] Remove creation_policy from Concept by @peterschmidt85 in dstackai/dstack#3542
[Bug]: Run doesn't show Waiting runner limit exceeded in Error by @peterschmidt85 in dstackai/dstack#3546
[Docs] Update SKILL.md by @peterschmidt85 in dstackai/dstack#3547
Fix probes=None client incompatibility by @jvstme in dstackai/dstack#3544
Fix probes=None server incompatibility by @jvstme in dstackai/dstack#3543

Full changelog: dstackai/dstack@0.20.7...0.20.8

Contributors

olgenn, Bihan, and 2 other contributors

Assets 2

28 Jan 16:59

jvstme

0.20.7-v1

89fee21

0.20.7-v1

Services

Replica groups

A service can now include multiple replica groups. Each group can define its own commands, resources spec, and scaling rules.

type: service
name: llama-8b-service

image: lmsysorg/sglang:latest
env:
  - MODEL_ID=deepseek-ai/DeepSeek-R1-Distill-Llama-8B

replicas:
  - count: 1..2
    scaling:
      metric: rps
      target: 10
    commands:
      - |
        python -m sglang.launch_server \
          --model-path $MODEL_ID \
          --port 8000 \
          --trust-remote-code
    resources:
      gpu: 48GB

  - count: 1..4
    scaling:
      metric: rps
      target: 5
    commands:
      - |
        python -m sglang.launch_server \
          --model-path $MODEL_ID \
          --port 8000 \
          --trust-remote-code
    resources:
      gpu: 24GB

port: 8000
model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Note

Properties such as regions, port, image, env and some other cannot be configured per replica group. This support is coming soon.

Note

Native support for disaggregated prefill and decode, allowing both worker types to run within a single service, is coming soon.

Events

Events are now also supported for volumes, gateways, and secrets.

$ dstack event --target-gateway my-gateway
[2026-01-28 11:53:03] [👤admin] [gateway my-gateway] Gateway created. Status: SUBMITTED
[2026-01-28 11:53:32] [gateway my-gateway] Gateway status changed SUBMITTED -> PROVISIONING
[2026-01-28 11:54:46] [gateway my-gateway] Gateway status changed PROVISIONING -> RUNNING
[2026-01-28 11:55:08] [👤admin] [gateway my-gateway] Gateway set as default

Instance events now also include reachability and health events.

Finally, we have added Events under Concepts in the documentation.

CLI

`dstack project`

The dstack project and dstack project set-default commands now allow you to interactively select the default project when these commands are run without arguments.

`dstack login`

The dstack login command can now be run without arguments. In this case, it will interactively ask for the URL and provider if needed. If you want to use dstack Sky, you can simply press Enter without entering a URL or provider.

Also, if you have multiple projects, the command will prompt you to select the default project as well.

What's changed

Implement pagination for /api/project/list and /api/users/list by @r4victor in dstackai/dstack#3489
Update dstack server CLI logo by @r4victor in dstackai/dstack#3438
Move pytest.ini options to pyproject.toml by @r4victor in dstackai/dstack#3491
[UX] Make dstack project and dstack project set-default interactive for default project selection by @peterschmidt85 in dstackai/dstack#3488
Add replica groups in dstack-service by @Bihan in dstackai/dstack#3408
[chore]: Add list_events utility for unit tests by @jvstme in dstackai/dstack#3493
[Docs]: Fix k8s backend config example by @jvstme in dstackai/dstack#3495
Move ruff.toml to pyproject.toml by @r4victor in dstackai/dstack#3496
Events: instance/job reachability and health by @jvstme in dstackai/dstack#3482
Volume events by @jvstme in dstackai/dstack#3494
Set INSTANCE_UNREACHABLE for unreachable on-demand instances by @r4victor in dstackai/dstack#3497
Support gateway events in API, CLI, and UI by @jvstme in dstackai/dstack#3499
Use numeric replica-group names by @Bihan in dstackai/dstack#3502
Add gateway lifecycle events by @jvstme in dstackai/dstack#3500
Docs minor improvements by @peterschmidt85 in dstackai/dstack#3501
Support secret events in API, CLI, and UI by @jvstme in dstackai/dstack#3504
[Docs] Events #3397 by @peterschmidt85 in dstackai/dstack#3506
[UX] Extend dstack login with interactive selection of url and default project by @peterschmidt85 in dstackai/dstack#3492
Add secret lifecycle events by @jvstme in dstackai/dstack#3505
Fix apply plan compatibility with old servers by @jvstme in dstackai/dstack#3507
[UI] Minor tweaks by @peterschmidt85 in dstackai/dstack#3508
Fix dstack event compat. with older servers by @jvstme in dstackai/dstack#3509
Fix scaling during update to replica groups by @jvstme in dstackai/dstack#3510

Full changelog: dstackai/dstack@0.20.6...0.20.7

Contributors

Bihan, r4victor, and 2 other contributors

Assets 2

21 Jan 13:31

peterschmidt85

0.20.6-v1

89fee21

0.20.6-v1

Server deployment

Memory optimization

This release reduces peak server memory usage. Previously, memory grew with the total number of instances ever submitted; this is now fixed. We recommend upgrading if memory usage increases over time.

Logs storage

Fluent Bit + Elasticsearch/OpenSearch

Run logs can now be stored in your own log storage via Fluent Bit. At the same time, dstack can now read run logs from Elasticsearch/OpenSearch (to display in the UI and CLI), if Fluent Bit ships the logs there.

See the docs for more details.

Fleets

Since 0.20, dstack requires at least one fleet to be created before you can submit any runs. To make this easier, we’ve simplified default fleet creation during project setup in the UI:

In addition, if your project doesn’t have a fleet, the UI will prompt you to create one.

What's changed:

Hotfix. Fixed generation fleet fields in project forms by @olgenn in dstackai/dstack#3486
Add missing Box imports by @r4victor in dstackai/dstack#3485
Use the same metrics endpoint label for 404 requests by @r4victor in dstackai/dstack#3455
Refactoring Inspect page by @olgenn in dstackai/dstack#3457
Migrate from Slurm by @peterschmidt85 in dstackai/dstack#3454
[Internal]: Handle GitHub API errors in release_notes.py by @jvstme in dstackai/dstack#3463
Display InstanceAvailability.NO_BALANCE in CLI by @jvstme in dstackai/dstack#3460
Do not return NO_BALANCE to older clients by @jvstme in dstackai/dstack#3462
Optimize job submissions loading by @r4victor in dstackai/dstack#3466
[CLI] Add --memory option to apply and offer by @un-def in dstackai/dstack#3461
[runner] Rework and fix user processing by @un-def in dstackai/dstack#3456
Optimize fleet instances db queries by @r4victor in dstackai/dstack#3467
Kubernetes: adjust offer GPU count by @un-def in dstackai/dstack#3469
Add missing job status change event for scaling by @jvstme in dstackai/dstack#3465
Fix find_optimal_fleet_with_offers log message by @un-def in dstackai/dstack#3470
Fix missing instance lock in delete_fleets by @r4victor in dstackai/dstack#3471
Optimize list and get fleets by @r4victor in dstackai/dstack#3472
feat(logging): add fluent-bit log shipping by @DragonStuff in dstackai/dstack#3431
Adjust fluent-bit logging integration by @r4victor in dstackai/dstack#3478
Emit events for instance status changes by @jvstme in dstackai/dstack#3477
[runner] Restore --home-dir option as no-op by @un-def in dstackai/dstack#3480
[UI] Default fleet in project wizard by @olgenn in dstackai/dstack#3464
Support shared AWS compute caches by @r4victor in dstackai/dstack#3483
[UI] Minor re-order in the sidebar by @peterschmidt85 in dstackai/dstack#3484

Full changelog: dstackai/dstack@0.20.3...0.20.6

Contributors

un-def, olgenn, and 4 other contributors

Assets 2

21 Jan 11:20

r4victor

0.20.5-v1

89fee21

0.20.5-v1

This is a hotfix release that fixes a bug in 0.20.4 with some UI pages not working (Users, Projects, Settings).

What's Changed

Add missing Box imports by @r4victor in dstackai/dstack#3485

Full Changelog: dstackai/dstack@0.20.4...0.20.5

Contributors

r4victor

Assets 2

21 Jan 10:39

r4victor

0.20.4-v1

89fee21

0.20.4-v1

What's changed

Use the same metrics endpoint label for 404 requests by @r4victor in dstackai/dstack#3455
Refactoring Inspect page by @olgenn in dstackai/dstack#3457
Migrate from Slurm by @peterschmidt85 in dstackai/dstack#3454
[Internal]: Handle GitHub API errors in release_notes.py by @jvstme in dstackai/dstack#3463
Display InstanceAvailability.NO_BALANCE in CLI by @jvstme in dstackai/dstack#3460
Do not return NO_BALANCE to older clients by @jvstme in dstackai/dstack#3462
Optimize job submissions loading by @r4victor in dstackai/dstack#3466
[CLI] Add --memory option to apply and offer by @un-def in dstackai/dstack#3461
[runner] Rework and fix user processing by @un-def in dstackai/dstack#3456
Optimize fleet instances db queries by @r4victor in dstackai/dstack#3467
Kubernetes: adjust offer GPU count by @un-def in dstackai/dstack#3469
Add missing job status change event for scaling by @jvstme in dstackai/dstack#3465
Fix find_optimal_fleet_with_offers log message by @un-def in dstackai/dstack#3470
Fix missing instance lock in delete_fleets by @r4victor in dstackai/dstack#3471
Optimize list and get fleets by @r4victor in dstackai/dstack#3472
feat(logging): add fluent-bit log shipping by @DragonStuff in dstackai/dstack#3431
Adjust fluent-bit logging integration by @r4victor in dstackai/dstack#3478
Emit events for instance status changes by @jvstme in dstackai/dstack#3477
[runner] Restore --home-dir option as no-op by @un-def in dstackai/dstack#3480
[UI] Default fleet in project wizard by @olgenn in dstackai/dstack#3464
Support shared AWS compute caches by @r4victor in dstackai/dstack#3483
[UI] Minor re-order in the sidebar by @peterschmidt85 in dstackai/dstack#3484

Full Changelog: dstackai/dstack@0.20.3...0.20.4

Contributors

un-def, olgenn, and 4 other contributors

Assets 2

08 Jan 18:08

jvstme

0.20.3-v1

89fee21

0.20.3-v1

Dev environments

Windsurf IDE

Dev environments now support Windsurf as a first-class IDE option alongside VSCode and Cursor.

type: dev-environment
ide: windsurf

repos:
- https://github.com/dstackai/dstack

resources:
  gpu: 24GB..:1

dstack provisions an instance for your dev environment and seamlessly connects your local Windsurf editor to it.

Troubleshooting

Runs/fleets/volumes/gateways JSON via CLI

You can now inspect the full JSON state of runs, fleets, volumes, and gateways using these CLI commands:

$ dstack run get <name> --json
$ dstack fleet get <name> --json
$ dstack volume get <name> --json
$ dstack gateway get <name> --json

Runs/fleets JSON via UI

The UI includes new "Inspect" tabs with read-only JSON viewers for runs and fleets, making it easier to debug and understand resource states.

What's changed

Fix TestRemoveDanglingTasks by @un-def in dstackai/dstack#3426
[runner] Configure and start sshd by @un-def in dstackai/dstack#3421
Resolve url for dstack login by @r4victor in dstackai/dstack#3427
[shim] Fix DockerRunner tests by @un-def in dstackai/dstack#3429
Remove httpx duplicated in dev deps by @r4victor in dstackai/dstack#3433
[UX] Better "No fleets" messages; plus updated Troubleshooting guide by @peterschmidt85 in dstackai/dstack#3428
[runner] Streamline authorized_keys management by @un-def in dstackai/dstack#3435
Change /dstack/venv ownership to the current user by @un-def in dstackai/dstack#3437
[UX] Add an API that returns projects that lack active fleets by @peterschmidt85 in dstackai/dstack#3425
Make no fleet notifications dismissible by @r4victor in dstackai/dstack#3439
Adjust kubernetes gpu matching for RTX5090 by @r4victor in dstackai/dstack#3440
[runner] Fix MPI hostfile by @un-def in dstackai/dstack#3441
[Crusoe] Minor edits by @peterschmidt85 in dstackai/dstack#3448
[Dev environments] Support windsurf IDE by @peterschmidt85 in dstackai/dstack#3444
Add processing instance debug log message by @jvstme in dstackai/dstack#3450
[runner] Decouple Server and Executor by @un-def in dstackai/dstack#3447
[Feature] Allow to see JSON state of runs/volumes/fleets/gateways via CLI/UI by @peterschmidt85 in dstackai/dstack#3445

Full Changelog: dstackai/dstack@0.20.1...0.20.3

Contributors

un-def, r4victor, and 2 other contributors

Assets 2

25 Dec 11:44

r4victor

0.20.1-v1

89fee21

0.20.1-v1

CLI

No-fleets warning

Since the last major release, fleets are required before submitting runs. This update makes that requirement explicit in the CLI.

When a run is submitted for a project that has no fleets, the CLI now shows a dedicated warning. The run status has also been updated in both the CLI and UI to No fleets instead of No offers.

This removes ambiguity around failed runs that previously appeared as No offers.

`dstack login`

You can now authenticate the CLI using a new command, dstack login, instead of manually providing a token.

dstack Enterprise supports SSO with providers such as Okta, Microsoft Entra ID, and Google.

Services

Service configurations now support gateway: true.

For services that require gateway features (such as auto-scaling, custom domains, WebSockets, etc), this property makes the requirement explicit. When set, dstack ensures a default gateway is present.

`dstack-shim`

In addition to the dstack-runner auto-update mechanism introduced in 0.20.0, dstack-shim now also supports auto-updating.

See contributing/RUNNER-AND-SHIM.md for details.

What's Changed

[Docs] Reflect the 0.20 changes related to working_dir and repo_dir by @peterschmidt85 in dstackai/dstack#3356
[Docs]: Fix environment variables reference layout by @jvstme in dstackai/dstack#3396
Add more events about users and projects by @jvstme in dstackai/dstack#3390
Implement shim auto-update by @un-def in dstackai/dstack#3395
[Fleets] Updated error message and docs by @peterschmidt85 in dstackai/dstack#3377
[Blog] dstack 0.20 GA: Fleet-first UX and other important changes by @peterschmidt85 in dstackai/dstack#3401
[runner] Get container cgroup path from procfs by @un-def in dstackai/dstack#3402
[Internal] Add an index for user email by @peterschmidt85 in dstackai/dstack#3409
Don't send asyncio.CancelledError to Sentry by @un-def in dstackai/dstack#3404
[Internal] Allow passing AnyActor to update_user by @peterschmidt85 in dstackai/dstack#3410
Replace Instance.termination_reason values with codes by @peterschmidt85 in dstackai/dstack#3187
[Docs] Added the Lambda example under Clusters by @peterschmidt85 in dstackai/dstack#3407
[runner] Revamp main.go by @un-def in dstackai/dstack#3411
Was implemented Event list for job, run and fleet by @olgenn in dstackai/dstack#3392
Fix event target type rendering in server logs by @jvstme in dstackai/dstack#3414
Support gateway: true in service configurations by @jvstme in dstackai/dstack#3413
Implement dstack login command and CLI OAuth flow by @r4victor in dstackai/dstack#3415
Allow users to delete their only project by @jvstme in dstackai/dstack#3416
Indicate deleted actors and projects in Events API by @jvstme in dstackai/dstack#3422
[UX] Make "No fleets" run status more explicit #3405 by @peterschmidt85 in dstackai/dstack#3406
No fleets notification #373 by @olgenn in dstackai/dstack#3418
Bump gpuhunt==0.1.16 by @r4victor in dstackai/dstack#3423
Revert "No fleets notification #373 (#3418)" by @r4victor in dstackai/dstack#3424

Full Changelog: dstackai/dstack@0.20.0...0.20.1

Contributors

un-def, olgenn, and 3 other contributors

Assets 2

17 Dec 11:34

r4victor

0.20.0-v1

89fee21

0.20.0-v1

dstack 0.20 is a major release that brings significant improvements and introduces a number of breaking changes. Read below for the most important ones. For migration notes, please refer to the migration guide.

Fleets

dstack previously had two different ways to provision instances for runs: using a fleet configuration or using automatic fleet provisioning on run apply. To unify the UX, dstack no longer creates fleets automatically.

Fleets must now be created explicitly before submitting runs. This gives users full control over the provisioning lifecycle. If you don't need any limits on instance provisioning (as was the case with auto-created fleets), you can create a single elastic fleet for all runs:

type: fleet
name: default-fleet
nodes: 0..

Note that multi-node tasks require fleets with placement: cluster, which provides the best possible connectivity. You will need a separate fleet for each cluster.

Note

To keep the old behavior with auto-created fleets, set the DSTACK_FF_AUTOCREATED_FLEETS_ENABLED environment variable.

Runs

Working directory

Previously, the working_dir property had complicated semantics: it defaulted to /workflow, but for tasks and services without commands, the image's working directory was used instead.

This has now been simplified: working_dir always defaults to the image's working directory. The working directory of the default dstack images is now set to /dstack/run.

Repo directory

Working with repos is now more explicit and intuitive. First, dstack now only sets up repos that are explicitly defined in run configurations via repos; repos initialized with dstack init are not set up unless specified:

type: dev-environment
ide: vscode
repos:
  # Clone the repo in the configuration's dir into `working_dir`
  - .

Second, repos[].path now defaults to working_dir (".") instead of /workflow.

Third, cloning a repo into a non-empty directory now raises an error so that mistakes are not silently ignored. The previous behavior of skipping cloning can be specified explicitly with if_exists: skip:

type: dev-environment
ide: vscode
repos:
  - local_path: .
    path: /my_volume/repo
    if_exists: skip

Events

dstack now stores important events—such as resource CRUD operations, status changes, and other information crucial for auditing and debugging. Users can view events using the dstack event CLI command or in the UI.

$ dstack event
[2025-12-11 15:05:20] [👤admin] [run clever-cheetah-1] Run submitted. Status: SUBMITTED
[2025-12-11 15:05:20] [job clever-cheetah-1-0-0] Job created on run submission. Status: SUBMITTED
[2025-12-11 15:05:26] [job clever-cheetah-1-0-0, instance cloud-fleet-0] Job assigned to instance. Instance status: BUSY (1/1 blocks busy)

CLI

JSON output

The dstack ps and dstack gateway commands now support --format json / --json arguments that print results in JSON instead of plaintext:

$ dstack ps --json
{
  "project": "main",
  "runs": [
    {
      "id": "5f2e08b5-2098-4064-86c7-0efe0eb84970",
      "project_name": "main",
      "user": "admin",
      "fleet": {
        "id": "9598d5db-67d8-4a2e-bdd2-842ab93b2f2e",
        "name": "cloud-fleet"
      },
      ...
    }
  ]
}

Verda (formerly Datacrunch)

The datacrunch backend has been renamed to verda, following the company's rebranding.

projects:
  - name: main
    backends:
      - type: verda
        creds:
          type: api_key
          client_id: xfaHBqYEsArqhKWX-e52x3HH7w8T
          client_secret: B5ZU5Qx9Nt8oGMlmMhNI3iglK8bjMhagTbylZy4WzncZe39995f7Vxh8

Gateways

Gateway configurations now support an optional instance_type property that allows overriding the default gateway instance type:

type: gateway
name: example-gateway

backend: aws
region: eu-west-1

instance_type: t3.large

domain: example.com

Currently instance_type is supported for aws and gcp backends.

All breaking changes

Fleets are no longer created automatically on run apply and have to be created explicitly before submitting runs.
The run's working_dir now always defaults to the image's working directory instead of /workflow. The working directory of dstack default images is now /dstack/run.
repos[].path now defaults to working_dir (".") instead of /workflow.
Dropped implicitly loaded repos; repos must be specified via repos configuration property.
Cloning a repo into a non-empty directory now raises an error. This can be changed by setting if_exists: skip.
Dropped CLI commands dstack config, dstack stats, and dstack gateway create.
Dropped Python API RunCollection methods RunCollection.get_plan(), RunCollection.exec_plan(), and RunCollection.submit().
Dropped local repos support: dstack init --local and dstack.api.LocalRepo.
Dropped Azure deprecated VM series Dsv3 and Esv4.
Dropped legacy server environment variables DSTACK_SERVER_METRICS_TTL_SECONDS and DSTACK_FORCE_BRIDGE_NETWORK.

Deprecations

Deprecated the API endpoint /api/project/{project_name}/fleets/create in favor of /api/project/{project_name}/fleets/apply.
Deprecated repo_dir argument in RunCollection.get_run_plan() in favor of repos[].path.

What's Changed

Update alembic pin by @r4victor in dstackai/dstack#3331
[Internal] Bump golangci-lint to v2 by @un-def in dstackai/dstack#3336
Add RELEASE.md by @r4victor in dstackai/dstack#3334
Implement runner auto-update by @un-def in dstackai/dstack#3333
[Feature]: Add JSON output option for CLI commands by @peterschmidt85 in dstackai/dstack#3335
[Docs] Make Kapa.ai a sidebar (styling update) by @peterschmidt85 in dstackai/dstack#3338
[Docs] Fleet-related improvements by @peterschmidt85 in dstackai/dstack#3339
[UI] Run Wizard. Added docker, python and repos fields by @olgenn in dstackai/dstack#3252
Make autocreated fleets opt-in by @r4victor in dstackai/dstack#3342
Add repos[].if_exists run configuration option by @un-def in dstackai/dstack#3341
Clean up 0.19 compatibility code by @r4victor in dstackai/dstack#3347
[Breaking]: Drop deprecated RunCollection methods by @jvstme in dstackai/dstack#3340
Events emission framework and API by @jvstme in dstackai/dstack#3325
Drop legacy server environment variables by @un-def in dstackai/dstack#3349
Drop legacy working/repo dirs where possible by @un-def in dstackai/dstack#3348
refactor: just remove 3 duplicated lines in 1 .py file by @immanuwell in dstackai/dstack#3350
Drop dstack config, dstack stats, dstack gateway create by @r4victor in dstackai/dstack#3351
Drop hardcoded Hot Aisle VM specs by @jvstme in dstackai/dstack#3239
[Docs]: Fix TensorDock still present in reference by @jvstme in dstackai/dstack#3321
[Blog] How Toffee streamlines inference and cut GPU costs with dstack by @peterschmidt85 in dstackai/dstack#3345
Drop config.yml->repos and LocalRepo support by @un-def in dstackai/dstack#3352
[chore]: Fix Alembic path_separator warning by @jvstme in dstackai/dstack#3353
Clean up low- and high-level API clients by @un-def in dstackai/dstack#3355
Add EventTarget.project_name by @jvstme in dstackai/dstack#3362
[runner] Chown repo dir by @un-def in dstackai/dstack#3364
Add tini to server Docker images by @un-def in dstackai/dstack#3357
Set WORKDIR in base Docker images by @un-def in dstackai/dstack#3358
#3309 events UI by @olgenn in dstackai/dstack#3361
Pass state param to github callback by @r4victor in dstackai/dstack#3366
Add the dstack event command for viewing events by @jvstme in dstackai/dstack#3365
[Internal] Packer: apt-upgrade.sh: autoaccept conffiles changes by @un-def in dstackai/dstack#3367
Bump base_image to 0.12 by @un-def in dstackai/dstack#3368
Restore RunSpec.working_dir field by @un-def in dstackai/dstack#3371
Drop DSTACK_FF_EVENTS by @jvstme in dstackai/dstack#3372
Change field order in dstack event by @jvstme in dstackai/dstack#3373
Add more events by @jvstme in dstackai/dstack#3369
[Verda] Rename the datacrunch backend to verda by @peterschmidt85 in dstackai/dstack#3359
[Docs] Added Migration guide (for the upcoming 0.20) by @peterschmidt85 in dstackai/dstack#3374
Set pyright typeCheckingMode by @r4victor in dstackai/dstack#3379
Run frontend-pre-commit only on frontend files by @r4victor in dstackai/dstack#3380
[Docs] Add Crusoe example under Clusters by @peterschmidt85 in dstackai/dstack#3381
Allow customizing gateway instance type by @jvstme in dstackai/dstack#3384
[Docs] Merge GCP clusters examples by @peterschmidt85 in dstackai/dstack#337...

Contributors

un-def, olgenn, and 4 other contributors

Assets 2

21 Nov 08:40

r4victor

0.19.38-v1

89fee21

0.19.38-v1

Run plan

Since 0.19.26 release, dstack provisions instances with respect to configured fleets, but run plans offers didn't reflect that, so you not might see the actual offers used for provisioning. This is now fixed and run plan shows offers with respect to configured fleets. For example, you can create a fleet for provisioning spot GPU instances on AWS:

type: fleet
name: cloud-fleet
nodes: 0..
backends: [aws]
spot_policy: spot
resources: 
  gpu: 1..

and the runs respect that configuration:

✗ dstack apply                                                      
...
 #  BACKEND          RESOURCES                            INSTANCE TYPE  PRICE    
 1  aws (us-east-1)  cpu=4 mem=16GB disk=100GB T4:16GB:1  g4dn.xlarge    $0.526   
 2  aws (us-east-2)  cpu=4 mem=16GB disk=100GB T4:16GB:1  g4dn.xlarge    $0.526   
 3  aws (us-west-2)  cpu=4 mem=16GB disk=100GB T4:16GB:1  g4dn.xlarge    $0.526   
    ...                                                                           
 Shown 3 of 309 offers, $71.552max

Gateways

dstack gateways now support SGLang Router, enabling inference request routing with policies such as cache_aware, power_of_two, round_robin, and random. Currently, the gateway supports sglang-router version 0.2.1. You can enable the SGLang router in your gateway configuration and select any of the available routing policies. Example configuration:

type: gateway
name: sglang-gateway

backend: aws
region: eu-west-1

domain: example.com
router:
  type: sglang
  policy: cache_aware

What's Changed

[Docs] Update to the latest mkdocs-material and add the contributing/DOCS.md by @peterschmidt85 in dstackai/dstack#3286
[Docs] Describe some gateway options on Concepts/Gateways page by @un-def in dstackai/dstack#3287
Expand max_duration reference by @r4victor in dstackai/dstack#3292
[Docker] Fix ssh zombie processes issue by @un-def in dstackai/dstack#3295
[Docs] Fix incorrect URLs by @peterschmidt85 in dstackai/dstack#3297
[Blog] NVIDIA DGX Spark by @peterschmidt85 in dstackai/dstack#3298
Return plan offers wrt fleets by @r4victor in dstackai/dstack#3300
Show task nodes in run plan by @r4victor in dstackai/dstack#3301
Log non-zero exit status in SSHTunnel.close/aclose by @un-def in dstackai/dstack#3296
Fix in-place update when files are used by @un-def in dstackai/dstack#3289
[Runpod] Require CUDA 12.8+ on the host by @peterschmidt85 in dstackai/dstack#3304
Fix SSHAttach.detach() by @un-def in dstackai/dstack#3306
Add SGLang Router Support by @Bihan in dstackai/dstack#3267

Full Changelog: dstackai/dstack@0.19.37...0.19.38

Contributors

un-def, Bihan, and 2 other contributors

Assets 2

Releases: dstackai/dstack-enterprise

0.20.10-v1

Services

Prefill-Decode disaggregation

Model endpoint

CLI

dstack apply

Uh oh!

0.20.8-v1

CLI

dstack event --watch

dstack fleet

Skills

SKILL.md

Services

Probes

UI

until_ready

Model probes

Backends

RunPod

Community Cloud

CUDO

What's changed

Contributors

Uh oh!

0.20.7-v1

Services

Replica groups

Events

CLI

dstack project

dstack login

What's changed

Contributors

Uh oh!

0.20.6-v1

Server deployment

Memory optimization

Logs storage

Fluent Bit + Elasticsearch/OpenSearch

Fleets

Contributors

Uh oh!

0.20.5-v1

What's Changed

Contributors

Uh oh!

0.20.4-v1

What's changed

Contributors

Uh oh!

0.20.3-v1

Dev environments

Windsurf IDE

Troubleshooting

Runs/fleets/volumes/gateways JSON via CLI

Runs/fleets JSON via UI

What's changed

Contributors

Uh oh!

0.20.1-v1

CLI

No-fleets warning

dstack login

Services

dstack-shim

What's Changed

Contributors

Uh oh!

0.20.0-v1

Fleets

Runs

Working directory

Repo directory

Events

CLI

JSON output

Verda (formerly Datacrunch)

Gateways

`dstack apply`

`dstack event --watch`

`dstack fleet`

`SKILL.md`

`until_ready`

`dstack project`

`dstack login`

`dstack login`

`dstack-shim`