Skip to content

Commit 1b28b30

Browse files
Deploying to gh-pages from @ dstackai/dstack@f3d6913 🚀
1 parent b8a39dc commit 1b28b30

File tree

13 files changed

+4594
-465
lines changed

13 files changed

+4594
-465
lines changed
52.6 KB
Loading

blog/changelog/index.html

Lines changed: 67 additions & 71 deletions
Original file line numberDiff line numberDiff line change
@@ -3463,6 +3463,17 @@
34633463
</label>
34643464
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
34653465

3466+
<li class="md-nav__item">
3467+
<a href="#nebius-joins-dstack-sky-gpu-marketplace-with-production-ready-gpu-clusters" class="md-nav__link">
3468+
<span class="md-ellipsis">
3469+
3470+
Nebius joins dstack Sky GPU marketplace, with production-ready GPU clusters
3471+
3472+
</span>
3473+
</a>
3474+
3475+
</li>
3476+
34663477
<li class="md-nav__item">
34673478
<a href="#orchestrating-gpus-on-digitalocean-and-amd-developer-cloud" class="md-nav__link">
34683479
<span class="md-ellipsis">
@@ -3560,17 +3571,6 @@
35603571
</span>
35613572
</a>
35623573

3563-
</li>
3564-
3565-
<li class="md-nav__item">
3566-
<a href="#exporting-gpu-cost-and-other-metrics-to-prometheus" class="md-nav__link">
3567-
<span class="md-ellipsis">
3568-
3569-
Exporting GPU, cost, and other metrics to Prometheus
3570-
3571-
</span>
3572-
</a>
3573-
35743574
</li>
35753575

35763576
</ul>
@@ -3749,6 +3749,17 @@
37493749
</label>
37503750
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
37513751

3752+
<li class="md-nav__item">
3753+
<a href="#nebius-joins-dstack-sky-gpu-marketplace-with-production-ready-gpu-clusters" class="md-nav__link">
3754+
<span class="md-ellipsis">
3755+
3756+
Nebius joins dstack Sky GPU marketplace, with production-ready GPU clusters
3757+
3758+
</span>
3759+
</a>
3760+
3761+
</li>
3762+
37523763
<li class="md-nav__item">
37533764
<a href="#orchestrating-gpus-on-digitalocean-and-amd-developer-cloud" class="md-nav__link">
37543765
<span class="md-ellipsis">
@@ -3846,17 +3857,6 @@
38463857
</span>
38473858
</a>
38483859

3849-
</li>
3850-
3851-
<li class="md-nav__item">
3852-
<a href="#exporting-gpu-cost-and-other-metrics-to-prometheus" class="md-nav__link">
3853-
<span class="md-ellipsis">
3854-
3855-
Exporting GPU, cost, and other metrics to Prometheus
3856-
3857-
</span>
3858-
</a>
3859-
38603860
</li>
38613861

38623862
</ul>
@@ -3877,6 +3877,51 @@ <h1 id="changelog">Changelog<a class="headerlink" href="#changelog" title="Perma
38773877
<article class="md-post md-post--excerpt">
38783878
<header class="md-post__header">
38793879

3880+
<div class="md-post__meta md-meta">
3881+
<ul class="md-meta__list">
3882+
<li class="md-meta__item">
3883+
<time datetime="2025-09-18 00:00:00+00:00">September 18, 2025</time></li>
3884+
3885+
<li class="md-meta__item">
3886+
in
3887+
3888+
<a href="./" class="md-meta__link">Changelog</a></li>
3889+
3890+
3891+
3892+
<li class="md-meta__item">
3893+
3894+
4 min read
3895+
3896+
</li>
3897+
3898+
3899+
</ul>
3900+
3901+
</div>
3902+
</header>
3903+
<div class="md-post__content md-typeset">
3904+
<h2 id="nebius-joins-dstack-sky-gpu-marketplace-with-production-ready-gpu-clusters"><a class="toclink" href="../nebius-in-dstack-sky/">Nebius joins dstack Sky GPU marketplace, with production-ready GPU clusters</a></h2>
3905+
<p><code>dstack</code> is an <a href="https://github.com/dstackai/dstack" target="_blank">open-source <span class="twemoji external"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a> control plane for orchestrating GPU workloads. It can provision cloud VMs, run on top of Kubernetes, or manage on-prem clusters. If you don’t want to self-host, you can use <a href="https://sky.dstack.ai" target="_blank">dstack Sky <span class="twemoji external"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a>, the managed version of <code>dstack</code> that also provides access to cloud GPUs via its markfetplace.</p>
3906+
<p>With our latest release, we’re excited to announce that <a href="https://nebius.com/" target="_blank">Nebius <span class="twemoji external"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a>, a purpose-built AI cloud for large scale training and inference, has joined the <code>dstack</code> Sky marketplace
3907+
to offer on-demand and spot GPUs, including clusters.</p>
3908+
<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-sky-nebius.png" width="630"/></p>
3909+
3910+
3911+
<nav class="md-post__action">
3912+
<a href="../nebius-in-dstack-sky/">
3913+
<span>Continue reading</span>
3914+
<span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
3915+
</a>
3916+
</nav>
3917+
3918+
3919+
</div>
3920+
</article>
3921+
3922+
<article class="md-post md-post--excerpt">
3923+
<header class="md-post__header">
3924+
38803925
<div class="md-post__meta md-meta">
38813926
<ul class="md-meta__list">
38823927
<li class="md-meta__item">
@@ -4311,55 +4356,6 @@ <h2 id="supporting-mpi-and-ncclrccl-tests"><a class="toclink" href="../mpi/">Sup
43114356
</div>
43124357
</article>
43134358

4314-
<article class="md-post md-post--excerpt">
4315-
<header class="md-post__header">
4316-
4317-
<div class="md-post__meta md-meta">
4318-
<ul class="md-meta__list">
4319-
<li class="md-meta__item">
4320-
<time datetime="2025-04-01 00:00:00+00:00">April 1, 2025</time></li>
4321-
4322-
<li class="md-meta__item">
4323-
in
4324-
4325-
<a href="./" class="md-meta__link">Changelog</a></li>
4326-
4327-
4328-
4329-
<li class="md-meta__item">
4330-
4331-
2 min read
4332-
4333-
</li>
4334-
4335-
4336-
</ul>
4337-
4338-
</div>
4339-
</header>
4340-
<div class="md-post__content md-typeset">
4341-
<h2 id="exporting-gpu-cost-and-other-metrics-to-prometheus"><a class="toclink" href="../prometheus/">Exporting GPU, cost, and other metrics to Prometheus</a></h2>
4342-
<h3 id="why-prometheus" style="display:none"><a class="toclink" href="../prometheus/#why-prometheus">Why Prometheus</a></h3>
4343-
<p>Effective AI infrastructure management requires full visibility into compute performance and costs. AI researchers need
4344-
detailed insights into container- and GPU-level performance, while managers rely on cost metrics to track resource usage
4345-
across projects.</p>
4346-
<p>While <code>dstack</code> provides key metrics through its UI and <a href="../dstack-metrics/"><code>dstack metrics</code></a> CLI, teams often need more granular data and prefer
4347-
using their own monitoring tools. To support this, we’ve introduced a new endpoint that allows real-time exporting all collected
4348-
metrics—covering fleets and runs—directly to Prometheus.</p>
4349-
<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-prometheus-v3.png" width="630"/></p>
4350-
4351-
4352-
<nav class="md-post__action">
4353-
<a href="../prometheus/">
4354-
<span>Continue reading</span>
4355-
<span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
4356-
</a>
4357-
</nav>
4358-
4359-
4360-
</div>
4361-
</article>
4362-
43634359

43644360

43654361

blog/changelog/page/2/index.html

Lines changed: 71 additions & 68 deletions
Original file line numberDiff line numberDiff line change
@@ -3461,6 +3461,17 @@
34613461
</label>
34623462
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
34633463

3464+
<li class="md-nav__item">
3465+
<a href="#exporting-gpu-cost-and-other-metrics-to-prometheus" class="md-nav__link">
3466+
<span class="md-ellipsis">
3467+
3468+
Exporting GPU, cost, and other metrics to Prometheus
3469+
3470+
</span>
3471+
</a>
3472+
3473+
</li>
3474+
34643475
<li class="md-nav__item">
34653476
<a href="#accessing-dev-environments-with-cursor" class="md-nav__link">
34663477
<span class="md-ellipsis">
@@ -3558,17 +3569,6 @@
35583569
</span>
35593570
</a>
35603571

3561-
</li>
3562-
3563-
<li class="md-nav__item">
3564-
<a href="#using-volumes-to-optimize-cold-starts-on-runpod" class="md-nav__link">
3565-
<span class="md-ellipsis">
3566-
3567-
Using volumes to optimize cold starts on RunPod
3568-
3569-
</span>
3570-
</a>
3571-
35723572
</li>
35733573

35743574
</ul>
@@ -3747,6 +3747,17 @@
37473747
</label>
37483748
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
37493749

3750+
<li class="md-nav__item">
3751+
<a href="#exporting-gpu-cost-and-other-metrics-to-prometheus" class="md-nav__link">
3752+
<span class="md-ellipsis">
3753+
3754+
Exporting GPU, cost, and other metrics to Prometheus
3755+
3756+
</span>
3757+
</a>
3758+
3759+
</li>
3760+
37503761
<li class="md-nav__item">
37513762
<a href="#accessing-dev-environments-with-cursor" class="md-nav__link">
37523763
<span class="md-ellipsis">
@@ -3844,17 +3855,6 @@
38443855
</span>
38453856
</a>
38463857

3847-
</li>
3848-
3849-
<li class="md-nav__item">
3850-
<a href="#using-volumes-to-optimize-cold-starts-on-runpod" class="md-nav__link">
3851-
<span class="md-ellipsis">
3852-
3853-
Using volumes to optimize cold starts on RunPod
3854-
3855-
</span>
3856-
</a>
3857-
38583858
</li>
38593859

38603860
</ul>
@@ -3875,6 +3875,55 @@ <h1 id="changelog">Changelog<a class="headerlink" href="#changelog" title="Perma
38753875
<article class="md-post md-post--excerpt">
38763876
<header class="md-post__header">
38773877

3878+
<div class="md-post__meta md-meta">
3879+
<ul class="md-meta__list">
3880+
<li class="md-meta__item">
3881+
<time datetime="2025-04-01 00:00:00+00:00">April 1, 2025</time></li>
3882+
3883+
<li class="md-meta__item">
3884+
in
3885+
3886+
<a href="../../" class="md-meta__link">Changelog</a></li>
3887+
3888+
3889+
3890+
<li class="md-meta__item">
3891+
3892+
2 min read
3893+
3894+
</li>
3895+
3896+
3897+
</ul>
3898+
3899+
</div>
3900+
</header>
3901+
<div class="md-post__content md-typeset">
3902+
<h2 id="exporting-gpu-cost-and-other-metrics-to-prometheus"><a class="toclink" href="../../../prometheus/">Exporting GPU, cost, and other metrics to Prometheus</a></h2>
3903+
<h3 id="why-prometheus" style="display:none"><a class="toclink" href="../../../prometheus/#why-prometheus">Why Prometheus</a></h3>
3904+
<p>Effective AI infrastructure management requires full visibility into compute performance and costs. AI researchers need
3905+
detailed insights into container- and GPU-level performance, while managers rely on cost metrics to track resource usage
3906+
across projects.</p>
3907+
<p>While <code>dstack</code> provides key metrics through its UI and <a href="../../../dstack-metrics/"><code>dstack metrics</code></a> CLI, teams often need more granular data and prefer
3908+
using their own monitoring tools. To support this, we’ve introduced a new endpoint that allows real-time exporting all collected
3909+
metrics—covering fleets and runs—directly to Prometheus.</p>
3910+
<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-prometheus-v3.png" width="630"/></p>
3911+
3912+
3913+
<nav class="md-post__action">
3914+
<a href="../../../prometheus/">
3915+
<span>Continue reading</span>
3916+
<span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
3917+
</a>
3918+
</nav>
3919+
3920+
3921+
</div>
3922+
</article>
3923+
3924+
<article class="md-post md-post--excerpt">
3925+
<header class="md-post__header">
3926+
38783927
<div class="md-post__meta md-meta">
38793928
<ul class="md-meta__list">
38803929
<li class="md-meta__item">
@@ -4318,52 +4367,6 @@ <h2 id="supporting-amd-accelerators-on-runpod"><a class="toclink" href="../../..
43184367
</div>
43194368
</article>
43204369

4321-
<article class="md-post md-post--excerpt">
4322-
<header class="md-post__header">
4323-
4324-
<div class="md-post__meta md-meta">
4325-
<ul class="md-meta__list">
4326-
<li class="md-meta__item">
4327-
<time datetime="2024-08-13 00:00:00+00:00">August 13, 2024</time></li>
4328-
4329-
<li class="md-meta__item">
4330-
in
4331-
4332-
<a href="../../" class="md-meta__link">Changelog</a></li>
4333-
4334-
4335-
4336-
<li class="md-meta__item">
4337-
4338-
2 min read
4339-
4340-
</li>
4341-
4342-
4343-
</ul>
4344-
4345-
</div>
4346-
</header>
4347-
<div class="md-post__content md-typeset">
4348-
<h2 id="using-volumes-to-optimize-cold-starts-on-runpod"><a class="toclink" href="../../../volumes-on-runpod/">Using volumes to optimize cold starts on RunPod</a></h2>
4349-
<p>Deploying custom models in the cloud often faces the challenge of cold start times, including the time to provision a
4350-
new instance and download the model. This is especially relevant for services with autoscaling when new model replicas
4351-
need to be provisioned quickly. </p>
4352-
<p>Let's explore how <code>dstack</code> optimizes this process using volumes, with an example of
4353-
deploying a model on RunPod.</p>
4354-
4355-
4356-
<nav class="md-post__action">
4357-
<a href="../../../volumes-on-runpod/">
4358-
<span>Continue reading</span>
4359-
<span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
4360-
</a>
4361-
</nav>
4362-
4363-
4364-
</div>
4365-
</article>
4366-
43674370

43684371

43694372

0 commit comments

Comments
 (0)