dstackai
diff --git a/‎assets/images/social/blog/toffee.png‎
33.9 KB b/‎assets/images/social/blog/toffee.png‎
33.9 KB
diff --git a/‎assets/images/social/docs/reference/cli/dstack/config.png‎
-20.6 KB b/‎assets/images/social/docs/reference/cli/dstack/config.png‎
-20.6 KB
diff --git a/‎assets/images/social/examples.png‎
338 Bytes b/‎assets/images/social/examples.png‎
338 Bytes
diff --git a/‎assets/images/social/index.png‎
-5.06 KB b/‎assets/images/social/index.png‎
-5.06 KB
diff --git a/‎blog/case-studies/index.html‎
Lines changed: 76 additions & 0 deletions b/‎blog/case-studies/index.html‎
Lines changed: 76 additions & 0 deletions
diff --git a/‎blog/ea-gtc25/index.html‎
Lines changed: 1 addition & 1 deletion b/‎blog/ea-gtc25/index.html‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎blog/index.html‎
Lines changed: 65 additions & 67 deletions b/‎blog/index.html‎
Lines changed: 65 additions & 67 deletions
@@ -3609,6 +3609,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        How Toffee streamlines inference and cut GPU costs with dstack
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#how-ea-uses-dstack-to-fast-track-ai-development" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3750,6 +3761,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        How Toffee streamlines inference and cut GPU costs with dstack
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#how-ea-uses-dstack-to-fast-track-ai-development" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3826,6 +3848,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        How Toffee streamlines inference and cut GPU costs with dstack
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#how-ea-uses-dstack-to-fast-track-ai-development" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3855,6 +3888,49 @@ <h1 id="case-studies">Case studies<a class="headerlink" href="#case-studies" tit
         <article class="md-post md-post--excerpt">
   <header class="md-post__header">
 
+    <div class="md-post__meta md-meta">
+      <ul class="md-meta__list">
+        <li class="md-meta__item">
+          <time datetime="2025-12-05 00:00:00+00:00">December 5, 2025</time></li>
+        
+          <li class="md-meta__item">
+            in
+            
+              <a href="./" class="md-meta__link">Case studies</a></li>
+        
+        
+          
+          <li class="md-meta__item">
+            
+              4 min read
+            
+          </li>
+        
+        
+      </ul>
+      
+    </div>
+  </header>
+  <div class="md-post__content md-typeset">
+    <h2 id="how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack"><a class="toclink" href="../toffee/">How Toffee streamlines inference and cut GPU costs with dstack</a></h2>
+<p>In a recent engineering <a href="https://research.toffee.ai/blog/how-we-use-dstack-at-toffee">blog post</a>, Toffee shared how they use <code>dstack</code> to run large-language and image-generation models across multiple GPU clouds, while keeping their core backend on AWS. This case study summarizes key insights and highlights how <code>dstack</code> became the backbone of Toffee’s multi-cloud inference stack.</p>
+<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-toffee.png" width="630" /></p>
+
+    
+      <nav class="md-post__action">
+        <a href="../toffee/">
+            <span>Continue reading</span>
+            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
+        </a>
+      </nav>
+    
+    
+  </div>
+</article>
+      
+        <article class="md-post md-post--excerpt">
+  <header class="md-post__header">
+    
     <div class="md-post__meta md-meta">
       <ul class="md-meta__list">
         <li class="md-meta__item">
 
@@ -3792,7 +3792,7 @@
   <span class="md-ellipsis">
 
 
-    NVIDIA GTC 2025 ↗
+    NVIDIA GTC 2025
 
 
 
 
@@ -3486,6 +3486,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        How Toffee streamlines inference and cut GPU costs with dstack
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#sglang-router-integration-and-disaggregated-inference-roadmap" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3583,17 +3594,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#supporting-hot-aisle-amd-ai-developer-cloud" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Supporting Hot Aisle AMD AI Developer Cloud
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -3859,6 +3859,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        How Toffee streamlines inference and cut GPU costs with dstack
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#sglang-router-integration-and-disaggregated-inference-roadmap" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3956,17 +3967,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#supporting-hot-aisle-amd-ai-developer-cloud" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Supporting Hot Aisle AMD AI Developer Cloud
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -3993,6 +3993,49 @@ <h1 id="blog">Blog<a class="headerlink" href="#blog" title="Permanent link">&par
         <article class="md-post md-post--excerpt">
   <header class="md-post__header">
 
+    <div class="md-post__meta md-meta">
+      <ul class="md-meta__list">
+        <li class="md-meta__item">
+          <time datetime="2025-12-05 00:00:00+00:00">December 5, 2025</time></li>
+        
+          <li class="md-meta__item">
+            in
+            
+              <a href="case-studies/" class="md-meta__link">Case studies</a></li>
+        
+        
+          
+          <li class="md-meta__item">
+            
+              4 min read
+            
+          </li>
+        
+        
+      </ul>
+      
+    </div>
+  </header>
+  <div class="md-post__content md-typeset">
+    <h2 id="how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack"><a class="toclink" href="toffee/">How Toffee streamlines inference and cut GPU costs with dstack</a></h2>
+<p>In a recent engineering <a href="https://research.toffee.ai/blog/how-we-use-dstack-at-toffee">blog post</a>, Toffee shared how they use <code>dstack</code> to run large-language and image-generation models across multiple GPU clouds, while keeping their core backend on AWS. This case study summarizes key insights and highlights how <code>dstack</code> became the backbone of Toffee’s multi-cloud inference stack.</p>
+<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-toffee.png" width="630" /></p>
+
+    
+      <nav class="md-post__action">
+        <a href="toffee/">
+            <span>Continue reading</span>
+            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
+        </a>
+      </nav>
+    
+    
+  </div>
+</article>
+      
+        <article class="md-post md-post--excerpt">
+  <header class="md-post__header">
+    
     <div class="md-post__meta md-meta">
       <ul class="md-meta__list">
         <li class="md-meta__item">
@@ -4376,51 +4419,6 @@ <h2 id="introducing-passive-gpu-health-checks"><a class="toclink" href="gpu-helt
   </div>
 </article>
 
-        <article class="md-post md-post--excerpt">
-  <header class="md-post__header">
-    
-    <div class="md-post__meta md-meta">
-      <ul class="md-meta__list">
-        <li class="md-meta__item">
-          <time datetime="2025-08-11 00:00:00+00:00">August 11, 2025</time></li>
-        
-          <li class="md-meta__item">
-            in
-            
-              <a href="changelog/" class="md-meta__link">Changelog</a></li>
-        
-        
-          
-          <li class="md-meta__item">
-            
-              3 min read
-            
-          </li>
-        
-        
-      </ul>
-      
-    </div>
-  </header>
-  <div class="md-post__content md-typeset">
-    <h2 id="supporting-hot-aisle-amd-ai-developer-cloud"><a class="toclink" href="hotaisle/">Supporting Hot Aisle AMD AI Developer Cloud</a></h2>
-<p>As the ecosystem around AMD GPUs matures, developers are looking for easier ways to experiment with ROCm, benchmark new architectures, and run cost-effective workloads—without manual infrastructure setup.  </p>
-<p><code>dstack</code> is an open-source orchestrator designed for AI workloads, providing a lightweight, container-native alternative to Kubernetes and Slurm.</p>
-<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-hotaisle.png" width="630"/></p>
-<p>Today, we’re excited to announce native integration with <a href="https://www.hotaisle.io/">Hot Aisle</a>, an AMD-only GPU neocloud offering VMs and clusters at highly competitive on-demand pricing.  </p>
-
-    
-      <nav class="md-post__action">
-        <a href="hotaisle/">
-            <span>Continue reading</span>
-            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
-        </a>
-      </nav>
-    
-    
-  </div>
-</article>
-
Original file line number	Diff line number	Diff line change
`@@ -3792,7 +3792,7 @@`
`3792`	`3792`	`<span class="md-ellipsis">`
`3793`	`3793`
`3794`	`3794`
`3795`		`- NVIDIA GTC 2025 ↗`
	`3795`	`+ NVIDIA GTC 2025`
`3796`	`3796`
`3797`	`3797`
`3798`	`3798`