You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<h2id="how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack"><aclass="toclink" href="../toffee/">How Toffee streamlines inference and cut GPU costs with dstack</a></h2>
3916
+
<p>In a recent engineering <ahref="https://research.toffee.ai/blog/how-we-use-dstack-at-toffee">blog post</a>, Toffee shared how they use <code>dstack</code> to run large-language and image-generation models across multiple GPU clouds, while keeping their core backend on AWS. This case study summarizes key insights and highlights how <code>dstack</code> became the backbone of Toffee’s multi-cloud inference stack.</p>
<h2id="how-toffee-streamlines-inference-and-cut-gpu-costs-with-dstack"><aclass="toclink" href="toffee/">How Toffee streamlines inference and cut GPU costs with dstack</a></h2>
4021
+
<p>In a recent engineering <ahref="https://research.toffee.ai/blog/how-we-use-dstack-at-toffee">blog post</a>, Toffee shared how they use <code>dstack</code> to run large-language and image-generation models across multiple GPU clouds, while keeping their core backend on AWS. This case study summarizes key insights and highlights how <code>dstack</code> became the backbone of Toffee’s multi-cloud inference stack.</p>
<h2id="supporting-hot-aisle-amd-ai-developer-cloud"><aclass="toclink" href="hotaisle/">Supporting Hot Aisle AMD AI Developer Cloud</a></h2>
4407
-
<p>As the ecosystem around AMD GPUs matures, developers are looking for easier ways to experiment with ROCm, benchmark new architectures, and run cost-effective workloads—without manual infrastructure setup. </p>
4408
-
<p><code>dstack</code> is an open-source orchestrator designed for AI workloads, providing a lightweight, container-native alternative to Kubernetes and Slurm.</p>
<p>Today, we’re excited to announce native integration with <ahref="https://www.hotaisle.io/">Hot Aisle</a>, an AMD-only GPU neocloud offering VMs and clusters at highly competitive on-demand pricing. </p>
0 commit comments