Skip to content

Commit 97f750d

Browse files
Deploying to gh-pages from @ dstackai/dstack@a05cf74 🚀
1 parent 807fd5e commit 97f750d

File tree

6 files changed

+128
-127
lines changed

6 files changed

+128
-127
lines changed

assets/images/social/examples.png

-338 Bytes
Loading

assets/images/social/partners.png

-427 Bytes
Loading

blog/sglang-router/index.html

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3994,6 +3994,7 @@ <h2 id="limitations-and-roadmap">Limitations and roadmap<a class="headerlink" hr
39943994
<ul>
39953995
<li>Enabling prefill and decode worker separation for full disaggregation (today, only standard workers are supported).</li>
39963996
<li>Introducing auto-scaling based on TTFT (Time to First Token) and ITL (Inter-Token Latency), complementing the current requests-per-second scaling metric.</li>
3997+
<li>Supporting multi-node replicas, enabling a single replica to span multiple nodes instead of being limited to one.</li>
39973998
<li>Extending native support to more emerging inference stacks.</li>
39983999
</ul>
39994000
<h2 id="whats-next">What's next?<a class="headerlink" href="#whats-next" title="Permanent link">&para;</a></h2>

search/search_index.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)