You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<p>Using private subnets assumes that both the <code>dstack</code> server and users can access the configured VPC's private subnets.
4594
4620
Additionally, <ahref="https://cloud.google.com/nat/docs/overview">Cloud NAT</a> must be configured to provide access to external resources for provisioned instances.</p>
<p>Log in to the SSH TUI as described in the <ahref="https://hotaisle.xyz/quick-start/">Hot Aisle Quick Start <spanclass="twemoji external"><svgxmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><pathd="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a>.
4624
+
Create a new team and generate an API key for the member in the team.</p>
<p>Log into your <ahref="https://lambdalabs.com/service/gpu-cloud">Lambda Cloud <spanclass="twemoji external"><svgxmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><pathd="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a> account, click API keys in the sidebar, and then click the <code>Generate API key</code>
Copy file name to clipboardExpand all lines: docs/concepts/services/index.html
+25-1Lines changed: 25 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -4453,7 +4453,31 @@ <h3 id="probes">Probes<a class="headerlink" href="#probes" title="Permanent link
4453
4453
</ul>
4454
4454
<p>If multiple probes are configured for the service, their statuses are displayed in the order in which the probes appear in the configuration.</p>
4455
4455
</details>
4456
-
<p>Probes are executed for each service replica while the replica is <code>running</code>. Probe statuses do not affect how <code>dstack</code> handles replicas, except during <ahref="#rolling-deployment">rolling deployments</a>.</p>
4456
+
<p>Probes are executed for each service replica while the replica is <code>running</code>. A probe execution is considered successful if the replica responds with a <code>2xx</code> status code. Probe statuses do not affect how <code>dstack</code> handles replicas, except during <ahref="#rolling-deployment">rolling deployments</a>.</p>
4457
+
<detailsclass="info">
4458
+
<summary>HTTP request configuration</summary>
4459
+
<p>You can configure the HTTP request method, headers, and other properties. To include secret values in probe requests, use environment variable interpolation, which is enabled for the <code>url</code>, <code>headers[i].value</code>, and <code>body</code> properties.</p>
<h6class="reference-item" id="url"><code>url</code> - (Optional) The URL to request. Defaults to <code>/</code>.<aclass="headerlink" href="#url" title="Permanent link">¶</a></h6>
4150
+
<h6class="reference-item" id="method"><code>method</code> - (Optional) The HTTP method to use for the probe (e.g., <code>get</code>, <code>post</code>, etc.). Defaults to <code>get</code>.<aclass="headerlink" href="#method" title="Permanent link">¶</a></h6>
4151
+
<h6class="reference-item" id="headers"><code>headers</code> - (Optional) A list of HTTP headers to include in the request.<aclass="headerlink" href="#headers" title="Permanent link">¶</a></h6>
4152
+
<h6class="reference-item" id="body"><code>body</code> - (Optional) The HTTP request body to send with the probe.<aclass="headerlink" href="#body" title="Permanent link">¶</a></h6>
4150
4153
<h6class="reference-item" id="timeout"><code>timeout</code> - (Optional) Maximum amount of time the HTTP request is allowed to take. Defaults to <code>10s</code>.<aclass="headerlink" href="#timeout" title="Permanent link">¶</a></h6>
4151
4154
<h6class="reference-item" id="interval"><code>interval</code> - (Optional) Minimum amount of time between the end of one probe execution and the start of the next. Defaults to <code>15s</code>.<aclass="headerlink" href="#interval" title="Permanent link">¶</a></h6>
4152
4155
<h6class="reference-item" id="ready_after"><code>ready_after</code> - (Optional) The number of consecutive successful probe executions required for the replica to be considered ready. Used during rolling deployments. Defaults to <code>1</code>.<aclass="headerlink" href="#ready_after" title="Permanent link">¶</a></h6>
<h6class="reference-item" id="name"><code>name</code> - The name of the HTTP header.<aclass="headerlink" href="#name" title="Permanent link">¶</a></h6>
4159
+
<h6class="reference-item" id="value"><code>value</code> - The value of the HTTP header.<aclass="headerlink" href="#value" title="Permanent link">¶</a></h6>
<h6class="reference-item" id="on_events"><code>on_events</code> - (Optional) The list of events that should be handled with retry. Supported events are <code>no-capacity</code>, <code>interruption</code>, <code>error</code>. Omit to retry on all events.<aclass="headerlink" href="#on_events" title="Permanent link">¶</a></h6>
4155
4162
<h6class="reference-item" id="duration"><code>duration</code> - (Optional) The maximum period of retrying the run, e.g., <code>4h</code> or <code>1d</code>.<aclass="headerlink" href="#duration" title="Permanent link">¶</a></h6>
Copy file name to clipboardExpand all lines: docs/reference/environment-variables/index.html
+4Lines changed: 4 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -3916,6 +3916,10 @@ <h2 id="server">Server<a class="headerlink" href="#server" title="Permanent link
3916
3916
<li><codeid="DSTACK_SERVER_BACKGROUND_PROCESSING_DISABLED">DSTACK_SERVER_BACKGROUND_PROCESSING_DISABLED</code> - Disables background processing if set to any value. Useful to run only web frontend and API server.</li>
3917
3917
<li><codeid="DSTACK_SERVER_MAX_PROBES_PER_JOB">DSTACK_SERVER_MAX_PROBES_PER_JOB</code> - Maximum number of probes allowed in a run configuration. Validated at apply time.</li>
3918
3918
<li><codeid="DSTACK_SERVER_MAX_PROBE_TIMEOUT">DSTACK_SERVER_MAX_PROBE_TIMEOUT</code> - Maximum allowed timeout for a probe. Validated at apply time.</li>
3919
+
<li><codeid="DSTACK_SERVER_METRICS_RUNNING_TTL_SECONDS">DSTACK_SERVER_METRICS_RUNNING_TTL_SECONDS</code> – Maximum age of metrics samples for running jobs.</li>
3920
+
<li><codeid="DSTACK_SERVER_METRICS_FINISHED_TTL_SECONDS">DSTACK_SERVER_METRICS_FINISHED_TTL_SECONDS</code> – Maximum age of metrics samples for finished jobs.</li>
3921
+
<li><codeid="DSTACK_SERVER_INSTANCE_HEALTH_TTL_SECONDS">DSTACK_SERVER_INSTANCE_HEALTH_TTL_SECONDS</code> – Maximum age of instance health checks.</li>
3922
+
<li><codeid="DSTACK_SERVER_INSTANCE_HEALTH_MIN_COLLECT_INTERVAL_SECONDS">DSTACK_SERVER_INSTANCE_HEALTH_MIN_COLLECT_INTERVAL_SECONDS</code> – Minimum time interval between consecutive health checks of the same instance.</li>
<h6class="reference-item" id="type"><code>type</code> - The type of backend. Must be <code>hotaisle</code>.<aclass="headerlink" href="#type" title="Permanent link">¶</a></h6>
3926
+
<h6class="reference-item" id="team_handle"><code>team_handle</code> - The Hot Aisle team handle.<aclass="headerlink" href="#team_handle" title="Permanent link">¶</a></h6>
3927
+
<h6class="reference-item" id="regions"><code>regions</code> - (Optional) The list of Hot Aisle regions. Omit to use all regions.<aclass="headerlink" href="#regions" title="Permanent link">¶</a></h6>
3928
+
<h6class="reference-item" id="_creds"><ahref="#hotaisle-creds"><code>creds</code></a> - The credentials.<aclass="headerlink" href="#_creds" title="Permanent link">¶</a></h6>
<h6class="reference-item" id="type"><code>type</code> - The type of credentials. Must be <code>api_key</code>.<aclass="headerlink" href="#type" title="Permanent link">¶</a></h6>
3931
+
<h6class="reference-item" id="api_key"><code>api_key</code> - The Hot Aisle API key.<aclass="headerlink" href="#api_key" title="Permanent link">¶</a></h6>
<h6class="reference-item" id="type"><code>type</code> - The type of backend. Must be <code>lambda</code>.<aclass="headerlink" href="#type" title="Permanent link">¶</a></h6>
3926
3934
<h6class="reference-item" id="regions"><code>regions</code> - (Optional) The list of Lambda regions. Omit to use all regions.<aclass="headerlink" href="#regions" title="Permanent link">¶</a></h6>
0 commit comments