Skip to content

Commit 722c9e2

Browse files
Deploying to gh-pages from @ dstackai/dstack@1d931dc 🚀
1 parent 5de823b commit 722c9e2

File tree

65 files changed

+2818
-103
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

65 files changed

+2818
-103
lines changed
-1.35 KB

assets/images/social/blog/index.png

-1.34 KB
47.1 KB
-1.34 KB
-1.25 KB
-1.33 KB
-1.26 KB
-1.25 KB
-1.25 KB

assets/images/social/docs/fleets.png

-1.33 KB
-1.26 KB
-1.31 KB

assets/images/social/docs/index.png

-1.32 KB
-1.31 KB
-1.28 KB
-1.34 KB
-1.25 KB

assets/images/social/docs/tasks.png

-1.33 KB

assets/images/social/index.png

-1.24 KB

assets/images/social/pricing.png

-1.24 KB

assets/images/social/privacy.png

-620 Bytes

assets/images/social/terms.png

-1.25 KB

blog/archive/2024/index.html

+62
Original file line numberDiff line numberDiff line change
@@ -2172,6 +2172,17 @@
21722172
</label>
21732173
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
21742174

2175+
<li class="md-nav__item">
2176+
<a href="#optimizing-inference-cold-starts-on-runpod-with-volumes" class="md-nav__link">
2177+
<span class="md-ellipsis">
2178+
2179+
Optimizing inference cold starts on RunPod with volumes
2180+
2181+
</span>
2182+
</a>
2183+
2184+
</li>
2185+
21752186
<li class="md-nav__item">
21762187
<a href="#dstack-sky-now-allows-using-your-own-cloud-accounts" class="md-nav__link">
21772188
<span class="md-ellipsis">
@@ -2322,6 +2333,17 @@
23222333
</label>
23232334
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
23242335

2336+
<li class="md-nav__item">
2337+
<a href="#optimizing-inference-cold-starts-on-runpod-with-volumes" class="md-nav__link">
2338+
<span class="md-ellipsis">
2339+
2340+
Optimizing inference cold starts on RunPod with volumes
2341+
2342+
</span>
2343+
</a>
2344+
2345+
</li>
2346+
23252347
<li class="md-nav__item">
23262348
<a href="#dstack-sky-now-allows-using-your-own-cloud-accounts" class="md-nav__link">
23272349
<span class="md-ellipsis">
@@ -2362,6 +2384,46 @@ <h1 id="2024">2024<a class="headerlink" href="#2024" title="Permanent link">&par
23622384
<article class="md-post md-post--excerpt">
23632385
<header class="md-post__header">
23642386

2387+
<div class="md-post__meta md-meta">
2388+
<ul class="md-meta__list">
2389+
<li class="md-meta__item">
2390+
<time datetime="2024-08-13 00:00:00">August 13, 2024</time></li>
2391+
2392+
2393+
2394+
<li class="md-meta__item">
2395+
2396+
2 min read
2397+
2398+
</li>
2399+
2400+
2401+
</ul>
2402+
2403+
</div>
2404+
</header>
2405+
<div class="md-post__content md-typeset">
2406+
<h2 id="optimizing-inference-cold-starts-on-runpod-with-volumes"><a class="toclink" href="../../volumes-on-runpod/">Optimizing inference cold starts on RunPod with volumes</a></h2>
2407+
<p>Deploying custom models in the cloud often faces the challenge of cold start times, including the time to provision a
2408+
new instance and download the model. This is especially relevant for services with autoscaling when new model replicas
2409+
need to be provisioned quickly. </p>
2410+
<p>Let's explore how <code>dstack</code> optimizes this process using volumes, with an example of
2411+
deploying a model on RunPod.</p>
2412+
2413+
2414+
<nav class="md-post__action">
2415+
<a href="../../volumes-on-runpod/">
2416+
Continue reading
2417+
</a>
2418+
</nav>
2419+
2420+
2421+
</div>
2422+
</article>
2423+
2424+
<article class="md-post md-post--excerpt">
2425+
<header class="md-post__header">
2426+
23652427
<div class="md-post__meta md-meta">
23662428
<ul class="md-meta__list">
23672429
<li class="md-meta__item">

blog/dstack-sky-own-cloud-accounts/index.html

+2
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,8 @@
1818
<link rel="prev" href="../dstack-sky/">
1919

2020

21+
<link rel="next" href="../volumes-on-runpod/">
22+
2123

2224

2325

blog/index.html

+62
Original file line numberDiff line numberDiff line change
@@ -2088,6 +2088,17 @@
20882088
</label>
20892089
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
20902090

2091+
<li class="md-nav__item">
2092+
<a href="#optimizing-inference-cold-starts-on-runpod-with-volumes" class="md-nav__link">
2093+
<span class="md-ellipsis">
2094+
2095+
Optimizing inference cold starts on RunPod with volumes
2096+
2097+
</span>
2098+
</a>
2099+
2100+
</li>
2101+
20912102
<li class="md-nav__item">
20922103
<a href="#dstack-sky-now-allows-using-your-own-cloud-accounts" class="md-nav__link">
20932104
<span class="md-ellipsis">
@@ -2322,6 +2333,17 @@
23222333
</label>
23232334
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
23242335

2336+
<li class="md-nav__item">
2337+
<a href="#optimizing-inference-cold-starts-on-runpod-with-volumes" class="md-nav__link">
2338+
<span class="md-ellipsis">
2339+
2340+
Optimizing inference cold starts on RunPod with volumes
2341+
2342+
</span>
2343+
</a>
2344+
2345+
</li>
2346+
23252347
<li class="md-nav__item">
23262348
<a href="#dstack-sky-now-allows-using-your-own-cloud-accounts" class="md-nav__link">
23272349
<span class="md-ellipsis">
@@ -2362,6 +2384,46 @@ <h1 id="blog">Blog<a class="headerlink" href="#blog" title="Permanent link">&par
23622384
<article class="md-post md-post--excerpt">
23632385
<header class="md-post__header">
23642386

2387+
<div class="md-post__meta md-meta">
2388+
<ul class="md-meta__list">
2389+
<li class="md-meta__item">
2390+
<time datetime="2024-08-13 00:00:00">August 13, 2024</time></li>
2391+
2392+
2393+
2394+
<li class="md-meta__item">
2395+
2396+
2 min read
2397+
2398+
</li>
2399+
2400+
2401+
</ul>
2402+
2403+
</div>
2404+
</header>
2405+
<div class="md-post__content md-typeset">
2406+
<h2 id="optimizing-inference-cold-starts-on-runpod-with-volumes"><a class="toclink" href="volumes-on-runpod/">Optimizing inference cold starts on RunPod with volumes</a></h2>
2407+
<p>Deploying custom models in the cloud often faces the challenge of cold start times, including the time to provision a
2408+
new instance and download the model. This is especially relevant for services with autoscaling when new model replicas
2409+
need to be provisioned quickly. </p>
2410+
<p>Let's explore how <code>dstack</code> optimizes this process using volumes, with an example of
2411+
deploying a model on RunPod.</p>
2412+
2413+
2414+
<nav class="md-post__action">
2415+
<a href="volumes-on-runpod/">
2416+
Continue reading
2417+
</a>
2418+
</nav>
2419+
2420+
2421+
</div>
2422+
</article>
2423+
2424+
<article class="md-post md-post--excerpt">
2425+
<header class="md-post__header">
2426+
23652427
<div class="md-post__meta md-meta">
23662428
<ul class="md-meta__list">
23672429
<li class="md-meta__item">

0 commit comments

Comments
 (0)