diff --git a/docs/docs/concepts/backends.md b/docs/docs/concepts/backends.md
index 291857139e..7c1eb0b791 100644
--- a/docs/docs/concepts/backends.md
+++ b/docs/docs/concepts/backends.md
@@ -1,18 +1,16 @@
 # Backends
 
-Backends allow `dstack` to manage compute across various environments.
-They can be configured via `~/.dstack/server/config.yml` or through the [project settings page](../concepts/projects.md#backends) in the UI.
+Backends allow `dstack` to provision fleets across cloud providers or Kubernetes clusters.
 
-`dstack` supports three types of backends: 
+`dstack` supports two types of backends: 
 
   * [VM-based](#vm-based) – use `dstack`'s native integration with cloud providers to provision VMs, manage clusters, and orchestrate container-based runs.  
   * [Container-based](#container-based) – use either `dstack`'s native integration with cloud providers or Kubernetes to orchestrate container-based runs; provisioning in this case is delegated to the cloud provider or Kubernetes.  
-  * [On-prem](#on-prem) – use `dstack`'s native support for on-prem servers without needing Kubernetes.  
 
-!!! info "dstack Sky"
-    If you're using [dstack Sky](https://sky.dstack.ai), backend configuration is optional. dstack Sky lets you use pre-configured backends to access GPU marketplace.
+??? info "SSH fleets"
+    When using `dstack` with on-prem servers, backend configuration isn’t required. Simply create [SSH fleets](../concepts/fleets.md#ssh-fleets) once the server is up.
 
-See the examples of backend configuration below.
+Backends can be configured via `~/.dstack/server/config.yml` or through the [project settings page](../concepts/projects.md#backends) in the UI. See the examples of backend configuration below.
 
 ## VM-based
 
@@ -1154,16 +1152,6 @@ projects:
 
 Also, the `vastai` backend supports on-demand instances only. Spot instance support coming soon.
 
-## On-prem
-
-In on-prem environments, the [Kubernetes](#kubernetes) backend can be used if a Kubernetes cluster is already set up and configured.  
-However, often [SSH fleets](../concepts/fleets.md#ssh-fleets) are a simpler and lighter alternative.
-
-### SSH fleets
-
-SSH fleets require no backend configuration. 
-All you need to do is [provide hostnames and SSH credentials](../concepts/fleets.md#ssh-fleets), and `dstack` sets up a fleet that can orchestrate container-based runs on your servers.
-
 SSH fleets support the same features as [VM-based](#vm-based) backends.
 
 !!! info "What's next"
diff --git a/docs/docs/concepts/dev-environments.md b/docs/docs/concepts/dev-environments.md
index ca64857868..4d46e73ac4 100644
--- a/docs/docs/concepts/dev-environments.md
+++ b/docs/docs/concepts/dev-environments.md
@@ -2,9 +2,12 @@
 
 A dev environment lets you provision an instance and access it with your desktop IDE.
 
+??? info "Prerequisites"
+    Before running a dev environment, make sure you’ve [installed](../installation/index.md) the server and CLI, and created a [fleet](fleets.md).
+
 ## Apply a configuration
 
-First, define a dev environment configuration as a YAML file in your project folder.
+First, define a dev environment configuration as a YAML file.
 The filename must end with `.dstack.yml` (e.g. `.dstack.yml` or `dev.dstack.yml` are both acceptable).
 
 <div editor-title="examples/.dstack.yml"> 
diff --git a/docs/docs/concepts/fleets.md b/docs/docs/concepts/fleets.md
index 58c0f29be1..90c323be37 100644
--- a/docs/docs/concepts/fleets.md
+++ b/docs/docs/concepts/fleets.md
@@ -4,27 +4,31 @@ Fleets act both as pools of instances and as templates for how those instances a
 
 `dstack` supports two kinds of fleets: 
 
-* [Backend fleets](#backend-fleets) – dynamically provisioned through configured backends; they are supported with any type of backends: [VM-based](backends.md#vm-based) and [container-based](backends.md#container-based) (incl. [`kubernetes`](backends.md#kubernetes))
+* [Backend fleets](#backend-fleets) – dynamically provisioned through configured backends; they are supported with any type of backends: [VM-based](backends.md#vm-based) and [container-based](backends.md#container-based) (incl. [Kubernetes](backends.md#kubernetes))
 * [SSH fleets](#ssh-fleets) – created using on-prem servers; do not require backends
 
-When you run `dstack apply` to start a dev environment, task, or service, `dstack` will reuse idle instances from an existing fleet whenever available.
+When you submit a dev environment, task, or service, `dstack` reuses idle instances or provisions new ones based on the fleet configuration. 
+
+> You must create a fleet before submitting runs.
 
 ## Backend fleets
 
-If you configured [backends](backends.md), `dstack` can provision fleets on the fly.
-However, it’s recommended to define fleets explicitly.
+Backend fleets allow provisioning compute across cloud providers or Kubernetes clusters. 
 
-### Apply a configuration
+??? info "Prerequisites"
+    Before creating a backend fleet, make sure to configure the corresponding [backends](backends.md).
+
+### Apply the configuration
 
-To create a backend fleet, define a configuration as a YAML file in your project directory. The file must have a
+To create a backend fleet, define a configuration as a YAML file. The file must have a
 `.dstack.yml` extension (e.g. `.dstack.yml` or `fleet.dstack.yml`).
 
-<div editor-title="examples/misc/fleets/.dstack.yml">
+<div editor-title="fleet.dstack.yml">
     
     ```yaml
     type: fleet
     # The name is optional, if not specified, generated randomly
-    name: default-fleet
+    name: default
     
     # Can be a range or a fixed number
     # Allow to provision of up to 2 instances
@@ -48,7 +52,7 @@ To create or update the fleet, pass the fleet configuration to [`dstack apply`](
 <div class="termy">
 
 ```shell
-$ dstack apply -f examples/misc/fleets/.dstack.yml
+$ dstack apply -f fleet.dstack.yml
 
 Provisioning...
 ---> 100%
@@ -59,18 +63,15 @@ Provisioning...
 
 </div>
 
-`dstack` always keeps the minimum number of nodes provisioned. Additional instances, up to the maximum limit, are provisioned on demand.
-
-!!! info "Container-based backends"
-    For [container-based](backends.md#container-based) backends  (such as `kubernetes`, `runpod`, etc), `nodes` must be defined as a range starting with `0`. In these cases, instances are provisioned on demand as needed.
+If `nodes` is a range that starts above `0`, `dstack` pre-creates the initial number of instances up front, while any additional ones are created on demand. 
 
-    <!-- TODO: Ensure the user sees the error or warning otherwise -->
+> Setting the `nodes` range to start above `0` is supported only for [VM-based backends](backends.md#vm-based).
 
 ??? info "Target number of nodes"
 
     If `nodes` is defined as a range, you can start with more than the minimum number of instances by using the `target` parameter when creating the fleet.
 
-    <div editor-title=".dstack.yml"> 
+    <div editor-title="fleet.dstack.yml"> 
 
     ```yaml
     type: fleet
diff --git a/docs/docs/concepts/services.md b/docs/docs/concepts/services.md
index 716ab211ed..24a0187de8 100644
--- a/docs/docs/concepts/services.md
+++ b/docs/docs/concepts/services.md
@@ -2,6 +2,9 @@
 
 Services allow you to deploy models or web apps as secure and scalable endpoints.
 
+??? info "Prerequisites"
+    Before running a service, make sure you’ve [installed](../installation/index.md) the server and CLI, and created a [fleet](fleets.md).
+
 ## Apply a configuration
 
 First, define a service configuration as a YAML file in your project folder.
diff --git a/docs/docs/concepts/tasks.md b/docs/docs/concepts/tasks.md
index ea0d775fb1..d6e2fb5591 100644
--- a/docs/docs/concepts/tasks.md
+++ b/docs/docs/concepts/tasks.md
@@ -1,11 +1,13 @@
 # Tasks
 
-A task allows you to run arbitrary commands on one or more nodes.
-They are best suited for jobs like training or batch processing.
+A task allows you to run arbitrary commands on one or more nodes. They are best suited for jobs like training or batch processing.
+
+??? info "Prerequisites"
+    Before running a task, make sure you’ve [installed](../installation/index.md) the server and CLI, and created a [fleet](fleets.md).
 
 ## Apply a configuration
 
-First, define a task configuration as a YAML file in your project folder.
+First, define a task configuration as a YAML file.
 The filename must end with `.dstack.yml` (e.g. `.dstack.yml` or `dev.dstack.yml` are both acceptable).
 
 [//]: # (TODO: Make tabs - single machine & distributed tasks & web app)
diff --git a/docs/docs/installation/index.md b/docs/docs/installation/index.md
index 5d24373fd9..aad8741b66 100644
--- a/docs/docs/installation/index.md
+++ b/docs/docs/installation/index.md
@@ -8,11 +8,7 @@
 
 ### Configure backends
 
-To orchestrate compute across cloud providers or existing Kubernetes clusters, you need to configure backends.
-
-Backends can be set up in `~/.dstack/server/config.yml` or through the [project settings page](../concepts/projects.md#backends) in the UI.
-
-For more details, see [Backends](../concepts/backends.md).
+To orchestrate compute across cloud providers or Kubernetes clusters, you need to configure [backends](../concepts/backends.md).
 
 ??? info "SSH fleets"
     When using `dstack` with on-prem servers, backend configuration isn’t required. Simply create [SSH fleets](../concepts/fleets.md#ssh-fleets) once the server is up.
@@ -79,8 +75,7 @@ The server can run on your laptop or any environment with access to the cloud an
 To verify that backends are properly configured, use the [`dstack offer`](../reference/cli/dstack/offer.md#list-gpu-offers) command to list available GPU offers.
 
 !!! info "Server deployment"
-    For more details on server deployment options, see the
-    [Server deployment](../guides/server-deployment.md) guide.
+    For more details on server deployment options, see the [Server deployment](../guides/server-deployment.md) guide.
 
 ## Set up the CLI
 
@@ -206,5 +201,5 @@ This configuration is stored in `~/.dstack/config.yml`.
     1. Follow [Quickstart](../quickstart.md)
     2. See [Backends](../concepts/backends.md)
     3. Check the [server deployment](../guides/server-deployment.md) guide
-    4. Browse [examples](/examples)
+    4. Browse [examples](../../examples.md)
     5. Join the community via [Discord](https://discord.gg/u8SmfwPpMd)
diff --git a/docs/docs/quickstart.md b/docs/docs/quickstart.md
index 7b71519dbe..759ec1b573 100644
--- a/docs/docs/quickstart.md
+++ b/docs/docs/quickstart.md
@@ -1,81 +1,108 @@
 # Quickstart
 
-> Before using `dstack`, ensure you've [installed](installation/index.md) the server.
+??? info "Prerequsites"
+    Before using `dstack`, ensure you've [installed](installation/index.md) the server and the CLI.
 
-## Set up a directory
-    
-Set up a directory where you'll store you project files and `dstack` configurations.
+## Create a fleet
 
-<div class="termy">
+Before you can submit your first run, you have to create a [fleet](concepts/fleets.md). 
 
-```shell
-$ mkdir quickstart && cd quickstart
-```
+=== "Backend fleet"
+    If you're using cloud providers or Kubernetes clusters and have configured the corresponding [backends](concepts/backends.md), create a fleet as follows:
 
-</div>
+    <div editor-title="fleet.dstack.yml"> 
 
-## Create a fleet
+    ```yaml
+    type: fleet
+    name: default
 
-If [backends](concepts/backends.md) are configured, `dstack` can create a new [backend fleet](concepts/fleets.md#backend-fleets) on the fly. However, it’s recommended to create them explicitly.
+    # Allow to provision of up to 2 instances
+    nodes: 0..2
 
-<h3>Define a configuration</h3>
+    # Deprovision instances above the minimum if they remain idle
+    idle_duration: 1h
 
-Create the following fleet configuration inside your project folder:
+    resources:
+      # Allow to provision up to 8 GPUs
+      gpu: 0..8
+    ```
 
-<div editor-title="fleet.dstack.yml"> 
+    </div>
 
-```yaml
-type: fleet
-name: default
+    Pass the fleet configuration to `dstack apply`:
 
-# Allow to provision of up to 2 instances
-nodes: 0..2
+    <div class="termy">
 
-# Deprovision instances above the minimum if they remain idle
-idle_duration: 1h
+    ```shell
+    $ dstack apply -f fleet.dstack.yml
+        
+      #  BACKEND  REGION           RESOURCES                 SPOT  PRICE
+      1  gcp      us-west4         2xCPU, 8GB, 100GB (disk)  yes   $0.010052
+      2  azure    westeurope       2xCPU, 8GB, 100GB (disk)  yes   $0.0132
+      3  gcp      europe-central2  2xCPU, 8GB, 100GB (disk)  yes   $0.013248
 
-resources:
-  # Allow to provision up to 8 GPUs
-  gpu: 0..8
-```
+    Create the fleet? [y/n]: y
 
-</div>
+      FLEET    INSTANCE  BACKEND  RESOURCES  PRICE  STATUS  CREATED 
+      defalut  -         -        -          -      -       10:36
+    ```
 
-<h3>Apply the configuration</h3>
+    </div>
 
-Apply the configuration via [`dstack apply`](reference/cli/dstack/apply.md):
+    If `nodes` is a range that starts above `0`, `dstack` pre-creates the initial number of instances up front, while any additional ones are created on demand. 
+    
+    > Setting the `nodes` range to start above `0` is supported only for [VM-based backends](concepts/backends.md#vm-based).
 
-<div class="termy">
+    If the fleet needs to be a cluster, the [placement](concepts/fleets.md#backend-placement) property must be set to `cluster`. 
+    
+=== "SSH fleet"
+    If you have a group of on-prem servers accessible via SSH, you can create an SSH fleet as follows:
 
-```shell
-$ dstack apply -f fleet.dstack.yml
+    <div editor-title="fleets.dstack.yml"> 
     
-     #  BACKEND  REGION           RESOURCES                 SPOT  PRICE
-     1  gcp      us-west4         2xCPU, 8GB, 100GB (disk)  yes   $0.010052
-     2  azure    westeurope       2xCPU, 8GB, 100GB (disk)  yes   $0.0132
-     3  gcp      europe-central2  2xCPU, 8GB, 100GB (disk)  yes   $0.013248
+    ```yaml
+    type: fleet
+    name: my-fleet
+
+    ssh_config:
+      user: ubuntu
+      identity_file: ~/.ssh/id_rsa
+      hosts:
+        - 3.255.177.51
+        - 3.255.177.52
+    ```
+      
+    </div>
 
-Fleet cloud-fleet does not exist yet.
-Create the fleet? [y/n]: y
- FLEET          INSTANCE  BACKEND  RESOURCES  PRICE  STATUS  CREATED 
- defalut-fleet  -         -        -          -      -       10:36
-```
+    Pass the fleet configuration to `dstack apply`:
 
-</div>
+    <div class="termy">
+
+    ```shell
+    $ dstack apply -f fleet.dstack.yml
+        
+    Provisioning...
+    ---> 100%
+
+      FLEET     INSTANCE  GPU             PRICE  STATUS  CREATED 
+      my-fleet  0         L4:24GB (spot)  $0     idle    3 mins ago      
+                1         L4:24GB (spot)  $0     idle    3 mins ago    
+    ```
+
+    </div>
 
-Alternatively, you can create an [SSH fleet](concepts/fleets#ssh-fleets).
+    > Hosts must have Docker and GPU drivers installed and meet the other [requirements](concepts/fleets.md#ssh-fleets).
+
+    If the fleet needs to be a cluster, the [placement](concepts/fleets.md#ssh-placement) property must be set to `cluster`.
 
 ## Submit your first run
 
 `dstack` supports three types of run configurations.
 
 === "Dev environment"
-
     A [dev environment](concepts/dev-environments.md) lets you provision an instance and access it with your desktop IDE.
 
-    <h3>Define a configuration</h3>
-
-    Create the following run configuration inside your project folder:
+    Create the following run configuration:
 
     <div editor-title=".dstack.yml"> 
 
@@ -96,9 +123,7 @@ Alternatively, you can create an [SSH fleet](concepts/fleets#ssh-fleets).
 
     </div>
 
-    <h3>Apply the configuration</h3>
-
-    Apply the configuration via [`dstack apply`](reference/cli/dstack/apply.md):
+    Apply the configuration via `dstack apply`:
 
     <div class="termy">
 
@@ -121,17 +146,12 @@ Alternatively, you can create an [SSH fleet](concepts/fleets#ssh-fleets).
     
     </div>
 
-    Open the link to access the dev environment using your desktop IDE.
-
-    Alternatively, you can access it via `ssh <run name>`.
+    Open the link to access the dev environment using your desktop IDE. Alternatively, you can access it via `ssh <run name>`.
 
 === "Task"
-
     A [task](concepts/tasks.md) allows you to schedule a job or run a web app. Tasks can be distributed and can forward ports.
 
-    <h3>Define a configuration</h3>
-
-    Create the following run configuration inside your project folder:
+    Create the following run configuration:
 
     <div editor-title="task.dstack.yml"> 
 
@@ -161,9 +181,7 @@ Alternatively, you can create an [SSH fleet](concepts/fleets#ssh-fleets).
     By default, tasks run on a single instance. To run a distributed task, specify 
     [`nodes`](concepts/tasks.md#distributed-tasks), and `dstack` will run it on a cluster.
 
-    <h3>Apply the configuration</h3>
-
-    Run the configuration via [`dstack apply`](reference/cli/dstack/apply.md):
+    Run the configuration via `dstack apply`:
 
     <div class="termy">
 
@@ -190,12 +208,9 @@ Alternatively, you can create an [SSH fleet](concepts/fleets#ssh-fleets).
     If you specified `ports`, they will be automatically forwarded to `localhost` for convenient access.
 
 === "Service"
-
     A [service](concepts/services.md) allows you to deploy a model or any web app as an endpoint.
 
-    <h3>Define a configuration</h3>
-
-    Create the following run configuration inside your project folder:
+    Create the following run configuration:
 
     <div editor-title="service.dstack.yml"> 
 
@@ -226,9 +241,7 @@ Alternatively, you can create an [SSH fleet](concepts/fleets#ssh-fleets).
 
     </div>
 
-    <h3>Apply the configuration</h3>
-
-    Run the configuration via [`dstack apply`](reference/cli/dstack/apply.md):
+    Run the configuration via `dstack apply`:
 
     <div class="termy">
 
@@ -254,15 +267,9 @@ Alternatively, you can create an [SSH fleet](concepts/fleets#ssh-fleets).
     
     </div>
 
-    !!! info "Gateway"
-        To enable [auto-scaling](concepts/services.md#replicas-and-scaling),
-        [rate limits](concepts/services.md#rate-limits),
-        or use a custom domain with HTTPS, 
-        set up a [gateway](concepts/gateways.md) before running the service.
-        If you're using [dstack Sky](https://sky.dstack.ai),
-        a gateway is pre-configured for you.
+    > To enable auto-scaling rate limits, or use a custom domain with HTTPS, set up a [gateway](concepts/gateways.md) before running the service.
 
-`dstack apply` automatically provisions instances and runs the workload according to the configuration.
+`dstack apply` automatically provisions instances with created fleets and runs the workload according to the configuration.
 
 ## Troubleshooting