continuedev · sanjibani · Jun 22, 2026 · Jun 22, 2026
@@ -3,7 +3,7 @@ title: "vLLM"
 description: "Configure vLLM's high-performance inference library with Continue for chat, autocomplete, and embeddings, including setup instructions for Llama3.1, Qwen2.5-Coder, and Nomic Embed models"
 ---
 
-vLLM is an open-source library for fast LLM inference which typically is used to serve multiple users at the same time. It can also be used to run a large model on multiple GPU:s (e.g. when it doesn´t fit in a single GPU). Run their OpenAI-compatible server using `vllm serve`. See their [server documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html) and the [engine arguments documentation](https://docs.vllm.ai/en/latest/usage/engine_args.html).
+vLLM is an open-source library for fast LLM inference which typically is used to serve multiple users at the same time. It can also be used to run a large model on multiple GPU:s (e.g. when it doesn´t fit in a single GPU). Run their OpenAI-compatible server using `vllm serve`. See their [server documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html) and the [engine arguments documentation](https://docs.vllm.ai/en/latest/configuration/engine_args.html).
 
 ```shell
 vllm serve meta-llama/Meta-Llama-3.1-8B-Instruct

@@ -252,7 +252,7 @@ Show me the netlify.toml configuration and explain each optimization"
 
 <Info>
   **Netlify Build Features You're Getting**:
-  - [Build Caching](https://docs.netlify.com/configure-builds/build-caching/) - Persist dependencies between builds
+  - [Build Caching](https://docs.netlify.com/) - Persist dependencies between builds (the old `/configure-builds/build-caching/` URL was retired in the Netlify docs restructure)
   - [Build Plugins](https://docs.netlify.com/integrations/build-plugins/) - Extend build process with 100+ plugins
   - [Conditional Builds](https://docs.netlify.com/configure-builds/ignore-builds/) - Skip unnecessary builds
   - [Monorepo Support](https://docs.netlify.com/configure-builds/monorepos/) - Optimize multi-package repos
@@ -311,13 +311,13 @@ Generate a full report with before/after bundle sizes"
 
     **Features Available Through Netlify MCP**: 
     - [Bundle
-    Analyzer](https://docs.netlify.com/configure-builds/build-plugins/bundle-analyzer/): Visualize your JavaScript bundles 
+    Analyzer](https://docs.netlify.com/): Visualize your JavaScript bundles (the old `/configure-builds/build-plugins/bundle-analyzer/` URL was retired in the Netlify docs restructure)
     - [Asset
-    Optimization](https://docs.netlify.com/configure-builds/post-processing/):
-    Automatic minification and compression 
+    Optimization](https://docs.netlify.com/):
+    Automatic minification and compression (the old `/configure-builds/post-processing/` URL was retired in the Netlify docs restructure)
     - [Edge
-    Network](https://docs.netlify.com/platform/edge-network/): Global CDN with
-    smart caching 
+    Network](https://docs.netlify.com/): Global CDN with
+    smart caching (the old `/platform/edge-network/` URL was retired in the Netlify docs restructure) 
     - [Deploy
     Previews](https://docs.netlify.com/site-deploys/deploy-previews/): Test
     optimizations before production