diff --git a/docs/customize/model-providers/more/vllm.mdx b/docs/customize/model-providers/more/vllm.mdx index 3d15982dbb7..9c9fce67ce4 100644 --- a/docs/customize/model-providers/more/vllm.mdx +++ b/docs/customize/model-providers/more/vllm.mdx @@ -3,7 +3,7 @@ title: "vLLM" description: "Configure vLLM's high-performance inference library with Continue for chat, autocomplete, and embeddings, including setup instructions for Llama3.1, Qwen2.5-Coder, and Nomic Embed models" --- -vLLM is an open-source library for fast LLM inference which typically is used to serve multiple users at the same time. It can also be used to run a large model on multiple GPU:s (e.g. when it doesn´t fit in a single GPU). Run their OpenAI-compatible server using `vllm serve`. See their [server documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html) and the [engine arguments documentation](https://docs.vllm.ai/en/latest/usage/engine_args.html). +vLLM is an open-source library for fast LLM inference which typically is used to serve multiple users at the same time. It can also be used to run a large model on multiple GPU:s (e.g. when it doesn´t fit in a single GPU). Run their OpenAI-compatible server using `vllm serve`. See their [server documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html) and the [engine arguments documentation](https://docs.vllm.ai/en/latest/configuration/engine_args.html). ```shell vllm serve meta-llama/Meta-Llama-3.1-8B-Instruct diff --git a/docs/guides/netlify-mcp-continuous-deployment.mdx b/docs/guides/netlify-mcp-continuous-deployment.mdx index 86348ffb8a7..3dabf3e534a 100644 --- a/docs/guides/netlify-mcp-continuous-deployment.mdx +++ b/docs/guides/netlify-mcp-continuous-deployment.mdx @@ -252,7 +252,7 @@ Show me the netlify.toml configuration and explain each optimization" **Netlify Build Features You're Getting**: - - [Build Caching](https://docs.netlify.com/configure-builds/build-caching/) - Persist dependencies between builds + - [Build Caching](https://docs.netlify.com/) - Persist dependencies between builds (the old `/configure-builds/build-caching/` URL was retired in the Netlify docs restructure) - [Build Plugins](https://docs.netlify.com/integrations/build-plugins/) - Extend build process with 100+ plugins - [Conditional Builds](https://docs.netlify.com/configure-builds/ignore-builds/) - Skip unnecessary builds - [Monorepo Support](https://docs.netlify.com/configure-builds/monorepos/) - Optimize multi-package repos @@ -311,13 +311,13 @@ Generate a full report with before/after bundle sizes" **Features Available Through Netlify MCP**: - [Bundle - Analyzer](https://docs.netlify.com/configure-builds/build-plugins/bundle-analyzer/): Visualize your JavaScript bundles + Analyzer](https://docs.netlify.com/): Visualize your JavaScript bundles (the old `/configure-builds/build-plugins/bundle-analyzer/` URL was retired in the Netlify docs restructure) - [Asset - Optimization](https://docs.netlify.com/configure-builds/post-processing/): - Automatic minification and compression + Optimization](https://docs.netlify.com/): + Automatic minification and compression (the old `/configure-builds/post-processing/` URL was retired in the Netlify docs restructure) - [Edge - Network](https://docs.netlify.com/platform/edge-network/): Global CDN with - smart caching + Network](https://docs.netlify.com/): Global CDN with + smart caching (the old `/platform/edge-network/` URL was retired in the Netlify docs restructure) - [Deploy Previews](https://docs.netlify.com/site-deploys/deploy-previews/): Test optimizations before production