Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion docs/customize/model-providers/more/vllm.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ title: "vLLM"
description: "Configure vLLM's high-performance inference library with Continue for chat, autocomplete, and embeddings, including setup instructions for Llama3.1, Qwen2.5-Coder, and Nomic Embed models"
---

vLLM is an open-source library for fast LLM inference which typically is used to serve multiple users at the same time. It can also be used to run a large model on multiple GPU:s (e.g. when it doesn´t fit in a single GPU). Run their OpenAI-compatible server using `vllm serve`. See their [server documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html) and the [engine arguments documentation](https://docs.vllm.ai/en/latest/usage/engine_args.html).
vLLM is an open-source library for fast LLM inference which typically is used to serve multiple users at the same time. It can also be used to run a large model on multiple GPU:s (e.g. when it doesn´t fit in a single GPU). Run their OpenAI-compatible server using `vllm serve`. See their [server documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html) and the [engine arguments documentation](https://docs.vllm.ai/en/latest/configuration/engine_args.html).

```shell
vllm serve meta-llama/Meta-Llama-3.1-8B-Instruct
Expand Down
12 changes: 6 additions & 6 deletions docs/guides/netlify-mcp-continuous-deployment.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -252,7 +252,7 @@ Show me the netlify.toml configuration and explain each optimization"

<Info>
**Netlify Build Features You're Getting**:
- [Build Caching](https://docs.netlify.com/configure-builds/build-caching/) - Persist dependencies between builds
- [Build Caching](https://docs.netlify.com/) - Persist dependencies between builds (the old `/configure-builds/build-caching/` URL was retired in the Netlify docs restructure)
- [Build Plugins](https://docs.netlify.com/integrations/build-plugins/) - Extend build process with 100+ plugins
- [Conditional Builds](https://docs.netlify.com/configure-builds/ignore-builds/) - Skip unnecessary builds
- [Monorepo Support](https://docs.netlify.com/configure-builds/monorepos/) - Optimize multi-package repos
Expand Down Expand Up @@ -311,13 +311,13 @@ Generate a full report with before/after bundle sizes"

**Features Available Through Netlify MCP**:
- [Bundle
Analyzer](https://docs.netlify.com/configure-builds/build-plugins/bundle-analyzer/): Visualize your JavaScript bundles
Analyzer](https://docs.netlify.com/): Visualize your JavaScript bundles (the old `/configure-builds/build-plugins/bundle-analyzer/` URL was retired in the Netlify docs restructure)
- [Asset
Optimization](https://docs.netlify.com/configure-builds/post-processing/):
Automatic minification and compression
Optimization](https://docs.netlify.com/):
Automatic minification and compression (the old `/configure-builds/post-processing/` URL was retired in the Netlify docs restructure)
- [Edge
Network](https://docs.netlify.com/platform/edge-network/): Global CDN with
smart caching
Network](https://docs.netlify.com/): Global CDN with
smart caching (the old `/platform/edge-network/` URL was retired in the Netlify docs restructure)
- [Deploy
Previews](https://docs.netlify.com/site-deploys/deploy-previews/): Test
optimizations before production
Expand Down
Loading