Skip to content

feat(vllm): single-shot mode + no-gateway offchain call#9

Draft
rickstaa wants to merge 2 commits into
mainfrom
rs/vllm-single-shot-direct
Draft

feat(vllm): single-shot mode + no-gateway offchain call#9
rickstaa wants to merge 2 commits into
mainfrom
rs/vllm-single-shot-direct

Conversation

@rickstaa

@rickstaa rickstaa commented Jul 1, 2026

Copy link
Copy Markdown
Member

Draft — blocked on go-livepeer#3955 (single-shot on-chain payment). Opening now so we don't lose the no-gateway offchain path. Relates to #4, #5.

What

  • runners.json: mode persistent → single-shot; add routing: "label".
  • README: document calling vLLM directly through the orchestrator (/apps/vllm/app/...) with plain curl — no gateway.py, no SDK. Offchain-only.
  • README: on-chain section is a non-rendering pre-merge TODO (this PR merges only after #3955, when single-shot billing exists — so no "not implemented" caveat ships).

Why

Single-shot runners are reachable at the orchestrator's single-shot proxy route (ProxyLiveRunnerSingleShot), so a plain curl streams SSE with zero dependencies — and it shows the orch reverse-proxy is streaming-transparent. Persistent runners can't (route returns 400 "runner is not single-shot").

Pre-merge checklist (do before undrafting)

rickstaa and others added 2 commits July 1, 2026 15:15
Switch the vLLM runner to single-shot with label routing, and document
calling it directly through the orchestrator (/apps/vllm/app/...) with
plain curl -- no gateway.py, no SDK. Offchain-only until single-shot
on-chain payment lands (go-livepeer#3955), so the on-chain section is a
pending note for now.

Relates to #4, #5.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant