GitHub - llmrb/mruby-llm: mruby's most capable AI runtime

About

mruby-llm is mruby's most capable AI runtime.

It brings a single runtime for providers, agents, tools, skills, MCP, A2A (Agent2Agent), streaming, files, and persisted state to mruby in a form that can be embedded into small standalone applications. The project began as a fork of llm.rb, and a large number of features turned out to be portable. Both projects generally improve each other and code continues to flow both ways.

It supports OpenAI, OpenAI-compatible endpoints, Anthropic, Google Gemini, DeepSeek, xAI, Z.ai, Ollama, and llama.cpp. The mruby port keeps the same overall execution model as llm.rb, but adapts it to mruby constraints.

Quick start

LLM::Context

The LLM::Context object is at the heart of the runtime. Almost all other features build on top of it. It is a low-level interface to a model, and requires tool execution to be managed manually. The LLM::Agent class is almost the same as LLM::Context, but it manages tool execution for you:

llm = LLM.openai(key: ENV["OPENAI_SECRET"])
ctx = LLM::Context.new(llm, stream: $stdout)
ctx.talk("Hello world")

LLM::Agent

The LLM::Agent object is implemented on top of LLM::Context. It provides the same interface, but manages tool execution for you. It also includes loop guards that detect repeated tool-call patterns and advise the model to change course rather than raise an error:

llm = LLM.openai(key: ENV["OPENAI_SECRET"])
agent = LLM::Agent.new(llm, stream: $stdout)
agent.talk("Hello world")

Agents (Advanced)

An agent can be configured to require confirmation before a tool is executed. When a matching tool is called, mruby-llm runs on_tool_confirmation. That callback must decide whether to cancel the tool call or approve it and execute it by calling fn.spawn(strategy).wait, and it must always return an instance of LLM::Function::Return:

require "llm"

class Agent < LLM::Agent
  tools DeleteFile
  confirm "delete-file"

  def on_tool_confirmation(fn, strategy)
    path = fn.arguments.path
    if path.start_with?("/tmp/")
      fn.spawn(strategy).wait
    else
      fn.cancel(reason: "Deletion requires approval")
    end
  end
end

llm = LLM.openai(key: ENV["KEY"])
Agent.new(llm, stream: $stdout).talk("Delete /tmp/example.txt.")

Tools

The LLM::Tool class can be subclassed to implement your own tools that extend the abilities of a model:

class ReadFile < LLM::Tool
  name "read-file"
  description "Read a file"
  parameter :path, String, "The filename or path"
  required %i[path]

  def call(path:)
    {contents: File.read(path)}
  end
end

MCP

The LLM::MCP object lets mruby-llm use tools provided by an MCP server. Those tools are exposed through the same runtime as local tools, so you can pass them to either LLM::Context or LLM::Agent.

Use the stdio transport to run a local MCP server:

llm = LLM.openai(key: ENV["OPENAI_SECRET"])
mcp = LLM::MCP.stdio(argv: ["ruby", "server.rb"])

mcp.run do
  ctx = LLM::Context.new(llm, stream: $stdout, tools: mcp.tools)
  ctx.talk("Use the available tools to inspect the environment.")
  ctx.talk(ctx.wait(:call)) while ctx.functions?
end

Use the HTTP transport with remote MCP servers:

llm = LLM.openai(key: ENV["OPENAI_SECRET"])
mcp = LLM::MCP.http(
  url: "https://remote-mcp.example.com"
)

mcp.run do
  ctx = LLM::Context.new(llm, stream: $stdout, tools: mcp.tools)
  ctx.talk("Use the available tools.")
  ctx.talk(ctx.wait(:call)) while ctx.functions?
end

A2A (Agent 2 Agent)

The LLM::A2A object lets mruby-llm use skills provided by a remote A2A agent. Those skills are exposed through the same runtime as local tools, so you can pass them to either LLM::Context or LLM::Agent.

Use remote skills as local tools:

a2a = LLM::A2A.rest(
  url: "https://remote-agent.example.com",
  headers: {"Authorization" => "Bearer token"}
)
llm = LLM.openai(key: ENV["OPENAI_SECRET"])
ctx = LLM::Context.new(llm, tools: a2a.skills)
ctx.talk "Analyze this CSV and summarize the trends."
ctx.talk(ctx.wait(:call)) while ctx.functions?

For more on direct messaging, task operations, push notification configs, and JSON-RPC, see the LLM::A2A API docs.

Skills

Skills are reusable instructions loaded from a SKILL.md directory. They let you package behavior and tool access together, and they plug into the same runtime as tools, agents, and MCP:

---
name: release
description: Prepare a release
tools: ["read-file"]
---

## Task

Review the release state and summarize what changed.

class ReleaseAgent < LLM::Agent
  model "gpt-4.1-mini"
  skills "./skills/release"
end

LLM::Stream

The LLM::Stream object lets you observe output and runtime events as they happen. You can subclass it to handle streamed content in your own application:

require "llm"

class Stream < LLM::Stream
  def on_content(content)
    $stdout << content
  end
end

llm = LLM.openai(key: ENV["KEY"])
ctx = LLM::Context.new(llm, stream: Stream.new)
ctx.talk "Write a haiku about Ruby."

LLM::Stream (advanced)

The LLM::Stream object can also resolve tool calls while output is still streaming. In on_tool_call, you can spawn the tool, push the work onto the stream queue, and later drain it with wait:

require "llm"

class Stream < LLM::Stream
  def on_content(content)
    $stdout << content
  end

  def on_tool_call(tool, error)
    return queue << error if error
    queue << ctx.spawn(tool, :call)
  end
end

llm = LLM.openai(key: ENV["KEY"])
ctx = LLM::Context.new(llm, stream: Stream.new, tools: [ReadFile])
ctx.talk "Read README.md and summarize the quick start."
ctx.talk(ctx.wait) while ctx.functions?

Concurrency

llm.rb can run tool work concurrently. In mruby-llm, the available strategy is :call for sequential execution. On LLM::Agent, you can set this with concurrency:

class Agent < LLM::Agent
  model "gpt-4.1-mini"
  tools ReadFile
  concurrency :call
end

llm = LLM.openai(key: ENV["OPENAI_SECRET"])
agent = Agent.new(llm, stream: $stdout)
agent.talk "Read README.md and CHANGELOG.md and compare them."

Context Compaction

Long-lived conversations can be compacted automatically. The LLM::Compactor summarizes older history and replaces it with a compact summary. Set token_threshold: as a percentage of the model's context window:

require "llm"

class Stream < LLM::Stream
  def on_compaction(ctx, compactor)
    puts "Compacting #{ctx.messages.size} messages..."
  end

  def on_compaction_finish(ctx, compactor)
    puts "Compacted to #{ctx.messages.size} messages."
  end
end

llm = LLM.openai(key: ENV["OPENAI_SECRET"])
ctx = LLM::Context.new(
  llm,
  stream: Stream.new,
  compactor: {
    token_threshold: "90%",
    retention_window: 8
  }
)

Serialization

The LLM::Context object can be serialized to JSON, which makes it suitable for storing in a file, a database column, or a Redis queue:

require "llm"

llm = LLM.openai(key: ENV["OPENAI_SECRET"])

# Serialize a context
ctx1 = LLM::Context.new(llm)
ctx1.talk "Remember that my favorite language is Ruby"
string = ctx1.to_json

# Restore a context (from JSON)
ctx2 = LLM::Context.new(llm, stream: $stdout)
ctx2.restore(string:)
ctx2.talk "What is my favorite language?"

Integration

Add to your mruby build config:

MRuby::Build.new("app") do |conf|
  curldir = File.expand_path(ENV["CURLDIR"] || "/usr/local")
  conf.toolchain

  conf.cc.include_paths << File.join(curldir, "include")
  conf.linker.library_paths << File.join(curldir, "lib")

  conf.gembox "default"
  conf.gem github: "llmrb/mruby-llm", branch: "main"
  conf.enable_debug
end

For local development in this repository, use the bundled Makefile:

make
make test

The Makefile expects an mruby checkout at ../mruby. Override that with MRUBY_DIR=/absolute/path/to/mruby if needed.

For direct integration into another mruby build, build through your mruby checkout:

ruby minirake MRUBY_CONFIG=/absolute/path/to/build_config.rb

Dependencies are declared in mrbgem.rake. In practice the main external build requirement is libcurl, because the runtime depends on mruby-curl and mruby-http. The local build also expects curl headers and libraries under /usr/local by default; override with CURLDIR=/absolute/path.

Dependencies

Declared mrbgem dependencies include:

mruby-http
mruby-curl
mruby-json
mruby-stringio
mruby-process
mruby-io
mruby-time
mruby-env
mruby-struct
mruby-regexp

See mrbgem.rake.

Resources

doc site has the API docs.
llm.rb is the CRuby runtime this is based on.

License

BSD Zero Clause
See LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
.github/workflows		.github/workflows
data		data
examples		examples
mrblib		mrblib
spec		spec
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
build.rb		build.rb
llm-small.png		llm-small.png
llm.png		llm.png
mrbgem.rake		mrbgem.rake

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Quick start

LLM::Context

LLM::Agent

Agents (Advanced)

Tools

MCP

A2A (Agent 2 Agent)

Skills

LLM::Stream

LLM::Stream (advanced)

Concurrency

Context Compaction

Serialization

Integration

Dependencies

Resources

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Quick start

LLM::Context

LLM::Agent

Agents (Advanced)

Tools

MCP

A2A (Agent 2 Agent)

Skills

LLM::Stream

LLM::Stream (advanced)

Concurrency

Context Compaction

Serialization

Integration

Dependencies

Resources

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages