evo2 infer: `generate` CLI mode (steer from the command line) by polinabinder1 · Pull Request #1632 · NVIDIA-BioNeMo/bionemo-framework

polinabinder1 · 2026-06-11T20:01:35Z

Summary

Adds a generate CLI mode so you can steer SAE features from the command line — a fourth
mode alongside serve / encode / batch. Stacked on #1622 (base = pbinder/evo2-sae-serve)
so the verified engine + server core stays frozen.

# 1. find an active feature on a sequence
./launch_inference.sh encode --sequence ATGGCC...GTGCAT --organism Human --top-k 8
#    → {"feature_id": 29244, "label": "motif_ATG", "max_activation": 109.8}, ...
# 2. steer generation with it (strength ~2-3x its peak; repeat --clamp for more features)
./launch_inference.sh generate --prompt ATGGCC... --organism Human --clamp 29244:300

What's here (diff vs #1622)

cli.py: generate subparser + handler (reuses engine.generate); _parse_clamps turns
repeatable --clamp FEATURE_ID[:STRENGTH] args into feature specs.
launch_inference.sh: usage now documents generate + the encode→generate steering loop.
tests/test_cli.py: CPU coverage of _parse_clamps (3 tests, pass).

Tests / verification

_parse_clamps — 3 CPU tests pass (id:strength, default strength, empty).
The generation path itself is the engine's generate, GPU-verified in evo2 SAE recipe: live inference engine + steering server + CLI #1622 (test_steering).

Merge #1622 first; GitHub auto-retargets this to main afterward.

🤖 Generated with Claude Code

Fourth CLI mode alongside serve/encode/batch: `launch_inference.sh generate --prompt ATGC... --clamp FEATURE_ID[:STRENGTH]` runs steered generation from the command line (repeat --clamp for several features). Reuses the verified engine.generate; `_parse_clamps` turns the repeatable --clamp args into feature specs. Usage docs the encode->generate steering loop. Stacked on #1622 (the engine + server), so the verified core stays frozen. test_cli.py covers the clamp parsing (CPU); the generation path itself is the engine's (GPU-verified in #1622's test_steering). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: Polina Binder <pbinder@nvidia.com>

coderabbitai · 2026-06-11T20:02:16Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: da8424e5-e012-431b-912b-7b563eca9e2c

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch pbinder/evo2-sae-cli-generate

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…_sae serve` Shrink the inference PR to the engine + server + their tests. The encode/batch/generate command-line tools (cli.py) and launch_inference.sh move to the stacked CLI PR (#1632); the server stays launchable here via `python -m evo2_sae serve` (__main__.py, env-configured). fasta.py stays (shared by the extraction-side chunk_fasta.py and, via the base, the CLI). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: Polina Binder <pbinder@nvidia.com>

polinabinder1 · 2026-06-11T21:49:19Z

Merged into the steering PR #1634, which now holds the generate CLI alongside the shared sae.steering primitive + harness, and unifies all clamping (engine/CLI/dashboard/harness) onto one implementation (sae.steering, B).

copy-pr-bot · 2026-06-11T22:00:02Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

polinabinder1 requested review from jstjohn, jwilber, pstjohn, savitha-eng and trvachov as code owners June 11, 2026 20:01

polinabinder1 marked this pull request as draft June 11, 2026 21:18

polinabinder1 closed this Jun 11, 2026

polinabinder1 mentioned this pull request Jun 11, 2026

evo2 SAE steering: one clamp (sae.steering) for engine, CLI + harness #1634

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evo2 infer: `generate` CLI mode (steer from the command line)#1632

evo2 infer: `generate` CLI mode (steer from the command line)#1632
polinabinder1 wants to merge 1 commit into
pbinder/evo2-sae-servefrom
pbinder/evo2-sae-cli-generate

polinabinder1 commented Jun 11, 2026

Uh oh!

coderabbitai Bot commented Jun 11, 2026

Review skipped

Uh oh!

polinabinder1 commented Jun 11, 2026

Uh oh!

copy-pr-bot Bot commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

polinabinder1 commented Jun 11, 2026

Summary

What's here (diff vs #1622)

Tests / verification

Uh oh!

coderabbitai Bot commented Jun 11, 2026

Review skipped

Uh oh!

polinabinder1 commented Jun 11, 2026

Uh oh!

copy-pr-bot Bot commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant