Feat/memory update by alcholiclg · Pull Request #912 · modelscope/ms-agent

alcholiclg · 2026-05-19T16:29:28Z

Change Summary

refractor memory.
support for different memory backend: mem0, mempalace, byterover, supermemory, reme, file-based(ours).
refractor session log and context compression pipeline.

Related issue number

Checklist

The pull request title is a good summary of the changes - it will be used in the changelog
Unit tests for the changes exist
Run pre-commit install and pre-commit run --all-files before git commit, and passed lint check.
Documentation reflects the changes where applicable

gemini-code-assist

Code Review

This pull request introduces a new unified memory architecture, featuring a pluggable backend system, a robust session logging mechanism, and a non-destructive context assembly pipeline. The reviewer identified several critical performance and functional issues, including the use of blocking synchronous calls (subprocess.run and time.sleep) within asynchronous methods, inefficient object instantiation inside loops, missing fields during message restoration that could break tool-use interactions, and a performance bottleneck in the session log metadata update logic. All comments provide actionable feedback to improve the system's scalability and correctness.

gemini-code-assist · 2026-05-19T16:32:32Z

+                    messages = [_Msg(
+                        role=m.get('role', 'user'),
+                        content=m.get('content', ''),
+                        tool_calls=m.get('tool_calls'),
+                    ) for m in restored]


When restoring messages from the session log, the tool_call_id and name fields are omitted. This will break the conversation history for tool-use interactions, as many LLM providers require these fields to correctly associate tool outputs with their corresponding calls. These fields should be included in the restoration logic, similar to how they are handled in _dicts_to_messages in the ContextAssembler.

messages = [_Msg( role=m.get('role', 'user'), content=m.get('content', ''), tool_calls=m.get('tool_calls'), tool_call_id=m.get('tool_call_id'), name=m.get('name'), ) for m in restored]

gemini-code-assist · 2026-05-19T16:32:33Z

+        result = subprocess.run(
+            cmd, capture_output=True, text=True,
+            timeout=timeout, cwd=cwd, env=env,
+        )


subprocess.run is a synchronous, blocking call. When executed within an asynchronous context (such as the inject or search methods of this backend), it will block the entire event loop until the CLI command completes. This can cause significant latency and prevent other concurrent tasks from running. Consider using asyncio.create_subprocess_exec or wrapping the call in asyncio.to_thread to avoid blocking the loop.

gemini-code-assist · 2026-05-19T16:32:33Z

+
+        try:
+            fts = FTSRetriever(self._config)
+            results = await fts.search(query, limit=5)


Instantiating FTSRetriever on every call to _inject_fts_context is inefficient. The FTSRetriever constructor performs a SQLite connection and runs schema initialization scripts (_ensure_schema). Since inject is called in every iteration of the agent loop, this will significantly degrade performance. It is better to initialize the retriever once (e.g., in the backend's start method or lazily as a cached instance attribute) and reuse it.

gemini-code-assist · 2026-05-19T16:32:33Z

+            if isinstance(results, dict):
+                if self._is_transient_error(results):
+                    time.sleep(1)
+                    results = search_memories(


time.sleep(1) is a blocking call that will freeze the entire event loop in an asynchronous application. This prevents other concurrent tasks from making progress. Use await asyncio.sleep(1) instead (this will require importing asyncio).

await asyncio.sleep(1)

gemini-code-assist · 2026-05-19T16:32:33Z

+        """Rewrite the first line (metadata header) of the JSONL file."""
+        if not self._path.exists():
+            self._ensure_metadata()
+            return
+        lines = self._path.read_text(encoding="utf-8").splitlines()
+        meta_line = json.dumps(
+            {**meta, "_type": "metadata"}, ensure_ascii=False
+        )
+        if lines and lines[0].strip():
+            try:
+                first = json.loads(lines[0])
+                if first.get("_type") == "metadata":
+                    lines[0] = meta_line
+                else:
+                    lines.insert(0, meta_line)
+            except json.JSONDecodeError:
+                lines.insert(0, meta_line)
+        else:
+            lines.insert(0, meta_line)
+        self._path.write_text("\n".join(lines) + "\n", encoding="utf-8")
+        self._metadata = meta


The _rewrite_metadata method reads the entire session log file into memory, modifies the first line, and writes the whole content back to disk. This results in O(N) complexity for both memory and I/O, where N is the number of messages in the session. As the conversation history grows, this will become a major performance bottleneck. Consider storing mutable session metadata (like last_consolidated) in a separate sidecar file (e.g., {session_key}.meta) to keep the main log file strictly append-only.

suluyana · 2026-06-26T08:18:26Z

+            orchestrator.set_llm(self.llm)
+            orchestrator.init_update_queue()
+        if self.session_log is not None and hasattr(orchestrator, '_session_log'):
+            orchestrator._session_log = self.session_log


llm_agent.py
Lines 1296-1299
await self.load_memory()
...
self._init_session_log()

load_memory先于init_session_log被调用，因此load_memory时self.session_log里的值是否未被初始化

suluyana · 2026-06-26T08:26:57Z

+    def last_consolidated(self, value: int) -> None:
+        meta = self._load_metadata()
+        meta["last_consolidated"] = value
+        self._rewrite_metadata(meta)


_rewrite_metadata 是全文件读写。长session下，如果高频，可以再优化一下

suluyana · 2026-06-26T08:44:04Z

        """
        for memory_tool in self.memory_tools:
            messages = await memory_tool.run(messages)
        return messages


inject_memory 与 condense_memory 实现相同

suluyana · 2026-06-26T08:48:06Z


            if self.runtime.round == 0:
                # New task: create standardized messages first
                messages = await self.create_messages(messages)


_init_session_log 中并不会更新round，_init_session_log后round还是0，仍然会走create_messages，这是符合预期的吗

suluyana · 2026-06-26T08:49:08Z

+                        if self.session_log is not None:
+                            self.session_log.append(
+                                self._msg_to_dict(cutoff_msg))
+                        self.save_history(messages)


save_history和session_log是否有功能重叠，能否直接合并？

suluyana · 2026-06-26T09:33:38Z

+            return
+
+        session_dir = getattr(
+            session_cfg, 'dir', None


yaml的配置示例文档缺乏。

suluyana · 2026-06-26T09:33:52Z

+        session_key = getattr(session_cfg, 'session_key', None) if session_cfg else None
+        self.session_log = SessionLog(session_dir, session_key=session_key)
+
+        compaction_cfg = getattr(self.config, 'compaction', None)


yaml的配置示例文档缺乏。

suluyana · 2026-06-26T09:35:08Z

+    show("backend_options", cfg.backend_options)
+
+    step("Config with backend_options...")
+    cfg2 = MemoryConfig(


alcholiclg added 2 commits May 12, 2026 18:27

add .cursor to gitignore

9c338a4

refactor memory and session log

2583687

alcholiclg had a problem deploying to testci May 19, 2026 16:29 — with GitHub Actions Error

gemini-code-assist Bot reviewed May 19, 2026

View reviewed changes

suluyana reviewed Jun 26, 2026

View reviewed changes

Uh oh!

Conversation

alcholiclg commented May 19, 2026

Change Summary

Related issue number

Checklist

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants