fix: stop session summaries polluting LanceDB tools#486
fix: stop session summaries polluting LanceDB tools#486AliceLJY wants to merge 1 commit intofix/subagent-wrapper-continuation-linesfrom
Conversation
a20ad65 to
a66a8bc
Compare
|
Cleaned up the branch and force-pushed to keep this PR scoped to #485 only. Removed one unrelated commit so the diff now contains just the session-summary filtering fix and its regression coverage. Re-ran the targeted checks:
|
a66a8bc to
4f77407
Compare
4f77407 to
c61a9e0
Compare
AliceLJY
left a comment
There was a problem hiding this comment.
Self-review passed. Summary:
Write path removed: deleted storeSystemSessionSummary + complex before_reset handler (-108 lines). Replaced with noop logger.
Read-time filtering for legacy data:
isSessionSummaryEntry()— metadata type checkfilterSessionSummaryResults()— post-filter for recalllistVisibleEntries()— batch over-fetch + re-page so hidden rows don't consume visible offsets
Recall over-fetch buffer: min(includeFullText ? 40 : 24, max(safeLimit*4, safeLimit+8)) → .slice(0, safeLimit)
Tests: recall filtering, list pagination with interleaved session-summary rows, offset handling.
Fixes #485. mergeStateStatus: CLEAN.
@rwmjhb ready for your review.
Review: fix: stop session summaries polluting LanceDB toolsGood problem — session summaries shouldn't appear in Must Fix1. The PR removes session-memory entries from tool results, but also appears to disable the Should Fix
Also
|
|
Thanks for the detailed review! All points addressed: Must Fix — Should Fix — Over-fetch vs retriever cap: Good catch. The retriever clamps to Should Fix — Also: Rebased onto current |
rwmjhb
left a comment
There was a problem hiding this comment.
Review: fix: stop session summaries polluting LanceDB tools
session-summary 污染 LanceDB 导致数据库膨胀和 recall 结果质量下降,这是高价值修复。
Must Fix
-
systemSessionMemory 被静默禁用: 当前实现把
systemSessionMemory变成了 no-op,会 break 依赖这个 strategy 的 legacy 和 plugin-only 配置。应该保留功能但修复写入路径,而不是直接禁用。 -
Build failure:
stale_base=true,rebase 后应该能解决。
Notes
- PR comment 提到 retarget 到 #495——base branch 是否正确?
- 已有的污染数据没有清理方案,建议在后续 PR 中加 migration。
Summary
systemSessionMemoryfrom writingsession-summaryrows into the main LanceDB storesession-summaryrows frommemory_recallandmemory_list/newhandling and user-facing tool filteringTesting
node --test test/session-summary-before-reset.test.mjsnode --test test/recall-text-cleanup.test.mjsnode test/plugin-manifest-regression.mjsFixes #485.