Jianjie Zheng
2026
Evaluating Memory Capability in Continuous Lifelog Scenario
Jianjie Zheng | Zhichen Liu | Zhanyu Shen | Jingxiang Qu | Guanhua Chen | Yile Wang | Yang Xu | Yang Liu | Sijie Cheng
Findings of the Association for Computational Linguistics: ACL 2026
Jianjie Zheng | Zhichen Liu | Zhanyu Shen | Jingxiang Qu | Guanhua Chen | Yile Wang | Yang Xu | Yang Liu | Sijie Cheng
Findings of the Association for Computational Linguistics: ACL 2026
Nowadays, wearable devices can continuously lifelog ambient conversations, creating substantial opportunities for memory systems. However, existing benchmarks primarily focus on online one-on-one chatting or human-AI interactions, thus neglecting the unique demands of real-world scenarios. Given the scarcity of public lifelogging audio datasets, we propose a hierarchical synthesis framework to curate LifelogBench, a novel benchmark comprising two complementary subsets: EgoMem, built on real-world egocentric videos, and LifeMem, constructed using simulated virtual community. Crucially, to address the issue of temporal leakage in traditional offline settings, we propose an Online Evaluation protocol that strictly adheres to temporal causality, ensuring systems are evaluated in a realistic streaming fashion. Our experimental results reveal a counterintuitive finding: current sophisticated memory systems fail to outperform a simple RAG-based baseline. This highlights the detrimental impact of over-designed structures and lossy compression in current approaches, emphasizing the necessity of high-fidelity context preservation for lifelog scenarios.
GIFT: Guided Fine-Tuning and Transfer for Enhancing Instruction-Tuned Language Models
Zhiwen Ruan | Yichao Du | Jianjie Zheng | Longyue Wang | Yun Chen | Peng Li | Jinsong Su | Yang Liu | Guanhua Chen
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Zhiwen Ruan | Yichao Du | Jianjie Zheng | Longyue Wang | Yun Chen | Peng Li | Jinsong Su | Yang Liu | Guanhua Chen
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Instruction-tuned large language models (LLMs) exhibit strong instruction-following and generalization abilities, enabled by expensive post-training pipelines. However, adapting them to specific downstream tasks remains challenging: direct fine-tuning often disrupts this delicate balance, while existing adapter-based transfer methods typically treat the instruction-tuned model as a passive target that only participates at the final merging stage. We propose GIFT (Guided Fine-Tuning and Transfer), a simple and efficient framework that incorporates instruction-level guidance into task adaptation. GIFT fine-tunes a low-rank adapter on the pretrained base model using token-level confidence signals derived from the instruction-tuned model. The learned adapter is then merged into the instruction-tuned model, yielding task-specialized models that preserve general instruction-following behavior. We evaluate GIFT on mathematical reasoning and knowledge-intensive benchmarks across multiple model families and scales. Results show that GIFT consistently outperforms direct fine-tuning and representative transfer-based baselines, while maintaining robust generalization and favorable test-time scaling behavior.