Technology > AI (Artificial Intelligence)3/14/2024 2:30 PM
Long-term memory for LLMs is an unsolved problem, and doing naive retrieval from a vector database doesn’t work. The recent iteration of MemGPT (Packer et al.) takes a big step in this direction. Taking the LLM as an OS analog, the authors propose “virtual context management” to manage both memory in-context window and in external storage. Recent advances in function calling allow these agents to read and write from these data sources, and modify their own context. We'll do both a presentation on the paper as well as a Q&A session.