Search papers, labs, and topics across Lattice.
This paper outlines the evolution of Large Language Models (LLMs) from simple conversational agents to sophisticated autonomous systems, termed Digital Colleagues, capable of reasoning and self-improvement. It identifies two key dimensions of this transformation: the cognitive core, which is shifting from fast-thinking, next-token prediction to more deliberate reasoning through advanced techniques like Chain-of-Thought and reinforcement learning, and the task execution level, which is moving towards persistent, skill-based workspaces that enhance tool use and governance. The findings indicate that this new "Workspace + Skill" paradigm significantly improves the reliability and effectiveness of LLMs in complex tasks, moving beyond traditional instruction-response interactions.
LLMs are evolving from reactive chatbots to proactive digital colleagues, fundamentally changing how AI can assist in complex tasks.
Large Language Models (LLMs) are undergoing a fundamental transformation from conversational generators into integrated AI systems capable of reasoning, action, memory, and self-improvement. We conceptualize this transition as a shift from Chatbot to Digital Colleague: from conversational answers to persistent work. We organize this transition along two tightly coupled dimensions. First, at the cognitive core level, LLMs are advancing from Chatbot-era "fast thinking" systems driven by next-token prediction toward Thinking LLMs that leverage inference-time computation, Chain-of-Thought reasoning, reflection, process supervision, and reinforcement learning to support more deliberate and reliable cognition. Second, at the tool-augmented task execution level, LLMs are progressing from tool-calling Agents that invoke external resources in an ad hoc manner toward OpenClaw-style workstation systems (OpenClaw) equipped with persistent Workspaces, skills, verification loops, and governance. The "Workspace + Skill" paradigm makes episodic tool use colleague-like via state persistence, reusable procedures, task closure, and experience reuse. We examine data construction shifts from instruction-response pairs to State-Action-Observation trajectories and evaluation from static benchmarks to sandboxed, auditable, self-evolving AI ecosystems.