Realtime AI News - Bilingual Model, Agent, and Tool Updates

Realtime AI News

Jun 25, 2026, 04:00 AM UTCLLM AgentMemoryTrustworthinessarXiv

TRUSTMEM: Learning Trustworthy Memory Consolidation for LLM Agents with Long-Term Memory

A new arXiv paper introduces the TRUSTMEM framework to address error accumulation and hallucination persistence in LLM agent long-term memory caused by generated write, revise, and delete operations.

Read English 中文

Jun 25, 2026, 04:00 AM UTCLLMMulti-AgentEducationFinancial Literacy

Agentic Knowledge Tracing: A Multi-Agent LLM Architecture for Stealth Assessment of Financial Literacy in Serious Games

Researchers propose the Agentic BKT pipeline, a multi-agent LLM architecture that stealthily assesses financial competencies from open-ended gameplay events without disrupting the learning experience.

Read English 中文

Jun 25, 2026, 04:00 AM UTCAI ResearchReinforcement LearningEnergy

Supervised Reinforcement Learning Tackles Distributed Energy Resource Coordination

Researchers propose a supervised reinforcement learning approach for coordinating distributed energy resources (DERs), achieving more efficient energy management under the uncertainty and complexity that challenge traditional optimization methods.

Read English 中文

Jun 25, 2026, 04:00 AM UTCAI ResearchEdge AINeural Architecture Search

On-Device Neural Architecture Search Enables Edge Devices to Design Their Own Networks

Researchers propose a novel approach that performs lightweight neural architecture search directly on deployment devices, allowing sensor edge devices to redesign tiny neural networks optimized for real-time data.

Read English 中文

Jun 25, 2026, 04:00 AM UTCAI ResearchFinanceBenchmark

MacroLens Benchmark Released: Multi-Task Financial Reasoning Under Macroeconomic Scenarios

Researchers release MacroLens, a multi-task benchmark designed for contextual financial reasoning under macroeconomic scenarios, addressing key challenges like data leakage and reporting lags in time-series evaluation.

Read English 中文

Jun 25, 2026, 04:00 AM UTCAI ResearchLanguage ModelsInterpretability

Study Reveals 'Readout Blind Spot' in Looped Language Models: Dense Supervision Misses Hidden State Variables

A new study shows that dense per-loop cross-entropy loss in looped language models only controls variables exposed by the readout, not all hidden-state variables active in the recurrent transition, creating a systematic supervision blind spot.

Read English 中文

Jun 25, 2026, 04:00 AM UTCAI ResearchAI for ScienceQuantum Computing

Human-AI Collaboration Discovers Quantum Algorithms: From Vague Intuition to Mathematical Discovery

A new paper documents how human-AI co-discovery transformed a vague research intuition into concrete sign-embedding quantum algorithms for matrix equations and matrix functions, showing a new paradigm for AI-assisted mathematics.

Read English 中文

Jun 25, 2026, 04:00 AM UTCarXivAgentsEvaluation

AgentOdyssey: A New Framework for Evaluating Test-Time Continual Learning in AI Agents

AgentOdyssey procedurally generates open-ended text games to benchmark agents on exploration, knowledge acquisition, memory retention, and long-horizon planning.

Read English 中文

Jun 25, 2026, 02:00 AM UTCOpenAIAgentResearch

How agents are transforming work

OpenAI publishes a new research paper examining how AI agents are transforming work by handling longer, more complex tasks and expanding productivity across roles.

Read English 中文

Jun 25, 2026, 12:00 AM UTCResearchARCReasoning

DiARC Paper: Distinguishing Positive and Negative Samples Improves LLM Reasoning on ARC Tasks

A new arXiv paper introduces DiARC, a method that improves large language models' performance on the Abstraction and Reasoning Corpus (ARC) by distinguishing positive and negative samples.

Read English 中文

Jun 24, 2026, 10:41 PM UTCCerebrasStockAI ChipsEarnings

Cerebras Stock Plunges After First Earnings Since IPO as CEO Says Margin Outlook Misunderstood

AI chipmaker Cerebras saw its stock plummet after its first earnings report since going public, with a narrower gross margin forecast spooking investors.

Read English 中文

Jun 24, 2026, 08:09 PM UTCEnterprise AIIndustry

Companies scramble to stop employees from burning through AI budgets with small tasks

TechCrunch reports that companies are rushing to stop employees from exhausting AI budgets on low-value small tasks, marking a shift from the 'tokenmaxxing' era to an era of 'token rationing'.

Read English 中文

Jun 24, 2026, 04:51 PM UTCGoogleResearchReasoningLLM

Thinking to recall: How reasoning unlocks parametric knowledge in LLMs

Google Research explores how the reasoning process activates and retrieves parametric knowledge stored within large language models.

Read English 中文

Daily Briefs

2026-06-25Daily AI Brief — June 25, 2026: Agent Systems Breakthrough, Quantization Inflation Hidden Cost, Brain-Computer Interface Multi-Agent Framework