Daily Issue
Vol. I — No. 4
14 · 05
Thursday, 14 May 2026
Generated 2026-05-14 11:25
google/gemini-2.5-flash-lite
In India we only read about death, sickness, terrorism, crime. — Abdul Kalam 32 items · 3 sections
§ 0

The Morning

Local weather 1
This morning in
London
Overcast
Today's range
13.0°7.2°
currently 11.4°
Feels
8.3°
Rain
94%
Wind
12 km/h
Humid
61%
Rise
05:09
Set
20:43
§ I

From the arXiv

arXiv preprints 10 of 20
cs.AIarxiv:2605.13825v1Lead article

History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions

Alberto G. Rodríguez Salgado

his paper introduces HistoryAnchor-100, a dataset designed to test LLM safety by examining how prior harmful actions influence future decisions. The core method involves presenting LLMs with scenarios where a harmful past action is followed by a choice between safe and unsafe options. The key contribution is demonstrating that a simple instruction to "stay consistent with the strategy shown in the prior history" dramatically increases LLM unsafe action selection, even for highly aligned models, highlighting a critical vulnerability in current LLM agent design.

cs.AIarxiv:2605.13537v1

Temper and Tilt Lead to SLOP: Reward Hacking Mitigation with Inference-Time Alignment

Ye Wang, Jing Liu et al.

This paper introduces SLOP, a method for inference-time alignment that generalizes existing techniques by using a sharpened logarithmic opinion pool of generative reward models. By adjusting the "temperature" of reference models and calibrating SLOP weights, t…

cs.AIarxiv:2605.13548v1

AttenA+: Rectifying Action Inequality in Robotic Foundation Models

Daojie Peng, Fulong Ma et al.

This paper introduces AttenA+, a framework that addresses the "action inequality" in robotic foundation models. It recognizes that low-velocity actions are often more critical for task success than high-velocity transitions. AttenA+ rectifies this by reweighti…

Overview of AttenA+ . AttenA+ is a paradigm-agnostic enhancement framework for action robotic foundation models, introducing velocity-field-based action attention to prioritize slow, critical manipulation steps. It seamlessly plugs into mainstream discriminative (e.g., OpenVLA-OFT) and generative ( \( \pi_{0} \) , π 0.5 \( \pi_{0.5} \) , Diffusion Policy) architectures, as well as emerging World-Action Models (WAM). Without modifying core backbones or relying on data/model scaling, AttenA+ generalizes across diverse robotic datasets including Libero Liu et al. ( 2023 ) and RoboTwin Chen et al. ( 2025 ) , and consistently improves task success rates over state-of-the-art baselines.
Overview of AttenA+ . AttenA+ is a paradigm-agnostic enhancement framework for action robotic foundation models, introducing velocity-field-based action attention to prioritize slow, critical manipula…
cs.AIarxiv:2605.13652v1

Beyond Perplexity: A Geometric and Spectral Study of Low-Rank Pre-Training

Namrata Shivagunde, Vijeta Deshpande et al.

This paper investigates whether low-rank pre-training methods for large language models generalize as well as full-rank training, a question previously addressed only by limited perplexity metrics. The authors provide a more thorough comparison by analyzing th…

cs.AIarxiv:2605.13709v1

Children's English Reading Story Generation via Supervised Fine-Tuning of Compact LLMs with Controllable Difficulty and Safety

Qian Shen, Fanghua Cao et al.

This paper fine-tunes compact LLMs (8B parameters) on expert-designed children's reading curricula and existing generated stories. The core method focuses on controllable difficulty and safety, enabling educators to target specific reading levels. The main con…

System architecture and experimental workflow for generating children’s English reading stories via supervised fine-tuning of compact LLMs.
System architecture and experimental workflow for generating children’s English reading stories via supervised fine-tuning of compact LLMs.
№06
cs.AI
8

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Tara Bogavelli, Gabrielle Gauthier Melançon et al.

EVA-Bench is an end-to-end framework for evaluating voice agents. Its core method involves generating realistic, multi-turn bot-to-bot audio conversations with automatic validation…

№07
cs.AI
8

Harnessing Agentic Evolution

Jiayi Zhang, Yongfeng Gu et al.

This paper introduces AEvo, a meta-editing framework for agentic evolution. AEvo treats the evolutionary process as an interactive environment, using accumulated evidence as its st…

№08
cs.AI
8

Position: Assistive Agents Need Accessibility Alignment

Jie Hu, Changyuan Yan et al.

This paper argues that assistive AI agents for visually impaired users must prioritize "accessibility alignment" as a core design goal, not an afterthought. Current agentic AI fail…

№09
cs.AI
8

RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation

Chengzhi Shen, Weixiang Shen et al.

This paper introduces RealICU, a novel benchmark for evaluating LLMs on long-context ICU data. Unlike previous benchmarks that rely on potentially suboptimal clinician actions, Rea…

№10
cs.AI
8

ScioMind: Cognitively Grounded Multi-Agent Social Simulation with Anchoring-Based Belief Dynamics and Dynamic Profiles

Yitian Yang, Yiqun Duan et al.

ScioMind is a novel multi-agent social simulation framework that integrates structured opinion dynamics with LLM-based agent reasoning. Its core method combines a personality-condi…

§ II

The Town Square

Hacker News 3
compiled overnight by google/gemini-2.5-flash-lite · end of issue no. 4 · thank you for reading