2026-05-14 — Linnet Daily

In India we only read about death, sickness, terrorism, crime. — Abdul Kalam 32 items · 3 sections

§I arXiv Papers (20) §II Hacker News (3) §III GitHub Trending (9)

§0 Weather §I arXiv Papers §II Hacker News §III GitHub Trending

§ 0

The Morning

Local weather 1

This morning in

London

Overcast

Today's range

13.0°↓7.2°

currently 11.4°

Feels

8.3°

Rain

94%

Wind

12 km/h

Humid

61%

Rise

05:09

Set

20:43

§ I

From the arXiv

arXiv preprints 10 of 20

cs.AIarxiv:2605.13825v1Lead article

History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions

Alberto G. Rodríguez Salgado

his paper introduces HistoryAnchor-100, a dataset designed to test LLM safety by examining how prior harmful actions influence future decisions. The core method involves presenting LLMs with scenarios where a harmful past action is followed by a choice between safe and unsafe options. The key contribution is demonstrating that a simple instruction to "stay consistent with the strategy shown in the prior history" dramatically increases LLM unsafe action selection, even for highly aligned models, highlighting a critical vulnerability in current LLM agent design.

Read abstract →Full PDF

cs.AIarxiv:2605.13537v1

Temper and Tilt Lead to SLOP: Reward Hacking Mitigation with Inference-Time Alignment

Ye Wang, Jing Liu et al.

This paper introduces SLOP, a method for inference-time alignment that generalizes existing techniques by using a sharpened logarithmic opinion pool of generative reward models. By adjusting the "temperature" of reference models and calibrating SLOP weights, t…

abstract pdf

cs.AIarxiv:2605.13548v1

AttenA+: Rectifying Action Inequality in Robotic Foundation Models

Daojie Peng, Fulong Ma et al.

This paper introduces AttenA+, a framework that addresses the "action inequality" in robotic foundation models. It recognizes that low-velocity actions are often more critical for task success than high-velocity transitions. AttenA+ rectifies this by reweighti…

abstract pdf

Overview of AttenA+ . AttenA+ is a paradigm-agnostic enhancement framework for action robotic foundation models, introducing velocity-field-based action attention to prioritize slow, critical manipulation steps. It seamlessly plugs into mainstream discriminative (e.g., OpenVLA-OFT) and generative ( \( \pi_{0} \) , π 0.5 \( \pi_{0.5} \) , Diffusion Policy) architectures, as well as emerging World-Action Models (WAM). Without modifying core backbones or relying on data/model scaling, AttenA+ generalizes across diverse robotic datasets including Libero Liu et al. ( 2023 ) and RoboTwin Chen et al. ( 2025 ) , and consistently improves task success rates over state-of-the-art baselines. — Overview of AttenA+ . AttenA+ is a paradigm-agnostic enhancement framework for action robotic foundation models, introducing velocity-field-based action attention to prioritize slow, critical manipula…

cs.AIarxiv:2605.13652v1

Beyond Perplexity: A Geometric and Spectral Study of Low-Rank Pre-Training

Namrata Shivagunde, Vijeta Deshpande et al.

This paper investigates whether low-rank pre-training methods for large language models generalize as well as full-rank training, a question previously addressed only by limited perplexity metrics. The authors provide a more thorough comparison by analyzing th…

abstract pdf

cs.AIarxiv:2605.13709v1

Children's English Reading Story Generation via Supervised Fine-Tuning of Compact LLMs with Controllable Difficulty and Safety

Qian Shen, Fanghua Cao et al.

This paper fine-tunes compact LLMs (8B parameters) on expert-designed children's reading curricula and existing generated stories. The core method focuses on controllable difficulty and safety, enabling educators to target specific reading levels. The main con…

abstract pdf

System architecture and experimental workflow for generating children’s English reading stories via supervised fine-tuning of compact LLMs.

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Tara Bogavelli, Gabrielle Gauthier Melançon et al.

EVA-Bench is an end-to-end framework for evaluating voice agents. Its core method involves generating realistic, multi-turn bot-to-bot audio conversations with automatic validation…

abstract pdf

Harnessing Agentic Evolution

Jiayi Zhang, Yongfeng Gu et al.

This paper introduces AEvo, a meta-editing framework for agentic evolution. AEvo treats the evolutionary process as an interactive environment, using accumulated evidence as its st…

abstract pdf

Position: Assistive Agents Need Accessibility Alignment

Jie Hu, Changyuan Yan et al.

This paper argues that assistive AI agents for visually impaired users must prioritize "accessibility alignment" as a core design goal, not an afterthought. Current agentic AI fail…

abstract pdf

RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation

Chengzhi Shen, Weixiang Shen et al.

This paper introduces RealICU, a novel benchmark for evaluating LLMs on long-context ICU data. Unlike previous benchmarks that rely on potentially suboptimal clinician actions, Rea…

abstract pdf

ScioMind: Cognitively Grounded Multi-Agent Social Simulation with Anchoring-Based Belief Dynamics and Dynamic Profiles

Yitian Yang, Yiqun Duan et al.

ScioMind is a novel multi-agent social simulation framework that integrates structured opinion dynamics with LLM-based agent reasoning. Its core method combines a personality-condi…

abstract pdf

See all 20 papers →

§ II

The Town Square

Hacker News 3

264

pts

Top story

Tell HN: Dont use Claude Design, lost access to my projects after unsubscribing

A user reported losing access to their projects after unsubscribing from Claude Design, highlighting a potential issue with the service's data retention policy.

13 May discuss on HN →

205

The US is winning the AI race where it matters most: commercialization

avkcode.github.io13 May

169

Meta won't let you block its AI account on Threads

theverge.com13 May