RAMP: Hybrid DRL for Online Learning of Numeric Action Models

Original Source

ArXiv AI (cs.AI)

by Yarin Benyamin, Argaman Mordoch, Shahaf S. Shperberg, Roni Stern

Read Full Article

arXiv:2604.08685v1 Announce Type: new Abstract: Automated planning algorithms require an action model specifying the preconditions and effects of each action, but obtaining such a model is often hard. Learning action models from observations is feasible, but existing algorithms for numeric domains are offline, requiring expert traces as input. We propose the Reinforcement learning, Action Model learning, and Planning (RAMP) strategy for learning numeric planning action models online via interactions with the environment. RAMP simultaneously trains a Deep Reinforcement Learning (DRL) policy, learns a numeric action model from past interactions, and uses that model to plan future actions when possible. These components form a positive feedback loop: the RL policy gathers data to refine the action model, while the planner generates plans to continue training the RL policy. To facilitate this integration of RL and numeric planning, we developed Numeric PDDLGym, an automated framework for converting numeric planning problems to Gym environments. Experimental results on standard IPC numeric domains show that RAMP significantly outperforms PPO, a well-known DRL algorithm, in terms of solvability and plan quality.

Tags:AI

Original Content Credit

This summary is sourced from ArXiv AI (cs.AI). For the complete article with full details, research data, and author insights, please visit the original source.

Visit ArXiv AI (cs.AI)

Stellantis Ramps Up AI Strategy With Microsoft Deal

AI Business

Industry News1m

Stellantis Ramps Up AI Strategy With Microsoft Deal

One of the world's biggest automakers is plunging full-on into AI.

Apr 16, 2026

IEEE Entrepreneurship Connects Hardware Startups With Investors

IEEE Spectrum AI

Business AI1m

IEEE Entrepreneurship Connects Hardware Startups With Investors

Roughly 90 percent of hard tech startups fail due to funding constraints, longer R&D timelines for developing hardware, and the complexity of manufacturing their products, according to a number of studies. Generally, these startups require up to 50 percent more investor financing

Apr 16, 2026

Wired AI

Industry News1m

The Battle for OpenAI’s Soul

In Musk v. Altman, a jury will soon determine whether OpenAI has strayed from its founding mission to ensure AGI benefits humanity. Here’s what to know.

Apr 16, 2026

RAMP: Hybrid DRL for Online Learning of Numeric Action Models

Related Articles

Stellantis Ramps Up AI Strategy With Microsoft Deal

IEEE Entrepreneurship Connects Hardware Startups With Investors

The Battle for OpenAI’s Soul