Search results for Engineering

13 - Learning Nondeterministic Models
from Part IV - Nondeterministic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 342-346
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Learning for nondeterministic models can take advantage of most of the techniques developed for probabilistic models (Chapter 10). Indeed, note that in reinforcement learning (RL), probabilities of action transitions are not needed, so RL techniques can be applied to nondeterministic models too. For instance, we can use the algorithms for Q-learning, parametric Q-learning, and deep Q-learning. However, these algorithms do not give explicit description models of actions. In this chapter, we therefore discuss some intuitions and also some challenges of how the techniques for learning deterministic action specifications could be extended to deal with nondeterministic models. Note, however, that learning lifted action schemas in nondeterministic models is still an open problem.

19 - Learning for Temporal Acting and Planning
from Part VI - Temporal Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 438-446
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Temporal models are quite rich, allowing concurrency and temporal constraints to be handled. But the development of the temporal models is a bottleneck, to be eased with machine learning techniques. In this chapter, we first briefly address the problem of learning heuristics for temporal planning (Section 19.1). We then consider the issue of learning durative action schema and temporal methods (Section 19.2). The chapter outlines the proposed approaches, based on techniques seen earlier in the book, without getting into detailed descriptions of the corresponding procedures.

18 - Acting with Temporal Controllability
from Part VI - Temporal Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 419-437
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter addresses the issues of acting with temporal models . It presents methods for handling dynamic controllability (Section 18.1), dispatching (Section 18.2), and execution and refinement of a temporal plan (Section 18.3). It proposes methods for acting with a reactive temporal refinement engine (Section 18.4), planning with Monte Carlo rollouts (Section 18.5), and integrating planning and acting (Section 18.6).

Preface
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp xv-xviii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

15 - Myopic Policy Bounds for POMDPs and Sensitivity to Model Parameters
from Part III - POMDP Structural Results
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 401-422
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

11 - Acting with Nondeterministic Models
from Part IV - Nondeterministic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 275-308
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we introduce different representations and techniques for acting with nondeterministic models: nondeterministic state transition systems (Section 11.1), automata (Section 11.2), behavior trees (Section 11.3), and Petri nets (Section 11.4).

1 - Introduction
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 1-8
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Symmetric influence of forward and opposing tidal currents on rogue wave statistics
Saulo Mendes, Ina Teutsch, Jérôme Kasparian
Journal:

Journal of Fluid Mechanics / Volume 1012 / 10 June 2025

Published online by Cambridge University Press:

05 June 2025, A22
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Rogue waves are associated with various ocean processes, both at the coast and in the open ocean. In either zone, inhomogeneities in the wave field caused by shoaling, crossing seas or current interactions disturb the wave statistics, increasing the rogue wave probability and magnitude. Such amplification of the frequency of rogue waves and their intensity, i.e. the maximum normalised height, have been attested to in numerical simulations and laboratory studies, in particular for wave–current interactions. In this study, we investigate the effect of the current intensity and direction on rogue wave probability, by analysing long-term observations from the southern North Sea. We observe that the amplification is similar for opposing and following currents. Despite the sea states being dominantly broadbanded and featuring a large directional spread, the anomalous statistics are of the same order of magnitude as those observed in unidirectional laboratory experiments for stationary currents.

Part V - Inverse Reinforcement Learning
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 523-524
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part VIII - Other Topics and Perspectives
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 535-536
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the past, techniques for natural language translation were not very relevant for acting and planning systems. However, with the recent advent of large language models and their various multimodal extensions into foundation models, this is no longer the case. This last part introduces large language models and their potential benefits in acting, planning, and learning. It discusses the perceiving, monitoring, and goal reasoning functions for deliberation.

Appendix C - Discrete-Time Martingales
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 593-595
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

List of Algorithms
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 569-572
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Index
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 629-636
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part I - Stochastic Models and Bayesian Inference
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 9-10
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Reinforcement Learning
from Part III - Probabilistic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 228-270
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Learning to act with probabilistic models is the area of reinforcement learning (RL), the topic of this chapter. RL in some ways parallels the adaptation mechanisms of natural beings to their environment, relying on feedback mechanisms and extending the homeostasis regulations to complex behaviors. With continual learning, an actor can cope with a continually changing environment.This chapter first introduces the main principles of reinforcement learning. It presents a simple Q-learning RL algorithm. It shows how to generalize a learned relation with a parametric representation. it introduces neural network methods, which play a major in learning and are needed for deep RL (Section 10.5) and policy-based RL (Section 10.6). The issues of aided reinforcement learning with shaped rewards, imitation learning, and inverse reinforcement learning are addressed next. Section 10.8 is about probabilistic planning and RL.

Index
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 610-614
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix D - Markov Processes
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 596-597
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix F - Summary of POMDP Algorithms
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 602-604
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part V - Hierarchical Refinement Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 347-348
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This part of the book is devoted to acting, planning, and learning with operational models of actions expressed with a hierarchical task-oriented representation. Operational models are valuable for acting. They allow for detailed descriptions of complex actions handling dynamic environments with exogenous events. The representation relies on hierarchical refinement methods that describe alternative ways to handle tasks and react to events. A method can be any complex algorithm, decomposing a task into subtasks and primitive actions. Subtasks are refined recursively. Actions trigger the execution of sensory-motor procedures in closed loops that query and change the world stochastically.

6 - Nonparametric Bayesian Inference
from Part I - Stochastic Models and Bayesian Inference
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 114-162
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Engineering

Refine search

Refine search

Actions for selected content:

212269 results in Engineering

13 - Learning Nondeterministic Models

Summary

19 - Learning for Temporal Acting and Planning

Summary

18 - Acting with Temporal Controllability

Summary

Preface

15 - Myopic Policy Bounds for POMDPs and Sensitivity to Model Parameters

11 - Acting with Nondeterministic Models

Summary

1 - Introduction

Symmetric influence of forward and opposing tidal currents on rogue wave statistics

Part V - Inverse Reinforcement Learning

Part VIII - Other Topics and Perspectives

Summary

Appendix C - Discrete-Time Martingales

List of Algorithms

Index

Part I - Stochastic Models and Bayesian Inference

10 - Reinforcement Learning

Summary

Index

Appendix D - Markov Processes

Appendix F - Summary of POMDP Algorithms

Part V - Hierarchical Refinement Models

Summary

6 - Nonparametric Bayesian Inference

Engineering

Refine search

Refine search

Actions for selected content:

Save Search

212269 results in Engineering

Summary

Summary

Summary

Summary

Summary

Summary

Summary