Search results for Engineering

Part VIII - Other Topics and Perspectives
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 535-536
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the past, techniques for natural language translation were not very relevant for acting and planning systems. However, with the recent advent of large language models and their various multimodal extensions into foundation models, this is no longer the case. This last part introduces large language models and their potential benefits in acting, planning, and learning. It discusses the perceiving, monitoring, and goal reasoning functions for deliberation.

Appendix C - Discrete-Time Martingales
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 593-595
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

List of Algorithms
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 569-572
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Index
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 629-636
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part I - Stochastic Models and Bayesian Inference
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 9-10
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Reinforcement Learning
from Part III - Probabilistic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 228-270
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Learning to act with probabilistic models is the area of reinforcement learning (RL), the topic of this chapter. RL in some ways parallels the adaptation mechanisms of natural beings to their environment, relying on feedback mechanisms and extending the homeostasis regulations to complex behaviors. With continual learning, an actor can cope with a continually changing environment.This chapter first introduces the main principles of reinforcement learning. It presents a simple Q-learning RL algorithm. It shows how to generalize a learned relation with a parametric representation. it introduces neural network methods, which play a major in learning and are needed for deep RL (Section 10.5) and policy-based RL (Section 10.6). The issues of aided reinforcement learning with shaped rewards, imitation learning, and inverse reinforcement learning are addressed next. Section 10.8 is about probabilistic planning and RL.

Index
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 610-614
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix D - Markov Processes
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 596-597
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix F - Summary of POMDP Algorithms
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 602-604
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part V - Hierarchical Refinement Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 347-348
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This part of the book is devoted to acting, planning, and learning with operational models of actions expressed with a hierarchical task-oriented representation. Operational models are valuable for acting. They allow for detailed descriptions of complex actions handling dynamic environments with exogenous events. The representation relies on hierarchical refinement methods that describe alternative ways to handle tasks and react to events. A method can be any complex algorithm, decomposing a task into subtasks and primitive actions. Subtasks are refined recursively. Actions trigger the execution of sensory-motor procedures in closed loops that query and change the world stochastically.

6 - Nonparametric Bayesian Inference
from Part I - Stochastic Models and Bayesian Inference
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 114-162
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - Fully Observed Markov Decision Processes
from Part II - POMDPs. Models, Algorithms and Applications
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 165-189
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Design and optimization of MIMO dielectric resonator antenna with high gain and circular polarization features using machine learning algorithms
Swati Anand Dwivedi, Raghavendra Sharma, Vivek Singh Kushwah
Journal:

International Journal of Microwave and Wireless Technologies / Volume 17 / Issue 3 / April 2025

Published online by Cambridge University Press:

05 June 2025, pp. 504-513
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This paper elaborates the design and analysis of cross-aperture-coupled twin port ceramic radiator. Stimulation of alumina ceramic using a cross slot helps to produce circular waves within 7.35–7.8 GHz. The polarization diversity concept helps to improve the separation level by above 25 dB. Loading of double negative unit cell made metasurface (MS) improves the antenna gain over 11.5 dBi within the working spectrum. Machine learning (ML) techniques, i.e. Decision Tree and Random Forest are utilized to predict the |S11|/Axial ratio parameters. Experimental verification/ML prediction and optimized simulated consequences confirm that the structured radiator works efficiently between 7.21 and 8.2 GHz with over 25 dB isolation between the ports. Directive pattern and decent values of (MIMO) parameters make the radiator applicable for the 6G communication system.

21 - Bayesian Inverse Reinforcement Learning
from Part V - Inverse Reinforcement Learning
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 549-569
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

21 - Task and Motion Planning
from Part VII - Motion and Manipulation Models in Robotics
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 485-525
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Task and motion planning (TAMP) problems combine abstract causal relations from preconditions to effects with computational geometry, kinematics, and dynamics. This chapter is about the integration of planning for motion/manipulation with planning for abstract actions. It introduces the main sampling-based algorithms for motion planning. Manipulation planning is subsequently introduced. A few approaches specific to TAMP are then presented.

17 - Temporal Representation and Planning
from Part VI - Temporal Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 391-418
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is about planning approaches with explicit time in the descriptive and operational models of actions, as well as in the models of the expected evolution of the world not caused by the actor. It describes a planning algorithm that handles durative and concurrent activities with respect to a predicted dynamics. Section 17.1 presents a knowledge representation for modeling actions and tasks with temporal variables using temporal refinement methods. Temporal plans and planning problems are defined as chronicles, i.e., collections of assertions and tasks with explicit temporal constraints. A planning algorithm with temporal refinement methods is developed in Section 17.2. The basic techniques for managing temporal and domain constraints are then presented in Section 17.3.

Part IV - Stochastic Gradient Algorithms and Reinforcement Learning
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 423-424
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Analysis and modelling of non-local eddy diffusivity in turbulent channel flow
Fujihiro Hamba
Journal:

Journal of Fluid Mechanics / Volume 1012 / 10 June 2025

Published online by Cambridge University Press:

05 June 2025, A21
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Local eddy viscosity and diffusivity models are widely used to understand and predict turbulent flows. However, the local approximations in space and time are not always valid for actual turbulent flows. Recently, a non-local eddy diffusivity model for turbulent scalar flux was proposed to improve the local model and was validated using direct numerical simulation (DNS) of homogeneous isotropic turbulence with an inhomogeneous mean scalar (Hamba 2022 J. Fluid Mech. 950, A38). The model was modified using the scale-space energy density in preparation for application to inhomogeneous turbulence (Hamba 2023 J. Fluid Mech. 977, A11). In this paper, the model is further improved by incorporating the effects of turbulence anisotropy, inhomogeneity and wall boundaries. The needed inputs from the flow to evaluate the model are the Reynolds stress and the energy dissipation rate. With the improved model, one- and two-dimensional profiles ofthe non-local eddy diffusivity in turbulent channel flow are evaluated and compared with the exact DNS values. The DNS results reveal a contribution to the scalar flux from the mean scalar gradient in a wide upstream region. Additionally, the temporal profile of the non-local eddy diffusivity moves downstream, diffuses anisotropically and is tilted towards the bottom wall. The model reproduces this behaviour of mean flow convection and anisotropic turbulent diffusion well. These results indicate that the non-local eddy diffusivity model is useful for gaining insights into scalar transport in inhomogeneous turbulence.

4 - Learning Deterministic Models
from Part I - Deterministic State-Transition Systems
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 71-94
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is about two key aspects of learning with deterministic models: learning heuristics to speed up the search for a solution plan and the automated synthesis of the model itself. We discuss how to learn heuristics for exploring parts of the search space that are more likely to lead to solutions. We then address the problem of how to learn a deterministic model, with a focus on learning action schemas.

2 - Stochastic State Space Models
from Part I - Stochastic Models and Bayesian Inference
Vikram Krishnamurthy, Cornell University, New York
Book:

Partially Observed Markov Decision Processes

Published online:

16 May 2025

Print publication:

05 June 2025, pp 11-33
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Engineering

Refine search

Refine search

Actions for selected content:

212275 results in Engineering

Part VIII - Other Topics and Perspectives

Summary

Appendix C - Discrete-Time Martingales

List of Algorithms

Index

Part I - Stochastic Models and Bayesian Inference

10 - Reinforcement Learning

Summary

Index

Appendix D - Markov Processes

Appendix F - Summary of POMDP Algorithms

Part V - Hierarchical Refinement Models

Summary

6 - Nonparametric Bayesian Inference

7 - Fully Observed Markov Decision Processes

Design and optimization of MIMO dielectric resonator antenna with high gain and circular polarization features using machine learning algorithms

21 - Bayesian Inverse Reinforcement Learning

21 - Task and Motion Planning

Summary

17 - Temporal Representation and Planning

Summary

Part IV - Stochastic Gradient Algorithms and Reinforcement Learning

Analysis and modelling of non-local eddy diffusivity in turbulent channel flow

4 - Learning Deterministic Models

Summary

2 - Stochastic State Space Models

Engineering

Refine search

Refine search

Actions for selected content:

Save Search

212275 results in Engineering

Summary

Summary

Summary

Summary

Summary

Summary