A Tensor-Based Markov Decision Process Representation

Daniela Kuinchtner; Felipe Meneguzzi; Afonso Sales

doi:10.1007/978-3-030-60884-2_23

A Tensor-Based Markov Decision Process Representation

Daniela Kuinchtner^* (Collaborator), Felipe Meneguzzi^* (Collaborator), Afonso Sales^* (Collaborator)

^*Corresponding author for this work

Pontifícia Universidade Católica do Rio Grande do Sul

Research output: Chapter in Book/Report/Conference proceeding › Chapter

1 Citation (Scopus)

Abstract

A Markov Decision Process (MDP) is a sequential decision problem for a fully observable and stochastic environment. MDPs are widely used to model reinforcement learning problems. Researchers developed multiple solvers with increasing efficiency, each of which requiring fewer computational resources to find solutions for large MDPs. However, few of these solvers leverage advances in tensor processing to further increase solver efficiency, such as Google’s TPUs (https://cloud.google.com/tpu) and TensorFlow (https://www.tensorflow.org/). In this paper, we formalize an MDP problem in terms of Tensor Algebra, by representing transition models of MDPs compactly using tensors as vectors with fewer elements than its total size. Our method aims to facilitate implementation of various efficient MDP solvers reducing computational cost to generate monolithic MDPs.

Original language	English
Title of host publication	Advances in Soft Computing.
Subtitle of host publication	19th Mexican International Conference on Artificial Intelligence, MICAI 2020, Mexico City, Mexico, October 12–17, 2020, Proceedings, Part I
Publisher	Springer
Pages	313–324
Volume	12468
ISBN (Electronic)	978-3-030-60884-2
ISBN (Print)	978-3-030-60883-5
DOIs	https://doi.org/10.1007/978-3-030-60884-2_23
Publication status	Published - 7 Oct 2020

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer International Publisher
Volume	12468
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Keywords

Artificial intelligence
CANDECOMP/PARAFAC decomposition
Compact transition model
Markov Decision Process
Tensor algebra
Tensor decomposition

Access to Document

10.1007/978-3-030-60884-2_23

Cite this

Kuinchtner, D., Meneguzzi, F., & Sales, A. (2020). A Tensor-Based Markov Decision Process Representation. In Advances in Soft Computing. : 19th Mexican International Conference on Artificial Intelligence, MICAI 2020, Mexico City, Mexico, October 12–17, 2020, Proceedings, Part I (Vol. 12468, pp. 313–324). (Lecture Notes in Computer Science; Vol. 12468). Springer . https://doi.org/10.1007/978-3-030-60884-2_23

A Tensor-Based Markov Decision Process Representation. / Kuinchtner, Daniela (Collaborator); Meneguzzi, Felipe (Collaborator); Sales, Afonso (Collaborator).
Advances in Soft Computing. : 19th Mexican International Conference on Artificial Intelligence, MICAI 2020, Mexico City, Mexico, October 12–17, 2020, Proceedings, Part I. Vol. 12468 Springer , 2020. p. 313–324 (Lecture Notes in Computer Science; Vol. 12468).

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Kuinchtner, D, Meneguzzi, F & Sales, A 2020, A Tensor-Based Markov Decision Process Representation. in Advances in Soft Computing. : 19th Mexican International Conference on Artificial Intelligence, MICAI 2020, Mexico City, Mexico, October 12–17, 2020, Proceedings, Part I. vol. 12468, Lecture Notes in Computer Science, vol. 12468, Springer , pp. 313–324. https://doi.org/10.1007/978-3-030-60884-2_23

Kuinchtner D, Meneguzzi F, Sales A. A Tensor-Based Markov Decision Process Representation. In Advances in Soft Computing. : 19th Mexican International Conference on Artificial Intelligence, MICAI 2020, Mexico City, Mexico, October 12–17, 2020, Proceedings, Part I. Vol. 12468. Springer . 2020. p. 313–324. (Lecture Notes in Computer Science). doi: 10.1007/978-3-030-60884-2_23

@inbook{fa656b23dcfe4646b0266b4b7443e7f7,

title = "A Tensor-Based Markov Decision Process Representation",

abstract = "A Markov Decision Process (MDP) is a sequential decision problem for a fully observable and stochastic environment. MDPs are widely used to model reinforcement learning problems. Researchers developed multiple solvers with increasing efficiency, each of which requiring fewer computational resources to find solutions for large MDPs. However, few of these solvers leverage advances in tensor processing to further increase solver efficiency, such as Google{\textquoteright}s TPUs (https://cloud.google.com/tpu) and TensorFlow (https://www.tensorflow.org/). In this paper, we formalize an MDP problem in terms of Tensor Algebra, by representing transition models of MDPs compactly using tensors as vectors with fewer elements than its total size. Our method aims to facilitate implementation of various efficient MDP solvers reducing computational cost to generate monolithic MDPs.",

keywords = "Artificial intelligence, CANDECOMP/PARAFAC decomposition, Compact transition model, Markov Decision Process, Tensor algebra, Tensor decomposition",

author = "Daniela Kuinchtner and Felipe Meneguzzi and Afonso Sales",

year = "2020",

month = oct,

day = "7",

doi = "10.1007/978-3-030-60884-2_23",

language = "English",

isbn = "978-3-030-60883-5",

volume = "12468",

series = "Lecture Notes in Computer Science",

publisher = "Springer ",

pages = "313–324",

booktitle = "Advances in Soft Computing.",

}

TY - CHAP

T1 - A Tensor-Based Markov Decision Process Representation

A2 - Kuinchtner, Daniela

A2 - Meneguzzi, Felipe

A2 - Sales, Afonso

PY - 2020/10/7

Y1 - 2020/10/7

N2 - A Markov Decision Process (MDP) is a sequential decision problem for a fully observable and stochastic environment. MDPs are widely used to model reinforcement learning problems. Researchers developed multiple solvers with increasing efficiency, each of which requiring fewer computational resources to find solutions for large MDPs. However, few of these solvers leverage advances in tensor processing to further increase solver efficiency, such as Google’s TPUs (https://cloud.google.com/tpu) and TensorFlow (https://www.tensorflow.org/). In this paper, we formalize an MDP problem in terms of Tensor Algebra, by representing transition models of MDPs compactly using tensors as vectors with fewer elements than its total size. Our method aims to facilitate implementation of various efficient MDP solvers reducing computational cost to generate monolithic MDPs.

AB - A Markov Decision Process (MDP) is a sequential decision problem for a fully observable and stochastic environment. MDPs are widely used to model reinforcement learning problems. Researchers developed multiple solvers with increasing efficiency, each of which requiring fewer computational resources to find solutions for large MDPs. However, few of these solvers leverage advances in tensor processing to further increase solver efficiency, such as Google’s TPUs (https://cloud.google.com/tpu) and TensorFlow (https://www.tensorflow.org/). In this paper, we formalize an MDP problem in terms of Tensor Algebra, by representing transition models of MDPs compactly using tensors as vectors with fewer elements than its total size. Our method aims to facilitate implementation of various efficient MDP solvers reducing computational cost to generate monolithic MDPs.

KW - Artificial intelligence

KW - CANDECOMP/PARAFAC decomposition

KW - Compact transition model

KW - Markov Decision Process

KW - Tensor algebra

KW - Tensor decomposition

UR - https://doi.org/10.1007/978-3-030-60884-2_23

U2 - 10.1007/978-3-030-60884-2_23

DO - 10.1007/978-3-030-60884-2_23

M3 - Chapter

SN - 978-3-030-60883-5

VL - 12468

T3 - Lecture Notes in Computer Science

SP - 313

EP - 324

BT - Advances in Soft Computing.

PB - Springer

ER -

A Tensor-Based Markov Decision Process Representation

Abstract

Publication series

Keywords

Access to Document

Other files and links

Fingerprint

Cite this