About: We characterize the class of nondeterministic [Formula: see text]-automata that can be used for the analysis of finite Markov decision processes (MDPs). We call these automata ‘good-for-MDPs’ (GFM). We show that GFM automata are closed under classic simulation as well as under more powerful simulation relations that leverage properties of optimal control strategies for MDPs. This closure enables us to exploit state-space reduction techniques, such as those based on direct and delayed simulation, that guarantee simulation equivalence. We demonstrate the promise of GFM automata by defining a new class of automata with favorable properties—they are Büchi automata with low branching degree obtained through a simple construction—and show that going beyond limit-deterministic automata may significantly benefit reinforcement learning.

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: We characterize the class of nondeterministic [Formula: see text]-automata that can be used for the analysis of finite Markov decision processes (MDPs). We call these automata ‘good-for-MDPs’ (GFM). We show that GFM automata are closed under classic simulation as well as under more powerful simulation relations that leverage properties of optimal control strategies for MDPs. This closure enables us to exploit state-space reduction techniques, such as those based on direct and delayed simulation, that guarantee simulation equivalence. We demonstrate the promise of GFM automata by defining a new class of automata with favorable properties—they are Büchi automata with low branching degree obtained through a simple construction—and show that going beyond limit-deterministic automata may significantly benefit reinforcement learning. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : covidontheweb.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	We characterize the class of nondeterministic [Formula: see text]-automata that can be used for the analysis of finite Markov decision processes (MDPs). We call these automata ‘good-for-MDPs’ (GFM). We show that GFM automata are closed under classic simulation as well as under more powerful simulation relations that leverage properties of optimal control strategies for MDPs. This closure enables us to exploit state-space reduction techniques, such as those based on direct and delayed simulation, that guarantee simulation equivalence. We demonstrate the promise of GFM automata by defining a new class of automata with favorable properties—they are Büchi automata with low branching degree obtained through a simple construction—and show that going beyond limit-deterministic automata may significantly benefit reinforcement learning.
Subject	Reinforcement learning Optimal control Markov models Mathematical optimization Markov processes Models of computation Optimal decisions Automata (computation) Dynamic programming Stochastic control Belief revision Finite automata Model checking
part of	Good-for-MDPs Automata for Probabilistic Analysis and Reinforcement Learning
is abstract of	Good-for-MDPs Automata for Probabilistic Analysis and Reinforcement Learning
is hasSource of	covid:ann/target/8818b5fe2fef36a738445b78ca9b00383c621893 covid:ann/target/1041b9f505c1e8df1ff6e88198f6e0f468910bd6 covid:ann/target/7ad45fb017203dac118539dacdee07550e7f929d covid:ann/target/fad69a0869e8c24763fb658ab96f78b3eeb16e92 covid:ann/target/083c1a855daa619c65f75b463e49d4b7ee196b50 covid:ann/target/63966da6b769de8cd6551d8bf7680eeb7cde1b9e covid:ann/target/6888e759e2066c5e2800e3e0db1e656003ed58ec covid:ann/target/07b92ec885f3cbbb3abb34c4e470135aaea3eb4b covid:ann/target/aa27f26c789098e0507bd00f99fdc7ce08286fb3 covid:ann/target/2ddc0bcbc37d4a53a278c73c2ec8dda756aec265 covid:ann/target/8e16b5bd1e47ebea032a3e5e7c0c4b87eda3013d covid:ann/target/ef688d6f3170530f66ff98883f575f75c9f2bb86 covid:ann/target/136b821b24a15b4cf240532d854670de503e93c1 covid:ann/target/27083064805e39a539b0bd5c535c764fc1249010 covid:ann/target/5a718271733582eb9fbf95d4a173fb8c89d69078 covid:ann/target/6c185be49a595cede8aa0d1e795f2cafa6ea3850 covid:ann/target/743f4d30c553874bd002ccde6f08affea269f1b9 covid:ann/target/e650c29ea0983096142ea93799ce398c3e10629d covid:ann/target/d7b00684d75494432499206145f5c9f312aee334 covid:ann/target/b98a6f8acf3bf30da6ea9fba3b8ff81e905d94c7 covid:ann/target/7f3fe0c7f7032b73a81465bed92a8d83ae3e6013 covid:ann/target/f7da058203fd14193945193c7063426f79b24c1b covid:ann/target/4cb4a096849dda52f05b1de23ede8189997d75c0 covid:ann/target/7414bb40bb3f06e42f130381766f2f0727449431 covid:ann/target/5f41e55100f2d07a6cf87e12b5794ce558933ccc covid:ann/target/cb491ba8cbdacc1d5fc8630d1fe80fe219949434 covid:ann/target/070f9eb3712abc8ec06e49af36aeb0f345bf8b84 covid:ann/target/675dd3c28844c649a86642146cee4f7116f92213 covid:ann/target/73da4e6f465805fece241b104203c16acb7d6447 covid:ann/target/a0f44fb3616fab623acdc788759d095c0526ac61 covid:ann/target/05fa21b9bed29622defb772dc2eeb3b5fb1135a0 covid:ann/target/59253a040cbaeaf23c40516bfbfc1f56460bc9a3 covid:ann/target/f3dd64101fe756bd3a28465875b23cc44eb2cc80 covid:ann/target/dc6ce78c1f8ff867b87f31f7f8f2e73505213f33 covid:ann/target/c0a9aa0f2c264aadb7f4075db58b7672fe991662 covid:ann/target/6be114274440612883e7b3b5dab7fdc60ab77d9e covid:ann/target/43c92f0919013fd59c71e0327bbe38a19db50a01 covid:ann/target/dc60188c3060ae88ad9ebb599ec4631c08780ee4 covid:ann/target/f93e9b30d4b8740fbc768c31524ba896e56e35cd covid:ann/target/132e6c25fd74441751a976ff9b43f775cbaf9987 covid:ann/target/cc6e0aea5b8cfa558f4879d1344063aca5d6ce53 covid:ann/target/07252d5e0d360caeb4e78e3fbd62269620bdd77f covid:ann/target/7c3b70440f66a840fc51207bcf9b66523cdd5480 covid:ann/target/8565f8334dca636c09b770c4fffdd045e52cc2ad covid:ann/target/8d2b20693205efb2f55f12aad7cc7ceee6dbdf30 covid:ann/target/ea05159ba370d356b63ed7425d9b38c76768e336 covid:ann/target/068972a8307d9af17eb783179e887a889cb4c524 covid:ann/target/559c5f84afbf67e7f5a495f097c2194a0677ebe3 covid:ann/target/8e4e4903573c4610c07c8eae7bd1044030624a8f covid:ann/target/b4c30ce161a917888b9c32a7aeffa9c21492c0d9 covid:ann/target/5acd335eeedfa52261ed6bb0205760a435e722fd covid:ann/target/cf227b634602a30a3e25a1b2c7192d1c91dffcaf covid:ann/target/d2019aedd7bd7ad0327fc9b6a0d73cf4fb396867 covid:ann/target/364d057979a3defc5f8df8219b66a8654c11999b

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software