Matches in SemOpenAlex for { <https://semopenalex.org/work/W3097844894> ?p ?o ?g. }
- W3097844894 abstract "The Markov decision process is the mathematical formalization underlying the modern field of reinforcement learning when transition and reward functions are unknown. We derive a pseudo-Boolean cost function that is equivalent to a K-spin Hamiltonian representation of the discrete, finite, discounted Markov decision process with infinite horizon. This K-spin Hamiltonian furnishes a starting point from which to solve for an optimal policy using heuristic quantum algorithms such as adiabatic quantum annealing and the quantum approximate optimization algorithm on near-term quantum hardware. In arguing that the variational minimization of our Hamiltonian is approximately equivalent to the Bellman optimality condition for a prevalent class of environments we establish an interesting analogy with classical field theory. Along with proof-of-concept calculations to corroborate our formulation by simulated and quantum annealing against classical Q-Learning, we analyze the scaling of physical resources required to solve our Hamiltonian on quantum hardware." @default.
- W3097844894 created "2020-11-09" @default.
- W3097844894 creator A5008431584 @default.
- W3097844894 creator A5035694353 @default.
- W3097844894 creator A5087880949 @default.
- W3097844894 creator A5089047341 @default.
- W3097844894 date "2020-10-30" @default.
- W3097844894 modified "2023-09-26" @default.
- W3097844894 title "K-spin Hamiltonian for quantum-resolvable Markov decision processes" @default.
- W3097844894 cites W1631356911 @default.
- W3097844894 cites W1833245741 @default.
- W3097844894 cites W1983561139 @default.
- W3097844894 cites W1983624953 @default.
- W3097844894 cites W1984357125 @default.
- W3097844894 cites W1986613903 @default.
- W3097844894 cites W1990005421 @default.
- W3097844894 cites W1994287036 @default.
- W3097844894 cites W2001226941 @default.
- W3097844894 cites W2013520638 @default.
- W3097844894 cites W2026659355 @default.
- W3097844894 cites W2032100464 @default.
- W3097844894 cites W2074078071 @default.
- W3097844894 cites W2079905842 @default.
- W3097844894 cites W2102008187 @default.
- W3097844894 cites W2117941808 @default.
- W3097844894 cites W2128988466 @default.
- W3097844894 cites W2146364776 @default.
- W3097844894 cites W2170644461 @default.
- W3097844894 cites W2521267242 @default.
- W3097844894 cites W2580674237 @default.
- W3097844894 cites W2773252567 @default.
- W3097844894 cites W2794731980 @default.
- W3097844894 cites W2901983189 @default.
- W3097844894 cites W2902907165 @default.
- W3097844894 cites W2928404583 @default.
- W3097844894 cites W2964327027 @default.
- W3097844894 cites W2990778383 @default.
- W3097844894 cites W3009578594 @default.
- W3097844894 cites W3098762758 @default.
- W3097844894 cites W3113752344 @default.
- W3097844894 cites W3157585079 @default.
- W3097844894 doi "https://doi.org/10.1007/s42484-020-00026-6" @default.
- W3097844894 hasPublicationYear "2020" @default.
- W3097844894 type Work @default.
- W3097844894 sameAs 3097844894 @default.
- W3097844894 citedByCount "1" @default.
- W3097844894 countsByYear W30978448942022 @default.
- W3097844894 crossrefType "journal-article" @default.
- W3097844894 hasAuthorship W3097844894A5008431584 @default.
- W3097844894 hasAuthorship W3097844894A5035694353 @default.
- W3097844894 hasAuthorship W3097844894A5087880949 @default.
- W3097844894 hasAuthorship W3097844894A5089047341 @default.
- W3097844894 hasBestOaLocation W30978448942 @default.
- W3097844894 hasConcept C105795698 @default.
- W3097844894 hasConcept C106189395 @default.
- W3097844894 hasConcept C121332964 @default.
- W3097844894 hasConcept C121864883 @default.
- W3097844894 hasConcept C126255220 @default.
- W3097844894 hasConcept C130787639 @default.
- W3097844894 hasConcept C137019171 @default.
- W3097844894 hasConcept C159886148 @default.
- W3097844894 hasConcept C192353077 @default.
- W3097844894 hasConcept C28826006 @default.
- W3097844894 hasConcept C33923547 @default.
- W3097844894 hasConcept C58053490 @default.
- W3097844894 hasConcept C62520636 @default.
- W3097844894 hasConcept C84114770 @default.
- W3097844894 hasConcept C90408235 @default.
- W3097844894 hasConcept C98763669 @default.
- W3097844894 hasConceptScore W3097844894C105795698 @default.
- W3097844894 hasConceptScore W3097844894C106189395 @default.
- W3097844894 hasConceptScore W3097844894C121332964 @default.
- W3097844894 hasConceptScore W3097844894C121864883 @default.
- W3097844894 hasConceptScore W3097844894C126255220 @default.
- W3097844894 hasConceptScore W3097844894C130787639 @default.
- W3097844894 hasConceptScore W3097844894C137019171 @default.
- W3097844894 hasConceptScore W3097844894C159886148 @default.
- W3097844894 hasConceptScore W3097844894C192353077 @default.
- W3097844894 hasConceptScore W3097844894C28826006 @default.
- W3097844894 hasConceptScore W3097844894C33923547 @default.
- W3097844894 hasConceptScore W3097844894C58053490 @default.
- W3097844894 hasConceptScore W3097844894C62520636 @default.
- W3097844894 hasConceptScore W3097844894C84114770 @default.
- W3097844894 hasConceptScore W3097844894C90408235 @default.
- W3097844894 hasConceptScore W3097844894C98763669 @default.
- W3097844894 hasFunder F4320306084 @default.
- W3097844894 hasIssue "2" @default.
- W3097844894 hasLocation W30978448941 @default.
- W3097844894 hasLocation W30978448942 @default.
- W3097844894 hasOpenAccess W3097844894 @default.
- W3097844894 hasPrimaryLocation W30978448941 @default.
- W3097844894 hasRelatedWork W1757085302 @default.
- W3097844894 hasRelatedWork W1843088339 @default.
- W3097844894 hasRelatedWork W2011679439 @default.
- W3097844894 hasRelatedWork W2033466009 @default.
- W3097844894 hasRelatedWork W2053277979 @default.
- W3097844894 hasRelatedWork W2909673371 @default.
- W3097844894 hasRelatedWork W2990398346 @default.
- W3097844894 hasRelatedWork W3016250062 @default.
- W3097844894 hasRelatedWork W4287815547 @default.