Matches in SemOpenAlex for { <https://semopenalex.org/work/W4386821711> ?p ?o ?g. }
Showing items 1 to 86 of
86
with 100 items per page.
- W4386821711 abstract "This paper develops a distributed policy evaluation scheme under the scenario that a group of agents collaborate to estimate the value function of a given policy with the global state and the local reward. Under this framework, we formulate a cost function as a combination of linear function approximation and eligibility traces. Combining gradient decent and consensus iteration with time-averaging, a distributed policy evaluation algorithm is proposed and the convergence is shown by appropriately setting the gradient learning rates and the numbers of consensus iterations. The proposed policy evaluation scheme has the potential to improve the robust performance in the learning process. Experimental results validate the theoretical analyses of our algorithm." @default.
- W4386821711 created "2023-09-19" @default.
- W4386821711 creator A5005160651 @default.
- W4386821711 creator A5041338528 @default.
- W4386821711 creator A5058825605 @default.
- W4386821711 creator A5059845882 @default.
- W4386821711 creator A5088659212 @default.
- W4386821711 creator A5092761707 @default.
- W4386821711 date "2023-07-24" @default.
- W4386821711 modified "2023-09-26" @default.
- W4386821711 title "A Distributed Policy Evaluation Scheme Over Consensus Iteration" @default.
- W4386821711 cites W1542941925 @default.
- W4386821711 cites W2073900753 @default.
- W4386821711 cites W2075268401 @default.
- W4386821711 cites W2107338086 @default.
- W4386821711 cites W2114791779 @default.
- W4386821711 cites W2142184324 @default.
- W4386821711 cites W2144672231 @default.
- W4386821711 cites W2896481994 @default.
- W4386821711 cites W2908643158 @default.
- W4386821711 cites W2963028406 @default.
- W4386821711 cites W2963128910 @default.
- W4386821711 cites W3027121709 @default.
- W4386821711 cites W3085271681 @default.
- W4386821711 cites W3092429416 @default.
- W4386821711 cites W3108171015 @default.
- W4386821711 doi "https://doi.org/10.23919/ccc58697.2023.10240148" @default.
- W4386821711 hasPublicationYear "2023" @default.
- W4386821711 type Work @default.
- W4386821711 citedByCount "0" @default.
- W4386821711 crossrefType "proceedings-article" @default.
- W4386821711 hasAuthorship W4386821711A5005160651 @default.
- W4386821711 hasAuthorship W4386821711A5041338528 @default.
- W4386821711 hasAuthorship W4386821711A5058825605 @default.
- W4386821711 hasAuthorship W4386821711A5059845882 @default.
- W4386821711 hasAuthorship W4386821711A5088659212 @default.
- W4386821711 hasAuthorship W4386821711A5092761707 @default.
- W4386821711 hasConcept C111919701 @default.
- W4386821711 hasConcept C126255220 @default.
- W4386821711 hasConcept C127162648 @default.
- W4386821711 hasConcept C134306372 @default.
- W4386821711 hasConcept C14036430 @default.
- W4386821711 hasConcept C162324750 @default.
- W4386821711 hasConcept C2777303404 @default.
- W4386821711 hasConcept C31258907 @default.
- W4386821711 hasConcept C33923547 @default.
- W4386821711 hasConcept C41008148 @default.
- W4386821711 hasConcept C50522688 @default.
- W4386821711 hasConcept C57869625 @default.
- W4386821711 hasConcept C77618280 @default.
- W4386821711 hasConcept C78458016 @default.
- W4386821711 hasConcept C86803240 @default.
- W4386821711 hasConcept C98045186 @default.
- W4386821711 hasConceptScore W4386821711C111919701 @default.
- W4386821711 hasConceptScore W4386821711C126255220 @default.
- W4386821711 hasConceptScore W4386821711C127162648 @default.
- W4386821711 hasConceptScore W4386821711C134306372 @default.
- W4386821711 hasConceptScore W4386821711C14036430 @default.
- W4386821711 hasConceptScore W4386821711C162324750 @default.
- W4386821711 hasConceptScore W4386821711C2777303404 @default.
- W4386821711 hasConceptScore W4386821711C31258907 @default.
- W4386821711 hasConceptScore W4386821711C33923547 @default.
- W4386821711 hasConceptScore W4386821711C41008148 @default.
- W4386821711 hasConceptScore W4386821711C50522688 @default.
- W4386821711 hasConceptScore W4386821711C57869625 @default.
- W4386821711 hasConceptScore W4386821711C77618280 @default.
- W4386821711 hasConceptScore W4386821711C78458016 @default.
- W4386821711 hasConceptScore W4386821711C86803240 @default.
- W4386821711 hasConceptScore W4386821711C98045186 @default.
- W4386821711 hasFunder F4320321001 @default.
- W4386821711 hasLocation W43868217111 @default.
- W4386821711 hasOpenAccess W4386821711 @default.
- W4386821711 hasPrimaryLocation W43868217111 @default.
- W4386821711 hasRelatedWork W1521353230 @default.
- W4386821711 hasRelatedWork W1596397513 @default.
- W4386821711 hasRelatedWork W2087258916 @default.
- W4386821711 hasRelatedWork W2088260668 @default.
- W4386821711 hasRelatedWork W2089375346 @default.
- W4386821711 hasRelatedWork W2314394688 @default.
- W4386821711 hasRelatedWork W2394099118 @default.
- W4386821711 hasRelatedWork W2739624643 @default.
- W4386821711 hasRelatedWork W4229450262 @default.
- W4386821711 hasRelatedWork W4280566261 @default.
- W4386821711 isParatext "false" @default.
- W4386821711 isRetracted "false" @default.
- W4386821711 workType "article" @default.