Matches in SemOpenAlex for { <https://semopenalex.org/work/W4327652945> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W4327652945 endingPage "99" @default.
- W4327652945 startingPage "84" @default.
- W4327652945 abstract "This paper presents a novel state representation for reward-free Markov decision processes. The idea is to learn, in a self-supervised manner, an embedding space where distances between pairs of embedded states correspond to the minimum number of actions needed to transition between them. Compared to previous methods, our approach does not require any domain knowledge, learning from offline and unlabeled data. We show how this representation can be leveraged to learn goal-conditioned policies, providing a notion of similarity between states and goals and a useful heuristic distance to guide planning and reinforcement learning algorithms. Finally, we empirically validate our method in classic control domains and multi-goal environments, demonstrating that our method can successfully learn representations in large and/or continuous domains." @default.
- W4327652945 created "2023-03-18" @default.
- W4327652945 creator A5003267105 @default.
- W4327652945 creator A5057116576 @default.
- W4327652945 date "2023-01-01" @default.
- W4327652945 modified "2023-10-17" @default.
- W4327652945 title "State Representation Learning for Goal-Conditioned Reinforcement Learning" @default.
- W4327652945 cites W2165533158 @default.
- W4327652945 cites W2746553466 @default.
- W4327652945 cites W4294786244 @default.
- W4327652945 doi "https://doi.org/10.1007/978-3-031-26412-2_6" @default.
- W4327652945 hasPublicationYear "2023" @default.
- W4327652945 type Work @default.
- W4327652945 citedByCount "0" @default.
- W4327652945 crossrefType "book-chapter" @default.
- W4327652945 hasAuthorship W4327652945A5003267105 @default.
- W4327652945 hasAuthorship W4327652945A5057116576 @default.
- W4327652945 hasBestOaLocation W43276529452 @default.
- W4327652945 hasConcept C105795698 @default.
- W4327652945 hasConcept C106189395 @default.
- W4327652945 hasConcept C111919701 @default.
- W4327652945 hasConcept C11413529 @default.
- W4327652945 hasConcept C119857082 @default.
- W4327652945 hasConcept C134306372 @default.
- W4327652945 hasConcept C154945302 @default.
- W4327652945 hasConcept C159886148 @default.
- W4327652945 hasConcept C173801870 @default.
- W4327652945 hasConcept C17744445 @default.
- W4327652945 hasConcept C199539241 @default.
- W4327652945 hasConcept C2776359362 @default.
- W4327652945 hasConcept C2778572836 @default.
- W4327652945 hasConcept C33923547 @default.
- W4327652945 hasConcept C36503486 @default.
- W4327652945 hasConcept C41008148 @default.
- W4327652945 hasConcept C41608201 @default.
- W4327652945 hasConcept C48103436 @default.
- W4327652945 hasConcept C72434380 @default.
- W4327652945 hasConcept C94625758 @default.
- W4327652945 hasConcept C97541855 @default.
- W4327652945 hasConceptScore W4327652945C105795698 @default.
- W4327652945 hasConceptScore W4327652945C106189395 @default.
- W4327652945 hasConceptScore W4327652945C111919701 @default.
- W4327652945 hasConceptScore W4327652945C11413529 @default.
- W4327652945 hasConceptScore W4327652945C119857082 @default.
- W4327652945 hasConceptScore W4327652945C134306372 @default.
- W4327652945 hasConceptScore W4327652945C154945302 @default.
- W4327652945 hasConceptScore W4327652945C159886148 @default.
- W4327652945 hasConceptScore W4327652945C173801870 @default.
- W4327652945 hasConceptScore W4327652945C17744445 @default.
- W4327652945 hasConceptScore W4327652945C199539241 @default.
- W4327652945 hasConceptScore W4327652945C2776359362 @default.
- W4327652945 hasConceptScore W4327652945C2778572836 @default.
- W4327652945 hasConceptScore W4327652945C33923547 @default.
- W4327652945 hasConceptScore W4327652945C36503486 @default.
- W4327652945 hasConceptScore W4327652945C41008148 @default.
- W4327652945 hasConceptScore W4327652945C41608201 @default.
- W4327652945 hasConceptScore W4327652945C48103436 @default.
- W4327652945 hasConceptScore W4327652945C72434380 @default.
- W4327652945 hasConceptScore W4327652945C94625758 @default.
- W4327652945 hasConceptScore W4327652945C97541855 @default.
- W4327652945 hasLocation W43276529451 @default.
- W4327652945 hasLocation W43276529452 @default.
- W4327652945 hasOpenAccess W4327652945 @default.
- W4327652945 hasPrimaryLocation W43276529451 @default.
- W4327652945 hasRelatedWork W1490038383 @default.
- W4327652945 hasRelatedWork W2353483528 @default.
- W4327652945 hasRelatedWork W2482498454 @default.
- W4327652945 hasRelatedWork W2937181779 @default.
- W4327652945 hasRelatedWork W2947128950 @default.
- W4327652945 hasRelatedWork W2952288475 @default.
- W4327652945 hasRelatedWork W3201878770 @default.
- W4327652945 hasRelatedWork W36691172 @default.
- W4327652945 hasRelatedWork W4297803875 @default.
- W4327652945 hasRelatedWork W4362647313 @default.
- W4327652945 isParatext "false" @default.
- W4327652945 isRetracted "false" @default.
- W4327652945 workType "book-chapter" @default.