Matches in SemOpenAlex for { <https://semopenalex.org/work/W4306179830> ?p ?o ?g. }
- W4306179830 endingPage "100588" @default.
- W4306179830 startingPage "100588" @default.
- W4306179830 abstract "Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool to represent molecular graphs, and the most popular molecular string representation, Smiles, has powered cheminformatics since the late 1980s. However, in the context of AI and ML in chemistry, Smiles has several shortcomings-most pertinently, most combinations of symbols lead to invalid results with no valid chemical interpretation. To overcome this issue, a new language for molecules was introduced in 2020 that guarantees 100% robustness: SELF-referencing embedded string (Selfies). Selfies has since simplified and enabled numerous new applications in chemistry. In this perspective, we look to the future and discuss molecular string representations, along with their respective opportunities and challenges. We propose 16 concrete future projects for robust molecular representations. These involve the extension toward new chemical domains, exciting questions at the interface of AI and robust languages, and interpretability for both humans and machines. We hope that these proposals will inspire several follow-up works exploiting the full potential of molecular string representations for the future of AI in chemistry and materials science." @default.
- W4306179830 created "2022-10-14" @default.
- W4306179830 creator A5002793023 @default.
- W4306179830 creator A5005389429 @default.
- W4306179830 creator A5007068765 @default.
- W4306179830 creator A5017347363 @default.
- W4306179830 creator A5017832889 @default.
- W4306179830 creator A5022581666 @default.
- W4306179830 creator A5024328518 @default.
- W4306179830 creator A5027355573 @default.
- W4306179830 creator A5028002938 @default.
- W4306179830 creator A5028051805 @default.
- W4306179830 creator A5028303962 @default.
- W4306179830 creator A5029665492 @default.
- W4306179830 creator A5032235212 @default.
- W4306179830 creator A5043852892 @default.
- W4306179830 creator A5046175815 @default.
- W4306179830 creator A5047497212 @default.
- W4306179830 creator A5047903834 @default.
- W4306179830 creator A5049443242 @default.
- W4306179830 creator A5051256159 @default.
- W4306179830 creator A5052771582 @default.
- W4306179830 creator A5057778679 @default.
- W4306179830 creator A5061244637 @default.
- W4306179830 creator A5063351330 @default.
- W4306179830 creator A5064016511 @default.
- W4306179830 creator A5065581694 @default.
- W4306179830 creator A5068540671 @default.
- W4306179830 creator A5071495561 @default.
- W4306179830 creator A5075317126 @default.
- W4306179830 creator A5078632284 @default.
- W4306179830 creator A5082060838 @default.
- W4306179830 creator A5083411426 @default.
- W4306179830 date "2022-10-01" @default.
- W4306179830 modified "2023-10-03" @default.
- W4306179830 title "SELFIES and the future of molecular string representations" @default.
- W4306179830 cites W1009610150 @default.
- W4306179830 cites W1508604947 @default.
- W4306179830 cites W1967191398 @default.
- W4306179830 cites W1971521451 @default.
- W4306179830 cites W1972549610 @default.
- W4306179830 cites W1977042981 @default.
- W4306179830 cites W1977635268 @default.
- W4306179830 cites W1980090013 @default.
- W4306179830 cites W1983153145 @default.
- W4306179830 cites W1986378419 @default.
- W4306179830 cites W1988359950 @default.
- W4306179830 cites W1991042362 @default.
- W4306179830 cites W1994733652 @default.
- W4306179830 cites W1996925740 @default.
- W4306179830 cites W2005946215 @default.
- W4306179830 cites W2008151531 @default.
- W4306179830 cites W2016986895 @default.
- W4306179830 cites W2017516062 @default.
- W4306179830 cites W2020996919 @default.
- W4306179830 cites W2021395631 @default.
- W4306179830 cites W2038095698 @default.
- W4306179830 cites W2038702914 @default.
- W4306179830 cites W2041686943 @default.
- W4306179830 cites W2046376233 @default.
- W4306179830 cites W2047715248 @default.
- W4306179830 cites W2061585475 @default.
- W4306179830 cites W2065338952 @default.
- W4306179830 cites W2066655294 @default.
- W4306179830 cites W2070942956 @default.
- W4306179830 cites W2079177760 @default.
- W4306179830 cites W2081694179 @default.
- W4306179830 cites W2082464146 @default.
- W4306179830 cites W2083224653 @default.
- W4306179830 cites W2087563523 @default.
- W4306179830 cites W2104979868 @default.
- W4306179830 cites W2150099651 @default.
- W4306179830 cites W2156462198 @default.
- W4306179830 cites W2167901312 @default.
- W4306179830 cites W2170973067 @default.
- W4306179830 cites W2172216479 @default.
- W4306179830 cites W2244690429 @default.
- W4306179830 cites W2315074804 @default.
- W4306179830 cites W2319902168 @default.
- W4306179830 cites W2324964582 @default.
- W4306179830 cites W2328906361 @default.
- W4306179830 cites W2329052088 @default.
- W4306179830 cites W2416896172 @default.
- W4306179830 cites W2523535195 @default.
- W4306179830 cites W2594183968 @default.
- W4306179830 cites W2613900957 @default.
- W4306179830 cites W2747592475 @default.
- W4306179830 cites W2756398738 @default.
- W4306179830 cites W2765135381 @default.
- W4306179830 cites W2775235705 @default.
- W4306179830 cites W2883583109 @default.
- W4306179830 cites W2884775584 @default.
- W4306179830 cites W2887459817 @default.
- W4306179830 cites W2889935426 @default.
- W4306179830 cites W2901476322 @default.
- W4306179830 cites W2903262661 @default.
- W4306179830 cites W2952734199 @default.
- W4306179830 cites W2953128081 @default.