Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385968090> ?p ?o ?g. }
- W4385968090 abstract "ChatGPT-like models have revolutionized various applications in artificial intelligence, from summarization and coding to translation, matching or even surpassing human performance. However, the current landscape lacks an accessible, efficient, and cost-effective end-to-end RLHF (Reinforcement Learning with Human Feedback) training pipeline for these powerful models, particularly when training at the scale of billions of parameters. This paper introduces DeepSpeed-Chat, a novel system that democratizes RLHF training, making it accessible to the AI community. DeepSpeed-Chat offers three key capabilities: an easy-to-use training and inference experience for ChatGPT-like models, a DeepSpeed-RLHF pipeline that replicates the training pipeline from InstructGPT, and a robust DeepSpeed-RLHF system that combines various optimizations for training and inference in a unified way. The system delivers unparalleled efficiency and scalability, enabling training of models with hundreds of billions of parameters in record time and at a fraction of the cost. With this development, DeepSpeed-Chat paves the way for broader access to advanced RLHF training, even for data scientists with limited resources, thereby fostering innovation and further development in the field of AI." @default.
- W4385968090 created "2023-08-19" @default.
- W4385968090 creator A5004330728 @default.
- W4385968090 creator A5012018288 @default.
- W4385968090 creator A5012467907 @default.
- W4385968090 creator A5018356320 @default.
- W4385968090 creator A5021145662 @default.
- W4385968090 creator A5022644245 @default.
- W4385968090 creator A5033384149 @default.
- W4385968090 creator A5037534081 @default.
- W4385968090 creator A5040302174 @default.
- W4385968090 creator A5041925582 @default.
- W4385968090 creator A5042655522 @default.
- W4385968090 creator A5043209884 @default.
- W4385968090 creator A5047471855 @default.
- W4385968090 creator A5049687601 @default.
- W4385968090 creator A5059714609 @default.
- W4385968090 creator A5069302961 @default.
- W4385968090 creator A5069595628 @default.
- W4385968090 creator A5077768924 @default.
- W4385968090 creator A5088562217 @default.
- W4385968090 date "2023-08-02" @default.
- W4385968090 modified "2023-10-01" @default.
- W4385968090 title "DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales" @default.
- W4385968090 doi "https://doi.org/10.48550/arxiv.2308.01320" @default.
- W4385968090 hasPublicationYear "2023" @default.
- W4385968090 type Work @default.
- W4385968090 citedByCount "0" @default.
- W4385968090 crossrefType "posted-content" @default.
- W4385968090 hasAuthorship W4385968090A5004330728 @default.
- W4385968090 hasAuthorship W4385968090A5012018288 @default.
- W4385968090 hasAuthorship W4385968090A5012467907 @default.
- W4385968090 hasAuthorship W4385968090A5018356320 @default.
- W4385968090 hasAuthorship W4385968090A5021145662 @default.
- W4385968090 hasAuthorship W4385968090A5022644245 @default.
- W4385968090 hasAuthorship W4385968090A5033384149 @default.
- W4385968090 hasAuthorship W4385968090A5037534081 @default.
- W4385968090 hasAuthorship W4385968090A5040302174 @default.
- W4385968090 hasAuthorship W4385968090A5041925582 @default.
- W4385968090 hasAuthorship W4385968090A5042655522 @default.
- W4385968090 hasAuthorship W4385968090A5043209884 @default.
- W4385968090 hasAuthorship W4385968090A5047471855 @default.
- W4385968090 hasAuthorship W4385968090A5049687601 @default.
- W4385968090 hasAuthorship W4385968090A5059714609 @default.
- W4385968090 hasAuthorship W4385968090A5069302961 @default.
- W4385968090 hasAuthorship W4385968090A5069595628 @default.
- W4385968090 hasAuthorship W4385968090A5077768924 @default.
- W4385968090 hasAuthorship W4385968090A5088562217 @default.
- W4385968090 hasBestOaLocation W43859680901 @default.
- W4385968090 hasConcept C119857082 @default.
- W4385968090 hasConcept C121332964 @default.
- W4385968090 hasConcept C153294291 @default.
- W4385968090 hasConcept C154945302 @default.
- W4385968090 hasConcept C170858558 @default.
- W4385968090 hasConcept C199360897 @default.
- W4385968090 hasConcept C202444582 @default.
- W4385968090 hasConcept C2522767166 @default.
- W4385968090 hasConcept C26517878 @default.
- W4385968090 hasConcept C2776214188 @default.
- W4385968090 hasConcept C2777211547 @default.
- W4385968090 hasConcept C33923547 @default.
- W4385968090 hasConcept C38652104 @default.
- W4385968090 hasConcept C41008148 @default.
- W4385968090 hasConcept C43521106 @default.
- W4385968090 hasConcept C48044578 @default.
- W4385968090 hasConcept C77088390 @default.
- W4385968090 hasConcept C9652623 @default.
- W4385968090 hasConceptScore W4385968090C119857082 @default.
- W4385968090 hasConceptScore W4385968090C121332964 @default.
- W4385968090 hasConceptScore W4385968090C153294291 @default.
- W4385968090 hasConceptScore W4385968090C154945302 @default.
- W4385968090 hasConceptScore W4385968090C170858558 @default.
- W4385968090 hasConceptScore W4385968090C199360897 @default.
- W4385968090 hasConceptScore W4385968090C202444582 @default.
- W4385968090 hasConceptScore W4385968090C2522767166 @default.
- W4385968090 hasConceptScore W4385968090C26517878 @default.
- W4385968090 hasConceptScore W4385968090C2776214188 @default.
- W4385968090 hasConceptScore W4385968090C2777211547 @default.
- W4385968090 hasConceptScore W4385968090C33923547 @default.
- W4385968090 hasConceptScore W4385968090C38652104 @default.
- W4385968090 hasConceptScore W4385968090C41008148 @default.
- W4385968090 hasConceptScore W4385968090C43521106 @default.
- W4385968090 hasConceptScore W4385968090C48044578 @default.
- W4385968090 hasConceptScore W4385968090C77088390 @default.
- W4385968090 hasConceptScore W4385968090C9652623 @default.
- W4385968090 hasLocation W43859680901 @default.
- W4385968090 hasOpenAccess W4385968090 @default.
- W4385968090 hasPrimaryLocation W43859680901 @default.
- W4385968090 hasRelatedWork W2136308941 @default.
- W4385968090 hasRelatedWork W2285613413 @default.
- W4385968090 hasRelatedWork W2308250245 @default.
- W4385968090 hasRelatedWork W2351187795 @default.
- W4385968090 hasRelatedWork W2364375860 @default.
- W4385968090 hasRelatedWork W2380641910 @default.
- W4385968090 hasRelatedWork W2389846579 @default.
- W4385968090 hasRelatedWork W2561691764 @default.
- W4385968090 hasRelatedWork W2589098947 @default.
- W4385968090 hasRelatedWork W4299638067 @default.
- W4385968090 isParatext "false" @default.
- W4385968090 isRetracted "false" @default.