DeepSpeed Chat

An End-To-End RLHF Pipeline To Train ChatGPT-like Models

About DeepSpeed Chat

Microsoft announced the release of DeepSpeed-Chat, a low-cost, open-source solution for RLHF training that will allow anyone to create high-quality ChatGPT-style models even with a single GPU. Microsoft claims that you can train up to a 13B model on a single GPU, or at low-cost of $300 on Azure Cloud using DeepSpeed-Chat.

DeepSpeed-Chat RLHF training experience is made possible using DeepSpeed-Inference and DeepSpeed-Training to offer 15x faster throughput than SoTA, while also supporting model sizes that are up to 7.5x larger on the same hardware. DeepSpeed-Chat makes complex RLHF training fast, affordable, and easily accessible to the AI community. It democratizes ChatGPT-like models!

The initial release of DeepSpeed-Chat includes the following three capabilities:

  • Easy-to-use Training and Inference Experience for ChatGPT Like Models.
  • DeepSpeed-RLHF Pipeline.
  • DeepSpeed-RLHF System

DeepSpeed Chat screenshots

Ready to start building?

At Apideck we're building the world's biggest API network. Discover and integrate over 12,000 APIs.

Check out the API Tracker