DeepSpeed Chat

An End-To-End RLHF Pipeline To Train ChatGPT-like Models

Acerca de DeepSpeed Chat

Microsoft announced the release of DeepSpeed-Chat, a low-cost, open-source solution for RLHF training that will allow anyone to create high-quality ChatGPT-style models even with a single GPU. Microsoft claims that you can train up to a 13B model on a single GPU, or at low-cost of $300 on Azure Cloud using DeepSpeed-Chat.

DeepSpeed-Chat RLHF training experience is made possible using DeepSpeed-Inference and DeepSpeed-Training to offer 15x faster throughput than SoTA, while also supporting model sizes that are up to 7.5x larger on the same hardware. DeepSpeed-Chat makes complex RLHF training fast, affordable, and easily accessible to the AI community. It democratizes ChatGPT-like models!

The initial release of DeepSpeed-Chat includes the following three capabilities:

  • Easy-to-use Training and Inference Experience for ChatGPT Like Models.
  • DeepSpeed-RLHF Pipeline.
  • DeepSpeed-RLHF System

DeepSpeed Chat capturas de pantalla

Ready to start building?

At Apideck we're building the world's biggest API network. Discover and integrate over 12,000 APIs.

Check out the API Tracker