Skip to content

Tags: spancer/DeepSpeed

Tags

v0.6.7

Toggle v0.6.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
use HF NeoX (deepspeedai#2087)

Co-authored-by: Olatunji Ruwase <[email protected]>
Co-authored-by: Jeff Rasley <[email protected]>

v0.6.6

Toggle v0.6.6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Improving memory utilization of Z2+MoE (deepspeedai#2079)

* Shards expert parameter groups
* Do upscaling, optimizer and deletion of fp32 grads one-by-one on each parameter group in zero-2
Co-authored-by: Olatunji Ruwase <[email protected]>

v0.6.5

Toggle v0.6.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fixing several bugs in the inference-api and the kernels (deepspeedai…

…#1951)

Co-authored-by: Jeff Rasley <[email protected]>

v0.6.4

Toggle v0.6.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Improve z3 trace management (deepspeedai#1916)

* Fix OOM and type mismatch

* Toggle prefetching

* Disable z3 prefetching for inference (temp workaround)

* Fix zero3 tracing issues

* Remove debug prints

* Enable prefetch for inference

* Code clarity

* Invalidate trace cache

* Trace cache invalidation when needed
Separate nvme prefetch from all-gather prefetch

* Track last used step id

* Use debug name in error message

* Construct param trace from module trace

Co-authored-by: Jeff Rasley <[email protected]>

v0.6.3

Toggle v0.6.3's commit message
bumped to v0.6.3

v0.6.2

Toggle v0.6.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix OOM and type mismatch (deepspeedai#1884)

Co-authored-by: Jeff Rasley <[email protected]>

v0.6.1

Toggle v0.6.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
qkv_out can be a single tensor or a list. Handling these cases separe…

…tely. (deepspeedai#1850)

Co-authored-by: Jeff Rasley <[email protected]>

v0.6.0

Toggle v0.6.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
force amd install via sudo (deepspeedai#1815)

v0.5.10

Toggle v0.5.10's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
[ZeRO] Default disable elastic ckpt in stage 1+2 and reduce CPU memor…

…y overhead during ckpt load (deepspeedai#1525)

Co-authored-by: Olatunji Ruwase <[email protected]>

v0.5.9

Toggle v0.5.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Various small documentation text improvements (deepspeedai#1665)

Co-authored-by: Jeff Rasley <[email protected]>