What's Changed
- Fused attention: Switch to Flash Decoding by @casper-hansen in #656
- Add EXAONE support by @lgai-exaone in #651
- fix exaone import by @casper-hansen in #659
- fix "Expected all tensors to be on the same device" by @casper-hansen in #664
- multi-gpu fix by @casper-hansen in #668
- Fix missing embed_tokens by @casper-hansen in #671
- install hub main branch by @casper-hansen in #672
- pin huggingface_hub by @casper-hansen in #673
- bump to 0.2.7.post3 by @casper-hansen in #674
New Contributors
- @lgai-exaone made their first contribution in #651
Full Changelog: v0.2.7.post2...v0.2.7.post3