-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update to using Model Optimizer (formerly AMMO) in PTQ workflow (#9178)
* Update PTQ to use nvidia-modelopt Signed-off-by: Jan Lasek <[email protected]> * Restore PTQ tests Signed-off-by: Jan Lasek <[email protected]> * Update docs Signed-off-by: Jan Lasek <[email protected]> * Comment on apply_rope_fusion Signed-off-by: Jan Lasek <[email protected]> * Support for calibration PP > 1 Signed-off-by: Jan Lasek <[email protected]> * Apply isort and black reformatting Signed-off-by: janekl <[email protected]> * Fix cicd-main.yml indent Signed-off-by: Jan Lasek <[email protected]> * Set data/tensor parallel groups Signed-off-by: Jan Lasek <[email protected]> * Install only torch dependecies Signed-off-by: Jan Lasek <[email protected]> * Follow up on recent modelopt changes Signed-off-by: Jan Lasek <[email protected]> * Model support matrix Signed-off-by: Jan Lasek <[email protected]> * Apply isort and black reformatting Signed-off-by: janekl <[email protected]> * Rename PTQ script as it should be model-agnostic Signed-off-by: Jan Lasek <[email protected]> * Remove unused import Signed-off-by: Jan Lasek <[email protected]> * Update setup instructions Signed-off-by: Jan Lasek <[email protected]> --------- Signed-off-by: Jan Lasek <[email protected]> Signed-off-by: janekl <[email protected]> Co-authored-by: janekl <[email protected]>
- Loading branch information
Showing
9 changed files
with
204 additions
and
119 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.