Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add config safety check #6

Closed

Conversation

HuiyingLi
Copy link
Collaborator

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Jenkins CI

To run Jenkins, a NeMo User with write access must comment jenkins on the PR.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

dimapihtar and others added 15 commits April 17, 2024 11:31
* fsdp tp > 1 bug fix

Signed-off-by: dimapihtar <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix fsdp for fp16 precision

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Use fp16 in unet if cfg.precision is 16bit

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* remove temp fix

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* guard unet_precision value setting

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* formatting

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* throw a warning if unet_config.unet_precision is not set but trainer.precision is set

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* use model.optim.name=megatron_fused_adam in sd_train

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* use bf16 in sd_train

Signed-off-by: Alexandros Koumparoulis <[email protected]>

---------

Signed-off-by: Alexandros Koumparoulis <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…IA#8959)

* Add a last stage job to enable/for blocking check

* temp for test

* fix typo

* fix typo

* fix GHA deps needs

* Revert "temp for test"

This reverts commit 5d71d09.
* Detect 'arcname' prefix in utils when handling .nemo tarball

Signed-off-by: Jan Lasek <[email protected]>

* Address megatron_amp_O2 = True case in quantization

Signed-off-by: Jan Lasek <[email protected]>

* Add Megatron-LM to PYTHONPATH correctly in Jenkinsfile

Signed-off-by: Jan Lasek <[email protected]>

---------

Signed-off-by: Jan Lasek <[email protected]>
* PR NVIDIA#8803: Update embedding init prototype to match mc

Signed-off-by: Jaemin Choi <[email protected]>

* PR NVIDIA#8810: Fix import of get_gpt_layer_ammo_spec

Signed-off-by: Jaemin Choi <[email protected]>

* PR NVIDIA#8853: Fix memory leak at loss func

Signed-off-by: Jaemin Choi <[email protected]>

---------

Signed-off-by: Jaemin Choi <[email protected]>
Signed-off-by: Shriya Palsamudram <[email protected]>
Co-authored-by: Jaemin Choi <[email protected]>
Co-authored-by: Eric Harper <[email protected]>
Co-authored-by: Shriya Palsamudram <[email protected]>
Co-authored-by: Pablo Garay <[email protected]>
* initial commit

Signed-off-by: Chen Cui <[email protected]>

* enable pp support for merge script and fix output precision

Signed-off-by: Chen Cui <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove incomplete script for next release

Signed-off-by: Chen Cui <[email protected]>

---------

Signed-off-by: Chen Cui <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adi Renduchintala <[email protected]>
Co-authored-by: Eric Harper <[email protected]>
* Move cached embedding devices and dtype for onnx export consistency

Signed-off-by: Mingyuan Ma <[email protected]>

* Add old trt export/inference script, currently not working in latest container.

Signed-off-by: Mingyuan Ma <[email protected]>

* Add NeMo TRT inference pipeline and quatization workflow

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add guards to avoid undefined variables

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* minor fix

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add conversion script from hf sdxl to nemo sdxl

Signed-off-by: Mingyuan Ma <[email protected]>

* Update quantize pipeline to adapt to variable image dimension

Signed-off-by: Mingyuan Ma <[email protected]>

* update sdxl pipeline to be aware of additional emb channels

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add guards for potential local var

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* copyright header

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update calib prompt file path

Signed-off-by: Mingyuan Ma <[email protected]>

* Update file paths

Signed-off-by: Mingyuan Ma <[email protected]>

* minor update

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update default quantization config

Signed-off-by: Mingyuan Ma <[email protected]>

* remove unused imports/vars

Signed-off-by: Mingyuan Ma <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove unused imports

Signed-off-by: Mingyuan Ma <[email protected]>

---------

Signed-off-by: Mingyuan Ma <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…8866)

* Replaced unpacking of nemo checkpoints on export with a VFS-like TarPath object.

Signed-off-by: Alexey Panteleev <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Fixed the signature of ZarrPathStore.__delitem__

Signed-off-by: Alexey Panteleev <[email protected]>

---------

Signed-off-by: Alexey Panteleev <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Onur Yilmaz <[email protected]>
Signed-off-by: eharper <[email protected]>
* Update Latest News

Adds links to articles on
* NeMo framework on GKE
* Responsible Gen AI using NeMo and Picasso
* NeMo powering Amazon Titan foundation models

Signed-off-by: Shashank Verma <[email protected]>

* Minor updates to latest news in README

* Remove bullets
* Editing text for clarity

Signed-off-by: Shashank Verma <[email protected]>

* Format latest news as a dropdown list

* Uses embedded html to format news to dropdown, hiding lengthy details
* Fixes formatting of the title

Signed-off-by: Shashank Verma <[email protected]>

* Add break to improve readability of latest news image

Signed-off-by: Shashank Verma <[email protected]>

* Add LLM and MM section in latest news

Signed-off-by: Shashank Verma <[email protected]>

* Add margin in latest news expandable lists

Signed-off-by: Shashank Verma <[email protected]>

* Remove styling of expandable list

* Github appears to not render styled elements when
embedded as raw html in rst

Signed-off-by: Shashank Verma <[email protected]>

* Fold the first news item by default

Signed-off-by: Shashank Verma <[email protected]>

---------

Signed-off-by: Shashank Verma <[email protected]>
Signed-off-by: Shashank Verma <[email protected]>
continue
if isinstance(module, (torch.nn.Linear, LinearWrapper)):
if (
(quant_level >= 2 and "ff.net" in name)

Check failure

Code scanning / CodeQL

Incomplete URL substring sanitization High

The string
ff.net
may be at an arbitrary position in the sanitized URL.
module.weight_quantizer.enable()
elif isinstance(module, (torch.nn.Linear, LinearWrapper)):
if (
(quant_level >= 2 and "ff.net" in name)

Check failure

Code scanning / CodeQL

Incomplete URL substring sanitization High

The string
ff.net
may be at an arbitrary position in the sanitized URL.

def __repr__(self):
s = "PercentileCalibrator("
s += super(MaxCalibrator, self).__repr__()

Check failure

Code scanning / CodeQL

First argument to super() is not enclosing class Error

First argument to super() should be PercentileCalibrator.
# limitations under the License.

import math
import time

Check notice

Code scanning / CodeQL

Unused import Note

Import of 'time' is not used.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.