Bunch of changes: GPTneoX for more methods, added Wandb, reorganized, compute optimized modify llama #10

prajwal1210 · 2024-05-06T22:08:18Z

No description provided.

… results, passing args to all methods to easily add new args

siddharth9820 · 2024-05-07T18:27:47Z

pushed a basic outline for the inference benchmark.

To run - torchrun --nproc_per_node 4 infer.py --model-id ..

siddharth9820 · 2024-05-07T23:51:07Z

@prajwal1210 commands to run the text generation benchmark -

HF Baseline
torchrun --nproc_per_node 1 infer.py --prompt-length 128 --gen-length 16 --batch-size 1 --seed 42

PCA TOPK (unoptimized)
torchrun --nproc_per_node 1 infer.py --prompt-length 128 --gen-length 16 --batch-size 1 --seed 42 --method pca-topk

PCA TOPK (optimized)
torchrun --nproc_per_node 1 infer.py --prompt-length 128 --gen-length 16 --batch-size 1 --seed 42 --method pca-topk --use-optimized-code

…; bunch of other changes

prajwal1210 added 3 commits May 3, 2024 10:42

Added parallel saving support; added modify_gptneox for all methods

20b7d2f

Reorganized examples/ folder, Added wandb support to track experiment…

4d54546

… results, passing args to all methods to easily add new args

Added compute optimised modify_llama_optimised in pca_topk

05488b9

prajwal1210 requested a review from siddharth9820 May 6, 2024 22:08

siddharth9820 approved these changes May 7, 2024

View reviewed changes

siddharth9820 added 2 commits May 7, 2024 10:05

some qol changes

a02c3f9

add basic inference benchmark

33c6fc0

siddharth9820 added 3 commits May 7, 2024 11:59

run pca_topk with text generation benchmark

be6471e

sample tokens from wikitext

58652f8

changes

68c3dfc

prajwal1210 added 2 commits May 10, 2024 12:48

Changes: Wandb logging change for perplexity and tasks; TP in lm_eval…

df8b247

…; bunch of other changes

Bug Fix: Optimised PCA-TopK modify_llama code

cdbbe0a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bunch of changes: GPTneoX for more methods, added Wandb, reorganized, compute optimized modify llama #10

Bunch of changes: GPTneoX for more methods, added Wandb, reorganized, compute optimized modify llama #10

prajwal1210 commented May 6, 2024

siddharth9820 commented May 7, 2024

siddharth9820 commented May 7, 2024

Bunch of changes: GPTneoX for more methods, added Wandb, reorganized, compute optimized modify llama #10

Are you sure you want to change the base?

Bunch of changes: GPTneoX for more methods, added Wandb, reorganized, compute optimized modify llama #10

Conversation

prajwal1210 commented May 6, 2024

siddharth9820 commented May 7, 2024

siddharth9820 commented May 7, 2024