Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Log_props output in TRT-LLM #11009

Closed
wants to merge 52 commits into from

Conversation

oyilmaz-nvidia
Copy link
Collaborator

What does this PR do ?

Adds the functionality to return log_props from TRT-LLM using PyTriton.

oyilmaz-nvidia and others added 30 commits June 13, 2024 10:12
Signed-off-by: Onur Yilmaz <[email protected]>
Signed-off-by: Onur Yilmaz <[email protected]>
Signed-off-by: Onur Yilmaz <[email protected]>
@oyilmaz-nvidia
Copy link
Collaborator Author

@athitten If this PR works for you, I'll do some cleanup and start the CI.

@athitten
Copy link
Collaborator

athitten commented Oct 25, 2024

@athitten If this PR works for you, I'll do some cleanup and start the CI.

@oyilmaz-nvidia sorry I dint get a chance to test it out yet. I will test it tomorrow. But the PR looks good otherwise. Thank you!

Copy link
Collaborator

@athitten athitten left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested it locally and it works, although the logProbs of the input tokens are 0.0 and logProbs of output tokens are as expected. It's probably some flag or setting on TRTLLM side to get logProbs of input tokens as well.
Thank you @oyilmaz-nvidia

@oyilmaz-nvidia oyilmaz-nvidia enabled auto-merge (squash) October 28, 2024 15:44
@oyilmaz-nvidia
Copy link
Collaborator Author

Closing this since it'll be included here: #11233

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants