-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Log_props output in TRT-LLM #11009
Log_props output in TRT-LLM #11009
Conversation
Signed-off-by: Onur Yilmaz <[email protected]>
Signed-off-by: Onur Yilmaz <[email protected]>
Signed-off-by: Onur Yilmaz <[email protected]>
@athitten If this PR works for you, I'll do some cleanup and start the CI. |
Signed-off-by: Onur Yilmaz <[email protected]>
@oyilmaz-nvidia sorry I dint get a chance to test it out yet. I will test it tomorrow. But the PR looks good otherwise. Thank you! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tested it locally and it works, although the logProbs of the input tokens are 0.0 and logProbs of output tokens are as expected. It's probably some flag or setting on TRTLLM side to get logProbs of input tokens as well.
Thank you @oyilmaz-nvidia
Closing this since it'll be included here: #11233 |
What does this PR do ?
Adds the functionality to return log_props from TRT-LLM using PyTriton.