-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TDT blog: adding leaderboard screenshot #8292
Conversation
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
Signed-off-by: Hainan Xu <[email protected]>
To put things in perspective, our Parakeet-TDT model with 1.1 billion parameters outperforms similar-sized Parakeet-RNNT-1.1b in accuracy, as measured as the average performance among 9 benchmarks on the [HuggingFace Leaderboard](https://huggingface.co/spaces/hf-audio/open_asr_leaderboard). Notably, Parakeet-TDT is the first model to achieve an average WER below 7.0 on the leaderboard. Additionally, it achieves an impressive real-time factor (RTF) of 8.8e-3, 64% faster than Parakeet-RNNT-1.1b's RTF of 14.4e-3. Remarkably, Parakeet-TDT's RTF is even 40% faster than Parakeet-RNNT-0.6b (RTF 12.3), despite the latter having about half the model size. | ||
|
||
<figure markdown> | ||
![HuggingFace Leaderboard](leaderboard_01312024.png) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't embed image into the commit, follow the instruction in the readme to upload the image into NeMo release
Signed-off-by: Hainan Xu <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Signed-off-by: Hainan Xu <[email protected]>
Wait the URL is wrong for the image. Please fix it by uploading to latest release - 1.22 |
Signed-off-by: Hainan Xu <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks !
What does this PR do ?
TDT blog: adding leaderboard screenshot
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
Jenkins CI
To run Jenkins, a NeMo User with write access must comment
jenkins
on the PR.Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information