Machine Learning Scrapbook
Subscribe
Sign in
Medusa: Multiple Decoding Heads for Faster…
Celine
Oct 12, 2024
Enhancing existing LLMs with multi-token prediction
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Medusa: Multiple Decoding Heads for Faster…
Enhancing existing LLMs with multi-token prediction