Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepSeek-R1 model on HuggingFace Hub #363

Open
saman-rahbar opened this issue Feb 10, 2025 · 0 comments
Open

DeepSeek-R1 model on HuggingFace Hub #363

saman-rahbar opened this issue Feb 10, 2025 · 0 comments

Comments

@saman-rahbar
Copy link

Hello everyone,

I wanted to flag a recent change in the DeepSeek-R1 model on HuggingFace Hub (including its distilled version). the chat template in tokenizer_config.json was updated to add a line "\n" at the end. it seems this modification aims to ensure the model begins from the “thinking” process rather than skipping that step entirely.

However, this change has unfortunately disrupted the functionality of the reasoning-parser in vLLM for DeepSeek-R1. I’m not certain who oversees the deepseek_r1 reasoning-parser within vLLM, but I would greatly appreciate any assistance in resolving this issue as soon as possible.

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant