Skip to content

Conversation

@LuJunru
Copy link
Contributor

@LuJunru LuJunru commented Jan 8, 2026

What does this PR do?

This PR adds the implementation for the released Youtu-LLM model. The model has the following features:

  • Type: Autoregressive Causal Language Models with Dense MLA
  • Release versions: Base and Instruct

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline,
    Pull Request section?
  • Was this discussed/approved via a Github issue or the forum? Please add a link
    to it if that's the case.
  • Did you make sure to update the documentation with your changes? Here are the
    documentation guidelines, and
    here are tips on formatting docstrings.
  • Did you write any new necessary tests?

Who can review?

@ArthurZucker @Cyrilvallez

@github-actions
Copy link
Contributor

github-actions bot commented Jan 8, 2026

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, youtu_llm

@github-actions
Copy link
Contributor

github-actions bot commented Jan 8, 2026

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43165&sha=0797aa

@LuJunru LuJunru closed this Jan 8, 2026
@LuJunru LuJunru deleted the master branch January 8, 2026 08:32
@LuJunru
Copy link
Contributor Author

LuJunru commented Jan 8, 2026

The base repository was forked 8 years ago from the transformers repo of that time, and further synchronized with commits, resulting in branch naming issues. I will close this pull request and working with a new fork.

The new pr is here: #43166.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant