콘텐츠로 이동

Qwen3MoE

이 콘텐츠는 아직 번역되지 않았습니다.

This model was released on 2025-04-29 and added to Hugging Face Transformers on 2025-03-31.

Qwen3MoE refers to the mixture of experts model architecture Qwen3-235B-A22B which was released with its dense variant Qwen3 (blog post).

To be released with the official model launch.

To be released with the official model launch.

[[autodoc]] Qwen3MoeConfig

[[autodoc]] Qwen3MoeModel - forward

[[autodoc]] Qwen3MoeForCausalLM - forward

[[autodoc]] Qwen3MoeForSequenceClassification - forward

[[autodoc]] Qwen3MoeForTokenClassification - forward

[[autodoc]] Qwen3MoeForQuestionAnswering - forward