*This model was contributed to Hugging Face Transformers on 2025-08-22.*

# HunYuanMoEV1 HunYuanMoEV1 is Tencent's mixture-of-experts language model with 80B total parameters and 13B active parameters per token. It uses fine-grained expert routing with Grouped Query Attention, supports 256K context length, and offers dual-mode reasoning (fast and slow thinking). The example below demonstrates how to generate text with [`Pipeline`] or the [`AutoModelForCausalLM`] class. ```python from transformers import pipeline pipe = pipeline( task="text-generation", model="tencent/Hunyuan-A13B-Instruct", ) pipe("The future of artificial intelligence is") ``` ```python from transformers import AutoModelForCausalLM, AutoTokenizer tokenizer = AutoTokenizer.from_pretrained("tencent/Hunyuan-A13B-Instruct") model = AutoModelForCausalLM.from_pretrained( "tencent/Hunyuan-A13B-Instruct", device_map="auto", ) input_ids = tokenizer("The future of artificial intelligence is", return_tensors="pt").to(model.device) output = model.generate(**input_ids, max_new_tokens=50) print(tokenizer.decode(output[0], skip_special_tokens=True)) ``` ## HunYuanMoEV1Config [[autodoc]] HunYuanMoEV1Config ## HunYuanMoEV1Model [[autodoc]] HunYuanMoEV1Model - forward ## HunYuanMoEV1ForCausalLM [[autodoc]] HunYuanMoEV1ForCausalLM - forward ## HunYuanMoEV1ForSequenceClassification [[autodoc]] HunYuanMoEV1ForSequenceClassification - forward