*This model was contributed to Hugging Face Transformers on 2026-05-20.* # Cohere2 MoE [Command A+] is a Mixture-of-Experts (MoE) language model from Cohere. It features a hybrid attention pattern combining sliding window and full attention layers, shared and routed experts, and supports a very large context window. ## Cohere2MoeConfig [[autodoc]] Cohere2MoeConfig ## Cohere2MoeModel [[autodoc]] Cohere2MoeModel - forward ## Cohere2MoeForCausalLM [[autodoc]] Cohere2MoeForCausalLM - forward