ReviveMoE: Fast Recovery for Hardware Failures in Large-Scale MoE LLM Inference Deployments

ReviveMoE: Fast Recovery for Hardware Failures in Large-Scale MoE LLM Inference Deployments

• Computer Science > Distributed, Parallel, and Cluster Computing [Submitted on 24 Feb 2026] Title:ReviveMoE: Fast Recovery for Hardware Failures in Large-Scale MoE LLM Inference D

Semantic Parallelism: Redefining Efficient MoE Inference via Model-Data Co-Scheduling

Semantic Parallelism: Redefining Efficient MoE Inference via Model-Data Co-Scheduling

• Computer Science > Machine Learning [Submitted on 6 Mar 2025 (v1), last revised 24 Feb 2026 (this version, v4)] Title:Semantic Parallelism: Redefining Efficient MoE Inference via

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

• China’s open‑source AI community has pivoted to Mixture‑of‑Experts (MoE) architectures for cost‑effective scalability. • MoE allows dynamic compute allocation, enabling models to