Huawei Unveils AI Data Platform with Pioneering '3+1' Architecture at MWC 2026
At MWC 2026, Huawei introduced a new AI Data Platform featuring a unique '3+1' architecture designed to solve critical AI inference bottlenecks, including hallucinations and memory loss, while offering flexible deployment options for enterprises.
Barcelona — Huawei has officially launched its AI Data Platform at the Mobile World Congress (MWC) 2026, introducing a proprietary "3+1" architecture aimed at addressing the industry-wide imbalance between AI training and inference capabilities. Yuan Yuan, President of Huawei's Data Storage Product Line, unveiled the solution during the company's product and solutions launch event on March 3.
The platform arrives as the industry grapples with a persistent "heavy training, light inference" mentality that has hindered the deployment of AI models in core operator businesses. Huawei identifies three critical bottlenecks in current inference systems: frequent hallucinations, suboptimal response experiences, and the absence of inference memory. The new architecture specifically targets these issues through specialized storage and optimization of knowledge, KV Cache, and memory, unified under a sophisticated management layer.
The "3+1" Architecture Breakdown
The foundation of the platform rests on three specialized data repositories plus a unified management technology:
Knowledge Base: Designed for high-precision multimodal retrieval, this component converts text, images, and video resources into fine-grained knowledge through multimodal lossless parsing and token-level encoding. By combining multi-dimensional retrieval and comparison mechanisms, the system achieves over 95% retrieval accuracy in intelligent query scenarios.
PB-Level KV Cache: Addressing efficiency in AI customer service applications, this massive cache infrastructure expands context windows during single conversations while enabling the reuse of historical KV Cache across multi-turn dialogues. By eliminating redundant computations, Huawei reports a 90% reduction in first-token latency and significantly improved model response speeds.
Memory Bank: This component provides contextual memory management that extracts and precipitates historical data and experience into recallable memory. The system enables continuous model evolution through accumulated context, allowing AI systems to become "smarter with use" over time—particularly valuable for business data insight applications.
UCM Technology: The "+1" in the architecture refers to Huawei's Unified Cache Management (UCM) technology, which implements a three-layer cache architecture for full lifecycle management and intelligent scheduling of memory data across the knowledge base, KV Cache, and memory bank.
Flexible Deployment Models
Huawei offers two deployment configurations to accommodate different infrastructure requirements. The integrated deployment utilizes the OceanStor A800 as its foundation, combining all capabilities with performance and flexible scalability. Alternatively, the separated deployment employs a "data engine node + OceanStor Dorado" architecture, allowing organizations to add data engine nodes to existing systems—protecting historical investments while enabling smooth business transformation.
The launch represents Huawei's latest push into AI infrastructure, positioning storage and data management as critical enablers for practical AI application deployment beyond the training phase.