Hundreds of millions in angel round funding poured into a new company: Can LPU+ reshape the AI inference landscape?

Home

Finance

ICV

Smart City

Digital Live

Cloud

Optics

Home Finance AI ICV Smart City Digital Live Cloud Optics

04/20 2026 560

On April 17, Hangzhou Yuanchuan Micro, just six months old, announced the completion of hundreds of millions of yuan in angel round funding. Led by top-tier investors including Zhejiang Innovation Investment, Origin Capital, Frees Fund, Shenzhen Capital Group, and SMIC Capital, with industrial players like SigmaStar and Jwsmart also participating, all funds are earmarked for R&D and mass production of LPU+ architecture inference chips.

In 2026, amid intense competition in training chips and surging demand for inference, how did this startup secure collective approval from top-tier capital and industry players? The answer lies in its technical approach, team expertise, and product delivery capabilities.

Why This Team Dares to Challenge GPU Dominance?

Yuanchuan Micro's confidence stems from a battle-hardened chip development team. Founder Yang Bin, a graduate of Xidian University, joined Huawei in 2000 and brings 25 years of chip architecture and mass production experience. He led Huawei's processor team in Silicon Valley, spearheaded wireless baseband chip development, and positioned Huawei as a global leader in base station baseband technology.

80% of the core team hails from industry leaders like Huawei, HiSilicon, Cambrian, and Horizon Robotics, averaging over 15 years of experience and having delivered dozens of mass-produced chips. They cover full-stack R&D from architecture design, compilers, SDKs, to system integration—practitioners who turn chip blueprints into commercial products.

The team's choice of LPU+ was no mere trend-following. After DeepSeek-R1's 2025 technical report, Yang Bin recognized the inflection point: large models becoming truly usable, with inference emerging as the primary battleground for computing power. Traditional GPU architectures, with their separated memory and compute and multi-level caches, proved inefficient for real-time inference. He took the leap, targeting the toughest challenge in the inference space—rebuilding computing infrastructure with dataflow architecture.

What Makes LPU+ So Powerful? Why 6x Faster Than H100?

Yuanchuan Micro's core weapon is its self-developed LPU+ non-Von Neumann dataflow architecture, breaking free from traditional GPU/NPU paths.

Conventional chips rely on multi-level caches to move data, akin to layered warehouse transfers—high latency, significant overhead. LPU+ uses hardware dataflow to directly connect compute units, enabling automated, pipeline-style data movement within on-chip SRAM. Scheduling is front-loaded at the compilation stage, with fixed paths per clock cycle, achieving ASIC-level data movement efficiency.

Real-world performance speaks volumes:

Mountain Series: For data center large models, it delivers 6x the token speed of NVIDIA H100, cuts single-token costs by 75%, reduces energy consumption to 1/3, and offers 100-500TOPS computing power, suitable for autonomous driving, AI clusters, and LLM servers. River Series: For edge-side agents, it consumes just 1-5W power, delivers 10-50TOPS, and achieves sub-1ms latency, ideal for robots, smart cameras, and in-vehicle terminals.

Its advantage lies not in isolated parameters but in architectural generational leaps: smaller chip area for the same computing power, freeing up space for high-bandwidth SRAM—effectively "free" high-speed storage that breaks the "memory wall" and ensures fast, cost-effective, and stable inference.

Can It Really Replace GPUs Commercially?

Technology must deliver in the real world. From day one, Yuanchuan Micro focused on commercialization, with two product lines precisely addressing industry needs.

The Mountain Series targets cost reduction in cloud inference. With current high costs for large model API calls, its 6x speed, 1/4 cost, and 1/3 energy consumption enable significant TCO optimization for smart computing centers and cloud providers, accelerating large-scale GPU replacement. The River Series captures edge intelligence opportunities, with low power and latency perfectly matching real-time demands in embodied AI, industrial control, and smart homes—making it ideal for robots and in-vehicle terminals.

Investors' logic is clear: financial firms like Frees Fund and Origin Capital value the architectural disruption; SMIC Capital ensures production capacity; SigmaStar and Jwsmart provide scene (Note: ' scene ' retained as it may refer to specific application scenarios without a direct English equivalent) and channels, forming a "capital + technology + industry" closed loop (Note: ' closed loop ' translated as "closed loop") to accelerate the journey from tape-out to commercialization.

My Reflections: The Inference Revolution Has Just Begun

2026 marks a watershed for AI computing power, with inference demand exceeding 70% and market size surpassing training chips for the first time. The industry shifts from "stacking computing power" to "optimizing efficiency." Yuanchuan Micro's rise signals a golden age for specialized inference chips.

NVIDIA's heavy investment in Groq LPU confirms dataflow architecture as an optimal solution for inference—LPU+ is no niche path. Yuanchuan Micro's value lies in its domestic supply chain focus, iterative architectures better suited for agents and multimodality, bypassing CUDA ecosystem barriers through "specialized scene (Note: ' scene ' retained) for overtaking" strategies.

For the industry, this is not just one company's success but a microcosm of China's chip sector evolving from "following" to "leading." As the training market saturates, Yuanchuan Micro and similar players are redefining the trillion-dollar inference blue ocean.

Over the next year, Yuanchuan Micro will advance its first chip tape-out and customer validation. When LPU+ achieves mass production, we may witness AI inference cost barriers shattered and intelligent agents becoming ubiquitous.

Disclaimer: This article is compiled, translated, and edited based on public media content for reference only. The views and content do not constitute any guidance and offer no project recommendations or commitments to readers! If this article inadvertently infringes your rights, please contact us for prompt resolution.

Image source: Internet

This article does not constitute investment advice

Solemnly declare: the copyright of this article belongs to the original author. The reprinted article is only for the purpose of spreading more information. If the author's information is marked incorrectly, please contact us immediately to modify or delete it. Thank you.

Newest

Links