February 10, 2026 3 min read 567 words

Zihan Yang, Shuyuan Tu, Licheng Zhang, Qi Dai, Yu-Gang Jiang, Zuxuan Wu

Also: Русский

This article is AI-generated from a scientific publication. We recommend verifying information in the original source.

Why It Matters

Engineers can now build real-time AI image tools for mobile and web apps that run efficiently on consumer hardware, enabling on-device creative applications without sacrificing image quality.

ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation

In Brief

ArcFlow is a new way to make AI-generated images faster and more efficient. It uses a smarter method to skip many of the slow steps normally needed in image creation, while still producing high-quality results. "Non-linear flow" refers to a smooth, curved path the AI takes to build an image, unlike simpler straight-line shortcuts.

The Problem

Creating detailed AI images usually takes a long time because the process involves many small steps to gradually "clean up" noise and form a clear picture. This is called diffusion, and while it produces excellent results, it’s slow—especially for real-time apps or mobile devices. Faster methods often sacrifice image quality because they take short cuts that don’t match how the original model builds images. This trade-off between speed and quality has long been a major challenge in AI image generation.

The Solution

To fix this, researchers created ArcFlow, a system that mimics how top AI models create images but does it in just two steps. Instead of using simple straight-line approximations (which often miss the true path), ArcFlow models the image-building process as a smooth, curved path—like a ball rolling down a winding hill rather than a straight ramp. This curved path, or "non-linear flow," better matches how the original model actually works. The team used a mathematical trick to calculate this path exactly, avoiding small errors that happen when computers estimate curves step-by-step. This allows ArcFlow to stay accurate even when skipping most steps. The system is trained by comparing its output to a powerful pre-trained model (like Qwen-Image-20B), using only small, lightweight changes to the original model—just 5% of the parameters are adjusted. This makes training fast and efficient. shows how ArcFlow (with only 1.8G trainable parameters) matches or beats models with far more parameters, like TwinFlow (38G), in image quality.

The ArcFlow model, with 1.8G trainable parameters, produces high-quality images comparable to or better than models with significantly more parameters, such as TwinFlow (38G) and Qwen-Image (1.8G).

Key Findings

ArcFlow achieves a 40x speedup using only 2 numerical function evaluations (NFEs), compared to original multi-step models, with no major drop in image quality.
ArcFlow outperforms other fast models in image quality, as shown by a lower FID Score—a measure of how close generated images are to real ones. confirms that ArcFlow consistently achieves the lowest FID Score across all training iterations, beating both piFlow and TwinFlow.

TwinFlow consistently achieves the highest FID Score, while ArcFlow/Ours achieves the lowest FID Score across all iterations.

In direct comparisons, ArcFlow produces images with more complex seamless transitions and higher detail, especially in textured or intricate scenes. highlights this by showing close-ups where ArcFlow captures finer details (like fabric folds or light reflections) better than Qwen-Image-Lightning.

The figure demonstrates that the ArcFlow method produces images with more complex seamless transitions and higher detail compared to the Qwen-Image-Lightning method.

Why It Matters

This breakthrough could make AI image generation much faster and more accessible. Imagine using AI to design game assets, create concept art, or generate visuals for social media—all in real time, on a phone or laptop. Because ArcFlow uses so little extra computation, it could run on devices with limited power, like smartphones or tablets. This opens up new possibilities for creative tools, real-time design, and on-device AI features without needing expensive hardware.

Limitations

The researchers report that ArcFlow’s performance may vary slightly depending on the prompt or image style, though it remains stable across benchmarks.
The method relies on a pre-trained teacher model (like Qwen-Image-20B or FLUX.1-dev), so it cannot generate images from scratch.
While the model is efficient, the full impact on diverse real-world applications—such as video generation or interactive design—has not yet been tested.

Read Original Paper

All Articles