diff-font：稳健单发字体生成的扩散模型

论文标题

diff-font：稳健单发字体生成的扩散模型

Diff-Font: Diffusion Model for Robust One-Shot Font Generation

论文作者

He, Haibin, Chen, Xinyuan, Wang, Chaoyue, Liu, Juhua, Du, Bo, Tao, Dacheng, Qiao, Yu

论文摘要

字体生成是一项艰巨且耗时的任务，尤其是在这些语言中，使用具有大量字符（例如中文）结构的意识形态图。为了解决这个问题，很少有字体产生甚至一击字体产生引起了很多关注。但是，大多数现有的字体生成方法仍然可能遭受（i）较大的跨膜差距挑战；（ii）微妙的跨第一个变化问题；（iii）不正确的复杂字符。在本文中，我们提出了一种基于扩散模型的新型单发字体生成方法，该方法名为DIFF-FONT，可以在大型数据集上稳定训练。提出的模型旨在通过仅给出一个样本作为参考来生成整个字体库。具体而言，构建了大型笔划的数据集，并提出了一个划分的扩散模型来保留每个生成的字符的结构和完成。据我们所知，拟议的Diff-Font是开发传播模型来处理字体生成任务的第一部作品。训练有素的DIFF-FONT不仅对字体差距和字体变化也很强，而且在困难的角色产生方面也实现了有希望的表现。与以前的字体生成方法相比，我们的模型在定性和定量上都达到最先进的性能。

Font generation is a difficult and time-consuming task, especially in those languages using ideograms that have complicated structures with a large number of characters, such as Chinese. To solve this problem, few-shot font generation and even one-shot font generation have attracted a lot of attention. However, most existing font generation methods may still suffer from (i) large cross-font gap challenge; (ii) subtle cross-font variation problem; and (iii) incorrect generation of complicated characters. In this paper, we propose a novel one-shot font generation method based on a diffusion model, named Diff-Font, which can be stably trained on large datasets. The proposed model aims to generate the entire font library by giving only one sample as the reference. Specifically, a large stroke-wise dataset is constructed, and a stroke-wise diffusion model is proposed to preserve the structure and the completion of each generated character. To our best knowledge, the proposed Diff-Font is the first work that developed diffusion models to handle the font generation task. The well-trained Diff-Font is not only robust to font gap and font variation, but also achieved promising performance on difficult character generation. Compared to previous font generation methods, our model reaches state-of-the-art performance both qualitatively and quantitatively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题