实时计算成像的两步训练深度学习框架，没有物理先验

论文标题

实时计算成像的两步训练深度学习框架，没有物理先验

A Two-step-training Deep Learning Framework for Real-time Computational Imaging without Physics Priors

论文作者

Shang, Ruibo, Hoffer-Hawlik, Kevin, Luke, Geoffrey P.

论文摘要

深度学习（DL）是许多应用程序计算成像的强大工具。一种常见的策略是重建初步图像作为神经网络的输入，以实现优化的图像。通常，以成像模型的先验知识获取初步图像。但是，一个突出的挑战是实际成像模型与假定模型的偏差程度。模型不匹配降低了初步图像的质量，因此会影响DL预测。另一个主要挑战是，由于大多数成像逆问题是错误的，并且网络被过度参数化，因此DL网络具有灵活性，可以从与成像模型无直接相关的数据中提取功能。为了解决这些挑战，提出了一个两步训练的DL（TST-DL）框架，用于没有物理先验的实时计算成像。首先，对单个完全连接的层（FCL）进行训练以直接学习模型。然后，该FCL是固定的，并与未经训练的U-NET体系结构进行了连接，以进行二步训练，以提高输出图像保真度，从而带来四个主要优势。首先，它不依赖于成像模型的准确表示，因为该模型是直接学习的。其次，可以实现实时成像。第三，TST-DL网络朝着所需方向进行训练，并且预测得到了改进，因为第一步被限制以学习模型，第二步通过学习最佳正常化程序来改善结果。第四，该方法适合数据的任何大小和维度。我们使用线性单像素相机成像模型演示了此框架。与其他DL框架和基于模型的迭代优化方法的结果相比，该结果进行了定量的比较。我们将此概念进一步扩展到图像脱离自动相关的应用中。

Deep learning (DL) is a powerful tool in computational imaging for many applications. A common strategy is to reconstruct a preliminary image as the input of a neural network to achieve an optimized image. Usually, the preliminary image is acquired with the prior knowledge of the imaging model. One outstanding challenge, however, is the degree to which the actual imaging model deviates from the assumed model. Model mismatches degrade the quality of the preliminary image and therefore affect the DL predictions. Another main challenge is that since most imaging inverse problems are ill-posed and the networks are over-parameterized, DL networks have flexibility to extract features from the data that are not directly related to the imaging model. To solve these challenges, a two-step-training DL (TST-DL) framework is proposed for real-time computational imaging without physics priors. First, a single fully-connected layer (FCL) is trained to directly learn the model. Then, this FCL is fixed and concatenated with an un-trained U-Net architecture for a second-step training to improve the output image fidelity, resulting in four main advantages. First, it does not rely on an accurate representation of the imaging model since the model is directly learned. Second, real-time imaging can be achieved. Third, the TST-DL network is trained in the desired direction and the predictions are improved since the first step is constrained to learn the model and the second step improves the result by learning the optimal regularizer. Fourth, the approach accommodates any size and dimensionality of data. We demonstrate this framework using a linear single-pixel camera imaging model. The results are quantitatively compared with those from other DL frameworks and model-based iterative optimization approaches. We further extend this concept to nonlinear models in the application of image de-autocorrelation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题