當(dāng)前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

Image Deconvolution with the Half-quadratic Splitting Method

發(fā)布時(shí)間：2024/1/18 编程问答 35 豆豆

生活随笔收集整理的這篇文章主要介紹了 Image Deconvolution with the Half-quadratic Splitting Method 小編覺得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.

Image Deconvolution with the Half-quadratic Splitting Method

在處理圖像重建或者逆問題的時(shí)候，我們經(jīng)常會(huì)看到一種稱為 Half-quadratic Splitting（HQS）的方法，這是在優(yōu)化領(lǐng)域里非常經(jīng)典的一種方法，之前也斷斷續(xù)續(xù)地找了很多相關(guān)的資料，發(fā)現(xiàn)斯坦福大學(xué)的計(jì)算成像課里某一節(jié) lecture note 把這個(gè)方法在圖像反卷積中的使用介紹地非常詳細(xì)。這篇文章也是基于這個(gè) lecture note 的一個(gè)學(xué)習(xí)筆記。

Image Formation

一般來說，圖像的成像過程可以用下面的式子表示：

$\eta \tag{1}$

其中， $b$ 表示觀察到的模糊圖像， $c$ 表示一個(gè)卷積核，可以認(rèn)為是光學(xué)系統(tǒng)的 PSF， $\eta$ 表示與信號(hào)無關(guān)的噪聲， $x$ 是希望重建出來的清晰圖像。

根據(jù)信號(hào)處理的理論，在空域的卷積相當(dāng)于在頻域的乘積，所以上面的式子可以寫成：

$\mathcal{F}^{-1} \{ \mathcal{F}(c) \cdot \mathcal{F}(x) \} + \eta \tag{2}$

不過要注意的是，這個(gè)等價(jià)只有在卷積滿足 circular boundary conditions 時(shí)才成立。

為了后續(xù)的推導(dǎo)方便，這里將卷積運(yùn)算轉(zhuǎn)換成矩陣運(yùn)算，可以得到如下的式子：

$\Leftrightarrow \mathbf{b} = \mathbf{C} \mathbf{x} \tag{3}$

Inverse Filtering and Wiener Deconvolution

在忽略噪聲的情況下，式 (2) 可以表示成：

$\mathcal{F} (b) = \mathcal{F}(c) \cdot \mathcal{F}(x)$

進(jìn)而可以直接算出 $x$ :

$\tilde{x}_{if} = \mathcal{F}^{-1} \{ \frac{\mathcal{F}(b)}{\mathcal{F}(c)} \} \tag{4}$

上面的 if 就是 inverse filtering 的縮寫，如果已知卷積核的形式，就可以計(jì)算出該卷積核對應(yīng)的傅里葉變換，然后除以觀察圖像的傅里葉變換，從而得到清晰圖像在頻域的值，最后在做一個(gè)反傅里葉變換，得到最終的清晰圖像。

這種方法看起來直觀高效，不過有一個(gè)問題，就是 $\mathcal{F}(c)$ 的值很小趨近于 0 的時(shí)候，將會(huì)放大觀察圖像中的噪聲，維納濾波通過加入一個(gè)阻尼系數(shù)，可以避免這個(gè)問題：

$\tilde{x}_{wf} = \mathcal{F}^{-1} \left\{ \frac{|\mathcal{F}(c)|^{2}}{|\mathcal{F}(c)|^{2} + \frac{1}{SNR}} \frac{\mathcal{F}(b)}{\mathcal{F}(c)} \right\} \tag{5}$

SNR 表示信噪比，如果觀測圖像中沒有噪聲，那么 SNR 會(huì)很高，這種情況下，維納濾波就變成了 inverse filter。

維納濾波相比于直接的逆濾波，可以得到更加合理的反卷積結(jié)果。不過，這類方法面臨的主要問題就是無法利用自然圖像的先驗(yàn)信息，接下來，我們將介紹如何將先驗(yàn)信息與優(yōu)化方法結(jié)合。

Nature Image Priors

首先，我們來看什么是自然圖像先驗(yàn)，自然圖像先驗(yàn)一般是一種數(shù)學(xué)模型，告訴我們在自然圖像中，像素的分布更傾向于什么形態(tài)，在求解病態(tài)逆問題的時(shí)候，可能存在無數(shù)種解都符合觀測值，而自然圖像先驗(yàn)，會(huì)幫助我們挑選那些看起來更合理的解。

自然圖像先驗(yàn)可以對圖像的分布進(jìn)行建模，這里，我們用正則化來表示對圖像的某些性質(zhì)，正則化一般對應(yīng)圖像先驗(yàn)的負(fù)對數(shù)。通常的正則化，包括平滑性，稀疏性，稀疏梯度，自相似性等。比如，稀疏性，可以用一階范數(shù)表示為： $|\mathbf{x}| = |\sum x_i|$ ，而平滑性，可以用梯度的二階范數(shù)表示： $|\Delta \mathbf{x}|_2$ ，不過，在求解圖像復(fù)原的逆問題中，應(yīng)用最廣泛的還是稱為 total variation (TV) 的正則，TV 的建立，是基于下面的觀察，大部分自然圖像，都可以看到很多區(qū)域都是平滑的，只有在不同物體的交界處才會(huì)出現(xiàn)邊界，在同一個(gè)物體的內(nèi)部，可以認(rèn)為像素值區(qū)域平滑或者相似，而在不同物體的邊界處，會(huì)有一個(gè)陡然的變化。

為了對 TV 建模，需要計(jì)算圖像的一階導(dǎo)數(shù)，這里，將圖像水平方向的一階導(dǎo)數(shù)表示為 $\mathbf{D}_x\mathbf{x}$ ，而垂直方向的一階導(dǎo)數(shù)表示為 $\mathbf{D}_y\mathbf{y}$ ，具體的數(shù)學(xué)形式如下所示：

$\mathbf{D}_x\mathbf{x} \Leftrightarrow \mathbf{x} \ast d_x, \quad d_x = \begin{bmatrix} 0 & 0 & 0\\ 0 & -1 & 1 \\ 0 & 0 & 0 \end{bmatrix}$

$\mathbf{D}_y\mathbf{x} \Leftrightarrow \mathbf{x} \ast d_y, \quad d_y = \begin{bmatrix} 0 & 0 & 0\\ 0 & -1 & 0 \\ 0 & 1 & 0 \end{bmatrix}$

這個(gè)圖像的一階導(dǎo)數(shù)，既可以用矩陣直接計(jì)算，也可以用卷積的形式計(jì)算。

圖像的 TV 正則可以表示為圖像梯度的一階范數(shù)，TV 正則項(xiàng)有兩種表達(dá)形式，一種稱為各向異性的 TV，另外一種稱為各向同性的正則，分別如下所示：

各向異性的 TV 正則

$TV_{anisotropic}(\mathbf{x}) = \left \| \mathbf{D}_x\mathbf{x} \right \|_{1} + \left \| \mathbf{D}_y\mathbf{x} \right \|_{1} = \sum_{i=1}^{N} \left| (\mathbf{D}_x\mathbf{x})_{i} \right| + \left| (\mathbf{D}_y\mathbf{x})_{i} \right| = \sum_{i=1}^{N} \sqrt{(\mathbf{D}_x\mathbf{x})^{2}_{i}}+\sqrt{(\mathbf{D}_y\mathbf{x})^{2}_{i}}$

各向同性的 TV 正則

$TV_{isotropic}(\mathbf{x}) = \left \| \mathbf{D}\mathbf{x} \right \|_{2,1} = \sum_{i=1}^{N} \sqrt{(\mathbf{D}_x\mathbf{x})^{2}_{i} + (\mathbf{D}_y\mathbf{x})^{2}_{i}}$

這兩種正則的差異，從表達(dá)式中可以看出，對于各向同性來說，每個(gè)像素 $i$ 的梯度約束是兩個(gè)方向同時(shí)約束的，而對于各向異性來說，兩個(gè)方向的梯度約束是分開約束的。

Regularized Deconvolution with Half-quadratic Splitting

The Half-quadratic Splitting Method

接下來，我們介紹 HQS 這種優(yōu)化方法，在介紹用這種方法求解圖像復(fù)原的逆問題之前，我們先探討這種方法的一般形式，考慮如下的一個(gè)優(yōu)化問題：

$\mathbf{b} = \mathbf{A}\mathbf{x} + \mathbf{\eta}$

其中， $\mathbf{x} \in R^{N}$ 是一個(gè)待求解的向量， $\mathbf{b} \in R^{M}$ 表示一個(gè)觀測向量， $\eta$ 表示噪聲， $\mathbf{A} \in R^{M \times N}$ 表示轉(zhuǎn)換矩陣，這個(gè)問題的一般求解形式如上所示：

$\min_{\mathbf{x}} \frac{1}{2} \left \| \mathbf{A}\mathbf{x} - \mathbf{b} \right \|_2^{2} + \lambda \Psi(\mathbf{x})$

其中，前面的第一項(xiàng)一般稱為數(shù)據(jù)項(xiàng)，第二項(xiàng)稱為正則項(xiàng)，直接求解上面的式子，有的時(shí)候并不能很好地收斂，一種更為魯棒的求解方式，應(yīng)該是將上面的優(yōu)化函數(shù)，改寫成下面這種形式：

$\min_{\mathbf{x}} \frac{1}{2} \left \| \mathbf{A}\mathbf{x} - \mathbf{b} \right \|_2^{2} + \lambda \Psi(\mathbf{z}) \\ \text{subject to} \quad \mathbf{D}\mathbf{x} - \mathbf{z} = 0$

這個(gè)優(yōu)化函數(shù)中，引入了一個(gè)中間變量 $\mathbf{z}$ ，這個(gè)額外的中間變量，可以將上面的優(yōu)化函數(shù)拆成兩部分，一部分是數(shù)據(jù)項(xiàng)，另外一部分是正則項(xiàng)，這兩項(xiàng)依賴的變量是相互獨(dú)立的， $\mathbf{x}, \mathbf{z}$ 之間靠一個(gè)約束表達(dá)式聯(lián)系，如果將這個(gè)約束項(xiàng)合入優(yōu)化函數(shù)，則整個(gè)優(yōu)化函數(shù)可以寫成：

$L_p(\mathbf{x}, \mathbf{z}) = f(\mathbf{x}) + g(\mathbf{z}) + \frac{p}{2} \left \| \mathbf{D}\mathbf{x} - \mathbf{z} \right \|_{2}^{2}$

優(yōu)化函數(shù)寫成這種形式，可以通過相互迭代的方式求解，求解 $\mathbf{x}$ 與求解 $\mathbf{z}$ 可以分開進(jìn)行：

$\mathbf{x} \gets \mathbf{prox}_{f, p} (\mathbf{z}) = \argmin_{\mathbf{x}} L_p(\mathbf{x}, \mathbf{z}) = \argmin_{\mathbf{x}} f(\mathbf{x}) + \frac{p}{2} \left \| \mathbf{D}\mathbf{x} - \mathbf{z} \right \|_{2}^{2} \tag{13}$

$\mathbf{z} \gets \mathbf{prox}_{f, p} (\mathbf{D}\mathbf{x}) = \argmin_{\mathbf{z}} L_p(\mathbf{x}, \mathbf{z}) = \argmin_{\mathbf{z}} g(\mathbf{z}) + \frac{p}{2} \left \| \mathbf{D}\mathbf{x} - \mathbf{z} \right \|_{2}^{2} \tag{14}$

從上面的求解過程可以看出，當(dāng)我們更新 $\mathbf{x}$ 的時(shí)候，只需要考慮 $f(\mathbf{x})$ ，而當(dāng)我們更新 $\mathbf{z}$ 的時(shí)候，只需要考慮 $g(\mathbf{z})$ ，而不需要同時(shí)考慮這兩個(gè)函數(shù)。這個(gè)性質(zhì)，可以構(gòu)建一個(gè)非常靈活的框架，讓我們可以靈活地與各種正則函數(shù)相結(jié)合。這種方式也被稱為 plug-and-play （即插即用）。

雖然 HQS 可以用于解決各種逆問題，不過我們這里還是討論比較特殊的一種圖像解卷積問題，我們討論一種已知固定卷積核的情況，這樣對應(yīng)的矩陣是一個(gè)循環(huán) Toeplitz 矩陣。先定義如下的關(guān)系：

$\mathcal{F}^{-1} \{ \mathcal{F}(c) \cdot \mathcal{F}(x) \} = \mathbf{C}\mathbf{x} \\ \mathcal{F}^{-1} \{ \mathcal{F}(c)^{*} \cdot \mathcal{F}(x) \} = \mathbf{C}^{T}\mathbf{x} \\ \mathcal{F}^{-1} \{ \frac{\mathcal{F}(b)}{\mathcal{F}(c)} \} = \mathbf{C}^{-1}\mathbf{x}$

Standard Form of HQS with TV and Denoising Regularizers

接下來，我們考慮基于 TV 正則的 HQS 的優(yōu)化方法，由上面的表達(dá)式，我們可以將帶 TV 正則的優(yōu)化函數(shù)寫成如下形式：

$\min_{\mathbf{x}} \frac{1}{2} \left \| \mathbf{C}\mathbf{x} - \mathbf{b} \right \|_2^{2} + \lambda \Psi(\mathbf{z}) \\ \text{subject to} \quad \mathbf{D}\mathbf{x} - \mathbf{z} = 0$

其中， $\mathbf{D} = [\mathbf{D}_{x}^{T}, \mathbf{D}_{y}^{T}] \in R^{2N \times N}$ 表示 $x, y$ 方向的一階導(dǎo)數(shù)，這里的 $\mathbf{z} \in R^{2N}$ 比 $\mathbf{x} \in R^{N}$ 要大一倍，因?yàn)槊總€(gè)像素，有 $x, y$ 兩個(gè)方向的梯度。

對于更為一般的情況，我們可以使用一個(gè)簡單的正則項(xiàng)，將待求解的圖像投影到一個(gè)靈活的自然圖像空間中，整個(gè)的 HQS 的形式可以寫成如下所示：

$\min_{\mathbf{x}} \frac{1}{2} \left \| \mathbf{C}\mathbf{x} - \mathbf{b} \right \|_2^{2} + \lambda \Psi(\mathbf{z}) \\ \text{subject to} \quad \mathbf{x} - \mathbf{z} = 0$

Efficient Implementation of the x-Update using Inverse Filtering

前面介紹過，HQS 的方法，會(huì)交替迭代更新 $\mathbf{x}, \mathbf{z}$ ，我們先來看 $\mathbf{x}$ 的更新，

$\mathbf{prox}_{f, p} (\mathbf{z}) = \argmin_{\mathbf{x}} f(\mathbf{x}) + \frac{p}{2} \left \| \mathbf{D}\mathbf{x} - \mathbf{z} \right \|_{2}^{2} = \argmin_{\mathbf{x}} \frac{1}{2} \left \| \mathbf{C}\mathbf{x} - \mathbf{b} \right \|_2^{2} + \frac{p}{2} \left \| \mathbf{D}\mathbf{x} - \mathbf{z} \right \|_{2}^{2} \tag{20}$

將上面的表達(dá)式展開，可以得到：

$\frac{1}{2} \left \| \mathbf{C}\mathbf{x} - \mathbf{b} \right \|_2^{2} + \frac{p}{2} \left \| \mathbf{D}\mathbf{x} - \mathbf{z} \right \|_{2}^{2} \\ = \frac{1}{2}(\mathbf{C}\mathbf{x} - \mathbf{b})^{T}(\mathbf{C}\mathbf{x} - \mathbf{b}) + \frac{p}{2}(\mathbf{D}\mathbf{x} - \mathbf{z})(\mathbf{D}\mathbf{x} - \mathbf{z})^{T} \\ = \frac{1}{2}(\mathbf{x}^{T}\mathbf{C}^{T}\mathbf{C}\mathbf{x} - 2 \mathbf{x}^{T}\mathbf{C}^{T} \mathbf{b} + \mathbf{b}^{T}\mathbf{b} ) + \frac{p}{2}(\mathbf{x}^{T}\mathbf{D}^{T}\mathbf{D}\mathbf{x} - 2 \mathbf{x}^{T}\mathbf{D}^{T} \mathbf{z} + \mathbf{z}^{T}\mathbf{z} )$

將上面的表達(dá)式對 $\mathbf{x}$ 求導(dǎo)，可以得到：

$\mathbf{C}^{T}\mathbf{C}\mathbf{x} - \mathbf{C}^{T} \mathbf{b} + p \mathbf{D}^{T}\mathbf{D}\mathbf{x} - p \mathbf{D}^{T} \mathbf{z}$

讓導(dǎo)數(shù)為 0 ，進(jìn)而可以求得 $\mathbf{x}$ ：

$\mathbf{C}^{T}\mathbf{C} + p\mathbf{D}^{T}\mathbf{D} )^{-1}(\mathbf{C}^{T} \mathbf{b} + p \mathbf{D}^{T} \mathbf{z} ) \tag{24}$

對于滿足 circular boundary conditions 的 2D 圖像解卷積問題，上面的式子，可以變換到傅里葉域，然后再進(jìn)行求解，上面所有的矩陣相乘的形式，都可以找到對應(yīng)的傅里葉域的形式。

Special Case of TV Regularizer

如果正則項(xiàng)是 TV 項(xiàng)，那么 $\mathbf{D}$ 就是有限差分算子，上面的公式 (24) 可以寫成如下形式：

$\mathbf{C}^{T}\mathbf{C} + p\mathbf{D}^{T}\mathbf{D} ) \Leftrightarrow \mathcal{F}^{-1} \{ \mathcal{F}\{c\}^{*} \cdot \mathcal{F}\{c\} + p (\mathcal{F}\{d_x\}^{*} \cdot \mathcal{F}\{d_x\} + \mathcal{F}\{d_y\}^{*} \cdot \mathcal{F}\{d_y\}) \}$

$(\mathbf{C}^{T} \mathbf{b} + p \mathbf{D}^{T} \mathbf{z} ) \Leftrightarrow \mathcal{F}^{-1} \{ \mathcal{F}\{c\}^{*} \cdot \mathcal{F}\{b\} + p (\mathcal{F}\{d_x\}^{*} \cdot \mathcal{F}\{z_1\} + \mathcal{F}\{d_y\}^{*} \cdot \mathcal{F}\{z_2\}) \}$

由此，可以得到公式(20) 的解為：

$\mathbf{prox}_{f, p} (\mathbf{z}) = \mathcal{F}^{-1} \left( \frac{\mathcal{F}\{c\}^{*} \cdot \mathcal{F}\{b\} + p (\mathcal{F}\{d_x\}^{*} \cdot \mathcal{F}\{z_1\} + \mathcal{F}\{d_y\}^{*} \cdot \mathcal{F}\{z_2\})}{\mathcal{F}\{c\}^{*} \cdot \mathcal{F}\{c\} + p (\mathcal{F}\{d_x\}^{*} \cdot \mathcal{F}\{d_x\} + \mathcal{F}\{d_y\}^{*} \cdot \mathcal{F}\{d_y\})} \right)$

Special Case of Denoising Reg

對于更為一般的正則項(xiàng)， $\mathbf{D}$ 可以認(rèn)為是一個(gè)單位矩陣，公式 (20) 的求解將變得更為簡單：

$\mathbf{prox}_{f, p} (\mathbf{z}) = \mathcal{F}^{-1} \left( \frac{\mathcal{F}\{c\}^{*} \cdot \mathcal{F}\{b\} + p \mathcal{F}\{z\} } {\mathcal{F}\{c\}^{*} \cdot \mathcal{F}\{c\} + p } \right)$

Updating z with the TV Regularizer

公式(14) 關(guān)于 $\mathbf{z}$ 的更新，可以表示成如下的形式：

$\mathbf{prox}_{g, p} (\mathbf{v}) = \mathcal{S}_{\lambda/p}(\mathbf{v}) = \argmin_{\mathbf{z}} \lambda \left| \mathbf{z} \right|_{1} + \frac{p}{2} \left \| \mathbf{v} - \mathbf{z} \right \|_{2}^{2}$

其中， $\mathbf{v} = \mathbf{D} \mathbf{x}$ ， $\mathcal{S}$ 是一個(gè)分段函數(shù)：

$\mathcal{S}_{k}(v) = \left\{\begin{matrix} v - k & v > k \\ 0 & |v| < k \\ v + k & v < -k \end{matrix}\right.$

Isotropic TV Norm

對于各向同性的 TV 正則，正則項(xiàng)可以表示為：

$g(\mathbf{z}) = \lambda \left \| \mathbf{D}\mathbf{x} \right \|_{2,1} = \lambda \sum_{i=1}^{N} \sqrt{(\mathbf{D}_x\mathbf{x})^{2}_{i} + (\mathbf{D}_y\mathbf{x})^{2}_{i}}$

那么，帶各向同性正則項(xiàng)的解卷積問題可以表示成：

$\min_{\mathbf{x}} \frac{1}{2} \left \| \mathbf{C}\mathbf{x} - \mathbf{b} \right \|_2^{2} + \lambda \sum_{i=1}^{N} \left \| \begin{bmatrix} z_i\\ z_{i+N} \end{bmatrix} \right \|_{2} \\ \text{subject to} \quad \mathbf{D}\mathbf{x} - \mathbf{z} = 0$

$z_i$ 表示 $\mathbf{z}$ 的第 $i$ 個(gè)元素，其中， $\leq i \leq N$ 表示水平方向的有限差分， $\leq i \leq 2N$ 表示垂直方向的有限差分。

$\mathbf{z}$ 的更新可以表示成：

$\mathbf{z} \gets \mathbf{prox}_{f, p} (\mathbf{v}) = \argmin_{\mathbf{z}} \lambda \sum_{i=1}^{N} \left \| \begin{bmatrix} z_i\\ z_{i+N} \end{bmatrix} \right \|_{2} + \frac{p}{2} \left \| \mathbf{v} - \mathbf{z} \right \|_{2}^{2} \quad \mathbf{v} = \mathbf{D}\mathbf{x}$

最終 $\mathbf{z}$ 的更新可以表示為：

$\begin{bmatrix} z_i\\ z_{i+N} \end{bmatrix} \gets \mathcal{S}_{\lambda /p} \left( \begin{bmatrix} v_i\\ v_{i+N} \end{bmatrix} \right), \quad 1 \leq i \leq N$

Updating z with DnCNN or any Gaussian Denoiser as the Regularizer

如果我們進(jìn)一步審視公式 (14) ，不考慮矩陣 $\mathbf{D}$ 的情況下，

$\argmin_{\mathbf{z}} g(\mathbf{z}) + \frac{p}{2} \left \| \mathbf{x} - \mathbf{z} \right \|_{2}^{2} \\ = \argmin_{\mathbf{z}} \lambda \Psi(\mathbf{z}) + \frac{p}{2} \left \| \mathbf{x} - \mathbf{z} \right \|_{2}^{2} \\ = \argmin_{\mathbf{z}} \Psi(\mathbf{z}) + \frac{p}{2 \lambda} \left \| \mathbf{x} - \mathbf{z} \right \|_{2}^{2} \tag{36}$

公式 (36) 可以看成是一個(gè)降噪問題，可以等價(jià)成如下的表達(dá)式：

$\argmin_{\mathbf{x}} \Psi(\mathbf{x}) + \frac{1}{2 \sigma^{2}} \left \| \mathbf{v} - \mathbf{x} \right \|_{2}^{2}$

其中， $\mathbf{v}$ 可以看成是觀測到的有噪圖像， $\mathbf{x}$ 表示我們需要恢復(fù)的無噪圖像，基于這個(gè)觀測，對于降噪的正則項(xiàng)，那么對 $\mathbf{z}$ 的更新可以簡單地變成一個(gè)降噪過程，這個(gè)降噪可以用傳統(tǒng)的降噪方法，也可以用基于CNN 的降噪方法，

$\mathbf{prox}_{\mathcal{D}, p} (\mathbf{x}) = \mathcal{D} \left( \mathbf{x}, \sigma^{2} = \frac{\lambda}{p} \right)$

最后，做一個(gè)總結(jié)，基于 TV 正則和基于降噪正則的 HQS 方法的主要流程可以歸納如下：

總結(jié)

以上是生活随笔為你收集整理的Image Deconvolution with the Half-quadratic Splitting Method的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯(cuò)，歡迎將生活随笔推薦給好友。

上一篇：华中科技大学计算机考研分析
下一篇： OceanBase开发者大会震撼来袭