4. The Central Path

The problem solving algorithm will be developed as an central path algorithm. By firstly develop the barrier function for the constrains then find an central path, using the barrier ‘force field’.

Suppose the strictly feasible holds.

Build the Log-barrier function (carefully choosen) based on a function (primal objective function, or with the duality function together), with a relaxation on the constrains to parameter \(\gamma\ (\eta)\). Then found the analytic center \(x(\gamma)\), with the sequence of \(\{ x(\gamma) \}\), we could get closer and closer to the optimal, with the convergence properties analysised later.

4.1 Barrier Function for a linear matrix inequality

As the former described problems (the primal SDP problem , the dual problem, and the primal-dual problem) are all SDPs. We will generally consider the linear matrix inequality constrain. Its barrier is (finity only the problem is strictly feasible) :

\[\begin{split}\phi(x) = \begin{cases} \log\det F(x)^{-1} \quad if F(x)>0 \\ + \infty \quad \quad otherwise \end{cases}\end{split}\]

The upper function is : analytic, stritly convex, and self-concordant . It has the following results. Using \(\nabla \log\det X = X^{-1}\).

\[(\nabla \phi(x))_{i} = -Tr(F_{i}\nabla\log\det(F)) = - Tr(F_{i}F(x)^{-1})\]
\[(\nabla^{2} \phi(x))_{i,j} = -Tr(F_{j}\nabla(Tr(F^{-1}F_{i})))= Tr(F(x)^{-1}F_{i}F(x)^{-1}F_{j})\]
\[\begin{split}\begin{align*} \log\det (X+\delta X)^{-1} &\approx \log\det X^{-1} + \sum_{i}(\nabla \phi(x))_{i}\delta x_{i} + \sum_{i}\sum_{j} \frac{1}{2} \delta x_{i}(\Delta^{2} ]\phi(x))_{i,j}\delta x_{j} \\ & = \log\det X^{-1} - \sum_{i} Tr(X^{-1}F_{i})\delta x_{i} + \sum_{i,j}\frac{1}{2} \delta x_{i} Tr(X^{-1}F_{i}X^{-1}F_{j}) x_{j} \\ & = \log\det X^{-1} - Tr(X^{-1}\delta X) + \frac{1}{2} Tr(X^{-1}\delta X X^{-1}\delta X) \end{align*}\end{split}\]

The first line uses the second order approximation, and using \(\delta X = \sum_{i}\delta x_{i}F_{i}\).

4.2 Analytic center

Suppose X is bounded. As the barrier is stirctly convex, it has an unique minimizer (The Analytic Center of the linear inequality):

\[x^{*} = \arg\min \phi(x)\]

Using the first-order optimal condition:

\[(\nabla \phi(x^{*}))_{i} = - Tr(F(x^{*})^{-1}F_{i}) = 0\]

The log function of a linear set (detX) could be seen as product of a set of distances. We have the similar interpretation as the general center path algorith, as a point that maximized the product of a set of distances.

We can use the Newton’s method to solve the analytic center as an iterative process of solve the minimize problem:

\[\begin{split}\begin{align*} \delta x^{N} &= \arg\min_{v\in \mathcal{R}^{m}} \phi(x+v) \\ &= \arg\min_{v\in \mathcal{R}^{m}} \log\det(F(x) +F(v))^{-1} \\ &= \arg\min_{v\in \mathcal{R}^{m}}\{ - (\log\det F(x)^{-1} - Tr(F(x)^{-1}F(v)) + \frac{1}{2} Tr(F(x)^{-1}F(v) F(x)^{-1}F(v))) \} \\ &= \arg\min_{v\in \mathcal{R}^{m}} \{ -\sum_{i}v_{i}Tr(F^{-1/2}F_{i}F^{-1/2}) + \frac{1}{2} \sum_{i,j}v_{i}v_{j}Tr((F^{-1/2}F_{i}F^{-1/2})(F^{-1/2}F_{j}F^{-1/2})) \} \\ &= \arg\min_{v\in \mathcal{R}^{m}} (-2Tr(A) + Tr(A^{T}A) )\\ &= \arg\min_{v\in \mathcal{R}^{m}} Tr((-I+A)^{T}(-I+A)) \\ &= \arg\min_{v\in \mathcal{R}^{m}} \| -I + A \|_{F} \end{align*}\end{split}\]

Where \(F(v) = \sum_{i}v_{i}F_{i}\), and \(A =\sum_{i}v_{i} F^{-1/2}F_{i}F^{-1/2}\), and the final line uses the Frobenius norm. Which is a least squares problem with m variables and n(n+1)/2 equations.

Then the step size could be calcuated using line search algorithm \(\hat{p} = \arg\min_{p} \phi(x+p\delta x^{N})\) Which could be simplfied by eigenvalue calculation.

The convergence analysis could be seen in the paper.

4.4 Central Path: Objective

Consider the following inequalities \(F(x)>0, c^{T}x=\gamma\) , where \(p^{*}<\gamma<\bar{p}\) (from our assumptions), where \(\bar{p} = \sup \{c^{T}x\mid F(x)>0 \}\) the bound of the objective problem. We could define the analytic center of these inequalities:

\[\begin{split}\begin{align*} x^{*}(\gamma) = & \arg\min && \log\det F(x)^{-1} \\ & subject\ to && F(x) > 0, c^{T}x =\gamma \end{align*}\end{split}\]

The curve of \(x^{*}(\gamma)\) is called the central path for the semidefinite problem in Chapter 1. As \(\gamma\) getting close to \(p^{*}\), x converges to an optimal point.

With the lagrangian:

\[\mathcal{L}(x, \lambda) = \log\det F(x)^{-1} + \lambda (c^{T}x - \gamma)\]

Apply the opimal condition:

\[\partial \mathcal{L}(x, \lambda) / \partial x = - Tr (F(x^{*}(\gamma))^{-1}F_{i}) + \lambda c_{i} = 0\]
\[Tr (F(x^{*}(\gamma))^{-1}F_{i}) = \lambda c_{i} , \quad i = 1,...,m\]

It shows that \(F(x^{*}(\gamma))^{-1}/\lambda\) is dual feasible when \(\lambda>0\) (as it satisfies the dual feasible conditions).

The duality gap with primal-dual feasible pair \(x=x^{*}(\gamma), Z=F(x^{*}(\gamma))^{-1}/\lambda\) is:

\[\eta = Tr(F(x)Z) = Tr(F(x^{*}(\gamma)) F(x^{*}(\gamma))^{-1}/\lambda ) = Tr(I/\lambda) = n/\lambda\]

We could also see that it solve the following dual SDP [1] :

\[\begin{split}\begin{align} &minimize \quad && \log\det Z^{-1} \\ &subject\ to && Tr(F_{i}Z) =c_{i}, i=1,...,m, \\ & && Z>0, \\ & && -Tr(F_{0}Z) = \gamma - n/\lambda \end{align}\end{split}\]

The Lagrangian multiplier \(\lambda\) is related to the duality gap of the point on the path of centers and the associated dual feasible point.

Define the deviation from the central path:

\[\psi(x) = \log\det F(x)^{-1} - \log\det F(x^{*}(c^{T}x))^{-1}\]

The difference between the value of the barrier function at the point x and the minimum of the barrier function over all points with the same value of cost function as x. (if it equals zeros, then x in the central path.)

See the convergence properties in the paper.

\[\# Newton \ steps \le 5 + 11\Phi(x)\]
[1](1, 2) See the proof here .

4.5 Central Path: Duality gap

Optimal the sum of primal and dual function, with a relaxation on the duality gap constraint.

\[\begin{split}\begin{align*} (x^{*}(\eta), Z^{*}(\eta)) = & \arg\min && -\log\det F(x) - \log\det Z \\ & subject\ to && F(x) > 0, Z>0\\ & && Tr(F_{i}Z) = c_{i}, i = 1,...,m,\\ & && c^{T}x + Tr(F_{0}Z) = \eta \end{align*}\end{split}\]

The constrains are primal strictly feasible conditions, dual strictly feasible conditions, and the definition of duality gap. The objective functions are the sum of barrier functions for primal variable and dual variable. We also have [2] :

\[Z^{*}(\eta) F(x^{*}(\eta)) = (\eta/n) I\]

And \(\eta = c^{T}x +Tr(F_{0}Z) = Tr(F(x)Z)\) (using \(Tr(F_{i}Z) = c_{i}\)), Therefore the optimal of the upper problem is :

\[\begin{split}\begin{align} -\log\det F(x^{*}(\eta))Z^{*}(\eta) &= - \log\det(\eta/n)I = - n\log(\eta/n) = n\log n - n\log(\eta) \\ &= n\log n - n\log Tr(F(x)Z) \end{align}\end{split}\]

The difference (a measure of the deviation of x, Z from centrality) is :

\[\begin{split}\begin{align} \psi(x, Z) &= - \log\det F(x)Z + \log\det F(x^{*}(\eta))Z^{*}(\eta) \\ &= - \log\det F(x)Z + n\log Tr(F(x)Z) - n\log n \\ \end{align}\end{split}\]
[2]Similar to the proof in [1] , we could get that \(Z^{*}(\eta) = F(x^{*}(\eta))^{-1}\eta/n\), which is equivalent to the expression.