Affine Transformation Based Ontology Sparse Vector Learning Algorithm

As a kind of information representation and shared model, ontology is introduced in nearly all fields of computer science. Acting as a concept semantic framework, ontology works high effectiveness and is widely employed in other engineering applications such as biology science, medical science, pharmaceutical science, material science, mechanical science and chemical science (for instance, see Coronnello et al. [2], Vishnu et al. [3], Roantree et al. [4], Kim and Park [5], Hinkelmann et al. [6], Pesaranghader et al. [7], Daly et al. [8], Agapito et al. [9], Umadevi et al. [10] and Cohen [11]).

The model of ontology can be regarded as a graph G = (V, E), in which each vertex v expresses a concept and each edge e = v_iv_j represents a directly contact between two concepts v_i and v_j. The aim of ontology similarity calculating is to learn a similarity function Sim : V × V → ℝ⁺ ∪ {0} which maps each pair of vertices to a score real number. Moreover, the purpose of ontology mapping is to bridge the link between two or more different ontologies based on the similarity between their concepts. Using two graphs G₁ and G₂ to express two ontologies O₁ and O₂, respectively. The target is to determine a set S_v ⊆ V(G₂) for each v ∈ G₁ where the vertices in S_v are semantically high similarity to the concept corresponding to v. Hence, it may compute the similarity S(v, v_j) for each v_j ∈ V (G₂) and select a parameter 0 < M < 1 for each v ∈ G₁. S_v is set for vertex v and its element meets S(v, v_j) ≥ M. From this perspective, the essence of ontology mapping is to yield a similarity function S and to determine a suitable parameter M according to detailed applications.

There are several effective learning tricks in ontology similarity measure and ontology mapping. Gao and Zhu [12] studied the gradient learning algorithms for ontology similarity computing and ontology mapping. Gao and Xu [13] obtained the stability analysis for ontology learning algorithms. Gao et al. [14] manifested an ontology sparse vector learning approach for ontology similarity measuring and ontology mapping based on ADAL trick. Gao et al. [15] researched an ontology optimization tactics according to distance calculating techniques. More theoretical analysis of ontology learning algorithm can be referred to Gao et al. [16].

In this paper, we propose a new ontology learning trick based on affine transformation. Furthermore, we present the efficiency of the algorithm in the biology and chemical applications via experiments.

Setting

Let V be an instance space. We use p dimension vector to express the semantics information of each vertex in ontology graph. Specifically, let v = {v₁, ···, v_p} be a vector corresponding to a vertex v. To facilitate the representation, we slightly confused the notations and v is used to represent both the ontology vertex and its corresponding vector.In the learning setting, the aim of ontology algorithms is to yield an vertices can be determined according to the difference between their corresponding real numbers. Obviously, the ontology function can be regarded as a dimensionality reduction operator f : ℝ^p → ℝ.

In recent years, the application of ontology algorithm faces many challenges. When it comes to the field of chemical and biology, the situation may become very complex since we need to deal with high dimensional data or big data. Under this background, sparse vector learning algorithms are introduced in biology and chemical ontology computation (see Afzali et al. [17], Khormuji, and Bazrafkan [18], Ciaramella and Borzi [19], Lorincz et al. [20], Saadat et al. [21], Yamamoto et al. [22], Lorintiu et al. [23], Mesnil and Ruzzene [24], Gopi et al. [25], and Dowell and Pinson [26] for more details). For example, if we aim to find what kind of genes causes a certain genetic disease, there are millions of genes in human’s bodies and the computation task is complex and tough. However, in fact, only a few classes of genes cause this kind of genetic disease. The sparse vector learning algorithm can effectively help scientists pinpoint genes in the mass disease genes.

One computational method of ontology function via sparse vector is expressed by

$f_{w} (v) = \sum_{i = 1}^{p} v_{i} w_{i} + δ,$ $$\begin{array}{} \displaystyle {f_{\bf{w}}}(v) = \mathop \sum \limits_{i = 1}^p {v_i}{w_i} + \delta , \end{array}$$(1)

where w = {w₁, ···, w_p} is a sparse vector which is used to shrink irrelevant component to zero and δ is a noise term. Using this model, the key to determine the ontology function f is to learn the optimal sparse vector w.

For example, the standard framework with the penalize term via the l₁ -norm of the unknown sparse vector w ∈ ℝ^p can be stated as:

$Y_{w} = l (w) + λ ‖ w ‖_{1},$ $$\begin{array}{} \displaystyle {Y_{\bf{w}}} = l({\bf{w}}) + \lambda {\left\| {\bf{w}} \right\|_1}, \end{array}$$(2)

where λ > 0 is a balance parameter and l is the principal function to measure the error of w. The balance term λ ||w||₁ is used to measure the sparsity of sparse vector w.

Algorithm description

Let D be a (p − 1) × p matrix which is denoted as

$D = (\begin{matrix} 1 & - 1 & 0 & \dots & 0 & 0 \\ 0 & 1 & - 1 & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & - 1 & 0 \\ 0 & 0 & \dots & 0 & 1 & - 1 \end{matrix})$ $$\begin{array}{} \displaystyle {\bf{D}} = \left( {\begin{array}{*{20}{c}} 1&{ - 1}&0& \cdots &0&0\\ 0&1&{ - 1}& \cdots &0&0\\ \vdots & \vdots & \ddots & \ddots & \vdots & \vdots \\ 0&0& \cdots &1&{ - 1}&0\\ 0&0& \cdots &0&1&{ - 1} \end{array}} \right) \end{array}$$(3)

One general ontology sparse vector learning framework can be stated as

$\min_{w \in R^{p}} \frac{1}{2} ‖ y - V w ‖ + \frac{ρ}{2} ‖ w ‖^{2} + λ ‖ D w ‖_{1},$ $$\begin{array}{} \displaystyle \mathop {\min }\limits_{\bf{w} \in {\mathbb{R}^p}} \frac{1}{2}\left\| {{\bf{y}} - {\bf{Vw}}} \right\| + \frac{\rho }{2}{\left\| {\bf{w}} \right\|^2} + \lambda {\left\| {{\bf{Dw}}} \right\|_1}, \end{array}$$(4)

where ρ ≥ 0 is also a balance parameter. Let $\tilde{V} = (\begin{matrix} V \\ \sqrt{ρ I} \end{matrix})$ $\begin{array}{} \displaystyle {\bf{\tilde V}} = (\begin{array}{*{20}{c}} {\bf{V}} \\ {\sqrt {\rho {\bf{I}}} } \\ \end{array}) \end{array}$ and $\tilde{y} = (\begin{matrix} y \\ 0 \end{matrix})$ $\begin{array}{} \displaystyle {\bf{\tilde y}} = (\begin{array}{*{20}{c}} {\bf{y}} \\ 0 \\ \end{array}) \end{array}$. Then ontology sparse vector learning problem (4) can be expressed as

$\min_{w \in ℝ^{p}} \frac{1}{2} ‖ \tilde{y} - \tilde{V} w ‖_{2}^{2} + λ ‖ D w ‖_{1} .$ $$\begin{array}{} \displaystyle \mathop {\min }\limits_{{\bf{w}} \in {\mathbb{R}^p}} \frac{1}{2}\left\| {{\bf{\tilde y}} - {\bf{\tilde Vw}}} \right\|_2^2 + \lambda {\left\| {{\bf{Dw}}} \right\|_1}. \end{array}$$(5)

An effective method to get the solution is to set variables β = Dw, then it becomes

$\begin{matrix} \begin{matrix} min \\ w \in R^{p}, β \in R^{p - 1} \end{matrix} = \frac{1}{2} {| | \tilde{y} - \tilde{V} w | |}_{2}^{2} + λ {| | β | |}_{1}, \\ β = Dw . \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {\begin{array}{*{20}{c}} {{\rm{min}}} \\ {{\bf{w}} \in {\mathbb{R}^p},\beta \in {\mathbb{R}^{p - 1}}} \\ \end{array} = \frac{1}{2}\left\| {{\bf{\tilde y}} - {\bf{\tilde Vw}}} \right\|_2^2 + \lambda {{\left\| \beta \right\|}_1},} \\ {\beta = {\bf{Dw}}.} \\ \end{array} \end{array}$$

By derivation, the Lagrange dual problem of above ontology problem is formulated as

$\begin{matrix} \min_{_{u \in ℝ^{p - 1}}} \frac{1}{2} {({\tilde{V}}^{T} \tilde{y} - D^{T} u)}^{T} ({\tilde{V}}^{T} \tilde{V}) † ({\tilde{V}}^{T} \tilde{y} - D^{T} u), \\ ‖ u ‖_{\infty} \leq λ, D^{T} u \in span ({\tilde{V}}^{T}) . \end{matrix}$ $$\begin{array}{c} \min\limits_{u\in\mathbb{R}^{p-1}}\frac{1}{2}(\tilde{{\bf V}}^{T}\tilde{{\bf y}}-{\bf D}^{T}{\bf u})^{T}(\tilde{{\bf V}}^{T}\tilde{{\bf V}})^{\dagger}(\tilde{{\bf V}}^{T}\tilde{{\bf y}}-{\bf D}^{T}{\bf u}),\\ \|u\|_{\infty}\le\lambda,{\bf D}^{T}{\bf u}\in {\rm span}(\tilde{{\bf V}}^{T}). \end{array}$$

The other version of ontology framework can be simply expressed as

$\begin{matrix} min_{w} \frac{1}{2} | | y - V w | | + λ | | w | |_{1} . \end{matrix}$ $$\begin{array}{} \displaystyle {\min\limits _{\bf{w}}}\frac{1}{2}{\bf{||y}} - {\bf{Vw}}|| + \lambda {\bf{||w||}}{_1}. \end{array}$$(6)

And, the ontology dual problem of (6) can be written as

$\begin{matrix} \sup_{θ} \frac{1}{2} ‖ y ‖_{2}^{2} - \frac{λ^{2}}{2} ‖ \frac{y}{λ} - θ ‖^{2}, \\ s . t . | v_{i}^{T} θ | \leq 1 \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {\mathop {\sup }\limits_\theta \frac{1}{2}\left\| {\bf{y}} \right\|_2^2 - \frac{{{\lambda ^2}}}{2}{{\left\| {\frac{{\bf{y}}}{\lambda } - \theta } \right\|}^2},} \\ {s.t.\quad |\upsilon _i^T\theta | \le 1} \\ \end{array} \end{array}$$(7)

for any i ∈ {1,2, · · · , p}. Moreover, ontology problem (7) is equal to the following problem:

$\begin{matrix} \begin{matrix} min_{θ} \frac{1}{2} {(\frac{y}{λ} - θ)}^{2}, \\ s . t . | υ_{i}^{T} θ | \leq 1 \end{matrix} \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{l}} {{\min\limits_\theta }\frac{1}{2}{{\left\| {\frac{{\bf{y}}}{\lambda } - \theta } \right\|}^2},} \\ {{\rm{s}}{\rm{.t}}.\quad |\upsilon _i^T\theta | \le 1} \\ \end{array} \end{array}$$(8)

for any i ∈ {1,2, · · · ,p}.

Set $D = {θ : | v_{i}^{T} θ | \leq 1, i \in {1, 2, \dots, p}}$ $\begin{array}{} \displaystyle {\scr D} = \left\{ {\theta :\left| {v_i^T\theta } \right| \le 1,i \in \left\{ {1,2, \cdots ,p} \right\}} \right\} \end{array}$ as the feasible set of ontology problem (8). Obviously, 𝒟 can be regarded as the intersection of a collection of closed half spaces which is a closed convex set, and 𝒟 ≠ ∅ since 0 ∈ 𝒟 .By means of (8), the projection of $\frac{y}{λ}$ $\begin{array}{} \displaystyle \frac{{\rm{y}}}{\lambda } \end{array}$ onto 𝒟 is the dual optimal solution θ* which is stated as $θ^{*} = P_{D} \frac{y}{λ}$ $\begin{array}{} \displaystyle {\theta ^*} = {\mathbb{P}_{\scr D}}\frac{{\bf{y}}}{\lambda } \end{array}$.

Next, we present our dual framework of ontology problem which can be formulated as a problem of projection. Set

$β = \tilde{y} - \tilde{V} w,$ $$\begin{array}{} \displaystyle \beta = {\bf{\tilde y}} - {\bf{\tilde Vw}}, \end{array}$$(9)

then the ontology problem (5) becomes

$\begin{matrix} \min_{w \in ℝ^{p}, β \in ℝ^{n}} \frac{1}{2} ‖ β ‖^{2} + λ ‖ D w ‖_{1}, \\ s . t . β = \tilde{y} - \tilde{V} w . \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {\mathop {\min }\limits_{\bf{w} \in {\mathbb{R}^p},\beta \in {\mathbb{R}^n}} \frac{1}{2}{{\left\| \beta \right\|}^2} + \lambda {{\left\| {{\bf{Dw}}} \right\|}_1},} \\ {s.t.\quad \beta = {\bf{\tilde y}} - {\bf{\tilde Vw}}.} \\ \end{array} \end{array}$$

Set λ θ ’ ℝ^N as the Lagrangian multipliers. Hence, the Lagrangian of ontology problem (5) can be expressed as

$\begin{matrix} L (w, β; θ) & = \frac{1}{2} {‖ β ‖}^{2} + λ {‖ D w ‖}_{1} + λ < θ, \tilde{y} - \tilde{V} w - β > \\ = \frac{1}{2} {‖ β ‖}^{2} - λ < θ, β > + λ {‖ D w ‖}_{1} - λ < θ, \tilde{V} w > + λ < θ, \tilde{y} > . \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {L({\bf{w}},\beta ;\theta )} \hfill & { = \frac{1}{2}{{\left\| \beta \right\|}^2} + \lambda {{\left\| {{\bf{Dw}}} \right\|}_1} + \lambda < \theta ,{\bf{\tilde y}} - {\bf{\tilde Vw}} - \beta > } \hfill \\ {} \hfill & { = \frac{1}{2}{{\left\| \beta \right\|}^2} - \lambda < \theta ,\beta > + \lambda {{\left\| {{\bf{Dw}}} \right\|}_1} - \lambda < \theta ,{\bf{\tilde Vw}} > + \lambda < \theta ,{\bf{\tilde y}} > .} \hfill \\ \end{array} \end{array}$$

Set $g_{1} (β) = \frac{1}{2} {‖ β | |}^{2} - λ < θ, β >$ $\begin{array}{} \displaystyle {g_1}(\beta ) = \frac{1}{2}{\left\| \beta \right\|^2} - \lambda < \theta ,\beta > \end{array}$ and $g_{2} (w) = λ ‖ D w ‖_{1} - λ < θ, \tilde{V} w >$ $\begin{array}{} \displaystyle {g_2}({\bf{w}}) = \lambda {\left\| {{\bf{Dw}}} \right\|_1} - \lambda < \theta ,{\bf{\tilde Vw}} > \end{array}$. Since g₁(β) is a quadratic, we deduce

$Δ g_{1} (β) = 0 \to λ θ = β \to \min_{β} g_{1} (β) = - \frac{λ^{2}}{2} ‖ θ ‖^{2} .$ $$\begin{array}{} \displaystyle \Delta {g_1}(\beta ) = 0 \to \lambda \theta = \beta \to \mathop {\min }\limits_\beta {g_1}(\beta ) = - \frac{{{\lambda ^2}}}{2}{\left\| \theta \right\|^2}. \end{array}$$(10)

Furthermore, we infer

$0 \in \partial g_{2} (w) \to \min_{w} g_{2} (w) = 0.$ $$\begin{array}{} \displaystyle 0 \in \partial {g_2}({\bf{w}}) \to \mathop {\min }\limits_{\bf{w}} {g_2}({\bf{w}}) = 0. \end{array}$$(11)

In terms of (10) and (11), we get the following equal ontology optimization version:

$\min_{w \in ℝ^{p}, β \in ℝ^{n}} L (w, β; θ) = - \frac{λ^{2}}{2} ‖ θ ‖^{2} + λ < θ, \tilde{y} > = \frac{1}{2} ‖ \tilde{y} ‖^{2} - \frac{λ^{2}}{2} ‖ \frac{\tilde{y}}{λ} - θ ‖^{2} .$ $$\begin{array}{} \displaystyle \mathop {\min }\limits_{\bf{w} \in {\mathbb{R}^p},\beta \in {\mathbb{R}^n}} L({\bf{w}},\beta ;\theta ) = - \frac{{{\lambda ^2}}}{2}{\left\| \theta \right\|^2} + \lambda < \theta ,{\bf{\tilde y}} > = \frac{1}{2}{\left\| {{\bf{\tilde y}}} \right\|^2} - \frac{{{\lambda ^2}}}{2}{\left\| {\frac{{{\bf{\tilde y}}}}{\lambda } - \theta } \right\|^2}. \end{array}$$(12)

Now, we discuss the deep ontology optimization model in view of affine transformation. Set Θ₁ ∈ ℝ^{p × p} as

$Θ_{1} = (\begin{matrix} 1 & 1 & 1 & \dots & 1 \\ 0 & 1 & 1 & \dots & 1 \\ 0 & 0 & 1 & \dots & 1 \\ ⋮ & ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & 1 \end{matrix})$ $$\begin{array}{} \displaystyle {\Theta _1} = \left( {\begin{array}{*{20}{c}} 1&1&1& \cdots &1\\ 0&1&1& \cdots &1\\ 0&0&1& \cdots &1\\ \vdots & \vdots & \ddots & \ddots & \vdots \\ 0&0&0& \cdots &1 \end{array}} \right) \end{array}$$(13)

and $\bar{V} = ({\bar{v}}_{1}, \dots, {\bar{v}}_{p}) = \tilde{V} Θ_{1}$ $\begin{array}{} \displaystyle {\bf{\bar V}} = ({{\bf{\bar V}}_1}, \cdots ,{{\bf{\bar V}}_p}) = {\bf{\tilde V}}{\Theta _1} \end{array}$. Then, we obtain ${\bar{v}}_{i} = Σ_{k = 1}^{i} {\tilde{v}}_{k}$ $\begin{array}{} \displaystyle {{\bf{\bar v}}_i} = \Sigma _{k = 1}^i{\tilde v_k} \end{array}$ for any i ’ {1, · · · , p}. Thus, the dual problem of the ontology sparse vector problem (5) becomes

$\begin{matrix} \max_{θ \in ℝ^{n}} = \frac{1}{2} ‖ \tilde{y} ‖^{2} - \frac{λ^{2}}{2} ‖ \frac{\tilde{y}}{λ} - θ ‖^{2}, \\ s . t . | {\bar{v}}_{i}^{T} θ | \leq 1,, i \in {1, \dots, p - 1}, \\ {\bar{v}}_{p}^{T} θ = 0. \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {\mathop {\max }\limits_{\theta \in {\mathbb{R}^n}} = \frac{1}{2}{{\left\| {{\bf{\tilde y}}} \right\|}^2} - \frac{{{\lambda ^2}}}{2}{{\left\| {\frac{{{\bf{\tilde y}}}}{\lambda } - \theta } \right\|}^2},} \\ {{\rm{s}}{\rm{.t}}.\quad |{\bf{\bar v}}_i^T\theta | \le 1,\quad ,i \in \{ 1, \cdots ,p - 1\} ,} \\ {{\bf{\bar v}}_p^T\theta = 0.} \\ \end{array} \end{array}$$(14)

In addition, the ontology problem (14) has the equal solution with the following ontology optimization problem

$\begin{matrix} \min_{θ \in ℝ^{n}} = \frac{1}{2} ‖ \frac{\tilde{y}}{λ} - θ ‖^{2} \\ s . t . | {\bar{v}}_{i}^{T} θ | \leq 1,, i \in {1, \dots, p - 1}, \\ {\bar{v}}_{p}^{T} θ = 0. \end{matrix}$ $$\begin{array}{} \begin{array}{*{20}{c}} {\mathop {\min }\limits_{\theta \in {\mathbb{R}^n}} = \frac{1}{2}{{\left\| {\frac{{{\bf{\tilde y}}}}{\lambda } - \theta } \right\|}^2}} \\ {{\rm{s}}{\rm{.t}}{\rm{.}}\quad |{\bf{\bar v}}_i^T\theta | \le 1,\quad ,i \in \{ 1, \cdots ,p - 1\} ,} \\ {{\bf{\bar v}}_p^T\theta = 0.} \\ \end{array} \end{array}$$(15)

Set $\bar{H}$ $\begin{array}{} \bar {\scr H} \end{array}$ as the feasible set of ontology problem (15), we yield

$\bar{H} = {θ : | {\bar{v}}_{i}^{T} θ | \leq 1,, i \in {1, \dots, p - 1}, {\bar{v}}_{p}^{T} θ = 0} .$ $$\begin{array}{} \bar {\scr H} = \left\{ {\theta :\left| {\overline {\bf{v}} _i^T\theta } \right| \le 1,\quad ,i \in \left\{ {1, \cdots ,p - 1} \right\},\overline {\bf{v}} _p^T\theta = 0} \right\}. \end{array}$$(16)

It is not hard to check that the dual optimal solution of ontology problem is the projection of $\frac{\tilde{y}}{λ}$ $\begin{array}{} \frac{{{\bf{\tilde y}}}}{\lambda } \end{array}$ onto $\bar{H}$ $\begin{array}{} \bar {\scr H} \end{array}$.

In the following contexts, we show the equivalent optimization model of our ontology sparse vector problem. Our discussion can be divided into two cases according to whether the value of ${\bar{v}}_{p}$ $\begin{array}{} {{\bf{\bar v}}_p} \end{array}$ equals to zero or not.

If ${\bar{v}}_{p} = 0$ $\begin{array}{} \displaystyle {{\bf{\bar v}}_p} = 0 \end{array}$, then we can skip the condition ${\bar{v}}_{p}^{T} θ = 0$ $\begin{array}{} \displaystyle {\rm{\bar v}}_p^T\theta = 0 \end{array}$ and the ontology framework can be reduced to

$\begin{matrix} \min_{θ \in ℝ^{n}} = \frac{1}{2} ‖ \frac{\tilde{y}}{λ} - θ ‖^{2} \\ s . t . | {\bar{v}}_{i}^{T} θ | \leq 1,, i \in {1, \dots, p - 1} . \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {\mathop {\min }\limits_{\theta \in {\mathbb{R}^n}} = \frac{1}{2}{{\left\| {\frac{{{\bf{\tilde y}}}}{\lambda } - \theta } \right\|}^2}} \\ {{\rm{s}}{\rm{.t}}.\quad |{\bf{\bar v}}_i^T\theta | \le 1,\quad ,i \in \{ 1, \cdots ,p - 1\} .} \\ \end{array} \end{array}$$(17)

If ${\bar{v}}_{p} \neq 0$ $\begin{array}{} \displaystyle {\overline {\bf{v}} _p} \ne 0 \end{array}$, we set $U = {α {\bar{v}}_{p} : α \in R},$ $\begin{array}{} \displaystyle {\scr U} = \left\{ {\alpha {{\overline {\bf{v}} }_p}:\alpha \in } \right\}, \end{array}$, $V = {x : x^{T} {\bar{x}}_{p} = 0},$ $\begin{array}{} \displaystyle {\scr V} = \left\{ {{\bf{x}}:{{\bf{x}}^T}{{\overline {\bf{x}} }_p} = 0} \right\}, \end{array}$, $Θ_{2} = I - \frac{{\bar{v}}_{p} {\bar{v}}_{p}^{T}}{| {| {\bar{v}}_{p} | |}^{2}}$ $\begin{array}{} \displaystyle {\Theta _2} = {\bf{I}} - \frac{{{{{\rm{\bar v}}}_p}{\rm{\bar v}}_p^T}}{{|\left| {{{{\rm{\bar v}}}_p}} \right|{|^2}}} \end{array}$, ${\tilde{y}}^{⊥} = Θ_{2} \tilde{y} = \tilde{y} - \frac{{\bar{v}}_{p}^{T} \tilde{y}}{| {| {\bar{v}}_{p} | |}^{2}}$ $\begin{array}{} \displaystyle {{\bf{\tilde y}}^ \bot } = {{\rm{\Theta }}_2}{\bf{\tilde y}} = {\bf{\tilde y}} - \frac{{{\bf{\bar v}}_p^T{\bf{\tilde y}}}}{{||{{{\bf{\bar v}}}_p}|{|^2}}} \end{array}$, ${\bar{v}}_{i}^{⊥} = Θ_{2} \tilde{v_{i}} = \tilde{v_{i}} - \frac{{\bar{v}}_{p}^{T} \bar{v_{i}}}{| {| {\bar{v}}_{p} | |}^{2}}$ $\begin{array}{} \displaystyle {\rm{\bar v}}_i^ \bot = {{\rm{\Theta }}_2}\mathop {{{\rm{v}}_i}}\limits^ = \mathop {{{\rm{v}}_i}}\limits^ - \frac{{{\rm{\bar v}}_p^T\overline {{{\rm{v}}_i}} }}{{|\left| {{{{\rm{\bar v}}}_p}} \right|{|^2}}} \end{array}$, for i ∈ {1, · · · , p — 1}. We get $< {\tilde{y}}^{⊥}, {\bar{v}}_{p} > = 0$ $\begin{array}{} \displaystyle < {{\bf{\tilde y}}^ \bot },{{\bf{\bar v}}_p} > = 0 \end{array}$ and $< θ, {\bar{v}}_{p} > = 0$ $\begin{array}{} \displaystyle \lt {\theta ,{{\overline {\bf{v}} }_p}} \gt = 0 \end{array}$ for any θ ∈ ℋ according to the fact that Θ₂ is a projection operator onto the linear subspace 𝒱 which is the orthogonal complement subspace of 𝒰. Therefore, for any θ ∈ ℋ, we yield $< \frac{{\tilde{y}}^{⊥}}{λ} - θ, {\bar{v}}_{p} > = 0$ $\begin{array}{} \displaystyle < \frac{{{{{\bf{\tilde y}}}^ \bot }}}{\lambda } - \theta ,{{\bf{\bar v}}_p} > = 0 \end{array}$ and further $‖ \frac{\tilde{y}}{λ} - θ ‖^{2} = ‖ \frac{{\tilde{y}}^{⊥}}{λ} - θ ‖^{2} + \frac{{({\bar{v}}_{p}^{T} \tilde{y})}^{2}}{λ^{2} ‖ {\bar{v}}_{p} ‖^{2}}$ $\begin{array}{} \displaystyle {\left\| {\frac{{{\bf{\tilde y}}}}{\lambda } - \theta } \right\|^2} = {\left\| {\frac{{{{{\bf{\tilde y}}}^ \bot }}}{\lambda } - \theta } \right\|^2} + \frac{{{{({\bf{\bar v}}_p^T{\bf{\tilde y}})}^2}}}{{{\lambda ^2}{{\left\| {{{{\bf{\bar v}}}_p}} \right\|}^2}}} \end{array}$.

Therefore, our ontology problem (15) can be expressed as

$\begin{matrix} \begin{matrix} min_{θ \in R^{n}} = \frac{1}{2} {(\frac{{\tilde{y}}^{⊥}}{λ} - θ)}^{2} + \frac{1}{2} \frac{{({\bar{v}}_{p}^{T} \tilde{y})}^{2}}{λ^{2} {({\bar{v}}_{p})}^{2}} \\ s . t . | {\bar{v}}_{i}^{T} θ | \leq 1,, i \in {1, \dots, p - 1}, \\ {\bar{v}}_{p}^{T} θ = 0. \end{matrix} \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {{\min\limits_{\theta \in {\mathbb{R}^n}}} = \frac{1}{2}{{\left\| {\frac{{{{{\bf{\tilde y}}}^ \bot }}}{\lambda } - \theta } \right\|}^2} + \frac{1}{2}\frac{{{{({\bf{\bar v}}_p^T{\bf{\tilde y}})}^2}}}{{{\lambda ^2}{{\left\| {{{{\bf{\bar v}}}_p}} \right\|}^2}}}} \\ {{\rm{s}}{\rm{.t}}.\quad |{\bf{\bar v}}_i^T\theta | \le 1,\quad ,i \in \{ 1, \cdots ,p - 1\} ,} \\ {{\bf{\bar v}}_p^T\theta = 0.} \\ \end{array} \end{array}$$(18)

Note that the second term in (18) doesn’t rely on θ, the ontology problem (18) can be determined by

$\begin{matrix} \begin{matrix} min_{θ \in R^{n}} = \frac{1}{2} {(\frac{{\tilde{y}}^{⊥}}{λ} - θ)}^{2} \\ s . t . | {\bar{v}}_{i}^{T} θ | \leq 1,, i \in {1, \dots, p - 1}, \\ {\bar{v}}_{p}^{T} θ = 0. \end{matrix} \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {{\min\limits_{\theta \in {\mathbb{R}^n}}} = \frac{1}{2}{{\left\| {\frac{{{{{\bf{\tilde y}}}^ \bot }}}{\lambda } - \theta } \right\|}^2}} \\ {{\rm{s}}{\rm{.t}}{\rm{.}}\quad |{\bf{\bar v}}_i^T\theta | \le 1,\quad ,i \in \{ 1, \cdots ,p - 1\} ,} \\ {{\bf{\bar v}}_p^T\theta = 0.} \\ \end{array} \end{array}$$(19)

Let $H^{⊥} = {θ : | < {\bar{v}}_{i}^{⊥}, θ > | \leq 1, i \in {1, \dots, p - 1}, {\bar{v}}_{i}^{T} θ = 0}$ $\begin{array}{} \displaystyle {{\scr H}^ \bot } = \left\{ {\theta :\left| {\lt {{\rm{\bar v}}_i^ \bot ,\theta } \gt } \right| \le 1,i \in \left\{ {1, \cdots ,p - 1} \right\},{\rm{\bar v}}_i^T\theta = 0} \right\} \end{array}$. We can derive that ℋ = ℋ^⊥. Thus, the ontology problem (19) can be further stated as

$\begin{matrix} \begin{matrix} min_{θ \in R^{n}} = \frac{1}{2} \in {(\frac{{\tilde{y}}^{⊥}}{λ} - θ)}^{2}, \\ s . t . | < {\bar{v}}_{i}^{⊥}, θ > | \leq 1,, i \in {1, \dots, p - 1}, \\ {\bar{v}}_{p}^{T} θ = 0. \end{matrix} \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {\min\limits_{\theta \in {\mathbb{R}^n}} = \frac{1}{2} \in {{\left\| {\frac{{{{{\bf{\tilde y}}}^ \bot }}}{\lambda } - \theta } \right\|}^2},} \\ {{\rm{s}}{\rm{.t}}.\quad | < {\bf{\bar v}}_i^ \bot ,\theta \gt | \le 1,\quad ,i \in \{ 1, \cdots ,p - 1\} ,} \\ {{\bf{\bar v}}_p^T\theta = 0.} \\ \end{array} \end{array}$$(20)

It implies that the dual optimal solution of ontology problem is the projection of $\frac{{\tilde{y}}^{⊥}}{λ}$ $\begin{array}{} \displaystyle \frac{{{{{\bf{\tilde y}}}^ \bot }}}{\lambda } \end{array}$ onto the feasible set ℋ^⊥. Finally, we have the final version of ontology sparse vector learning problem which has the same optimal solution with ontology problem (20):

$\begin{matrix} \begin{matrix} min_{θ \in R^{n}} = \frac{1}{2} ∥ \frac{{\tilde{y}}^{⊥}}{λ} - θ ∥ R^{2}, \\ s . t . | < {\bar{v}}_{i}^{⊥}, θ > | \leq 1,, i \in {1, \dots, p - 1} . \end{matrix} \end{matrix}$ $$\begin{array}{} \displaystyle \begin{array}{*{20}{c}} {\min\limits_{\theta \in {\mathbb{R}^n}} = \frac{1}{2} \|\frac{{{{{\bf{\tilde y}}}^ \bot }}}{\lambda } - \theta \|{\mathbb{R}^2},} \\ {s.t.\quad |\lt {\bf{\bar v}}_i^ \bot ,\theta \gt | \le 1,\quad ,i \in \{ 1, \cdots ,p - 1\} .} \\ \end{array} \end{array}$$(21)

The feasible set of ontology problem (21) is stated as

$\hat{H} = {θ : | {\bar{v}}_{i}^{⊥}, θ > | \leq 1,, i \in {1, ...., p - 1}} .$ $$\begin{array}{} \displaystyle \hat {\scr H} = \{ \theta :|{\bf{\bar v}}_i^ \bot ,\theta \gt | \le 1,{\rm{ }},i \in \{ 1,....,p - 1\} \} . \end{array}$$

Experiments

In this section, we test the feasibility of our new algorithm via the following four simulation experiments related to ontology similarity measure and ontology mapping below. After obtaining the sparse vector w, the ontology function is given by $\begin{matrix} f_{w} (v) = \sum_{i = 1}^{p} v_{i} w_{i} \end{matrix}$ $\begin{array}{} \displaystyle {f_{\bf{w}}}(v) = \sum\nolimits_{i = 1}^p {} {v_i}{w_i} \end{array}$ in which we ignore the noise term.

4.1

Ontology similarity measure experiment on biology data

In biology science, “GO” ontology (denoted by O₁ which was constructed in http: //www. geneontology. org, and Fig. 1 presents the basic structure of O₁) is a widely used database for gene researchers. Now, we apply this data set for our first experiment. We use P@N (Precision Ratio, see Craswell and Hawking [27] for more details) to measure the effectiveness of the experiment. In the first step, the closest N concepts (have highest similarity) for each vertex was deduced by the expert. Then, in the second step, the first N concepts for each vertex on ontology graph are determined by the algorithm and the precision ratios are obtained. In addition to our ontology learning algorithm, programming from Huang et al. [29], Gao and Liang [30] and Gao et al. [16] are employed to “GO” ontology, and the precision ratios which we inferred from these tricks are compared. Partial experiment results can be referred to Tab. 1.

Table 1

The experiment results of ontology similarity measure

	P@3 average precision ratio	P@5 average precision ratio	P@10 average precision ratio	P@20 average precision ratio
Algorithm in our paper	0.4762	0.5504	0.6731	0.7918
Algorithm in Huang et al. [29]	0.4638	0.5348	0.6234	0.7459
Algorithm in Gao and Liang [30]	0.4356	0.4938	0.5647	0.7194
Algorithm in Gao et al. [16]	0.4213	0.5183	0.6019	0.7239

From Fig. 1, take N = 3,5,10 or 20, the precision ratio in terms of our sparse vector ontology learning algorithm is higher than the precision ratio computed by algorithms by Huang et al. [29], Gao and Liang [30] and Gao et al. [16]. Specially, such precision ratios apparently increase as N increases. Thus, one result can be concluded that the ontology learning algorithm described in our paper is superior to that proposed by Huang et al. [29], Gao and Liang [30] and Gao et al. [16].

4.2

Ontology mapping experiment on physical data

Physical ontologies O₂ and O₃ (the structures of O₂ and O₃ can refer to Fig. 2 and Fig. 3, respectively) are used for our second experiment which aims to test the utility of ontology mapping. The ontology mapping between O₂ and O₃ are determined by means of our new ontology learning algorithm and P@N criterion is applied as well to test the equality of the experiment. Huang et al. [29], Gao and Liang [30] and Gao et al. [31] also employed ontology algorithms to “Physical” ontology, and we made a comparison among the precision ratios which we get from four methods. Several experiment results can be referred to Tab. 2.

Table 2

The experiment results of ontology mapping

	P@1 average precision ratio	P@3 average precision ratio	P@5 average precision ratio
Algorithm in our paper	0.6913	0.7742	0.9161
Algorithm in Huang et al. [29]	0.6129	0.7312	0.7935
Algorithm in Gao and Liang [30]	0.6913	0.7556	0.8452
Algorithm in Gao et al. [31]	0.6774	0.7742	0.8968

It can be seen that our algorithm is more efficient than ontology learning algorithms raised in Huang et al. [29], Gao and Liang [30] and Gao et al. [31] in particular when N is sufficiently large.

4.3

Ontology similarity measure experiment on plant data

In this part, “PO” ontology O₄ (which was constructed in http: //www.plantontology.org. Fig. 4 shows the basic structure of O₄) is used to test the efficiency of our new ontology learning algorithm for ontology similarity calculating. This ontology is famous in plant science which can be used as a dictionary for scientists to learn and search concepts and botanical features. P@N standard is used again for this experiment. Furthermore, ontology learning approaches in Wang et al. [28], Huang et al. [29] and Gao and Liang [30] are borrowed to the “PO” ontology in our experiment for comparison requirements. The accuracy by these ontology learning algorithms are computed and parts of the results are compared and presented in Tab. 3.

Table 3

The experiment results of ontology similarity measure

	P@3 average precision ratio	P@5 average precision ratio	P@10 average precision ratio
Algorithm in our paper	0.4865	0.6052	0.7393
Algorithm in Wang et al. [28]	0.4549	0.5117	0.5859
Algorithm in Huang et al. [29]	0.4282	0.4849	0.5632
Algorithm in Gao and Liang [30]	0.4831	0.5635	0.6871

It’s revealed in the Tab. 3 that the precision ratio in view of our ontology sparse vector learning algorithm is higher than the precision ratio proposed by ontology learning algorithms that Wang et al. [28], Huang et al. [29] and Gao and Liang [30] when N=3, 5 or 10. Furthermore, such precision ratios are increasing apparently as N increases. Therefore, we can conclude that the ontology sparse vector learning algorithm described in our paper is superior to the trick recommended in Wang et al. [28], Huang et al. [29] and Gao and Liang [30].

4.4

Ontology mapping experiment on humanoid robotics data

Humanoid robotics ontologies (denoted by O₅ and O₆, constructed by Gao and Zhu [12], and the structures of O5 and O6 can refer to in Fig. 5 and Fig. 6 respectively) are employed for our last experiment. Humanoid robotics ontologies are used to orderly and clearly express the humanoid robotics, and this experiment aims to determine ontology mapping between O₅ and O₆. Again, we use P@N criterion to measure the accuracy of data gotten in the experiment. Beside our ontology learning algorithm, ontology algorithms raised in Gao and Lan [32], Gao and Liang [30] and Gao et al. [31] are also applied on humanoid robotics ontologies, and the precision ratios which are obtained from four ontology learning algorithms are compared. Partial experiment results can refer to Tab. 4.

Table 4

The experiment results of ontology mapping

	P@1 average precision ratio	P@3 average precision ratio	P@5 average precision ratio
Algorithm in our paper	0.2778	0.5000	0.6556
Algorithm in Gao and Lan [32]	0.2778	0.4815	0.5444
Algorithm in Gao and Liang [30]	0.2222	0.4074	0.4889
Algorithm in Gao et al. [31]	0.2778	0.4630	0.5333

The experiment results presented in Table 4 imply that our ontology sparse vector learning algorithm works with more efficiency than other ontology learning algorithms obtained in Gao and Lan [32], Gao and Liang [30] and Gao et al. [31] especially when N is sufficiently large.

Conclusion

In our paper, an affine transformation based computation technology is considered and presented to the readers. This ontology technology is suitable for biological and chemical ontology engineering applications because of its similarity measure and ontology mapping. The main approach is based on affine transformation and its theoretical derivation. At last, simulation data show that our ontology scheming has high efficiency in biology, physics, plant and humanoid robotics fields. The ontology sparse vector learning algorithm raised in our paper illustrates the promising application prospects in multiple disciplines.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

eISSN:: 2444-8656
Language:: English

Publication timeframe:: 2 times per year
Journal Subjects:: Life Sciences, other, Mathematics, Applied Mathematics, General Mathematics, Physics

Journal RSS Feed

Affine Transformation Based Ontology Sparse Vector Learning Algorithm

Published Online: Apr 06, 2017

Page range: 111 - 122

Received: Jan 02, 2017

Accepted: Apr 06, 2017

DOI: https://doi.org/10.21042/AMNS.2017.1.00009

KeywordsOntology,, Similarity measure,, Ontology mapping,, Sparse vector,, Affine transformation

© Linli Zhu, Yu Pan, Jiangtao Wang, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

Fig. 6

Keywords
Ontology,, Similarity measure,, Ontology mapping,, Sparse vector,, Affine transformation