LP Solutions of Vectorial Integer Subset Sums – Cryptanalysis of Galbraith’s Binary Matrix LWE

Herold, Gottfried; May, Alexander

doi:10.1007/978-3-662-54365-8_1

LP Solutions of Vectorial Integer Subset Sums – Cryptanalysis of Galbraith’s Binary Matrix LWE

Gottfried Herold¹⁴ &
Alexander May¹⁴

Conference paper
First Online: 26 February 2017

1868 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 10174))

Abstract

We consider Galbraith’s space efficient LWE variant, where the $(m \times n)$-matrix A is binary. In this binary case, solving a vectorial subset sum problem over the integers allows for decryption. We show how to solve this problem using (Integer) Linear Programming. Our attack requires only a fraction of a second for all instances in a regime for m that cannot be attacked by current lattice algorithms. E.g. we are able to solve 100 instances of Galbraith’s small LWE challenge $(n,m) = (256, 400)$ all in a fraction of a second. We also show under a mild assumption that instances with $m \le 2n$ can be broken in polynomial time via LP relaxation. Moreover, we develop a method that identifies weak instances for Galbraith’s large LWE challenge $(n,m)=(256, 640)$.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Over the last decade, the Learning with Errors (LWE) problem [16] has proved to be extremely versatile for the construction of various cryptographic primitives. Since LWE is as hard as worst-case lattice problems, it is consider one of the most important post-quantum candidates. Let us recall that an LWE instance consists of a random $(m \times n)$-matrix $\mathbf {A}$ with elements from $\mathbb {Z}_q$ and an m-dimensional vector ${\varvec{b}} \in \mathbb {Z}_q^m$, where ${\varvec{b}} = \mathbf {A}{\varvec{s}}+ {\varvec{e}} \bmod q$ with a secret random ${\varvec{s}} \in \mathbb {Z}_q^n$ and where the entries of ${\varvec{e}} \in \mathbb {Z}_q^m$ are from a discretized normal distribution.

The LWE decisional problem is to distinguish $(\mathbf {A}, {\varvec{b}})$ from $(\mathbf {A}, {\varvec{u}})$ for random ${\varvec{u}}\in \mathbb {Z}_q^m$. While LWE has some intriguing hardness properties, it is known that one has to choose quite large n in order to reach a desired security level against lattice reduction attacks. This in turn makes the size of LWE instances $(\mathbf {A}, {\varvec{b}})$, and thus the size of public keys, undesirably large. For practical reasons, people therefore looked into various variants of LWE, such as ring-LWE [13, 14], LWE with short secret [2, 15] or LWE with short error [10, 15]. Recently, some special instances of ring-LWE were identified to have serious weaknesses [4, 6], but these instances were not suggested for cryptographic use. Moreover, it was shown that LWE with binary secrets and errors can be attacked in slightly subexponential time $2^{\mathcal {O}(n / \log \log n)}$ by a BKW-type algorithm [11], where LWE dimension $n=128$ was practically broken within half a day. Also, LWE with binary secret leads to more efficient lattice attacks [3]. While choosing special variants of LWE seems to slightly decrease the security, the improved attacks do not substantially endanger the security of these variants in general.

In this paper, we look at another LWE variant due to Galbraith [8]. In this variant, $\mathbf {A}$ is replaced by a binary matrix. This makes Galbraith’s variant very tempting for low-weight devices that are not capable of storing a sufficiently large LWE instance.

In [8], Galbraith instantiates Regev’s encryption system [16] with his binary matrix $\mathbf {A}$ and suggests to use the parameters $(n,m,q) = (256, 640, 4093)$ that were originally proposed by Lindner and Peikert [12] for Regev’s original scheme. Galbraith also gives a thorough security analysis based on lattices, where in his experiments he fixes n and tries to break encryption for increasing m. Based on this analysis, he concludes that instances with $m \ge 400$ might be hard to break with lattice techniques.

For Regev’s original scheme, security follows from hardness of LWE for appropriate parameters; this is not automatically the case for binary matrix $\mathbf {A}$ without changing parameters. For Galbraith’s choices, in order to break encryption, one can solve an equation of the form ${\varvec{u}}\mathbf {A}= {\varvec{c}}_1$ for a known matrix $\mathbf {A}\in \{0,1\}^{m \times n}$, some known ciphertext component ${\varvec{c}}_1 \in \mathbb {Z}^n$ and some unknown vector ${\varvec{u}}\in \{0,1\}^m$. In other words, one has to find a subset of all rows of $\mathbf {A}$ that sums to ${\varvec{c}}_1$. We call this problem therefore a vectorial integer subset sum. If the unknown vector ${\varvec{u}}$ is short, a vectorial integer subset sum can certainly be solved by finding a closest vector in some appropriate lattice. This is the standard analysis that was carried out in [8] against this avenue of attack.

However, a vectorial integer subset sum is by its definition also an Integer Linear Programming (ILP) problem. Namely, we are looking for an integral solution ${\varvec{u}}\in \mathbb {Z}^m$ of m linear equations over the integers. While it is known that ILP is in general NP-hard, it is also known that in many cases removing the integrality constraint on ${\varvec{u}}$ provides a lot of useful information about the problem. Removing the integrality constraint is called a LP relaxation of the problem. Without integrality constraints, the resulting problem can be solved in polynomial time, using e.g. the ellipsoid method [9].

We show under a mild assumption on $\mathbf {A}$ that the vectorial subset sum problem can for parameters $m \le 2n$ be solved by its LP relaxation (with success probability $\frac{1}{2}$). More precisely, the LP solution has the property that it is already integral. This in turn means that vectorial integer subset sums with $m \le 2n$ can be solved in polynomial time. In practice, we are able to solve instances with $n=256$ and $m \le 2n$ in a fraction of a second. Notice that this is already a regime for m that seems to be infeasible to reach with current lattice reduction algorithms.

However, $m \le 2n$ does not quite suffice to break Galbraith’s $(n,m)=(256,640)$-challenge in practice. Namely, when we look at instances with $m > 2n$ the success probability of our MATLAB ILP solver drops quite quickly – when we allow only some fixed, small computation time. Yet, when looking at a large number of instances of our vectorial integer subset sums, we realize experimentally that there is still a significant number of weak instances that are vulnerable to LP relaxation with some additional tricks (such as e.g. the cutting plane method). More concretely, we are able to show that at least 1 out of $2^{15}$ instances of Regev-type encryptions with $(n,m)=(256,640)$ can be solved in about 30 min. Interestingly, we are able to compute a simple score for every instance I that accurately predicts whether I is indeed weak – based on an estimation of the volume of the search space that comes from the LP relaxation. We find that such a quick test for identifying weak instances I is a quite remarkable property of Linear Programming. We are not aware of a similar property for other cryptanalytic methods. We hope that our results motivate more cryptanalytic research using (Integer) Linear Programming.

Note that our attack breaks Galbraith’s instantiation of LWE encryption with binary matrices, but does not break binary LWE itself. Due to that, our attack allows ciphertext recovery, but not key recovery.

Our paper is organized as follows. In Sect. 2, we recall Galbraith’s scheme and its cryptanalysis challenges. In Sect. 3, we model vectorial integer subset sums in form of an Integer Linear Programming. We attack instances with $m \le 2n$ in Sect. 4 and show that they actually admit a polynomial time attack. In Sect. 5, we show how to identify weak instances for large m and we present our experimental results for Galbraith’s large challenge $(n,m)=(256, 640)$.

2 Galbraith’s Binary Matrix LWE

Let us briefly recall Regev’s LWE encryption scheme. Let q be prime. One chooses a public $\mathbf {A}\in _R \mathbb {Z}_q^{m \times n}$ and a private ${\varvec{s}} \in _R \mathbb {Z}_q^n$. One then compute ${\varvec{b}} = \mathbf {A}{\varvec{s}} + {\varvec{e}} \bmod q$, where the $e_i$ are sampled from a discrete normal distribution with mean 0 and standard deviation $\sigma $. The public key consists of $(\mathbf {A}, {\varvec{b}})$.

For encrypting some message $M \in \{0,1\}$, one chooses a random nonce ${\varvec{u}}\in _R \{0,1\}^m$ and computes the ciphertext

$$\begin{aligned} {\varvec{c}} = ({\varvec{c}}_1, c_2) = ({\varvec{u}}\mathbf {A}\bmod q, \langle {\varvec{u}}, {\varvec{b}} \rangle + M \bigl \lfloor \tfrac{q}{2} \bigr \rfloor \bmod q) \in \mathbb {Z}_q^n \times \mathbb {Z}_q. \end{aligned}$$

For decryption to 0 respectively 1, one checks whether $c_1 {\varvec{s}} - c_2$ is closer to 0 respectively $\frac{q}{2}$.

After analyzing lattice attacks, Lindner and Peikert [12] suggest to use the parameters

$$ (n,m,q) = (256,640,4093) $$

for medium security level and estimate that these parameters offer roughly 128-bit security. However, for these parameters the public key $(\mathbf {A}, {\varvec{b}})$ has already 247 kilobytes, which is way too much for constrained devices.

Therefore, Galbraith [8] suggested to construct the public matrix $\mathbf {A}$ with binary entries simply from the seed of a PRNG. All that one has to store in this case is the seed itself, and the vector ${\varvec{b}}$. A similar trick is also used in other contexts to shorten the public key size [5].

Moreover, Galbraith gives a thorough security analysis of his LWE variant, based on its lattice complexity. In his security analysis he considers the problem of recovering the nonce ${\varvec{u}}$ from

$$\begin{aligned} {\varvec{c}}_1 = {\varvec{u}}\mathbf {A}. \end{aligned}$$

(1)

Notice that since now $\mathbf {A}\in \{0,1\}^{m \times n}$, every entry of ${\varvec{c}}_1$ is an inner product of two random binary length-m vectors. Thus, the entries of $c_1$ are random variables from a binomial distribution $B(m, \frac{1}{4})$ with expected value $\frac{m}{4}$. Since $\frac{m}{4} \ll q$, the equality ${\varvec{c}}_1 = {\varvec{u}}\mathbf {A}$ does not only hold modulo q, but also over the integers.

Hence, recovering ${\varvec{u}}$ from $({\varvec{c}}_1, \mathbf {A})$ can be seen as a vectorial integer subset sum problem. Once ${\varvec{u}}$ is recovered, one can easily subtract $\langle {\varvec{u}}, {\varvec{b}} \rangle $ from $c_2$ and thus recover the message m. Hence, solving the vectorial integer subset sum problem gives a ciphertext only message recovery attack.

We would like to stress that this attack does not allow for key recovery of ${\varvec{s}}$. We also note that in Regev’s original scheme, the security proof shows IND-CPA security assuming that the LWE problem is hard. For this reduction, we need that $c_1$ is essentially independent of $\mathbf {A}$, which is proven using the Leftover Hash Lemma by setting parameters sufficiently large. In particular, ${\varvec{u}}$ is required to have sufficient entropy and Eq. (1) has many solutions for ${\varvec{u}}$ in Regev’s non-binary scheme, whereas the parameters in Galbraith’s binary scheme are set such that ${\varvec{u}}$ is the unique solution to Eq. (1). Due to that, our attack does not give an attack on binary LWE. In fact, binary LWE was shown to be at least as secure as standard LWE in [1], provided n is increased by a factor $\mathcal {O}(\log q)$. Consequently, it seems unlikely that the attack extends to binary LWE.

2.1 Previous Cryptanalysis and Resulting Parameter Suggestions

In his security analysis, Galbraith attacks the vectorial integer subset sum by lattice methods. Namely, he first finds an arbitrary integer solution ${\varvec{w}} \in \mathbb {Z}^m$ with ${\varvec{c}}_1 = {\varvec{w}} \mathbf {A}$. Then he solves CVP with target vector ${\varvec{w}}$ in the lattice

$$ L = \{ {\varvec{v}} \in \mathbb {Z}^m \mid {\varvec{v}} \mathbf {A}\equiv 0 \bmod q \}. $$

Let ${\varvec{v}}$ be a CVP-solution, then we usually have ${\varvec{u}}= {\varvec{w}} - {\varvec{v}}$.

Galbraith reports that for $n = 256$ and $m \in [260,340]$, the CVP-method works well. He further conjectures that with additional tricks one should be able to handle values up to $m=380$ or 390, but that “it would be impressive to solve cases with $m > 400$ without exploiting weeks or months of computing resources”.

Based on his analysis, Galbraith raised the two following cryptanalysis challenges:

C1 with $(n,m) = (256, 400)$: The goal is to compute ${\varvec{u}}$ from $(\mathbf {A},c_1)$ in less than a day on an ordinary PC.
C2 with $(n,m) = (256, 640)$: The goal is mount an attack using current computing facilities that would take less than a year.

According to Galbraith, breaking C1 should be interpreted “as causing embarrassment to the author”, while C2 should be considered a “total break”.

3 Modeling Our Vectorial Integer Subset Sum as an Integer Linear Program

In the canonical form of an Integer Linear Program (ILP), one is given linear constraints

$$ \mathbf {A}' {\varvec{x}} \le {\varvec{b}}', {\varvec{x}} \ge 0\text { and }{\varvec{x}} \in \mathbb {Z}^m, $$

for which one has to maximize a linear objective function $\langle {\varvec{f}}, {\varvec{x}} \rangle $ for some ${\varvec{f}} \in \mathbb {R}^m$ that can be freely chosen.

Notice that it is straightforward to map our vectorial integer subset sum problem ${\varvec{u}}A = {\varvec{c}}_1$ from Eq. (1) into an ILP. Namely, we define the inequalities

$$\begin{aligned} \begin{aligned} \mathbf {A}^{\mathsf {T}}{\varvec{u}}&\le {\varvec{c}}_1 \\ -\mathbf {A}^{\mathsf {T}}{\varvec{u}}&\le -{\varvec{c}}_1 \text { and} \\ u_i&\le 1 \text { for all } i = 1, \ldots , m.\\ u_i&\ge 0 \text { for all } i = 1, \ldots , m. \end{aligned} \end{aligned}$$

(2)

We can for simplicity chose ${\varvec{f}} = \mathbf {0}$, since we are interested in any feasible solution to Eq. (2), and it is not hard to see that by the choice of our parameters our solution ${\varvec{u}}$ is a unique feasible solution. Namely, look at the map

$$\begin{aligned} \{0,1\}^m\rightarrow & {} \Bigl ( B \bigl (m, \tfrac{1}{4} \bigr ) \Bigr )^n, \\ {\varvec{u}}\mapsto & {} {\varvec{u}}\mathbf {A}, \end{aligned}$$

where $X \sim B(m, \frac{1}{4})$ is a binomially distribution random variable with m experiments and $\mathrm{Pr}[X=1] = \frac{1}{4}$ for each experiment. Notice that the $j^{th}$ entry, $1 \le j \le n$, of ${\varvec{u}}\mathbf {A}$ can be written as $u_1 a_{1,j} + \ldots + u_m a_{m, j}$, where we have the event $X_i$ that $u_i a_{i,j} = 1$ iff $u_i = a_{i,j} =1$, i.e. with probability $\frac{1}{4}$. Hence, we can model the entries of ${\varvec{u}}\mathbf {A}$ as random variables from $B(m, \frac{1}{4})$.

For the usual parameter choice $q>m$, the solution ${\varvec{u}}$ of Eq. (2) is unique as long as this map is injective, i.e. as long as the entropy of $\left( B(m, \frac{1}{4}) \right) ^n$ is larger than m. The entropy of the binomial distribution $\left( B(m, \frac{1}{4}) \right) ^n$ is roughly $\frac{n}{2} \log _2(\frac{3}{8} \pi e m)$. Thus, one can compute for which m we obtain unique solutions ${\varvec{u}}$. Choosing e.g. $n=256$, we receive unique ${\varvec{u}}$ for $m \le 1500$. Hence, in the remaining paper we can safely assume unique solutions to our vectorial subset sum problem.

4 Attacking $m \le 2n$: Solving Challenge C1

We ran 100 instances of Eq. (2) on an ordinary 2.8 GHz laptop with $n=256$ and increasing m. We used the ILP solver from MATLAB 2015, which was stopped whenever it did not find a solution after time $t_{\max }=10$ s. We found that the success probability of our attack dropped from $100\%$ at $m=490$ to approximately $1\%$ at $m=590$, cf. Table 1. The largest drop of success probability takes place slightly after $m=2n$.

For comparison, we also solved the LP relaxation, i.e. Eq. (2) without integrality constraint on ${\varvec{u}}$. This is much faster than ILP, so we solved 1000 instances for each m. We checked whether the returned non-integral solution matched our desired integral solution for ${\varvec{u}}$, in which case we call a run successful. The success rate of LP relaxation is also given in Table 1.

It turns out that Galbraith’s small C1 challenge can already solely be solved by its LP relaxation. Since LP relaxation is only the starting point for ILP, it does not come as a surprise that ILP has a slightly larger success rate. However, it is impressive that LP relaxation alone is already powerful enough to solve a significant fraction of all instances.

Table 1. Success probability for solving Eq. (2) for $n=256$. We used MATLAB 2015 and restricted to $t_{\max }=10$ s for the ILP.

Full size table

We now give a theoretical justification for the strength of LP relaxation, showing that under some mild heuristic, for $m\le 2n$, the solution of the LP relaxation is unique. Since, by construction, we know that there is an integral solution ${\varvec{u}}$ to Eq. (2), uniqueness of the solution directly implies that the LP solver has to find the desired ${\varvec{u}}$.

In the following lemma, we replace our linear constraints from $\mathbf {A}$ by some random linear constraints from some matrix $\bar{ \mathbf {A}}$ over the reals. This will give us already uniqueness of the solution ${\varvec{u}}$. Afterwards, we will argue why replacing $\bar{\mathbf {A}}$ back by our LWE matrix $\mathbf {A}$ should not affect the lemma’s statement.

Lemma 1

Let ${\varvec{u}}\in \{0,1\}^{2n}$. Let $\bar{\mathbf {A}} \in {\mathbb {R}}^{n \times 2n}$ be a random matrix, whose rows are uniformly distributed on the sphere around $\mathbf {0}\in {\mathbb {R}}^{2n}$. Then

$$\begin{aligned} \text {Pr}[ \not \exists {\varvec{x}}\in ({\mathbb {R}}\cap [0,1])^{2n} \mid \bar{\mathbf {A}} {\varvec{x}}=\bar{\mathbf {A}} {\varvec{u}}, {\varvec{x}}\ne {\varvec{u}}] = \frac{1}{2}. \end{aligned}$$

Proof

Let us look at the 2n-dimensional unit cube $U_{2n} = \{ {\varvec{x}}\in ({\mathbb {R}}\cap [0,1])^{2n} \}$. Obviously $\mathbf {0}, {\varvec{u}}\in U_{2n}$, both lying at corners of $U_{2n}$. Now, let us assume wlog. that ${\varvec{u}}= \mathbf {0}$ (which can be achieved by reflections). Let H be the hyperplane defined by the kernel of $\bar{\mathbf {A}}$.

Since $\bar{\mathbf {A}}$ is randomly chosen from ${\mathbb {R}}^{n \times 2n}$, it has full rank n with probability 1: since we chose the entries of $\bar{\mathbf {A}}$ from the reals ${\mathbb {R}}$, we avoid any problems that might arise from co-linearity. Thus, H as well as its orthogonal complement $H^{\perp }$ have dimension n. Notice that $H^\perp = {{\mathrm{\mathrm {Im}}}}(\bar{\mathbf {A}}^{\mathsf {T}})$. By construction, both H and $H^{\perp }$ intersect $U_{2n}$ in the corner $\mathbf {0} = {\varvec{u}}$. We are interested whether one of the hyperplanes goes through $U_{2n}$.

The answer to this question is given by Farkas’ Lemma [7], which tells us that exactly one of H and $H^{\perp }$ passes through $U_{2n}$. Notice first that not both can pass through $U_{2n}$. Now assume that H intersects $U_{2n}$ only in the zero point $\mathbf {0}$. Then Farkas’ Lemma tells us that there is a vector in its orthogonal complement $H^{\perp }$ that fully intersects $U_{2n}$. Notice that again by having vectors over the reals, the intersection $H^{\perp } \cap U_{2n}$ is n-dimensional.

By the randomness of $\bar{\mathbf {A}}$, the orientation of H in ${\mathbb {R}}^{2n}$ is uniformly random, and hence the same holds for the orientation of $H^{\perp }$. Since H and $H^{\perp }$ share exactly the same distribution, and since by Farkas’ Lemma exactly one out of both has a trivial intersection with $U_{2n}$, we have

$$\begin{aligned} \text {Pr}[H \cap U_{2n} = \{{\varvec{u}}\}] = \text {Pr}[H^\perp \cap U_{2n} = \{{\varvec{u}}\}]= \frac{1}{2}. \end{aligned}$$

Let ${\varvec{b}} = \bar{\mathbf {A}} {\varvec{u}}=\mathbf {0}$. Since $H = \ker (\bar{\mathbf {A}})$, it follows that ${\varvec{u}}$ is a unique solution to the equation $ {\mathbf {A}} {\varvec{x}}= {\varvec{b}}$ in the case that H has trivial intersection with $U_{2n}$. $\square $

Theorem 1

Under the heuristic assumption that our matrix $\mathbf {A}^{\mathsf {T}}$ behaves like a random $(n \times m)$-matrix, whose rows are uniformly distributed on the sphere around $0^{m}$, LP relaxation solves Eq. (2) in polynomial time for all $m \le 2n$.

Proof

Notice that the case $m = 2n$ follows directly from Lemma 1, since LP relaxation has to find the unique solution ${\varvec{u}}$, and its running time is polynomial using e.g. the ellipsoid method. For the case $m < 2n$ we can simply append $2n-m$ additional columns to $\mathbf {A}^{\mathsf {T}}$, and add a random subset of these to ${\varvec{c}}_1$.

Now let us say a word about the heuristic assumption from Theorem 1. Our assumption requires that the discretized $\mathbf {A}^{\mathsf {T}}$ defines a random orientation of a hyperplane just as $\bar{\mathbf {A}}$. Since $\mathbf {A}^{\mathsf {T}}$ has by definition only positive entries, its columns always have non-negative inner product with the all-one vector $1^n$. This minor technical problem can be fixed easily by centering the entries of $\mathbf {A}^{\mathsf {T}}$ around 0 via the following transformation of Eq. (2):

First, guess the Hamming weight $w = \sum _{i=1}^m u_i$. Then subtract $(\frac{1}{2}, \ldots , \frac{1}{2})$ from every column vector of $\mathbf {A}^{\mathsf {T}}$ and finally subtract $\frac{w}{2}$ from every entry of ${\varvec{c}}_1$. After this transformation $\mathbf {A}^{\mathsf {T}}$ has entries uniform from $\{\pm \frac{1}{2}\}$ and should fulfill the desired heuristic assumption of Theorem 1.

5 Attacking $m=640$: Solving Challenge C2

In order to tackle the $m=640$ challenge, we could in principle proceed as in the previous section, identify a weak instance for e.g. $m=590$, brute-force guess 50 coordinates of ${\varvec{u}}$ and run each time an ILP solver for 10 s.

However, we found out experimentally that even in dimension $m=640$ the density of weak instances is not negligible. Hence, it seems to be much more effective to identify weak instances than to brute-force coordinates. So in the following we try to identify what makes particular instances weak.

We follow the paradigm that an ILP is the easier to solve, the more the LP relaxation “knows about the problem”. In particular, we expect that a problem is easy to solve if the solution polytope $P$ of the LP relaxation of Eq. (2) is small. In the extreme case, if $P=\{{\varvec{u}}\}$, then the problem can be solved by the LP solver alone (cf. Theorem 1). To quantify the size of the solution space in an easy-to-compute way, we compute the length of a random projection of $P$. It turns out that this length, henceforth called score gives a very good prediction on the hardness of an instance.

More concretely, for an instance $I=(\mathbf {A},{\varvec{c}})$, we choose a vector ${\varvec{r}}$ with random direction. Then we maximize and minimize the linear objective function $\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{u}}}\right\rangle $ under the linear constraints given by the LP relaxation of Eq. (2) and consider their difference D. Clearly, $S_{{\varvec{r}}}:=\frac{D}{{||{{\varvec{r}}}||}}$ is the length of the orthogonal projection of $P$ onto the span of ${\varvec{r}}$. Formally, the score of an instance I wrt. to some direction ${\varvec{r}}$ is defined as follows.

Definition 1

Let $I=(\mathbf {A},{\varvec{c}})$ be an instance. Consider the solution polytope $P$ of the LP relaxation of Eq. (2), i.e. $P$ is defined as $P= [0,1]^m\cap \{{\varvec{x}}\mid \mathbf {A}^{\mathsf {T}}{\varvec{x}}= {\varvec{c}}\}$. Let ${\varvec{r}}\in {\mathbb {R}}^m$. Then the score $S_{{\varvec{r}}}$ is defined via

$$\begin{aligned} \begin{aligned} f_{\max }&:= \max _{{\varvec{x}}\in P}\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{x}}}\right\rangle \\ f_{\min }&:= \min _{{\varvec{x}}\in P}\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{x}}}\right\rangle \\ S_{{\varvec{r}}}&:= \frac{f_{\max }-f_{\min }}{{||{{\varvec{r}}}||}} \end{aligned} \end{aligned}$$

(3)

Note that $S_{{\varvec{r}}}$ can be computed by solving two LP problems, hence in polynomial time.

Since $S_{{\varvec{r}}}$ quantifies the search space for the ILP, instances with small score should be easier to compute. For $m=640$, we computed the scores of $2^{19}$ instances, which took approximately 1 s per instance.

Independence of ${\varvec{r}}$ and Reliability of Our Score. We experimentally confirm that for a given instance I, the value of $S_{{\varvec{r}}}$ is mainly a function of I and does not depend significantly on the particular choice of ${\varvec{r}}$. Therefore, we choose the fixed vector ${\varvec{r}}=(1,\ldots ,1,-1,\ldots ,-1)$ for ${\varvec{r}}$ with exactly $\frac{m}{2}$ ones and $\frac{m}{2}$ $-1$’s. We use the score $S=S_{{\varvec{r}}}$ for this particular choice of ${\varvec{r}}$ and sort instances according to S.

We confirm that the score S is a very good predictor for the success of ILP solvers and the success probability drops considerably at some cutoff value for S. E.g. for $m=520$ and within a 10 s time limit, we find that we can solve

${>}99\%$ of instances with $S\le 1.22$,
$60\%$ of instances with $1.22\le S \le 1.54$ and
${<}3\%$ of instances with $S>1.54$.

Distribution of S . Average values for S can be found in Table 2. Figure 1 shows the distribution of S. Note that while the distribution looks suspiciously Gaussian for $m=640$, there is a considerable negative skewness and the tail distribution towards 0 is much fatter than for a Gaussian (cf. Fig. 2). This fat tail enables us to find a significant fraction of weak instances even for large m.

Notice that a score $S=0$ basically means that LP relaxation finds the solution.

Table 2. Average values for S for $n=256$ and varying m. We used 1000 instances for each m.

Full size table

Results for $m=640$. We generated a large number $N=2^{19}$ of instances with $n=256$, $m=640$, and tried to solve only those 271 instances with the lowest score S, which in our case meant $S < 3.2$. We were able to solve 16 out of those 271 weakest instances in half an hour each. We found 15 instances with $S<2.175$, of which we solved 12. The largest value of S, for which we could solve an instance, was $S\approx 2.6$.

Fixing Coordinates. Let us provide some more detailed explanation why an ILP solver works well on instances with small score S. Consider some ${\varvec{r}}\in \{0,\pm 1\}^m$ of low Hamming weight ${|{{\varvec{r}}}|}_1 = w$, so ${||{{\varvec{r}}}||}=\sqrt{w}$. Heuristically, we expect that $S_{{\varvec{r}}}$ should be approximately S, as $S_{{\varvec{r}}}$ mainly depends on the instance and not on the choice of ${\varvec{r}}$. Of course, for a vector ${\varvec{r}}\in \{0,\pm 1\}^m$ with low Hamming weight we have

$$ S_{{\varvec{r}}} = \frac{1}{\sqrt{w}}\Bigl (\max _{{\varvec{x}}\in P}\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{x}}}\right\rangle - \min _{{\varvec{x}}\in P}\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{x}}}\right\rangle \Bigl ) \le \frac{1}{\sqrt{w}}\Bigl (\;\mathop {\max }\limits _{{\varvec{x}}\in [0,1]^m}\ \ \;\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{x}}}\right\rangle -\!\! \min _{{\varvec{x}}\in [0,1]^m}\!\!\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{x}}}\right\rangle \Bigr ) = \sqrt{w}, $$

but that only means we should expect $S_{{\varvec{r}}}$ to be even smaller. Since we know that for the true integer solution ${\varvec{u}}$, we have $\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{u}}}\right\rangle \in {\mathbb {Z}}$, we can add the cuts $\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{u}}}\right\rangle \le \lfloor f_{\max }\rfloor $ and $\left\langle {{\varvec{r}}}{\,,\,}{{\varvec{u}}}\right\rangle \ge \lceil f_{\min } \rceil $ to the set of equations, where $f_{\max }$ resp. $f_{\min }$ are the maximum resp. minimum computed for $S_{{\varvec{r}}}$.

This is a special case of what is called cut generation in Integer Linear Programming. If $S_{{\varvec{r}}}<\sqrt{w}$, i.e. $f_{\max }-f_{\min }<w$, then adding such a new inequality always makes the solution space of the LP relaxation smaller. In fact, such an inequality restricts the possible set that w out of the m variables $u_i$ can jointly obtain. So if $S_{{\varvec{r}}}<\sqrt{w}$ for many different ${\varvec{r}}$, we get lots of sparse relations between the $u_i$. Such inequalities are called good cuts.

In particular, consider the case $w=1$ and ${\varvec{r}}= (0,0,\ldots ,0,1,0,\ldots ,0)$, i.e. we maximize/minimize an individual variable $u_i$ over $P$. If this maximum is ${<}1$, we know that $u_i=0$ holds and if the minimum is ${>}0$, we know $u_i=1$. So if $S_{{\varvec{r}}}<1$ holds for some ${\varvec{r}}$ with ${|{{\varvec{r}}}|}_1=1$, we can fix one of the $u_i$’s and reduce the number of unknowns by one – which makes fixing further $u_i$’s even easier. If the score S is small, we expect that the ILP solver can find lots of such good cuts, possibly even cuts with $w=1$.

Indeed, in all instances that we could solve, some variables could be fixed by such good cuts with $w=1$. For dimensions $m\le 550$, most instances that were solved by the ILP could be solved by such cuts alone.

In fact, we preprocessed our 271 weak instances for $m=640$ by trying to fix each individual coordinate. This alone was sufficient to determine an average of ${>}100$ individual coordinates of the solution ${\varvec{u}}$ for $S<2.175$, and in one case it was sufficient to completely solve the problem.

6 Conclusion

According to Galbraith’s metric for the challenge C2 in Sect. 3, the results of Sect. 5 can be seen as total break for binary matrix LWE. On the other hand, one could easily avoid weak instances I by simply rejecting weak I’s during ciphertext generation. This would however violate the idea of lightweight encryption with binary matrix LWE.

Still, during our experiments we got the feeling that the vectorial integer subset sum problem gets indeed hard for large m, even for its weakest instances. So Galbraith’s variant might be safely instantiated for large m, but currently we find it hard to determine m’s that fulfill a concrete security level of e.g. 128 bit. One possibility to render our attack inapplicable is to change parameters such that modular reductions ${}\!\bmod q$ occur in Eq. (1), since our attack crucially relies on the fact that we work over ${\mathbb {Z}}$. Note here that while there are standard ways to model modular reduction via ILP as ${\varvec{c}}_1 = {\varvec{u}}\mathbf {A}- {\varvec{k}} q$, this renders LP relaxation useless: by allowing non-integral ${\varvec{k}}$, we can choose any value for ${\varvec{c}}_1,{\varvec{u}}$.

References

Boneh, D., Lewi, K., Montgomery, H., Raghunathan, A.: Key homomorphic PRFs and their applications. In: Canetti, R., Garay, J.A. (eds.) CRYPTO 2013. LNCS, vol. 8042, pp. 410–428. Springer, Heidelberg (2013). doi:10.1007/978-3-642-40041-4_23
Chapter Google Scholar
Brakerski, Z., Langlois, A., Peikert, C., Regev, O., Stehlé, D.: Classical hardness of learning with errors. In: Boneh, D., Roughgarden, T., Feigenbaum, J., (eds.), 45th ACM STOC, Palo Alto, CA, USA, 1–4 June, pp. 575–584. ACM Press (2013)
Google Scholar
Buchmann, J.A., Göpfert, F., Player, R., Wunderer, T.: On the hardness of LWE with binary error: revisiting the hybrid lattice-reduction and meet-in-the-middle attack. IACR Cryptology ePrint Archive, p. 89 (2016)
Google Scholar
Castryck, W., Iliashenko, I., Vercauteren, F.: Provably weak instances of ring-LWE revisited. In: Fischlin, M., Coron, J.-S. (eds.) EUROCRYPT 2016. LNCS, vol. 9665, pp. 147–167. Springer, Heidelberg (2016). doi:10.1007/978-3-662-49890-3_6
Chapter Google Scholar
Coron, J.-S., Naccache, D., Tibouchi, M.: Public key compression and modulus switching for fully homomorphic encryption over the integers. In: Pointcheval, D., Johansson, T. (eds.) EUROCRYPT 2012. LNCS, vol. 7237, pp. 446–464. Springer, Heidelberg (2012). doi:10.1007/978-3-642-29011-4_27
Chapter Google Scholar
Elias, Y., Lauter, K.E., Ozman, E., Stange, K.E.: Provably weak instances of ring-LWE. In: Gennaro, R., Robshaw, M. (eds.) CRYPTO 2015. LNCS, vol. 9215, pp. 63–92. Springer, Heidelberg (2015). doi:10.1007/978-3-662-47989-6_4
Chapter Google Scholar
Farkas, J.: Theorie der einfachen Ungleichungen. J. für die reine und angewandte Mathematik (Crelle’s Journal) 124, 1–27 (1902). http://resolver.sub.uni-goettingen.de/purl?GDZPPN002165023
Galbraith, S.D.: Space-efficient variants of cryptosystems based on learning with errors (2013). https://www.math.auckland.ac.nz/sgal018/pubs.html
Grötschel, M., Lovász, L., Schrijver, A.: Geometric Algorithms and Combinatorial Optimization. Algorithms and Combinatorics. Springer, Heidelberg (2012). doi:10.1007/978-3-642-78240-4. ISBN 978-3-642-78240-4
MATH Google Scholar
Güneysu, T., Lyubashevsky, V., Pöppelmann, T.: Practical lattice-based cryptography: a signature scheme for embedded systems. In: Prouff, E., Schaumont, P. (eds.) CHES 2012. LNCS, vol. 7428, pp. 530–547. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33027-8_31
Chapter Google Scholar
Kirchner, P., Fouque, P.-A.: An improved BKW algorithm for LWE with applications to cryptography and lattices. In: Gennaro, R., Robshaw, M. (eds.) CRYPTO 2015. LNCS, vol. 9215, pp. 43–62. Springer, Heidelberg (2015). doi:10.1007/978-3-662-47989-6_3
Chapter Google Scholar
Lindner, R., Peikert, C.: Better key sizes (and attacks) for LWE-based encryption. In: Kiayias, A. (ed.) CT-RSA 2011. LNCS, vol. 6558, pp. 319–339. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19074-2_21
Chapter Google Scholar
Lyubashevsky, V., Peikert, C., Regev, O.: On ideal lattices and learning with errors over rings. In: Gilbert, H. (ed.) EUROCRYPT 2010. LNCS, vol. 6110, pp. 1–23. Springer, Heidelberg (2010). doi:10.1007/978-3-642-13190-5_1
Chapter Google Scholar
Lyubashevsky, V., Peikert, C., Regev, O.: A toolkit for ring-LWE cryptography. In: Johansson, T., Nguyen, P.Q. (eds.) EUROCRYPT 2013. LNCS, vol. 7881, pp. 35–54. Springer, Heidelberg (2013). doi:10.1007/978-3-642-38348-9_3
Chapter Google Scholar
Micciancio, D., Peikert, C.: Hardness of SIS and LWE with small parameters. In: Canetti, R., Garay, J.A. (eds.) CRYPTO 2013. LNCS, vol. 8042, pp. 21–39. Springer, Heidelberg (2013). doi:10.1007/978-3-642-40041-4_2
Chapter Google Scholar
Regev, O.: On lattices, learning with errors, random linear codes, and cryptography. In: STOC, pp. 84–93. ACM (2005)
Google Scholar

Download references

Acknowledgements

The authors would like to thank Bernhard Esslinger and Patricia Wienen for comments.

Gottfried Herold was funded by the ERC grant 307952 (acronym FSC).

Author information

Authors and Affiliations

Faculty of Mathematics, Horst Görtz Institute for IT-Security, Ruhr-University Bochum, Bochum, Germany
Gottfried Herold & Alexander May

Authors

Gottfried Herold
View author publications
You can also search for this author in PubMed Google Scholar
Alexander May
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gottfried Herold .

Editor information

Editors and Affiliations

CWI , Amsterdam, Noord-Holland, The Netherlands
Serge Fehr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Herold, G., May, A. (2017). LP Solutions of Vectorial Integer Subset Sums – Cryptanalysis of Galbraith’s Binary Matrix LWE. In: Fehr, S. (eds) Public-Key Cryptography – PKC 2017. PKC 2017. Lecture Notes in Computer Science(), vol 10174. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-54365-8_1

Download citation

DOI: https://doi.org/10.1007/978-3-662-54365-8_1
Published: 26 February 2017
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-54364-1
Online ISBN: 978-3-662-54365-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

LP Solutions of Vectorial Integer Subset Sums – Cryptanalysis of Galbraith’s Binary Matrix LWE

Abstract

1 Introduction

2 Galbraith’s Binary Matrix LWE

2.1 Previous Cryptanalysis and Resulting Parameter Suggestions

3 Modeling Our Vectorial Integer Subset Sum as an Integer Linear Program