C A Hardness of Approximation Result for (DNF ◦ MOD m )-MCSP

The reduction from (DNF◦MODm)-MCSP^∗ to (DNF◦MODm)-MCSPpresented in Section 3 is notapproximation-preserving: given a partial functionf:Z^tm→ {0,1,∗}, it produces a total functiong:Z^O(t^m ^log^m)→ {0,1}such thatDNF_MOD_m(g) =DNF_MOD_m(f) +|f⁻¹(∗)|. The reduction introduces an additive term|f⁻¹(∗)|, and hence a (multiplicative) approximation ofDNF_MOD_m(g) does not give a good approximation ofDNF_MOD_m(f). In order to fix this situation, we give an approximation-preserving reduction. Our approach is inspired by a reduction described in [5].

ITheorem 45 (Approximation-preserving version of Corollary 27). There is a polynomial-time algorithm that, given the truth table of a partial function f:Z^tm → {0,1,∗}, produces the truth table of a total function g:Z^2t+2sm → {0,1}such that

DNFMOD_m(g) =|f⁻¹(∗)| ·(DNFMOD_m(f) + 1), wheres:=d(6t+ 4) logm+ 2e.

Proof. The idea of the proof is to amplify the circuit size forf; that is, we would like to force any circuitC computingg to also compute sub-functions corresponding to|f⁻¹(∗)|

copies off.

We can amplify the circuit size as follows. Let (L_x)_x∈f−1(∗) be a scattered collection of linear subspaces ofZ^sm. Define a function g⁰ by g⁰(x, z, w) := f(x) if z ∈ f⁻¹(∗) and w∈ L_z; otherwise g⁰(x, z, w) := 0. Then, under an appropriate choice of parameters, it can be shown that DNFMOD_m(g⁰) = |f⁻¹(∗)| ·DNFMOD_m(f). By combining an analogous reduction and the idea behind the proof of Theorem 16, we can obtain a total functiong such thatDNFMODm(g) =DNFMODm(g⁰) +|f⁻¹(∗)|=|f⁻¹(∗)| ·(DNFMODm(f) + 1).⁹ Details follow.

9 A black-box application of Corollary 27 produces a functiongsuch thatDNF_MOD_m(g) =DNF_MOD_m(g⁰) +

|g⁰⁻¹(∗)|, which is not sufficient for our purpose because|g⁰⁻¹(∗)|is larger than|f⁻¹(∗)|.

We first obtain a scattered collection (Lx)_x∈f⁻¹_(∗) ofr-dimensional linear subspaces of Z^smby using Theorem 25 forr:= 2t+ 2. Then we defineg:Z^2t+2sm → {0,1}as

g(x, y, z, w) :=











f(x) (iff(x)∈ {0,1} andy= 0^sandf(z) =∗ andw∈L_z) 1 (iff(x) =∗andy∈Lx)

0 (otherwise) for any ((x, y),(z, w))∈(Z^sm×Z^tm)².

IClaim 46(Analogue of Claim 17). DNFMOD_m(g)≤ |f⁻¹(∗)| ·(DNFMOD_m(f) + 1).

Proof. Suppose that aDNF◦MODmcircuitC=WK

k=1Ck computesf. For eachx^∗∈f⁻¹(∗), take anAND◦MOD_mcircuitC_x^∗ accepting{x^∗} ×L_x^∗ (by Lemma 2). Define

C⁰(x, y, z, w) := _

z^∗∈f⁻¹(∗) K

k=1

(Ck(x)∧y1= 0∧ · · · ∧ys= 0∧Cz^∗(z, w))∨ _

x^∗∈f⁻¹(∗)

Cx^∗(x, y).

It is easy to see thatC⁰ computesg. J

The rest of the proof is devoted to the reverse direction.

IClaim 47(Analogue of Claim 20). DNFMOD_m(g)≥ |f⁻¹(∗)| ·(DNFMOD_m(f) + 1).

Let C = WK

k=1C_k be a minimum DNF◦MOD_m circuit computing g. In particular, K = DNFMOD_m(g) ≤ |f⁻¹(∗)| ·(DNFMOD_m(f) + 1) ≤ m^2t+1. For each x ∈ f⁻¹(∗), let l(x)∈[K] be one of the indices such that|C_l(x)⁻¹(1)∩({x} ×Lx×Z^t+sm )|is maximized. Since S

k∈[K]C_k⁻¹(1)⊇ {x} ×L_x×Z^t+sm , there are at least |Lx| ·m^t+s/K ≥m^r+t+s/m^2t+1 ≥2 points in the setC_l(x)⁻¹(1)∩({x} ×Lx×Z^t+sm ).

DefineT₀:={C_l(x)|f(x) =∗ }. For eachz∈f⁻¹(∗), letT_zbe the set of allC_k such that k∈[K] andCkaccepts at least 2 elements from{(x,0^s, z)}×Lzfor somex∈f⁻¹(1). We will show that the setsT₀,{T_z}_z∈f−1(∗)are pairwise disjoint, and henceK≥ |T₀|+P

z∈f⁻¹(∗)|T_z|.

We will also prove that |T0|=|f⁻¹(∗)|and|Tz| ≥DNFMOD_m(f), which completes the proof.

IClaim 48. l:f⁻¹(∗)→[K] is injective (hence |T₀|=|f⁻¹(∗)|).

IClaim 49. T₀∩T_z=∅for any z∈f⁻¹(∗).

Since the proofs of these claims are essentially the same as in Claims 21 and 22, respectively (except that we have extra coordinates taking values inZ^tm×Z^sm), we omit them.

IClaim 50. Tz₁∩Tz₂=∅for any distinct elements z1, z2∈f⁻¹(∗).

Proof. The proof is basically the argument from Claim 21. For completeness, we briefly repeat it here. Towards a contradiction, assume that there exists a circuitCk inTz₁∩Tz₂. By the definition ofTz₁ andTz₂, there exist elementsx1, x2∈f⁻¹(1),a6=b∈Lz₁, andc∈Lz₂

such thatCk(x1,0^s, z1, a) =Ck(x1,0^s, z1, b) =Ck(x2,0^s, z2, c) = 1. SinceC_k⁻¹(1) is an affine subspace, we have (x₁,0^s, z₁, a)−(x1,0^s, z₁, b)+(x₂,0^s, z₂, c) = (x₂,0^s, z₂, a−b+c)∈C_k⁻¹(1).

SinceC_k⁻¹(1)∩({(x2,0^s, z2)} ×Z^sm)⊆ {(x2,0^s, z2)} ×Lz₂, we geta−b+c∈Lz₂. However, given thatc∈L_z₂, we obtain 0^s6=a−b∈L_z₁∩L_z₂, which contradictsL_z₁∩L_z₂ ={0^s}. J Fix any z ∈ f⁻¹(∗). For each C_k ∈ T_z, define an AND◦MOD_m circuit C_k⁰ so that C_k⁰⁻¹(1) ={x∈Z^tm|Ck(x,0^s, z, w) = 1 for somew∈Z^sm}. (Note that a projection of an affine subspaceC_k⁻¹(1) is again an affine subpace because a projection is a homomorphism.) Now defineCz:=W

C_k∈TzC_k⁰.

C C C 2 0 1 8

IClaim 51. Cz computesf for any z∈f⁻¹(∗). (In particular,|Tz| ≥DNFMODm(f).) Proof. Fix any x ∈ f⁻¹(1). Since {(x,0^s, z)} ×Lz is covered by S

k∈[K]C_k⁻¹(1), and

|L_z|=m^r,K≤m^2t+1, andr= 2t+ 2, there existsk∈[K] such that there are at least 2 elements in ({(x,0^s, z)} ×Lz)∩C_k⁻¹(1); hence, by the definition ofTz, we have Ck ∈Tz. Moreover,C_k⁰(x) = 1 by the definition ofC_k⁰; thusCz(x) =W

Ck∈TzC_k⁰(x) = 1.

Now fix anyx∈f⁻¹(0). Sinceg(x,0^s, z, w) = 0 for everyw∈Z^sm, we getC_k(x,0^s, z, w) = 0 for anyCk∈Tz; thusC_k⁰(x) = 0, which implies that Cz(x) = 0. J

Combining the claims above, we obtain DNFMOD_m(g) =K ≥ |T0|+ X

z∈f⁻¹(∗)

|Tz| ≥ |f⁻¹(∗)| ·(DNFMOD_m(f) + 1).

This completes the proof of Theorem 45. J

We can then establish a hardness of approximation result for computingDNF_MOD_m(f).

For a functionf:Z^tm→ {0,1}, define|f|:=m^t, which is the number of entries in the truth table of a functionf.

I Theorem 52. There exists a constant c > 0 such that if there is a quasipolynomial- time algorithm which approximates DNFMOD_m(f) to within a factor of clog log|f|, then

NP⊆DTIME(2^(logⁿ⁾^O(1)).

Proof. As noted by Trevisan [42], by choosing the parameters of Feige’s reduction [17], one can obtain hardness of approximation results for ther-bounded set cover problem. While Trevisan only analyzed the case whenr is constant (cf. Theorem 5), a similar analysis¹⁰ shows that it isNP-hard (under quasipolynomial-time many-one reductions) to approximate ther(n)-bounded set cover problem onnpoints within a factor ofγlogr(n) ( =γlog logn) forr(n) := lognand some small constantγ >0.

Suppose thatDNFMOD_m(g) can be approximated to within a factor of (γ/6) log log|g|by an algorithmA, whereg:Z^tm→ {0,1} is a total function. We show below that ifAruns in quasipolynomial time, thenNP⊆DTIME(2^(logⁿ⁾^O(1)).

First, note that in order to conclude this it is enough to describe a quasipolynomial-time algorithm B that approximates r-Bounded Set Cover to within a factor of γlogr(n) for r(n) = logn. Let ([n],S) be an instance of ther-Bounded Set Cover Problem. Algorithm B applies the deterministicnÔ(r(n))-time reduction provided by Corollary 29 to produce a partial Boolean functionf:ZÔ(r^m ^logⁿ⁾→ {0,1,∗}. It then invokes the deterministic reduction from Theorem 45 to construct fromf a total functiong:ZÔ(rm ^logⁿ⁾→ {0,1}. Finally,B uses the approximation algorithmAto compute a (γ/6) log log|g|approximation toDNF_MOD_m(g).

Leteg∈Nbe the value output byA. AlgorithmB outputsKe := 2eg/|f⁻¹(∗)|.

Note thatB runs in quasipolynomial time under our assumptions. It remains to show that it approximates the solution of the original set cover problem within a factor ofγlog logn.

Let K be the cost of an optimal solution to the initial set cover instance. Recall that

10 Specifically, for the parameters and notation in [17], given a 3CNF-5 formula onnvariables, letkbe a sufficiently large constant,m:=√

logn, and`:=clog logmfor a large constantc. Then the output of Feige’s reduction is an instance of the set cover problem onN :=m(5n)^`

points such that each set is of size at mostm2^O(`)≤r(N) = logN, and the gap between yes instances and no instances is (1−_k⁴) lnm= Ω(log logN).

2DNFMODm(f) is a 2-factor approximation forK; that is,K≤2·DNFMODm(f)≤2K. On the other hand, the guarantees of the algorithmA imply that

DNF_MOD_m(g) ≤ ge ≤ DNF_MOD_m(g)·(γ/6) log log|g|.

SinceDNFMOD_m(g) =|f⁻¹(∗)| ·(DNFMOD_m(f) + 1), we get K ≤ 2eg

|f⁻¹(∗)| ≤ (γ/6) log log|g| ·(K+ 1)

Therefore, for large enoughnand on non-trivial instances (i.e.K≥1), the valueKe output byB approximatesKto within a factor of 2·(γ/6) log log|g| ≤(γ/3)·(logr(n) + log logn+

O(1))≤(γ/3)·3 log logn. J

Finally, we note that whenmis prime, it is possible to design a quasipolynomial-time approximation algorithm forDNF_MOD_m(f) with an approximation factor ofO(log|f|).

ITheorem 53. Letpbe a prime number. There is a quasipolynomial-time algorithm which approximates DNF_MOD_p(f)to within a factor of ln|f|.

Proof. Let |f| =p^t be the number of entries in the truth table of f, the input function.

By the results of Section 2.1, computingDNFMOD_p(f) is equivalent to solving a set cover instance. Recall that set cover admits a polynomial-time approximation algorithm that achieves an approximation factor of lnN on instances over a universe of size N (cf. [40]).

Consequently, in order to prove the result it is enough to verify that computingDNF_MOD_p(f) reduces to a set cover instance with domain sizeNf :=|f⁻¹(1)| ≤ |f|and of size at most quasipolynomial in|f|.

Indeed, for a non-zero function f:Z^tp → {0,1}, DNFMOD_p(f) is exactly the minimum number of affine subspaces that coverf⁻¹(1). Therefore, by relabelling elements, computing DNF_MOD_p(f) reduces to a set cover instance ([N_f],Sf), where a setS ∈S_f if and only ifS viewed as a subset ofZ^tp is an affine subspace contained inf⁻¹(1). Each such affine subspace has dimension at mostt, and can be explicitly described by a basisv₁, . . . , v_`∈Z^tp, where

`≤t, and a vector b∈Z^tp. Hence there are at mostp^O(t²⁾ such spaces, and consequently,

|Sf| ≤p^O(t²⁾. In other words, we get a set cover instance over a ground set of size≤ |f|, and this instance contains at most|f|^O(log^|f|)sets.

Finally, since the sets inSf can be generated in time at most|f|^O(log^|f|), and the set cover approximation algorithm runs in time polynomial in its input length, the result holds. J

C C C 2 0 1 8

No documento LIPIcs – Leibniz International Proceedings in Informatics (páginas 122-126)