5 More Examples of Generation Problems - 13th International Computer Science Symposium in Russi

5.1 Monotone Integer Programming [12–14,36]

Consider the following system of linear inequalities

(*) Ax ≥ b, ¯0 ≤ x ≤ c, where A ∈ R^m×n, b ∈ R^m₊, c ∈ Rⁿ₊ (that is, coordinates of ccan take value +∞),x∈Zⁿ₊, and ¯0∈Rⁿ₊ is the origin.

System (*) is calledmonotoneifAx≥b⇒Ax ≥b ∀x, x|¯0≤x≤x≤c.

Complexity of Generation 9 Non-negativity of the matrix,A∈R^m×n₊ , is suﬃcient for monotonicity, but not necessary. Note that without monotonicity, already veriﬁcation of solvability of (*) is NP-complete, while for monotone systems it is trivial. In this case we will consider monotone generation.

Problem (K): generate all minimal integer solutionsZ(A, b, c) to (*).

If A is a (0,1) matrix, xis a (0,1) n-vector, and b =e is the vector of m ones, then (K) is the dualization of the hypergraphH deﬁned by the rows ofA;

then, a (minimal) solutionxto (*) is a (minimal) transversal toH.

We generalize, replacing a (0,1) vectorxby a non-negative integer one,b=e by an arbitrary b ∈ R^m+, and ﬁnally, H(A, b, c) by Z(A, b, c), thus, replacing the Boolean cube by the direct product of discrete intervals (a box). These replacements keeps all basic properties [9].

Now, let us Consider the dual problem.

Problem (L): generate M IS(Z(A, b, c)) = Z^dc(A, b, c), in other words, all maximalx∈Zⁿ₊not feasible to the monotone system (*), that is,x∈Z(A, b, c).

Problem (K) is uniformly DB, satisfying the inequality inequality

|Z^d(A, b, c)| ≤mn|Z(A, b, c)|,

which is “almost” precise. Namely, there exists an example with

|Z^d(A, b, c)| ≥ mn

2log²m|Z(A, b, c)|.

An incremental QP generation algorithm for Z(A, b, c) is provided by the method of joint generation, while generatingZ^d(A, b, c) is NP-complete [12].

5.2 Maximal Frequent and Minimal Infrequent Sets in Data Mining [1,17,24]

Given a binary m×n matrix A and positive integer t, a set C of columns is called t-frequent if there exist at leasttrows such that in the obtained t× |C|

submatrix all entries are 1s; otherwise C is called t-infrequent. Let α and β denote the numbers of the maximal t-frequent and minimal t-infrequent sets, respectively. The DB inequality α≤(m−t+ 1)β, implies that generating all minimalt-infrequent sets is incremental QP. In contrast, generating all maximal t-frequent sets is NP-complete. See [17] for more details.

5.3 Minimal Strongly Connected Subgraphs and Dicuts [35]

Given a (strongly connected) digraphG= (V, E), generate all

– (K) minimal edge-setsE⊆Esuch that the digraphG= (V, E) is strongly connected;

– (L) minimal dicuts, that is, minimal edge-setsE⊆E such that the digraph G= (V, E\E) is not strongly connected.

These dual problems are not DB; the size of each set may be exponential in the size of the other. Problem (K) is solved in incremental P time by the supergraph method, while (L) is NP-complete [35]. The proof of the latter claim is based on the “sausage lemma” (Sect.2.4and [25]) but requires extra tricks [35].

10 V. Gurvich

5.4 Generating Generalized Paths, Cuts, and Spanning Sets in Graphs [32,33,36,45]

Given a (non-directed connected) graphG= (V, E) andk edge-subsetsE1, . . . , E_k ⊆E, for every I ⊆[k] deﬁne the graph G_I = (V,∪i∈IE_i) and consider the following two pairs of problems. Generate all

– (K1) generalized spanning sets, that is, all minimalI⊆[k] ={1, . . . , k}such that graphG_I is connected;

– (L1) generalized complementary cuts, that is, all maximalI⊆[k] such that graphG_I is not connected.

Given also two poless, t∈V,generate all

– (K2) generalized s-t-paths, that is, all minimal I ⊆ [k] such that s and t belong to one connected component ofG_I.

– (L2) generalized complementarys-t-cuts, that is, all maximal I ⊆[k] such thatsandtare in distinct connected component ofG_I.

Problem (K1) is QP uniformly DB and, hence, can be solved in incremental QP time. Three other problems are NP-complete; all three proofs are based on the sausage lemma.

5.5 Spanning Linear Spaces by Linear Subspaces [10,34]

LetL={L₁, . . . , L_k}be a set of linear subspaces in a spaceL=F^dof dimension dover a ﬁeldF and t≤dbe a positive integer threshold, for every I⊆[k] set L_I =∪i∈IL_i, or in other words,L_I is spanned byL_i, i∈I;

Consider the following pair of dual problems. Generate all – (K) minimal subsetsI⊆[k] such thatdim(L_I)≥t;

– (L) maximal subsetsI⊆[k] such that dim(L_I)< t.

Ift=dthenL_I =L, that is,L_i, i∈I,span the whole space. Problem (K) is QP uniformly DB and, hence, is incremental QP, while (L) is NP-complete.

Let us ﬁx a basisB in L and assume that subspaces L_i, i∈ I, are deﬁned by subsets of B. In this case problem (K) is reduced to generating all minimal covers ofB by these subsets. This problem is equivalent to dualization.

5.6 Polymatroid Functions and Systems of Polymatroid Inequalities [10,34,38,41]

An integer non-negative set-functionf : 2^V →Z+ is calledpolymatroid if it is submodular,f(I∪I) +f(I∩I)≤f(I) +f(I) for anyI, I∈V, monotone non-decreasing, andf(∅) = 0.

For example, f(I) = dim(L_I) from the previous section is a polymatroid function. A special case of this example is the number of vertices minus the

Complexity of Generation 11 number of connected components of a graph. In other words, given a graph G = (V, E) and a family I of its edge-sets {Ei ⊆ E | i ∈ I}, we set f(I) =

|V| −C(G(V, E_I)), where E_I =∪_i∈IE_i and C(G) is the number of connected components of G. Another example is the number of degrees of freedom of a mechanical system. See [38] for more examples and details.

Given a system of polymatroid inequalities

(*) f_j(I) ≥t_j, j ∈ [r] = {1, . . . , r}, where functions f_j are polymatroid and thresholdst_j are QP inn=|V|, generate all

(K) minimalI⊆V satisfying (*); (L) maximalI⊆V not satisfying (*).

Problem (K) is QP uniformly DB, and, hence, it is incremental QP, while (L) is NP-complete.

5.7 Uniformly DB Inequalities for Polymatroid Systems [10,34]

LetH(f, t) be the family of all minimal subsets ofV satisfying (*).

Standardly,M IS(H(f, t)) denote the family of all maximal subsets ofV not satisfying (*). Let β =|H(f, t)| andα=|M IS(H(f, t)| denote the numbers of sets in these families. Then,

α≤β^log^t/c(n,β)forβ≥2 andα≤nwhenβ = 1, where c=c(n, β) is the (unique) root of the equation

2^c(n^c/^log^β−1) = 1.

This is a DB inequality for (K). It is QP whenevertis QP inn=|V|. This DB inequality can be replaced by a much simpler, although slightly weaker, one:

α≤(nβ)^log^t.

This bound can be viewed as a generalization, from graphs to hypergraphs, of an upper bound for the number of MIS. For graphs we have

2^p≤ |M IS(G)| ≤δ^p+ 1,

where p=p(G) is the maximum size of the induced matchings in Gand δ = δ(G) is the maximum number of pairs of vertices at distance 2; in particular, δ≤_n−1

, where the equality is achieved only on stars. This bound was proven in [4]; slightly weaker results were obtained in [2,43].

For the systems ofrpolymatroid inequalities, we generalize as follows:

α≤rmax(n, β^log^t/c(n,β)), where t= max(t₁, . . . , t_r).

Interestingly, the coeﬃcient 1/c= 1/c(n, β) in the exponent is accurate and cannot be reduced.

12 V. Gurvich

No documento 13th International Computer Science Symposium in Russia, CSR 2018 (páginas 38-42)