Top Qs
Timeline
Chat
Perspective

Intersection number (graph theory)

Fewest cliques covering a graph's edges From Wikipedia, the free encyclopedia

Intersection number (graph theory)
Remove ads

In the mathematical field of graph theory, the intersection number of a graph is the smallest number of elements needed to represent as an intersection graph of finite sets. In such a representation, each vertex is represented as a set, and two vertices are connected by an edge whenever their sets have a common element. The intersection number equals the smallest number of cliques needed to cover all of the edges of .[1][2]

Thumb
A graph with intersection number four. The four shaded regions indicate four cliques that cover all the edges of the graph. In an intersection representation, each vertex can be represented by the subset of these cliques that it belongs to.

Intersection numbers have been studied under multiple names. A set of cliques that cover all edges of a graph is called a clique edge cover[3] or edge clique cover,[4] or even just a clique cover, although the last term is ambiguous: a clique cover can also be a set of cliques that cover all vertices of a graph.[5] Sometimes "covering" is used in place of "cover".[6] As well as being called the intersection number, the minimum number of these cliques has been called the R-content,[7] edge clique cover number,[4] or clique cover number.[8] The problem of computing the intersection number has been called the intersection number problem,[9] the intersection graph basis problem,[10] covering by cliques,[10] the edge clique cover problem,[9] and the keyword conflict problem.[2]

Every graph with vertices and edges has intersection number at most . The intersection number is NP-hard to compute or approximate, but fixed-parameter tractable.

Remove ads

Definitions

Intersection graphs

Let be any family of sets, allowing sets in to be repeated. Then the intersection graph of is an undirected graph that has a vertex for each set in and an edge between each two sets that have a nonempty intersection. Every graph can be represented as an intersection graph in this way.[11] The intersection number of the graph is the smallest number such that there exists a representation of this type for which the union of the sets in has elements.[1] The problem of finding an intersection representation of a graph, using a given number of elements, is known as the intersection graph basis problem.[10]

Clique edge covers

An alternative definition of the intersection number of a graph is that it is the smallest number of cliques in (complete subgraphs of ) that together cover all of the edges of .[1][12] A set of cliques with this property is known as a clique edge cover or edge clique cover, and for this reason the intersection number is also sometimes called the edge clique cover number.[4]

Equivalence

The equality of the intersection number and the edge clique cover number has a short proof. In one direction, suppose that is the intersection graph of a family of sets whose union has elements. Then for any element , the sets in that contain form a clique in , because each pair of these sets has a non-empty intersection containing . Further, the cliques formed in this way cover every edge in : if two sets in form an edge by having a non-empty intersection, then that edge is contained in the clique for any element of the intersection. Therefore, the edges of can be covered by cliques, one per element of .[12]

In the other direction, if a graph can be covered by cliques, then each vertex of may be represented by a subset of the cliques, the ones that contain vertex . Two of these subsets, for two vertices and , have a nonempty intersection if and only if there is a clique in the intersection that contains both of them, if and only if there is an edge included in one of the covering cliques.[12]

Remove ads

Applications

Summarize
Perspective

The representation of a graph as an abstract intersection graph of sets can be used to construct more concrete geometric intersection representations of the same graph. In particular, if a graph has intersection number , it can be represented as an intersection graph of -dimensional unit hyperspheres. That is, its sphericity is at most .[4]

A clique cover can be used as a kind of adjacency labelling scheme for a graph, in which each vertex is labeled by a binary value, in a way that allows the existence of an edge between two vertices to be tested quickly by comparing their values. These labels have one bit per clique, which is set to zero if the vertex does not belong to the clique and one if the vertex belongs to the clique. With this labeling scheme, two vertices are adjacent if and only if the bitwise and of their labels is nonzero. The length of the labels is the intersection number of the graph. When this length is small, a computer representation of the graph that uses only these label can use less memory than explicit methods such as adjacency lists, and have faster tests for whether two vertices are adjacent. This method was used in an early application of intersection numbers, for labeling a set of keywords so that conflicting keywords could be quickly detected, by E. Kellerman of IBM. For this reason, another name for the problem of computing intersection numbers is the keyword conflict problem.[13][14] Similarly, in computational geometry, representations based on the intersection number have been considered as a compact representation for visibility graphs, but there exist geometric inputs for which this representation requires a near-quadratic number of cliques.[15]

Another class of applications comes from scheduling problems in which multiple users of a shared resource should be scheduled for time slots, in such a way that incompatible requests are never scheduled for the same time slot but all pairs of compatible requests are given at least one time slot together. The intersection number of a graph of compatibilities gives the minimum number of time slots needed for such a schedule: one slot for each clique in a clique cover of the compatibility graph.[2] In the design of compilers for very long instruction word computers, a small clique cover of a graph of incompatible operations can be used to represent their incompatibilities by a small number of artificial resources (one resource per clique), allowing resource-based scheduling techniques to be used to assign operations to instruction slots.[16]

Shephard and Vetta observe that the intersection number of any network equals the minimum number of constraints needed in an integer programming formulation of the problem of computing maximum independent sets, in which one has a 0-1 variable per vertex and a constraint that in each clique of a clique cover the variables sum to at most one. They argue that, for the intersection graphs of paths in certain fiber optic communications networks, these intersection numbers are small, explaining the relative ease of solving certain optimization problems in allocating bandwidth on the networks.[3]

In statistics and data visualization, edge clique covers of a graph representing statistically indistinguishable pairs of variables are used to produce compact letter displays that assist in visualizing multiple pairwise comparisons. These displays are formed by choosing a letter or other visual marker for each clique, and then labeling each variable by the letters for the cliques that it belongs to. This method provides a graphical representation of which variables are indistinguishable: they are indistinguishable if they share at least one letter in their labels.[17][18]

In the analysis of food webs describing predator-prey relationships among animal species, a competition graph or niche overlap graph is an undirected graph in which the vertices represent species, and edges represent pairs of species that both compete for the same prey. These can be derived from a directed acyclic graph representing predator-prey relations by drawing an edge in the competition graph whenever there exists a prey species such that the predator-prey relation graph has edges and . Every competition graph must have at least one isolated vertex, and the competition number of an arbitrary graph represents the smallest number of isolated vertices that could be added to make it into a competition graph. Biologically, if part of a competition graph is observed, then the competition number represents the smallest possible number of unobserved prey species needed to explain it. The competition number is at most equal to the intersection number: one can transform any undirected graph into a competition graph by adding a prey species for each clique in an edge clique cover. However, this relation is not exact, because it is also possible for the predator species to be prey of other species. In a graph with vertices, at most of them can be the prey of more than one other species, so the competition number is at least the intersection number minus .[19]

Edge clique covers have also been used to infer the existence of protein complexes, systems of mutually interacting proteins, from protein–protein interaction networks describing only the pairwise interactions between proteins.[20] More generally, Guillaume and Latapy have argued that, for complex networks of all types, replacing the network by a bipartite graph connecting its vertices to the cliques in a clique cover highlights the structure in the network.[21]

Remove ads

Upper bounds

Summarize
Perspective

Any graph with edges has intersection number at most . Each edge is itself a two-vertex clique. There are of these cliques and together they cover all the edges.[22] It is also true that every graph with vertices has intersection number at most . More strongly, the edges of every -vertex graph can be covered by at most cliques, all of which are either single edges or triangles. A greedy algorithm can find this cover: remove any two adjacent vertices and inductively cover the remaining graph. Restoring the two removed vertices, cover edges to their shared neighbors by triangles, leaving edges to unshared neighbors as two-vertex cliques. The inductive cover has at most cliques, and the two removed vertices contribute at most cliques, maximized when all other vertices are unshared neighbors and the edge between the two vertices must be used as a clique. Adding these two quantities gives cliques total.[2][12] This generalizes Mantel's theorem that a triangle-free graph has at most edges, for in a triangle-free graph the only optimal clique edge cover has one clique per edge and therefore the intersection number equals the number of edges.[2]

An even tighter bound is possible when the number of edges is strictly greater than . Let be the number of pairs of vertices that are not connected by an edge in the given graph , and let be the unique integer for which . Then the intersection number of is at most .[2][23] Graphs that are the complement of a sparse graph have small intersection numbers: the intersection number of any -vertex graph is at most , where is the base of the natural logarithm and is the maximum degree of the complement graph of .[6]

It follows from deep results on the structure of claw-free graphs that, when a connected -vertex claw-free graph has at least three independent vertices, it has intersection number at most . It remains an unsolved problem whether this is true of all claw-free graphs without requiring them to have large independent sets.[8] An important subclass of the claw-free graphs are the line graphs, graphs representing edges and touching pairs of edges of some other graph . An optimal clique cover of the line graph may be formed with one clique for each triangle in that has two or three degree-2 vertices, and one clique for each vertex that has degree at least two and is not a degree-two vertex of one of these triangles. The intersection number is the number of cliques of these two types.[7]

In the Erdős–Rényi–Gilbert model of random graphs, in which all graphs on labeled vertices are equally likely (or equivalently, each edge is present or absent, independently of other edges, with probability ) the intersection number of an -vertex random graph is with high probability smaller by a factor of than the number of edges. In these graphs, the maximum cliques have (with high probability) only a logarithmic number of vertices, implying that this many of them are needed to cover all edges. The tricker part of the bound is proving that it is possible to find enough logarithmically sized cliques to cover many edges, allowing the remaining leftover edges to be covered by two-vertex cliques.[24][25]

Much of the early research on intersection numbers involved calculating these numbers on various specific graphs, such as the graphs formed by removing a complete subgraph or a perfect matching from a larger complete graph.[26]

Remove ads

Computational complexity

Testing whether a given graph has intersection number at most a given number is NP-complete.[10][7][14] Therefore, it is also NP-hard to compute the intersection number of a given graph. In turn, the hardness of the intersection number has been used to prove that it is NP-complete to recognize the squares of split graphs.[27]

The problem of computing the intersection number is, however, fixed-parameter tractable: that is, it can be solved in an amount of time bounded by a polynomial in multiplied by a larger but computable function of the intersection number .[5][28] This may be shown by observing that there are at most distinct closed neighborhoods in the graph – two vertices that belong to the same set of cliques have the same neighborhood – and that the graph formed by selecting one vertex per closed neighborhood has the same intersection number as the original graph.[5][29] Therefore, in polynomial time the input can be reduced to a smaller kernel with at most vertices. Applying a brute-force search over at most assignments of distinct sets of cliques to the remaining vertices gives time double exponential in .[5][28] The double-exponential dependence on cannot be reduced to single exponential by a kernelization of polynomial size, unless the polynomial hierarchy collapses,[30] and if the exponential time hypothesis is true then double-exponential dependence is necessary regardless of whether kernelization is used.[28] On graphs of bounded treewidth, dynamic programming on a tree decomposition of the graph can find the intersection number in linear time,[31][20] but simpler algorithms based on finite sets of reduction rules do not work.[31]

There exists a constant such that the problem cannot be approximated in polynomial time with an approximation ratio better than .[32] The best approximation ratio that has been found is better than the trivial by only a polylogarithmic factor.[5] Researchers in this area have also investigated the computational efficiency of heuristics, without guarantees on the solution quality they produce, and their behavior on real-world networks.[5][33]

More efficient algorithms are known for certain special classes of graphs. The intersection number of an interval graph is always equal to its number of maximal cliques, which may be computed in polynomial time.[34][35] More generally, in chordal graphs, the intersection number may be computed by an algorithm that considers the vertices in an elimination ordering of the graph (an ordering in which each vertex and its later neighbors form a clique) and that, for each vertex , forms a clique for and its later neighbors whenever at least one of the edges incident to is not covered by any earlier clique.[35] It is also possible to find the intersection number in linear time in circular-arc graphs.[36] However, although these graphs have only a polynomial number of cliques to choose among for the cover, having few cliques alone is not enough to make the problem easy: there exist families of graphs with polynomially many cliques for which the intersection number remains NP-hard.[9] The intersection number can also be found in polynomial time for graphs whose maximum degree is five, but is NP-hard for graphs of maximum degree six.[37][38] On planar graphs, computing the intersection number exactly remains NP-hard, but it has a polynomial-time approximation scheme based on Baker's technique.[20]

Remove ads

See also

  • Bipartite dimension, the smallest number of bicliques needed to cover all edges of a graph
  • Bound graph, a type of graph characterized by clique edge covers of a special form
  • Clique cover, the NP-hard problem of finding a small number of cliques that cover all vertices of a graph

References

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.

Remove ads