Sets of binary sequences with small total Hamming distances
Introduction
In order to save power and increase speed of data processing, it is preferable to represent data in the form of sequences that are close together. To quantify closeness, we may consider the number of pairs of sequences at distance 1 from each other [1]. Other criteria for closeness such as the diameter, connectivity, and neighborhood of a hypercube representing the data are also considered [2], [3], [4]. In this paper, we consider a different criterion, namely, the total Hamming distance between all pairs of sequences and specify sets of binary sequences as close as possible under this criterion. Although in practice, data is represented by finite sequences, in order to simplify notation and not restrict the lengths of sequences, we consider infinite sequences. Actually, to construct a set of n binary sequences with minimum total Hamming distance, it suffices to consider sequences of lengths no more than .
Let and be binary sequences. The Hamming distance, , between s and is the number of positions i for which [3]. Henceforth we refer to the Hamming distance simply as distance. Since s and are binary sequences, .
Let be a finite set of binary sequences. We define the total distance of , , to be the sum of the distances between pairs of sequences in , i.e.,
Given a positive integer n, we are interested in finding the minimum possible total distance, denoted by , among all sets of n binary sequences and sets that achieve this minimum. For each , we define a set of n binary sequences, , which plays an important role in our investigation. This set consists of the all-0's sequence and the sequences, each having a single 1 and this 1 is in one of the first positions.
The main result is stated in the next section which gives an expression for and sets that achieve this minimum total distance. The proof is provided in Section 3.
Section snippets
Result
Our main result is the following theorem. Theorem 1 For , which is achieved by the set composed of the four sequences , , , and , where the dots stand for 0's. For , which is achieved by the set composed of the eight sequences , , , , , , , and . For all other values of , which is achieved by .
Proof of Theorem 1
Interestingly, as shown next, the total distance of a set of binary sequences can be determined easily from the number of sequences and their sum over the real numbers. Define the set sum of to be .
Lemma 1 Let be a set of n binary sequences with set sum . Then, the total distance of is given by
Proof From the definitions of the distance between two sequences and the total distance of a set, we have
References (4)
Maximum number of edges joining vertices on a cube
Inf. Process. Lett.
(2003)- et al.
A note about some properties of BC graphs
Inf. Process. Lett.
(2008)