1. Introduction
Coding theoretic experience has shown that considering linear codes over finite fields often yields significant complexity advantages over the nonlinear counterparts, particularly when it comes to complex tasks such as encoding and decoding. On the other side, it was recognized early [
1,
2] that the class of binary block codes contains excellent code families, which were not linear (Preparata, Kerdock codes, Goethals and Goethals–Delsarte codes). For a long time, it could not be explained why these families exhibit formal duality properties in terms of their distance enumerators that occur only on those among linear codes and their duals.
A true breakthrough in the understanding of this behavior came in the early 1990s when, after preceding work by Nechaev [
3], the paper by Hammons et al. [
4] discovered that these families allow a representation in terms of
-linear codes.
A crucial condition for this ring-theoretic representation was that was equipped with an alternative metric, the Lee weight, rather than with the traditional Hamming weight, which only distinguishes whether an element is zero or non-zero. The Lee weight is finer, assigning 2 a higher weight than the other non-zero elements of this ring.
The fact that the traditional settings of linear coding theory (finite fields endowed with the Hamming metric) are actually too narrow, which suggests expanding the theory in at least two directions: on the algebraic part, the next more natural algebraic structure serving as alphabet for linear coding is that of finite rings (and modules). On the metrical part, the appropriateness of the Lee weight for -linear coding suggests that the distance function for a generalized coding theory requires generalization as well.
Since these ground-breaking observations, an entire discipline arose within algebraic coding theory. A considerable community of scholars have been developing results in various directions, among them code duality, weight-enumeration, code equivalence, weight functions, homogeneous weights, existence bounds, code optimality and decoding schemes, to mention only a few.
The paper at hand aims at providing a further contribution to this discipline, by introducing the overweight on a finite ring. This weight is a generalization of the Lee weight over
, as well as of the weight introduced in [
5] by Krotov over
for any positive integer
s, which was further generalized to
in [
6].
We study the relations of this new weight to other well-known weights over rings and state several properties of the overweight, such as its extremal property. We also develop a number of standard existence bounds, such as a Singleton bound, a sphere-packing bound, a Plotkin bound and a version of the (assertive) Gilbert–Varshamov bound.
In the final part of this article, we derive a general Johnson bound for the homogeneous weight on a finite Frobenius ring. This result is important, as it is closely connected to list decoding capabilities.
2. Preliminaries
Throughout this paper, we will consider R to be a finite ring with identity, denoted by 1. If R is a finite ring, we denote by its group of invertible elements, also known as units.
Let us recall some preliminaries in coding theory, where we focus on ring-linear coding theory.
For a prime power q, let us denote by the finite field with q elements and, for a positive integer m, we denote by the ring of integers modulo m.
In traditional coding theory, we consider a linear code to be a subspace of a vector space over a finite field.
Definition 1. Let q be a prime power, and let be non-negative integers. A linear subspace C of of dimension k is called a linear code.
We define a weight in a general way.
Definition 2. Let R be a finite ring. A real-valued function w on R is called a weight if it is a non-negative function that maps 0 to 0.
It is natural to identify w with its additive extension to , and so we will always write for all . Every weight w on R induces a distance by . Again, we will identify d with its natural additive extension to .
If the weight additionally is positive definite, symmetric and satisfies the triangular inequality, that is,
- 1.
and for all ,
- 2.
for all ,
- 3.
for all ,
then the induced distance inherits these properties, i.e.,
- 1.
for all and if and only if .
- 2.
for all ,
- 3.
for all .
The most prominent and best studied weight in traditional coding theory is the Hamming weight.
Definition 3. Let . The Hamming weight of a vector is defined as the size of its supportand the Hamming distance between x and is given by The minimum Hamming distance of a code is then defined as the minimum distance between two different codewords
Note that the concept of minimum distance can be applied for any underlying distance d.
In the paper at hand, we focus on a more general setting where the ambient space is a module over a finite ring.
Definition 4. Let and let R be a finite ring. A submodule C of of size is called a left R-linear code.
The most studied ambient space for ring-linear coding theory is the integers modulo 4, denoted by , endowed with the Lee metric.
Definition 5. For , its Lee weight is defined as One of the most prominent generalizations of the Lee weight over is the homogeneous weight.
Definition 6. Let R be a Frobenius ring. A weight is called (left) homogeneous of average value , if and the following conditions hold:
- (i)
For all with , we have that .
- (ii)
For every non-zero ideal , it holds that
We will denote the homogeneous weight with .
The homogeneous weight was first introduced by Constantinescu and Heise in [
7] in the context of coding over integer residue rings. It was later generalized by Greferath and Schmidt [
8] to arbitrary finite rings, where the ideal
I in Definition 6 was assumed to be a principal ideal. In its original form, however, the homogeneous weight only exists on finite Frobenius rings. It can be shown that a left homogeneous weight is at the same time right homogeneous, and for this reason, we will omit the reference to any side for the sequel. In [
9], Honold and Nechaev finally generalized the notion of homogeneous weight to some finite modules, called weighted modules, over a (not necessarily commutative) ring
R with identity.
Since we will establish a Plotkin bound for a new weight, let us recall here the Plotkin bound over finite fields equipped with the Hamming metric.
Theorem 1 (Plotkin bound).
Let C be an block code over with minimum Hamming distance If , then For the homogeneous weight, the following Plotkin bound was established in [
10].
Theorem 2 (Plotkin bound for homogeneous weights, [
10]).
Let be a homogeneous weight of average value γ on R, and let C be an block code over R with minimum homogeneous distance d. If , then 3. Overweight
As the Hamming weight defined over the binary can be generalized to larger ambient spaces in different ways resulting in different metrics, such as the Hamming weight over
or the Lee weight over
; in addition, the Lee weight over
can be generalized in different ways. For example, the weight defined in [
5] over
for any positive integer
m is a possible generalization, but the most prominent generalization is the homogeneous weight (see for example [
10]). In this section, we introduce a new generalization, called the
overweight. This weight shows some interesting properties and relations to the homogeneous weight and can additionally be seen as a generalization of the weight defined in [
5] over
for any positive integer
s and the weight defined in [
6] over
.
Definition 7. Let R be a finite ring. The overweight
on R is defined as We also denote by W its additive expansion to , given by
Let us call the distance which is induced by the overweight the overweight distance, and denote it by D, i.e., .
The motivation of introducing this new weight is twofold: on one hand, it is theoretically interesting to explore a new generalization of the Lee weight over and its connections to other known weights over rings. On the other hand, the overweight would also be perfectly suitable for a channel, where unit errors are more likely.
Note that the overweight is designed to satisfy the following criteria: it is positive definite, symmetric, satisfies the triangular inequality and distinguishes between units and non-zero non-units. Furthermore, it is extremal in the sense that, on a big family of rings, any increase of the weight of non-zero non-units would violate the triangular inequality, thus the name overweight. We will now study this extremal property in more details.
We can consider weights with values in
, for some
, without fixing the subsets of
R where these values are attained. Thus, we are considering the generic weight function
where
and
Such a weight is always positive definite. In addition, the weight is symmetric if and only if
and
contain all additive inverses of their elements. Let us now consider the triangular inequality: if there exist
such that
, then we must have
Thus, in order for f to be an extremal weight, one chooses .
The overweight is a special case of such a weight function f with the choice . The existence of elements such that is satisfied for many rings—for example, for rings with a non-trivial Jacobson radical.
Relations to Other Weights
Clearly, the homogeneous weight and the overweight coincide with the Lee weight on
, with the Hamming metric on finite fields
, and finally with the weight [
6] on
Proposition 1. The overweight over finite chain rings gives an upper bound on the normalized homogeneous weight.
Proof. Over a finite chain ring with socle
S and residue field size
q, we have that the normalized homogeneous weight is defined as
If
, then also
, and
If
, then
and finally, if
, we have that
which implies the result. □
In [
11], Bachoc defines the following weight on
-algebras
A, with units
as follows:
This is in the same spirit as the overweight. The weight of Bachoc is, however, only assuming positive definiteness. We note that, whenever we have a -algebra, the two weights coincide. The overweight can thus also be seen as a generalization of Bachoc’s weight to a general finite ring.
Let us illustrate this connection with some examples: we consider the ring of matrices over and the ring . In both cases, the Bachoc weight only coincides with the homogeneous and the overweight in the case .
Finally, in [
5], Krotov defines the following weight over
, for any positive integer
m:
Clearly, this is a further generalization of the Lee weight over and thus coincides there with the homogeneous and the overweight. However, even more is true: the weight of Krotov and the overweight coincide over , for any positive integer s. Thus, the overweight may be considered as a generalization of Krotov’s weight over for any positive integer s.
Let us give some examples to illustrate the differences between the above-mentioned weights.
Example 1. In the following table, denotes the Hamming weight, the normalized homogeneous weight, denotes the Lee weight, denotes Krotov’s weight, denotes Bachoc’s weight and finally W denotes the overweight. Let us consider two easy but pathological cases, namely for Table 1 and for Table 2. Finally, another interesting connection to the Hamming weight arises by considering the following linear injective isometry.
Lemma 1. The mapis a linear isometry. Recall that, over , the overweight coincides with the weight of Bachoc and the homogeneous weight.
4. Bounds for the Overweight
In this section, we develop several bounds for the overweight, such as a Singleton bound, a sphere-packing bound, a Gilbert–Varshamov bound and a Plotkin bound.
For this, let us first define the minimum overweight distance of a code.
Definition 8. Let be a code. The minimum overweight distance of C is then denoted by and defined as 4.1. A Singleton Bound
The Singleton bound usually follows a puncturing argument, which is possible for the overweight, but gives the same result as applying the following observation:
Remark 1. For all , we have thatwhere denotes the Hamming weight. Hence, using the Singleton bound for the Hamming metric directly gives a Singleton bound for the overweight.
Proposition 2. Let be a code of size M and minimum overweight distance d. Then, Example 2. A trivial example for a code achieving the Singleton bound in Proposition 2 is given by , having and minimum overweight distance .
However, if we define the rank of a linear code
C, denoted by
, to be the minimal number of generators of
C, then the following bound is known for principal ideal rings [
12,
13]
Codes achieving this bound are called Maximum Distance with respect to Rank (MDR) codes, in order to differentiate from MDS codes. This is a sharper bound than the usual Singleton bound, since for non-free codes we have .
In the case of linear codes, the rank thus also leads to a sharper Singleton-like bound for the overweight.
Proposition 3. Let R be a principal ideal ring. Let be a linear code of rank and minimum overweight distance d. Then, Example 3. As an example for a code, we can consider , having minimum overweight distance .
4.2. A Sphere-Packing Bound
The sphere-packing bound as well as the Gilbert–Varshamov bound are generic bounds, and we are able to provide them for the overweight in a simple form involving the volume of the balls in the underlying metric space.
We begin by defining balls with respect to the overweight distance.
Definition 9. For a given radius , the overweight ball
of radius r centered in x is defined as Clearly, the volume of such a ball is invariant under translations, i.e.,
for all
.
Moreover, setting
and
, we have the generating function
for this weight function, so that the generating function for
W on
takes the form
where we have set
and
, and where the condition
is transformed in
,
. Now, setting
, we obtain the simplified expression for the generating function
Lemma 2. The foregoing implies that the ball of radius e (centered in 0) has volume exactly We thus provided an explicit formula for the cardinality of balls in with respect to the overweight distance.
We now obtain the sphere-packing bound for the overweight distance by combining the previous results. As before, R is a finite ring and , whereas represents the number of non-zero non-units.
Corollary 1 (Sphere-Packing Bound).
Let be a (not necessarily linear) code of length n, and minimum overweight distance . Then, we havewhere the cardinality of is given in Equation (
1).
If the minimum distance is even and R is a finite local ring with maximal ideal J, this bound can be adapted as follows.
Corollary 2. Let R be a local ring with maximal ideal J, and be a (not necessarily linear) code of length and minimum overweight distance . Then,where is the overweight ball of radius e in , and its volume is given in Equation (
1).
Proof. Pick
such that the cosets
form a partition of
R. For all
, define the set
Notice that the sets
form a partition of
R and that all elements of
have mutual overweight distance 1. Thus, given
, we denote with
the unique set
that contains
r. Furthermore, let
be the projection that removes the
’th coordinate and
Now, if are two codewords, then and are disjoint. Indeed, if , then as they cannot be disjoint. Hence, . Furthermore, both and are less than or equal to e, implying that . It follows that , which is a contradiction. □
To find non-trivial examples of perfect codes is as notoriously hard as over finite fields in the Hamming metric. Clearly, in the case
, there are non-trivial perfect codes, as the overweight coincides with the Hamming weight. Examples of such codes can be found in [
5] (Section IV). Furthermore, in the case
, linear 1-perfect codes are classified in terms of their parity-check matrix in [
6] (Theorem IV.1).
4.3. A Gilbert–Varshamov Bound
With arguments similar to those for the sphere-packing bound, we can also obtain a lower bound on the maximal size of a code with a fixed minimum distance.
Proposition 4 (Gilbert–Varshamov bound).
Let R be a finite ring, n a positive integer and . Then, there exists a code of minimum overweight distance at least d satisfyingwhere the volume is given in Equation (
1)
for , i.e., Proof. Assume of minimum overweight distance of at least d is a largest code of length n and minimum distance d. Then, the set of balls centered in the codewords must already cover the space . Since, if this were not the case, one would find an element that is not contained in the ball of radius around any element of C. This word y would have distance at least d to each of the words of C, and thus would be a code of properly larger size with distance at least d, a contradiction to the choice of C.
From the covering argument, we then see that
as desired. □
Let us consider the special case where
R is a finite chain ring. Since the overweight is an additive weight, and the conditions of [
14] are easily verified, we can use [
14] (Theorem 22) to obtain that random linear codes over
achieve the (asymptotic) Gilbert–Varshamov bound with high probability.
Example 4. As an easy example, we can consider . The maximal minimum overweight distance is given by . The Gilbert–Varshamov bound states for this example that there exists a code C with as . For example, the code has four elements.
4.4. A Plotkin Bound
Over a local ring, we can use methods similar to the ones used for the classical Plotkin bound to obtain an analogue of the Plotkin bound for (not necessarily linear) codes equipped with the overweight.
For the rest of this section, R is a finite local ring with maximal ideal J. The notation stems from the Jacobson radical of the ring R. Note that the factor ring is a finite field, whose cardinality will be denoted by q.
Similarly to the Hamming case, for a subset
, we will denote by
the average weight of the subset
A.
Lemma 3. Let be a left or right ideal. Then, Proof. Note that the last case is trivial as . If , then all non-zero elements of I have weight 2, so this case follows as well.
Finally, if , then there are elements of weight 1 and elements of weight 2. Hence, the total weight is and dividing by yields the claim. □
Corollary 3. Let R be a local ring with maximal ideal J and assume that . Then, we have that for all left or right ideals .
Proof. We immediately see that
for all
. Now, consider the case
. We have that
where we used that
. □
To ease the notation, let us denote by
the following
In what follows, we provide a Plotkin bound for the overweight over a local ring R with maximal ideal J. The case is already well studied, since, in this case, R is a field and D is simply the Hamming distance. Hence, we will assume that .
We start with a lemma for the Hamming weight. The proof of it follows the idea of the classical Plotkin bound, which can be found in [
15], and for the homogeneous weight in [
10].
Lemma 4. Let be a subset and P be a probability distribution on I. Then, we have that Proof. If we apply the Cauchy–Schwarz inequality to the latter sum, we obtain that
From which we can conclude. □
We are now ready for the most important step of the Plotkin bound. As before, R is a local ring with non-zero maximal ideal J and .
Proposition 5. Let P be a probability distribution on R. Then, it holds that Proof. Let
and pick
such that
if
. Then, it follows that the cosets
form a partition of
R. For all
, we denote by
It follows that
. By rewriting the initial sum as sum over all cosets, we obtain that
If
, then
defines a probability distribution on
. In this case, we apply Lemma 4 to obtain that
Note that the same inequality also trivially holds if
. Applying this and using that
, we obtain that
where we used that
since
in the last inequality. □
To complete the Plotkin bound for the overweight, we now follow the steps in [
10]. Using Proposition 5, we obtain the following result:
Proposition 6. Let be a (not necessarily linear) code of minimum overweight distance d. Then, Proof. The first inequality follows since the distance between all distinct pairs of C is at least d.
For the second inequality, let
be the projection onto the
ith coordinate. Note that
defines a probability distribution on
R for all
. Using Proposition 5, we obtain that
Thus, we obtain the claim. □
From this inequality, we obtain a Plotkin bound for the overweight distance. As before, R is a local ring with non-zero maximal ideal J and .
Theorem 3 (Plotkin bound for the overweight distance).
Let be a (not necessarily linear) code of minimum overweight distance and assume that . Then, Proof. We divide both sides of the inequality in Proposition 6 by
to obtain that
The result then follows from the assumption that . □
By rearranging the same inequality, we also obtain the following version of the Plotkin bound, which does not require the assumption that .
Corollary 4. Let be a (not necessarily linear) code with and let . Then, Proof. We obtain this by dividing both sides of the inequality in Proposition 6 by , which is non-zero by assumption. □
Remark 2. Note that W is a homogeneous weight on , and thus our bound coincides with the bound from [10] for the homogeneous weight on . Example 5. If we consider codes over and fix , . We obtain that and hence by the integrality that . The linear codeattains this bound. 5. A Johnson Bound for the Homogeneous Weight
Another interesting bound is the Johnson bound due to its relation with list-decodability. In the classical form, the Johnson bound gives an upper bound on the largest size
of a constant-weight
w code over
of length
n and minimum Hamming distance
d. However, for the list-decodability of a code, we are interested in codes having codewords of weight
at mostw. In fact, if the largest size of such a code
is small, e.g., at most a constant
L, then every ball of radius
w contains at most
L codewords and hence one can decode a list of a size at most
L. In more detail, the Johnson bound for list-decodability in the Hamming metric states that, if
where
denotes the relative minimum distance, then
This famous bound is still missing for the well-studied homogeneous weight, which is, like the overweight, a generalization of the Lee weight over
. In this section, we prove a Johnson bound for the homogeneous weight from Definition 6, denoted by
and let
be its average weight on
R. By abuse of notation, we denote with
also the extension of
to
, that is,
Note that
does not necessarily satisfy the triangle inequality. In [
7] (Theorem 2), it is shown that the homogeneous weight on
satisfies the triangle inequality if and only if
m is not divisible by 6.
We define the ball of radius r with respect to a homogeneous weight to be the set of all elements having distance less than or equal to r.
Definition 10. Let and . The ball of radius r centered in y is defined as Our aim is to provide a Johnson bound for the homogeneous weight over Frobenius rings. Thus, we begin by defining list-decodability.
Definition 11. Let R be a finite ring. Given , a code is called list-decodable (with respect to ) if, for every , it holds that Over Frobenius rings, the following result holds, which will play an important role in the proof of the Johnson bound.
Proposition 7 ([
10] (Corollary 3.3)).
Let R be a Frobenius ring, a (not necessarily linear) code of minimum distance d and If , then With this, we obtain an analogue of the Johnson bound for the homogeneous weight.
Theorem 4. Let R be a Frobenius ring and be a (not necessarily linear) code of minimum distance d. Assume that . Then, it holds that C is list-decodable if one of the following conditions is satisfied:
- (i)
We have that .
- (ii)
It holds that .
Proof. Assume that and let . We have to show that, under the given conditions, .
Note first that we may assume that
; otherwise, simply consider the translate
Assume that
are in
We have that
for
, thus using Proposition 7 and
, we obtain that
If we assume that
, then we clearly have
If this is not the case, we see that
is well-defined. Thus, if
then
and hence
It follows that □
Remark 3. Note that the second condition already forces .
Example 6. As an easy example, we can consider the code of minimum homogeneous distance 2 and . The second condition of Theorem 4 is clearly satisfied by choosing sinceimplying that the code is list decodable. For example, when setting , we see thatso the bound is attained. 6. Open Problems
We conclude this paper with some interesting open questions for the newly defined overweight that we have encountered.
Problem 1. Classify the codes that attain the bounds derived in this paper.
Problem 2. Give a Griesmer bound, an Elias-Bassalygo and a Johnson bound for the overweight.
Proving an analogue of a Griesmer, Elias-Bassalygo and Johnson bound poses a difficult challenge over rings and in particular for the overweight, due to the necessity of an effective upper bound on the sum of the distances.