Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Jump to content

Generics in Java

From Wikipedia, the free encyclopedia

Generics are a facility of generic programming that were added to the Java programming language in 2004 within version J2SE 5.0. They were designed to extend Java's type system to allow "a type or method to operate on objects of various types while providing compile-time type safety".[1] The aspect compile-time type safety required that parametrically polymorphic functions are not implemented in the Java virtual machine, since type safety is impossible in this case.[2][3]

The Java collections framework supports generics to specify the type of objects stored in a collection instance.

In 1998, Gilad Bracha, Martin Odersky, David Stoutamire and Philip Wadler created Generic Java, an extension to the Java language to support generic types.[4] Generic Java was incorporated in Java with the addition of wildcards.

Hierarchy and classification

[edit]

According to Java Language Specification:[5]

  • A type variable is an unqualified identifier. Type variables are introduced by generic class declarations, generic interface declarations, generic method declarations, and by generic constructor declarations.
  • A class is generic if it declares one or more type variables.[6] It defines one or more type variables that act as parameters.[7] A generic class declaration defines a set of parameterized types, one for each possible invocation of the type parameter section. All of these parameterized types share the same class at runtime.
  • An interface is generic if it declares one or more type variables.[7] It defines one or more type variables that act as parameters.[7] A generic interface declaration defines a set of types, one for each possible invocation of the type parameter section. All parameterized types share the same interface at runtime.
  • A method is generic if it declares one or more type variables.[8] These type variables are known as the formal type parameters of the method. The form of the formal type parameter list is identical to a type parameter list of a class or interface.
  • A constructor can be declared as generic, independently of whether the class that the constructor is declared in is itself generic. A constructor is generic if it declares one or more type variables. These type variables are known as the formal type parameters of the constructor. The form of the formal type parameter list is identical to a type parameter list of a generic class or interface.

Motivation

[edit]

The following block of Java code illustrates a problem that exists when not using generics. First, it declares an ArrayList of type Object. Then, it adds a String to the ArrayList. Finally, it attempts to retrieve the added String and cast it to an Integer—an error in logic, as it is not generally possible to cast an arbitrary string to an integer.

final List v = new ArrayList();
v.add("test"); // A String that cannot be cast to an Integer
final Integer i = (Integer) v.get(0); // Run time error

Although the code is compiled without error, it throws a runtime exception (java.lang.ClassCastException) when executing the third line of code. This type of logic error can be detected during compile time by using generics[7] and is the primary motivation for using them.[6] It defines one or more type variables that act as parameters.

The above code fragment can be rewritten using generics as follows:

final List<String> v = new ArrayList<String>();
v.add("test");
final Integer i = (Integer) v.get(0); // (type error)  compilation-time error

The type parameter String within the angle brackets declares the ArrayList to be constituted of String (a descendant of the ArrayList's generic Object constituents). With generics, it is no longer necessary to cast the third line to any particular type, because the result of v.get(0) is defined as String by the code generated by the compiler.

The logical flaw in the third line of this fragment will be detected as a compile-time error (with J2SE 5.0 or later) because the compiler will detect that v.get(0) returns String instead of Integer.[7] For a more elaborate example, see reference.[9]

Here is a small excerpt from the definition of the interfaces java.util.List and java.util.Iterator in package java.util:

interface List<E> {

    void add(E x);
    Iterator<E> iterator();

}

interface Iterator<E> {

    E next();
    boolean hasNext();

}

Generic class definitions

[edit]

Here is an example of a generic Java class, which can be used to represent individual entries (key to value mappings) in a map:

public class Entry<KeyType, ValueType> {
  
    private final KeyType key;
    private final ValueType value;

    public Entry(KeyType key, ValueType value) {  
        this.key = key;
        this.value = value;
    }

    public KeyType getKey() {
        return key;
    }

    public ValueType getValue() {
        return value;
    }

    public String toString() { 
        return "(" + key + ", " + value + ")";  
    }

}

This generic class could be used in the following ways, for example:

final Entry<String, String> grade = new Entry<String, String>("Mike", "A");
final Entry<String, Integer> mark = new Entry<String, Integer>("Mike", 100);
System.out.println("grade: " + grade);
System.out.println("mark: " + mark);

final Entry<Integer, Boolean> prime = new Entry<Integer, Boolean>(13, true);
if (prime.getValue()) {
    System.out.println(prime.getKey() + " is prime.");
}
else {
    System.out.println(prime.getKey() + " is not prime.");
}

It outputs:

grade: (Mike, A)
mark: (Mike, 100)
13 is prime.

Generic method definitions

[edit]

Here is an example of a generic method using the generic class above:

public static <Type> Entry<Type, Type> twice(Type value) {
    return new Entry<Type, Type>(value, value);
}

Note: If we remove the first <Type> in the above method, we will get compilation error (cannot find symbol "Type"), since it represents the declaration of the symbol.

In many cases, the user of the method need not indicate the type parameters, as they can be inferred:

final Entry<String, String> pair = Entry.twice("Hello");

The parameters can be explicitly added if needed:

final Entry<String, String> pair = Entry.<String>twice("Hello");

The use of primitive types is not allowed, and boxed versions must be used instead:

final Entry<int, int> pair; // Fails compilation. Use Integer instead.

There is also the possibility to create generic methods based on given parameters.

public <Type> Type[] toArray(Type... elements) {
    return elements;
}

In such cases you can't use primitive types either, e.g.:

Integer[] array = toArray(1, 2, 3, 4, 5, 6);

Diamond operator

[edit]

Thanks to type inference, Java SE 7 and above allow the programmer to substitute an empty pair of angle brackets (<>, called the diamond operator) for a pair of angle brackets containing the one or more type parameters that a sufficiently close context implies.[10] Thus, the above code example using Entry can be rewritten as:

final Entry<String, String> grade = new Entry<>("Mike", "A");
final Entry<String, Integer> mark = new Entry<>("Mike", 100);
System.out.println("grade: " + grade);
System.out.println("mark: " + mark);

final Entry<Integer, Boolean> prime = new Entry<>(13, true);
if (prime.getValue()) System.out.println(prime.getKey() + " is prime.");
else System.out.println(prime.getKey() + " is not prime.");

Type wildcards

[edit]

A type argument for a parameterized type is not limited to a concrete class or interface. Java allows the use of "type wildcards" to serve as type arguments for parameterized types. Wildcards are type arguments in the form "<?>"; optionally with an upper or lower bound. Given that the exact type represented by a wildcard is unknown, restrictions are placed on the type of methods that may be called on an object that uses parameterized types.

Here is an example where the element type of a Collection<E> is parameterized by a wildcard:

final Collection<?> c = new ArrayList<String>();
c.add(new Object()); // compile-time error
c.add(null); // allowed

Since we don't know what the element type of c stands for, we cannot add objects to it. The add() method takes arguments of type E, the element type of the Collection<E> generic interface. When the actual type argument is ?, it stands for some unknown type. Any method argument value we pass to the add() method would have to be a subtype of this unknown type. Since we don't know what type that is, we cannot pass anything in. The sole exception is null; which is a member of every type.[11]

To specify the upper bound of a type wildcard, the extends keyword is used to indicate that the type argument is a subtype of the bounding class.[12] So List<? extends Number> means that the given list contains objects of some unknown type which extends the Number class. For example, the list could be List<Float> or List<Number>. Reading an element from the list will return a Number. Adding null elements is, again, also allowed.[13]

The use of wildcards above adds flexibility[12] since there is not any inheritance relationship between any two parameterized types with concrete type as type argument. Neither List<Number> nor List<Integer> is a subtype of the other; even though Integer is a subtype of Number.[12] So, any method that takes List<Number> as a parameter does not accept an argument of List<Integer>. If it did, it would be possible to insert a Number that is not an Integer into it; which violates type safety. Here is an example that demonstrates how type safety would be violated if List<Integer> were a subtype of List<Number>:

final List<Integer> ints = new ArrayList<>();
ints.add(2);
final List<Number> nums = ints;  // valid if List<Integer> were a subtype of List<Number> according to substitution rule. 
nums.add(3.14);  
final Integer x = ints.get(1); // now 3.14 is assigned to an Integer variable!

The solution with wildcards works because it disallows operations that would violate type safety:

final List<? extends Number> nums = ints;  // OK
nums.add(3.14); // compile-time error
nums.add(null); // allowed

To specify the lower bounding class of a type wildcard, the super keyword is used. This keyword indicates that the type argument is a supertype of the bounding class. So, List<? super Number> could represent List<Number> or List<Object>. Reading from a list defined as List<? super Number> returns elements of type Object. Adding to such a list requires either elements of type Number, any subtype of Number or null (which is a member of every type).

The mnemonic PECS (Producer Extends, Consumer Super) from the book Effective Java by Joshua Bloch gives an easy way to remember when to use wildcards (corresponding to covariance and contravariance) in Java.[12]

Generics in throws clause

[edit]

Although exceptions themselves cannot be generic, generic parameters can appear in a throws clause:

public <T extends Throwable> void throwMeConditional(boolean conditional, T exception) throws T {
    if (conditional) {
        throw exception;
    }
}

Problems with type erasure

[edit]

Generics are checked at compile-time for type-correctness.[7] The generic type information is then removed in a process called type erasure.[6] For example, List<Integer> will be converted to the non-generic type List, which ordinarily contains arbitrary objects. The compile-time check guarantees that the resulting code uses the correct type.[7]

Because of type erasure, type parameters cannot be determined at run-time.[6] For example, when an ArrayList is examined at runtime, there is no general way to determine whether, before type erasure, it was an ArrayList<Integer> or an ArrayList<Float>. Many people are dissatisfied with this restriction.[14] There are partial approaches. For example, individual elements may be examined to determine the type they belong to; for example, if an ArrayList contains an Integer, that ArrayList may have been parameterized with Integer (however, it may have been parameterized with any parent of Integer, such as Number or Object).

Demonstrating this point, the following code outputs "Equal":

final List<Integer> li = new ArrayList<>();
final List<Float> lf = new ArrayList<>();
if (li.getClass() == lf.getClass()) { // evaluates to true
    System.out.println("Equal");
}

Another effect of type erasure is that a generic class cannot extend the Throwable class in any way, directly or indirectly:[15]

public class GenericException<T> extends Exception

The reason why this is not supported is due to type erasure:

try {
    throw new GenericException<Integer>();
}
catch (GenericException<Integer> e) {
    System.err.println("Integer");
}
catch (GenericException<String> e) {
    System.err.println("String");
}

Due to type erasure, the runtime will not know which catch block to execute, so this is prohibited by the compiler.

Java generics differ from C++ templates. Java generics generate only one compiled version of a generic class or function regardless of the number of parameterizing types used. Furthermore, the Java run-time environment does not need to know which parameterized type is used because the type information is validated at compile-time and is not included in the compiled code. Consequently, instantiating a Java class of a parameterized type is impossible because instantiation requires a call to a constructor, which is unavailable if the type is unknown.

For example, the following code cannot be compiled:

<T> T instantiateElementType(List<T> arg) {
     return new T(); //causes a compile error
}

Because there is only one copy per generic class at runtime, static variables are shared among all the instances of the class, regardless of their type parameter. Consequently, the type parameter cannot be used in the declaration of static variables or in static methods.

Type erasure was implemented in Java to maintain backward compatibility with programs written prior to Java SE5.[7]

Differences from Arrays

[edit]

There are several important differences between arrays (both primitive arrays and Object arrays), and generics in Java. Two of the major differences, namely, differences in terms of variance and reification.

Covariance, contravariance and invariance

[edit]

Generics are invariant, whereas arrays are covariant.[6] This is a benefit of using generic when compared to non-generic objects such as arrays.[6] Specifically, generics can help prevent run time exceptions by throwing a compile-time exception to force the developer to fix the code.

For example, if a developer declares an Object[] object and instantiates the object as a new Long[] object, no compile-time exception is thrown (since arrays are covariant).[6] This may give the false impression that the code is correctly written. However, if the developer attempts to add a String to this Long[] object, the program will throw an ArrayStoreException.[6] This run-time exception can be completely avoided if the developer uses generics.

If the developer declares a Collection<Object> object an creates a new instance of this object with return type ArrayList<Long>, the Java compiler will (correctly) throw a compile-time exception to indicate the presence of incompatible types (since generics are invariant).[6] Hence, this avoids potential run-time exceptions. This problem can be fixed by creating an instance of Collection<Object> using ArrayList<Object> object instead. For code using Java SE7 or later versions, the Collection<Object> can be instantiated with an ArrayList<> object using the diamond operator

Reification

[edit]

Arrays are reified, meaning that an array object enforces its type information at run-time, whereas generics in Java are not reified.[6]

More formally speaking, objects with generic type in Java are non-reifiable types.[6] A non-reifiable type is type whose representation at run-time has less information than its representation at compile-time.[6]

Objects with generic type in Java are non-reifiable due to type erasure.[6] Java only enforces type information at compile-time. After the type information is verified at compile-time, the type information is discarded, and at run-time, the type information will not be available.[6]

Examples of non-reifiable types include List<T> and List<String>, where T is a generic formal parameter. [6]

Project on generics

[edit]

Project Valhalla is an experimental project to incubate improved Java generics and language features, for future versions potentially from Java 10 onwards. Potential enhancements include:[16]

See also

[edit]

Citations

[edit]
  1. ^ Java Programming Language
  2. ^ A ClassCastException can be thrown even in the absence of casts or nulls."Java and Scala's Type Systems are Unsound" (PDF).
  3. ^ Bloch 2018, pp. 123–125, Chapter §5 Item 27: Eliminate unchecked warnings.
  4. ^ GJ: Generic Java
  5. ^ Java Language Specification, Third Edition by James Gosling, Bill Joy, Guy Steele, Gilad Bracha – Prentice Hall PTR 2005
  6. ^ a b c d e f g h i j k l m n o Bloch 2018, pp. 126–129, Chapter §5 Item 28: Prefer lists to arrays.
  7. ^ a b c d e f g h Bloch 2018, pp. 117–122, Chapter §5 Item 26: Don't use raw types.
  8. ^ Bloch 2018, pp. 135–138, Chapter §5 Item 30: Favor generic methods.
  9. ^ Gilad Bracha (July 5, 2004). "Generics in the Java Programming Language" (PDF). www.oracle.com.
  10. ^ "Type Inference for Generic Instance Creation".
  11. ^ Gilad Bracha (July 5, 2004). "Generics in the Java Programming Language" (PDF). www.oracle.com. p. 5.
  12. ^ a b c d Bloch 2018, pp. 139–145, Chapter §5 Item 31: Use bounded wildcards to increase API flexibility.
  13. ^ Bracha, Gilad. "Wildcards > Bonus > Generics". The Java™ Tutorials. Oracle. ...The sole exception is null, which is a member of every type...
  14. ^ Gafter, Neal (2006-11-05). "Reified Generics for Java". Retrieved 2010-04-20.
  15. ^ "Java Language Specification, Section 8.1.2". Oracle. Retrieved 24 October 2015.
  16. ^ Goetz, Brian. "Welcome to Valhalla!". OpenJDK mail archive. OpenJDK. Retrieved 12 August 2014.

References

[edit]
  • Bloch, Joshua (2018). "Effective Java: Programming Language Guide" (third ed.). Addison-Wesley. ISBN 978-0134685991.