Introduction To Java Threads: Section 1. About This Tutorial
Introduction To Java Threads: Section 1. About This Tutorial
Brian Goetz (brian@quiotix.com) Consultant Quiotix Skill Level: Introductory Date: 26 Sep 2002
The Java thread facility and API is deceptively simple; however, writing complex programs that use threading effectively is not. This tutorial explores threading basics: what threads are, why they are useful, and how to get started writing simple programs that use them. You will also learn about exchanging data between threads, controlling threads, and how threads can communicate with each other.
developerWorks
ibm.com/developerWorks/
ibm.com/developerWorks/
developerWorks
Some of the reasons for using threads are that they can help to: Make the UI more responsive Take advantage of multiprocessor systems Simplify modeling Perform asynchronous or background processing
More responsive UI
Event-driven UI toolkits, such as AWT and Swing, have an event thread that processes UI events such as keystrokes and mouse clicks. AWT and Swing programs attach event listeners to UI objects. These listeners are notified when a specific event occurs, such as a button being clicked. Event listeners are called from within the AWT event thread. If an event listener were to perform a lengthy task, such as checking spelling in a large document, the event thread would be busy running the spelling checker, and thus would not be able to process additional UI events until the event listener completed. This would make the program appear to freeze, which is disconcerting to the user. To avoid stalling the UI, the event listener should hand off long tasks to another thread so that the AWT thread can continue processing UI events (including requests to cancel the long-running task being performed) while the task is in progress.
Simplicity of modeling
In some cases, using threads can make your programs simpler to write and maintain. Consider a simulation application, where you simulate the interaction between
Introduction to Java threads Page 3 of 27
developerWorks
ibm.com/developerWorks/
multiple entities. Giving each entity its own thread can greatly simplify many simulation and modeling applications. Another example where it is convenient to use separate threads to simplify a program is when an application has multiple independent event-driven components. for example, an application might have a component that counts down the number of seconds since some event and updates a display on the screen. Rather than having a main loop check the time periodically and update the display, it is much simpler -and less error-prone -- to have a thread that does nothing but sleep until a certain amount of time has elapsed and then update the on-screen counter. This way the main thread doesn't need to worry about the timer at all.
ibm.com/developerWorks/
developerWorks
If you are going to use synchronization to protect access to shared variables, you must make sure to use it everywhere in your program where the variable is accessed.
Don't overdo it
While threads can greatly simplify many types of applications, overuse of threads can be hazardous to your program's performance and its maintainability. Threads consume resources. Therefore, there is a limit on how many threads you can create without degrading performance. In particular, using multiple threads will not make a CPU-bound program run any faster on a single-processor system.
Page 5 of 27
developerWorks
} } public static void main(String[] args) { CalculatePrimes calculator = new CalculatePrimes(); calculator.start(); try { Thread.sleep(TEN_SECONDS); } catch (InterruptedException e) { // fall through } calculator.finished = true; } }
ibm.com/developerWorks/
When you use multiple threads, you must be careful to follow the rules for sharing data between threads, which we'll cover in Sharing access to data . All these rules boil down to one basic principle: Don't forget to synchronize.
ibm.com/developerWorks/
developerWorks
Ending threads
A thread will end in one of three ways: The thread comes to the end of its run() method. The thread throws an Exception or Error that is not caught. Another thread calls one of the deprecated stop() methods. Deprecated means they still exist, but you shouldn't use them in new code and should strive to eliminate them in existing code. When all the threads within a Java program complete, the program exits.
Scheduling
Except when using Thread.join() and Object.wait(), the timing of thread scheduling and execution is nondeterministic. If two threads are running at the same time and neither is waiting, you must assume that between any two instructions, other threads may be running and modifying program variables. If your thread will be accessing data that may be visible to other threads, such as data referenced directly or indirectly from static fields (global variables), you must use synchronization to ensure data consistency. In the simple example below, we'll create and start two threads, each of which prints two lines to System.out:
Introduction to Java threads Page 7 of 27
developerWorks
public class TwoThreads { public static class Thread1 extends Thread { public void run() { System.out.println("A"); System.out.println("B"); } } public static class Thread2 extends Thread { public void run() { System.out.println("1"); System.out.println("2"); } } public static void main(String[] args) { new Thread1().start(); new Thread2().start(); } }
ibm.com/developerWorks/
We have no idea in what order the lines will execute, except that "1" will be printed before "2" and "A" before "B." The output could be any one of the following: 12AB 1A2B 1AB2 A12B A1B2 AB12
Not only may the results vary from machine to machine, but running the same program multiple times on the same machine may produce different results. Never assume one thread will do something before another thread does, unless you've used synchronization to force a specific ordering of execution.
Sleeping
The Thread API includes a sleep() method, which will cause the current thread to go into a wait state until the specified amount of time has elapsed or until the thread is interrupted by another thread calling Thread.interrupt() on the current thread's Thread object. When the specified time elapses, the thread again becomes runnable and goes back onto the scheduler's queue of runnable threads. If a thread is interrupted by a call to Thread.interrupt(), the sleeping thread will throw an InterruptedException so that the thread will know that it was awakened by an interrupt and won't have to check to see if the timer expired. The Thread.yield() method is like Thread.sleep(), but instead of sleeping, it simply pauses the current thread momentarily so that other threads can run. In most implementations, threads with lower priority will not run when a thread of higher priority calls Thread.yield().
Introduction to Java threads Page 8 of 27
ibm.com/developerWorks/
developerWorks
The CalculatePrimes example used a background thread to calculate primes, then slept for ten seconds. When the timer expired, it set a flag to indicate that the ten seconds had expired.
Daemon threads
We mentioned that a Java program exits when all of its threads have completed, but this is not exactly correct. What about the hidden system threads, such as the garbage collection thread and others created by the JVM? We have no way of stopping these. If those threads are running, how does any Java program ever exit? These system threads are called daemon threads. A Java program actually exits when all its non-daemon threads have completed. Any thread can become a daemon thread. You can indicate a thread is a daemon thread by calling the Thread.setDaemon() method. You might want to use daemon threads for background threads that you create in your programs, such as timer threads or other deferred event threads, which are only useful while there are other non-daemon threads running.
Page 9 of 27
developerWorks
for (int i=0; i < 10; i++) { threads[i] = new WorkerThread(bigMatrix[i]); threads[i].start(); } // Wait for each thread to finish try { for (int i=0; i < 10; i++) { threads[i].join(); max = Math.max(max, threads[i].getMax()); } } catch (InterruptedException e) { // fall through } System.out.println("Maximum value was " + max); } }
ibm.com/developerWorks/
ibm.com/developerWorks/
developerWorks
Not only do you have to worry about synchronizing access to data items shared between event listeners and other threads, but you have to find a way for longrunning tasks triggered by event listeners -- such as checking spelling in a large document or searching a file system for a file -- to run in a background thread so the UI doesn't freeze while the task is running (which would also prevent the user from canceling the operation). A good example of a framework for doing this is the SwingWorker class (see Resources). The AWT event thread is not a daemon thread; this is why System.exit() is often used to end AWT and Swing apps.
Using TimerTask
The TimerTask facility was introduced to the Java language in JDK 1.3. This convenient facility allows you to execute a task at a later time (that is, for example, run a task once ten seconds from now), or to execute a task periodically (that is, run a task every ten seconds). Implementing the Timer class is quite straightforward: it creates a timer thread and builds a queue of waiting events sorted by execution time. The TimerTask thread is marked as a daemon thread so it doesn't prevent the program from exiting. Because timer events execute in the timer thread, you must make sure that access to any data items used from within a timer task is properly synchronized. In the CalculatePrimes example, instead of having the main thread sleep, we could have used a TimerTask as follows:
public static void main(String[] args) { Timer timer = new Timer(); final CalculatePrimes calculator = new CalculatePrimes(); calculator.start(); timer.schedule( new TimerTask() { public void run() { calculator.finished = true; } }, TEN_SECONDS); }
developerWorks
ibm.com/developerWorks/
When writing servlets or JavaServer Pages (JSP) files, you must assume at all times that the same servlet or JSP file may be executing concurrently in multiple threads. Any shared data accessed by a servlet or JSP file must be appropriately synchronized; this includes fields of the servlet object itself.
ibm.com/developerWorks/
developerWorks
one thread to another and to prevent threads from seeing inconsistent intermediate results while another thread is updating several related data items. The example that calculated prime numbers in Thread basics used a shared boolean variable to indicate that the specified time period had elapsed. This illustrates the simplest form of sharing data between threads: polling a shared variable to see if another thread has finished performing a certain task.
Without synchronization, it is easy for data to be left in an inconsistent state. For example, if one thread is updating two related values (say, the position and velocity of a particle), and another thread is reading those two values, it is possible that the second thread could be scheduled to run after the first thread has written one value but not the other, thus seeing one old and one new value. Synchronization allows us to define blocks of code that must run atomically, in which they appear to execute in an all-or-nothing manner, as far as other threads can tell. The atomic execution or mutual exclusion aspect of synchronization is similar to the concept of critical sections in other operating environments.
developerWorks
ibm.com/developerWorks/
Processors can use caches to speed up access to memory (or compilers may store values in registers for faster access). On some multiprocessor architectures, if a memory location is modified in the cache on one processor, it is not necessarily visible to other processors until the writer's cache is flushed and the reader's cache is invalidated. This means that on such systems, it is possible for two threads executing on two different processors to see two different values for the same variable! This sounds scary, but it is normal. It just means that you have to follow some rules when accessing data used or modified by other threads. is simpler than synchronization and is suitable only for controlling access to single instances of primitive variables -- integers, booleans, and so on. When a variable is declared volatile, any write to that variable will go directly to main memory, bypassing the cache, while any read of that variable will come directly from main memory, bypassing the cache. This means that all threads see the same value for a volatile variable at all times.
Volatile
Without proper synchronization, it is possible for threads to see stale values of variables or experience other forms of data corruption.
Synchronization uses the concepts of monitors, or locks, to coordinate access to particular blocks of code. Every Java object has an associated lock. Java locks can be held by no more than one thread at a time. When a thread enters a synchronized block of code, the thread blocks and waits until the lock is available, acquires the lock when it becomes available, and then executes the block of code. It releases the lock when control exits the protected block of code, either by reaching the end of the block or when an exception is thrown that is not caught within the synchronized block. In this way, only one thread can execute a block protected by a given monitor at one time. The block can be considered atomic because, from the perspective of other threads, it appears to either have executed entirely or not at all.
ibm.com/developerWorks/
developerWorks
public class SyncExample { private static Object lockObject = new Object(); private static class Thread1 extends Thread { public void run() { synchronized (lockObject) { x = y = 0; System.out.println(x); } } } private static class Thread2 extends Thread { public void run() { synchronized (lockObject) { x = y = 1; System.out.println(y); } } } public static void main(String[] args) { new Thread1().start(); new Thread2().start(); } }
You must use synchronization in both threads for this program to work properly.
Java locking
Java locking incorporates a form of mutual exclusion. Only one thread may hold a lock at one time. Locks are used to protect blocks of code or entire methods, but it is important to remember that it is the identity of the lock that protects a block of code, not the block itself. One lock may protect many blocks of code or methods. Conversely, just because a block of code is protected by a lock does not mean that two threads cannot execute that block at once. It only means that two threads cannot execute that block at once if they are waiting on the same lock. In the following example, the two threads are free to execute the synchronized block in setLastAccess() simultaneously because each thread has a different value for thingie. Therefore, the synchronized block is protected by different locks in the two executing threads.
public class SyncExample { public static class Thingie { private Date lastAccess; public synchronized void setLastAccess(Date date) { this.lastAccess = date; } } public static class MyThread extends Thread { private Thingie thingie; public MyThread(Thingie thingie) { this.thingie = thingie;
Page 15 of 27
developerWorks
} public void run() { thingie.setLastAccess(new Date()); } } public static void main() { Thingie thingie1 = new Thingie(), thingie2 = new Thingie(); new MyThread(thingie1).start(); new MyThread(thingie2).start(); } }
ibm.com/developerWorks/
Synchronized methods
The simplest way to create a synchronized block is to declare a method as synchronized. This means that before entering the method body, the caller must acquire a lock:
public class Point { public synchronized void setXY(int x, int y) { this.x = x; this.y = y; } }
For ordinary synchronized methods, this lock will be the object on which the method is being invoked. For static synchronized methods, this lock will be the monitor associated with the Class object in which the method is declared. Just because setXY() is declared as synchronized doesn't mean that two different threads can't still execute setXY() at the same time, as long as they are invoking setXY() on different Point instances. Only one thread can execute setXY(), or any other synchronized method of Point, on a single Point instance at one time.
Synchronized blocks
The syntax for synchronized blocks is a little more complicated than for synchronized methods because you also need to explicitly specify what lock is being protected by the block. The following version of Point is equivalent to the version shown above:
public class Point { public void setXY(int x, int y) { synchronized (this) { this.x = x; this.y = y; } } }
It is common, but not required, to use the this reference as the lock. This means that the block will use the same lock as synchronized methods in that class.
Introduction to Java threads Page 16 of 27
ibm.com/developerWorks/
developerWorks
Because synchronization prevents multiple threads from executing a block at once, it has performance implications, even on uniprocessor systems. It is a good practice to use synchronization around the smallest possible block of code that needs to be protected. Access to local (stack-based) variables never need to be protected, because they are only accessible from the owning thread.
Page 17 of 27
developerWorks
synchronized (cache) { cache.clear(); } } public Object getObject(String objectName) { synchronized (cache) { Object o = cache.get(objectName); if (o == null) { o = load(objectName); cache.put(objectName, o); } } return o; } }
ibm.com/developerWorks/
Visibility
In addition to mutual exclusion, synchronization, like volatile, enforces certain visibility constraints. When an object acquires a lock, it first invalidates its cache, so that it is guaranteed to load variables directly from main memory.
Introduction to Java threads Page 18 of 27
ibm.com/developerWorks/
developerWorks
Similarly, before an object releases a lock, it flushes its cache, forcing any changes made to appear in main memory. In this way, two threads that synchronize on the same lock are guaranteed to see the same values in variables modified inside a synchronized block.
What happens if more than one thread tries to use this class at the same time? It could be a disaster. Because there's no synchronization, multiple threads could execute push() and pop() at the same time. What if one thread calls push() and another thread calls push() right between the time top is incremented and it is used as an index into values? Then both threads would store their new value in the same location! This is just one of many forms of data corruption that can occur when multiple threads rely on a known relationship between data values, but don't ensure that only one thread is manipulating those values at a given time.
Introduction to Java threads Page 19 of 27
developerWorks
ibm.com/developerWorks/
In this case, the cure is simple: synchronize both push() and pop(), and you'll prevent one thread from stepping on another. Note that using volatile would not have been enough -- you need to use synchronized to ensure that the relationship between top and values remains consistent.
What happens when we want to increment the counter? Look at the code for increment(). It is clear, but it is not thread-safe. What happens if two threads try to execute increment() at the same time? The counter might be incremented by 1 or by 2. Surprisingly, marking counter as volatile won't help, nor will making both get() and set() synchronized. Imagine that the counter is zero and two threads execute the increment code at exactly the same time. Both threads call Counter.get() and see that the counter is zero. Now both add one to this and then call Counter.set(). If our timing is unlucky, neither will see the other's update, even if counter is volatile or get() and set() are synchronized. Now, even though we've incremented the counter twice, the resulting value may only be one instead of two. For increment to work properly, not only do get() and set() have to be synchronized, but increment() needs to be synchronized, too! Otherwise, a thread calling increment() could interrupt another thread calling increment(). If you're unlucky, the end result would be that the counter is incremented once instead of twice. Synchronizing increment() prevents this from happening, because the entire increment operation is atomic. The same is true when you are iterating through the elements of a Vector. Even though the methods of Vector are synchronized, the contents of Vector could still change while you are iterating. If you want to ensure that the contents of Vector don't change while you're iterating through it, you have to wrap the entire block with synchronization on it.
Introduction to Java threads Page 20 of 27
ibm.com/developerWorks/
developerWorks
Deadlock
Whenever you have multiple processes contending for exclusive access to multiple locks, there is the possibility of deadlock. A set of processes or threads is said to be deadlocked when each is waiting for an action that only one of the others can perform. The most common form of deadlock is when Thread 1 holds a lock on Object A and is waiting for the lock on Object B, and Thread 2 holds the lock on Object B and is waiting for the lock on Object A. Neither thread will ever acquire the second lock or relinquish the first lock. They will simply wait forever. To avoid deadlock, you should ensure that when you acquire multiple locks, you always acquire the locks in the same order in all threads.
Performance considerations
There has been a lot written -- much of it wrong -- on the performance costs of synchronization. It is true that synchronization, especially contended synchronization, has performance implications, but these may not be as large as is widely suspected.
Introduction to Java threads Page 21 of 27
developerWorks
ibm.com/developerWorks/
Many people have gotten themselves in trouble by using fancy but ineffective tricks to try to avoid having to synchronize. One classic example is the double-checked locking pattern (see Resources for several articles on what's wrong with it). This harmless-looking construct purported to avoid synchronization on a common code path, but was subtly broken, and all attempts to fix it were also broken. When writing concurrent code, don't worry so much about performance until you've actually seen evidence of performance problems. Bottlenecks appear in the places we often least suspect. Speculatively optimizing one code path that may not even turn out to be a performance problem -- at the cost of program correctness -- is a false economy.
wakes it up
When notify() is invoked on an object, if there are any threads waiting on that object via wait(), then one thread will be awakened. When notifyAll() is invoked on an object, all threads waiting on that object will be awakened.
Introduction to Java threads Page 22 of 27
ibm.com/developerWorks/
developerWorks
These methods are the building blocks of more sophisticated locking, queuing, and concurrency code. However, the use of notify() and notifyAll() is complicated. In particular, using notify() instead of notifyAll() is risky. Use notifyAll() unless you really know what you're doing. Rather than use wait() and notify() to write your own schedulers, thread pools, queues, and locks, you should use the util.concurrent package (see Resources), a widely used open source toolkit full of useful concurrency utilities. JDK 1.5 will include the java.util.concurrent package; many of its classes are derived from util.concurrent.
Thread priorities
The Thread API allows you to associate an execution priority with each thread. However, how these are mapped to the underlying operating system scheduler is implementation-dependent. In some implementations, multiple -- or even all -priorities may be mapped to the same underlying operating system priority. Many people are tempted to tinker with thread priorities when they encounter a problem like deadlock, starvation, or other undesired scheduling characteristics. More often than not, however, this just moves the problem somewhere else. Most programs should simply avoid changing thread priority.
Thread groups
The ThreadGroup class was originally intended to be useful in structuring collections of threads into groups. However, it turns out that ThreadGroup is not all that useful. You are better off simply using the equivalent methods in Thread.
ThreadGroup does offer one useful feature not (yet) present in Thread: the uncaughtException() method. When a thread within a thread group exits because it threw an uncaught exception, the ThreadGroup.uncaughtException() method is
called. This gives you an opportunity to shut down the system, write a message to a log file, or restart a failed service.
SwingUtilities
Although it is not part of the Thread API, the SwingUtilities class deserves a brief mention. As mentioned earlier, Swing applications have a single UI thread (sometimes called the event thread) in which all UI activity must occur. Sometimes another thread may want to update the appearance of something on the screen or fire an event on a Swing object. The SwingUtilities.invokeLater() method allows you to pass a Runnable object to it, and the specified Runnable will be executed in the event thread. Its cousin,
Introduction to Java threads Page 23 of 27
developerWorks
invokeAndWait(), will invoke Runnable in the block until Runnable is finished executing.
void showHelloThereDialog() throws Exception { Runnable showModalDialog = new Runnable() { public void run() { JOptionPane.showMessageDialog(myMainFrame, "Hello There"); } }; SwingUtilities.invokeLater(showModalDialog); }
Section 8. Wrapup
Summary
Every Java program uses threads, whether you know it or not. If you are using either of the Java UI toolkits (AWT or Swing), Java Servlets, RMI, or JavaServer Pages or Enterprise JavaBeans technologies, you may be using threads without realizing it. There are a number of situations where you might want to explicitly use threads to improve the performance, responsiveness, or organization of your programs. These include: Making the user interface more responsive when performing long tasks Exploiting multiprocessor systems to handle multiple tasks in parallel Simplifying modeling of simulations or agent-based systems Performing asynchronous or background processing
While the thread API is simple, writing thread-safe programs is not. When variables are shared across threads, you must take great care to ensure that you have properly synchronized both read and write access to them. When writing a variable that may next be read by another thread, or reading a variable that may have been written by another thread, you must use synchronization to ensure that changes to data are visible across threads. When using synchronization to protect shared variables, you must ensure that not only are you using synchronization, but the reader and writer are synchronizing on the same monitor. Furthermore, if you rely on an object's state remaining the same across multiple operations, or rely on multiple variables staying consistent with each other (or consistent with their own past values), you must use synchronization to enforce this. But simply synchronizing every method in a class does not make it thread safe -- it just makes it more prone to deadlock.
Introduction to Java threads Page 24 of 27
ibm.com/developerWorks/
developerWorks
Resources
Explore Doug Lea's util.concurrent package (http://gee.cs.oswego.edu/dl/ classes/EDU/oswego/cs/dl/util/concurrent/intro.html), which contains a wealth of useful classes for building efficient concurrent applications. "Synchronization and the Java Memory Model " (http://gee.cs.oswego.edu/dl/ cpj/jmm.html) is an excerpt from Doug Lea's book that focuses on the actual meaning of synchronized. In his article "Writing multithreading Java applications " (developerWorks, February 2001, http://www.ibm.com/developerworks/library/j-thread.html), Alex Roetter outlines issues involved in Java multithreading and offers solutions to common problems. "Threading lightly, Part 1: Synchronization is not the enemy " by Brian Goetz (developerWorks, July 2001, http://www.ibm.com/developerworks/java/library/ j-threads1.html) explores how to manage the performance of concurrent applications. "Achieve strong performance with threads " by Jeff Friesen (JavaWorld, May 2002, http://www.javaworld.com/javaworld/jw-05-2002/jw-0503-java101.html) is a four-part tutorial on using threads. "Double-checked locking: Clever, but broken " (JavaWorld, February 2001, http://www.javaworld.com/jw-02-2001/jw-0209-double.html), explores the Java Memory Model in detail and the surprising consequences of failing to synchronize in certain situations. Thread safety is tricky stuff. "Java theory and practice: Safe construction techniques " (developerWorks, June 2002, http://www.ibm.com/developerworks/ library/j-jtp0618.html) offers some tips for safely constructing objects. In "Threads and Swing " (http://java.sun.com/products/jfc/tsc/articles/threads/ threads1.html), the Sun folks explore the rules of safe Swinging and introduce the useful SwingWorker class. Doug Lea's Concurrent Programming in Java, Second Edition (AddisonWesley, 1999, http://www.amazon.com/exec/obidos/ASIN/0201310090/ none0b69) is a masterful book on the subtle issues surrounding multithreaded programming in Java applications. Paul Hyde's Java Thread Programming (http://www.amazon.com/exec/obidos/ ASIN/0672315858/none0b69) is a nice tutorial and reference for many realworld multithreading issues. Allen Holub's book Taming Java Threads (http://www.amazon.com/exec/ obidos/ASIN/1893115100/none0b69) is an enjoyable introduction to the challenges of Java thread programming. The util.concurrent package is being formalized under Java Community Process JSR 166 (http://www.jcp.org/jsr/detail/166.jsp) for inclusion in the 1.5 release of the JDK. The Foxtrot project (http://foxtrot.sourceforge.net/) is an alternative -- and possibly simpler -- approach to threading in Swing applications.
Introduction to Java threads Page 25 of 27
developerWorks
ibm.com/developerWorks/
You'll find hundreds of articles about every aspect of Java programming in the developerWorks Java technology zone (http://www.ibm.com/developerworks/ java/). See the developerWorks Java technology tutorials page (http://www.ibm.com/ developerworks/views/java/libraryview.jsp?type_by=Tutorials) for a complete listing of free tutorials from developerWorks.
Page 26 of 27
ibm.com/developerWorks/
developerWorks
Page 27 of 27