100% found this document useful (10 votes)

1K views

High Speed Java

This document discusses techniques for improving the performance of Java programs. It begins by describing the key components of the Java Virtual Machine (JVM) and different techniques for executing Java code, including interpreters, just-in-time compilers, and ahead-of-time compilers. It then focuses on specific before-compilation and after-compilation optimization techniques like object inlining, method inlining, escape analysis, bound-check elimination, and loop-invariant code motion that can improve Java program speed. The goal is to analyze how to develop high performance Java programs that approach the speed of C programs.

Uploaded by

debnathb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (10 votes)

1K views

High Speed Java

Uploaded by

debnathb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Research & Whitepaper Volume 1.

High Speed Java

Java at the speed of C – an analysis of High
Speed Java programming environment
Author: Bikramjit Debnath

November 06
High Speed Java 2006

Contents
Alternative execution techniques ..............................................................................................................4
After-Compilation (AC) & Before-Compilation (BC) Techniques .................................................................6
Object In lining ......................................................................................................................................... 7
Method In lining ....................................................................................................................................... 7
Escape Analysis ........................................................................................................................................ 7
Bound-check elimination/deactivation ......................................................................................................8
Closed-world Assumption .........................................................................................................................8
Common-sub expression elimination ........................................................................................................8
Loop-invariant code motion......................................................................................................................9
Minimal synchronization, RMI and serialization .........................................................................................9
Conclusion ...............................................................................................................................................9

2 Research & White paper Volume 1.0

High Speed Java 2006

1
Deep down the JVM
The Java Virtual machine (JVM) is one of the most neglected subjects during
estimation and development of high-performance java applications. The JVM
provides the most vital interface between java code and the target platform.
The following section elaborates the crucial JVM internals in an easy to
understand approach.

The JVM executes the java byte-codes. The implementation of JVM across various platforms made java
portable. It is said to be virtual for it is implemented in software rather than in the hardware platform. JVM
interprets a stream of byte-code as a sequence of instructions for execution. JVM is stack-based and
consists of one-byte opcode followed by zero or more operands. The JVM instruction set defines more
than 202 standard opcodes, around 25 quick variations of some opcodes, and three reserved opcodes.
Operands provide additional information to execute the action.

The JVM is divided in five basic components as shown:

3 Research & White paper Volume 1.0

High Speed Java 2006

As shown in the above figure, registers, stack, garbage-collected heap, methods area, and execution
engine are implemented in every JVM. The byte codes are stored in the methods area. The program
counters point to the next byte for execution in the methods area. All operations in the JVM occur through
the stack. Data is pushed into the stack from constant pool in the methods area and from the local variable
section of the stack. The first section of the stack frame is local variables section which contains all the
local variables for current method invocation. The vars register points to this section of stack frame. The
second section is execution environment which maintains the stack operations. The frame register points
to this section. The final section is operand stack, the optop register points to this section, which is used
for storing parameters and temporary data for expression evaluations.

Memory is dynamically allocated from the garbage-collected heap using the new operator in executing
programs. The JVM specification does not allow the user to explicitly free allocated memory. Instead, the
garbage-collection process monitors the memory for returning unused objects to the memory pool. A
large variety of garbage collection algorithms have been developed, including reference counting, mark-
sweep, mark-compact, copying and non-copying implicit collection.

The core of the JVM is the execution engine. This is a virtual processor which interacts with me method
area for byte-code execution. Various implementation of execution engine are depicted in the subsequent
section.

Alternative execution techniques

A disadvantage of byte-code compilation is the computational model of the byte code which implements a
stack machine. Mapping this (virtual) stack machine on existing CPUs (that are register based) is harder
than directly generating register-oriented code Until recently, many high-performance codes have been
implemented in the C language, mainly for reasons of efficiency. From a software-development point-of-
view, it is desirable to use Java instead of C, allowing using features like threads, objects, exceptions,
runtime type checks, and array bounds checks in order to ease programming and debugging.
Unfortunately, several of these features introduce performance overheads, making it difficult to obtain the
same sequential speed for Java as for C. Hence, along with Java’s widespread usability; the need for a more
efficient execution mode has become apparent.

Therefore, apart from traditional interpretation techniques for JVM implementation, a variety of execution
techniques have been proposed to reduce the time of java programs. The alternatives compared to the
execution of programs written in typical languages like C are shown in the following figure:

4 Research & White paper Volume 1.0

High Speed Java 2006

The alternatives for executing java code as shown above are:

 Java Interpreters
 Java Compilers
 Java Compiler Optimizations
 Hot-spot JVM (Dynamic compiler)
 Just-In-Time (JIT) Compilers
 Direct Compilers
 Byte-Code to Source Translators
 Java Processors
Each of this implementation varies up to a great extent when tested to solve CPU and memory hungry
problems like – Iterative Deeping A* algorithms (15-puzzle), Traveling Sales Person Problem (TSP),
Successive Over relaxations (SOR) etc. The results show that choosing the best suitable JVM for a specific
application (Distributed, Embedded etc.) can make the speed similar to C versions. For example, the
Manta system (http://www.cs.vu.nl/manta) is such a fast native java compiler.

Note

The detailed discussions about the above techniques are out of the scope of this document.

5 Research & White paper Volume 1.0

High Speed Java 2006

2
High Speed Java – Tips and Techniques
The last section identified the key areas of JVM in executing java code. High
performance in java code execution can be achieved mainly in two ways – (1)
After-compilation (AC) techniques and (2) Before-compilation (BC)
Techniques. AC techniques mainly involve speculating on the correct
implementation of JVM rather than depending upon any available interpreter
for high-performance application development. The following section explores
some BC techniques to improve the performance.

After-Compilation (AC) & Before-Compilation (BC) Techniques

Optimization during the development phase (BC) and the execution phase (AC) are crucial for high-speed
execution of Java codes. The common optimization techniques are:

 Object In lining (AC)

 Method In lining (AC)
 Escape Analysis (AC)
 Bound-check elimination/deactivation (AC)
 Closed-world Assumption (BC)
 Common-sub expression elimination (BC)
 Loop-invariant code motion (BC)
 Minimal serialization, RMI and synchronization. (BC)

The basic understanding of AC techniques will be helpful to select the appropriate version of JVM which
supports those optimization features.

6 Research & White paper Volume 1.0

High Speed Java 2006

Object In lining
Java’s object model leads to many small objects with references to other objects. Performance is improved
by reducing overheads from object creation, garbage collection, and pointer dereferencing. The following
code fragment shows an example of object in lining (When compiler can derive that the array “a” is never
reassigned in objects of class A (left), then the array can be statically in lined into objects of class A (right).

class A{ class A{
int[] a = new int[10]; int a[10];
} }
Original class A with separate array object A with inclined array

Note that the shown optimization can not be implemented manually because Java lacks a corresponding
syntactical construct for arrays.

Method In lining
In C-like languages, function in lining is a well-known optimization for avoiding the costs of function
invocation. In object-oriented programs it is common to have many, small methods. So method in lining is
desirable for Java. However, in the presence of polymorphism and dynamic class loading, only methods
declared as static can be safely in lined. For all other methods, sub-classes may override the
implementation of a super class. These program semantics at the core of object-oriented programming
prevent efficient method in lining. In the following example, the method inc would be an ideal candidate
for in lining due to its small size. However, it can only be in lined if the compiler can safely derive that there
exists no subclass of A that replaces the implementation of inc.

class A{
int a;
void inc(){ a++; }
void other(){ inc(); }
}

Escape Analysis
Escape analysis considers the objects created by a given method. When the compiler can derive that such
an object can never escape the scope of its creating thread (for example, by assignment to a static
variable), then the object becomes unreachable after the method has terminated. In this case, object
allocation and garbage collection can be avoided all together by creating the object on the stack of the
running thread rather than via the general-purpose (heap) memory. In the case of creation on the stack,
method-local objects can be as efficient as function-local variables in C.

7 Research & White paper Volume 1.0

High Speed Java 2006

Bound-check elimination/deactivation
The violation of array boundaries is a frequently occurring programming mistake with C-like languages. To
avoid these mistakes, Java requires array bounds to be checked at runtime, possibly causing runtime
exceptions. This additional safety comes at the price of a performance penalty. A simple-minded, but
unsafe optimization is to suppress the code generation for array-bounds checks altogether. The idea is that
boundary violations will not occur after some successful, initial program testing with bounds checks
activated. Completely deactivating array-bounds checks thus gives the unsafety of C at the speed of C. For
example, if a method accesses a[i] in more than one statement, then only the first access needs a bound
check. For all other accesses, the checks can safely be omitted as long as Manta can derive from the code
that neither the array base “a” nor the index “i” has been changed in the meantime. For this purpose, the
compiler can perform a data-flow analysis, keeping track of the array bases and related sets of index
identifiers for which bounds checks have already been issued

Closed-world Assumption
Many compiler optimizations require knowledge about the complete set of Java classes that are part of an
application. Java’s polymorphism, in combination with dynamic class loading prevents such optimizations.
In this case, the programmer has to explicitly annotate methods as final in order to enable a large set of
optimizations. However, the final declaration has only limited applicability as it selectively disables
polymorphism. Its use for improving application performance furthermore contradicts its original intenti on
as a means for class-hierarchy design. Many high-performance applications (scientific) consist of a fixed set
of classes and do not use dynamic class loading at all. Such applications can be compiled under a closed
world assumption: all classes are available at compile time to gain excellent runtime performance.

Common-sub expression elimination

Calculating the common sub-expression initially and then substituting all references with the calculated
value in a loop increases the runtime performance for reducing the load on JVM stack.

for(…){ int sub = a[j]3CONST_VAL;

a[i] = a[j]*3*CONST_VAL; for(…){
b[i] = a[i]+ a[j]*3*CONST_VAL; a[i] = sub;
a[i] = b[i] - CONST_VAL; b[i] = a[i]+ sub;
} a[i] = b[i] - CONST_VAL;
Original loop with multiple use of a }
common expression loop with common sub-expression
elimination

8 Research & White paper Volume 1.0

High Speed Java 2006

Loop-invariant code motion

Moving the sub-expressions or the piece of code that does not depend upon the iterative loop values out of
the loop enhances the performance of execution.

for(…){ iVal = a[j]3CONST_VAL;

iVal = a[j]*3*CONST_VAL; for(…){
a[i] = iVal; a[i] = iVal;
b[i] = a[i]+ iVal; b[i] = a[i]+ iVal;
a[i] = b[i] - iVal; a[i] = b[i] - iVal;
} }
Original loop with improper use of a loop with loop-invariant code motion
common expression

Minimal synchronization, RMI and serialization

In Java, synchronization is provided through monitors, which language level constructs are used to
guarantee mutually-exclusive access to shared data. Since java allows an object to be synchronizable (with
or without synchronized methods), using a lock structure for each object can be very costly in terms of
memory. This requires the runtime-system to first query in the monitor-cache before it is used, which is
quite inefficient. Further the monitor-cache itself should be locked to avoid race condition. Thus monitor
approach is not scalable.

The thin lock and think lock approach in some JVM to optimize locking mechanism to avoid locks up to an
excessive nesting depth improves performance. However, optimal use of synchronization is necessary to
improve execution performance in any application.

The current Java RMI is designed to support client-server applications that communicate over TCP based
networks. Some RMI design goals results in severe performance limitations for high-performance
applications on closely connected environments such as clusters and distributed memory processors.

Similarly serialization is also a costly operation, and making class serializable selectively, optimal use of
RMI methods can improve the overall performance of any application.

Conclusion
From a software-development point-of-view, it is desirable to use Java instead of C, allowing using object-
oriented features in order to ease programming and debugging. Unfortunately, several of these features
introduce performance overheads, making it difficult to obtain the same sequential speed for Java as for C.
This paper discusses the various way of improving the runtime performance of Java application wi th the
inside view of JVM and elaborates a range of existing optimization techniques for Java both during
development and compilation and their performance impact on application.

9 Research & White paper Volume 1.0

High Speed Java 2006

10 Research & White paper Volume 1.0

2111236-Jcb 1cx 208s Backhoe Loader Workshop Service Manual PDF
100% (9)
2111236-Jcb 1cx 208s Backhoe Loader Workshop Service Manual PDF
242 pages
Node.js Design Patterns - Second Edition
From Everand
Node.js Design Patterns - Second Edition
Mario Casciaro
4.5/5 (4)
Dynamic Stresses Hydro Power Plant RKAggarwal
No ratings yet
Dynamic Stresses Hydro Power Plant RKAggarwal
22 pages
Honeywell VK41xx, VK81xx Handbook (Eng)
50% (2)
Honeywell VK41xx, VK81xx Handbook (Eng)
34 pages
Networking Essentials
0% (1)
Networking Essentials
6 pages
Parts PDF
100% (2)
Parts PDF
96 pages
Java and The JVM
No ratings yet
Java and The JVM
37 pages
Java Module 2
No ratings yet
Java Module 2
50 pages
Java Assignment
No ratings yet
Java Assignment
18 pages
Core Java Notes
No ratings yet
Core Java Notes
19 pages
JAVA VIVA ??
No ratings yet
JAVA VIVA ??
74 pages
JAVA Notes
No ratings yet
JAVA Notes
12 pages
The JVM Handbook: A Developer’s Guide to Java Virtual Machine
From Everand
The JVM Handbook: A Developer’s Guide to Java Virtual Machine
Robert Johnson
No ratings yet
Core Java Full Notes by kiran @satya technologies
No ratings yet
Core Java Full Notes by kiran @satya technologies
368 pages
JAVA - Unit 1
No ratings yet
JAVA - Unit 1
36 pages
Language Fudamentals(24-03-2025) (1)
No ratings yet
Language Fudamentals(24-03-2025) (1)
41 pages
Java Notes(Module 1) (DR.shiavani) (1)
No ratings yet
Java Notes(Module 1) (DR.shiavani) (1)
52 pages
1.core Java Introduction
No ratings yet
1.core Java Introduction
6 pages
Java Garbage Collector
No ratings yet
Java Garbage Collector
21 pages
Module_1_1
No ratings yet
Module_1_1
52 pages
Java - Unit I M.sc. Sem I
No ratings yet
Java - Unit I M.sc. Sem I
96 pages
SCJP Notes
No ratings yet
SCJP Notes
63 pages
Java Lec 1
No ratings yet
Java Lec 1
60 pages
Java and The JVM: Martin Schöberl
No ratings yet
Java and The JVM: Martin Schöberl
27 pages
Java Virtual Machine - Wiki.
No ratings yet
Java Virtual Machine - Wiki.
6 pages
Java
No ratings yet
Java
187 pages
Java Programming UNIT-1: Stealth Project But Later Its Name Was Changed To Green Project
No ratings yet
Java Programming UNIT-1: Stealth Project But Later Its Name Was Changed To Green Project
26 pages
Adv Java24
No ratings yet
Adv Java24
102 pages
Wepik Unraveling The Wonders of Java A Comprehensive Guide 20231106101829jKUr
No ratings yet
Wepik Unraveling The Wonders of Java A Comprehensive Guide 20231106101829jKUr
106 pages
INTRODUCTION
No ratings yet
INTRODUCTION
9 pages
My Java Notes Basic
No ratings yet
My Java Notes Basic
31 pages
Java J2EE-Unit-1
No ratings yet
Java J2EE-Unit-1
42 pages
Intro of Java CHAPTER 1 PDF
No ratings yet
Intro of Java CHAPTER 1 PDF
18 pages
CS4001NI WK01 L IntroductiontoJavaProgramming 82790 158224
No ratings yet
CS4001NI WK01 L IntroductiontoJavaProgramming 82790 158224
35 pages
Java Is An Object
No ratings yet
Java Is An Object
44 pages
What Is JVM
No ratings yet
What Is JVM
7 pages
Mastering JVM Performance Tuning and Optimization: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering JVM Performance Tuning and Optimization: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Java Notes Introduction
No ratings yet
Java Notes Introduction
24 pages
JAVA
No ratings yet
JAVA
34 pages
Java Notes
No ratings yet
Java Notes
27 pages
UNIT-1 Introduction - NOTES Java
No ratings yet
UNIT-1 Introduction - NOTES Java
44 pages
1) What Is Java?
No ratings yet
1) What Is Java?
11 pages
Interview Questions
No ratings yet
Interview Questions
50 pages
Java Processor
No ratings yet
Java Processor
10 pages
CoreJava Notes
No ratings yet
CoreJava Notes
25 pages
Car Service: Objective & Aim
No ratings yet
Car Service: Objective & Aim
27 pages
Principles of Java
No ratings yet
Principles of Java
8 pages
Java, Methodology RMMM
No ratings yet
Java, Methodology RMMM
12 pages
Mastering Edge Computing: Scalable Application Development with Azure
From Everand
Mastering Edge Computing: Scalable Application Development with Azure
Peter Jones
No ratings yet
1 Introduction To Java
No ratings yet
1 Introduction To Java
25 pages
Unit-01 - Basics of Java - With - Notes
No ratings yet
Unit-01 - Basics of Java - With - Notes
28 pages
Introduction to Java Lesson 2
No ratings yet
Introduction to Java Lesson 2
23 pages
Introduction to Programming Languages(part II) (1)
No ratings yet
Introduction to Programming Languages(part II) (1)
2 pages
Oop 1
No ratings yet
Oop 1
19 pages
Java Virtual Machine Fully Final
No ratings yet
Java Virtual Machine Fully Final
37 pages
Lecture – 2 Introduction to Java, Key Features
No ratings yet
Lecture – 2 Introduction to Java, Key Features
5 pages
KodNest- Assignment 3
No ratings yet
KodNest- Assignment 3
6 pages
OOPSJAVA_1_4_Notes_OKD
No ratings yet
OOPSJAVA_1_4_Notes_OKD
41 pages
THE JAVA EXPERIENCE
No ratings yet
THE JAVA EXPERIENCE
71 pages
J2EE Ashok PDF
100% (1)
J2EE Ashok PDF
189 pages
CSE 201 Java Lecture 1 TC
No ratings yet
CSE 201 Java Lecture 1 TC
33 pages
JavaCBook
No ratings yet
JavaCBook
165 pages
CH1_1
No ratings yet
CH1_1
19 pages
Java Guide
No ratings yet
Java Guide
192 pages
Java Streams Explained: A Practical Guide with Examples
From Everand
Java Streams Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Boiler
No ratings yet
Boiler
9 pages
Hand & Mechanical Directional Valves: ISO 4401 Sizes 06, 10, 16 and 25
No ratings yet
Hand & Mechanical Directional Valves: ISO 4401 Sizes 06, 10, 16 and 25
6 pages
Tawaran Pekerjaan Cat Kantor Saja 1
No ratings yet
Tawaran Pekerjaan Cat Kantor Saja 1
6 pages
SD 16 e
No ratings yet
SD 16 e
20 pages
09 Bolted Joint
No ratings yet
09 Bolted Joint
49 pages
SLIDES Pure Substances & Mixtures PART 1
No ratings yet
SLIDES Pure Substances & Mixtures PART 1
93 pages
Hummingbirds!: B U D S
No ratings yet
Hummingbirds!: B U D S
1 page
HTTP WWW - Showmegold.org News Mesh
No ratings yet
HTTP WWW - Showmegold.org News Mesh
2 pages
Lönne Motors: 1TZ9 IE3 1TZ9 IE3
No ratings yet
Lönne Motors: 1TZ9 IE3 1TZ9 IE3
8 pages
Manuale Air Conditioner 2
100% (1)
Manuale Air Conditioner 2
46 pages
Narrative Report
No ratings yet
Narrative Report
1 page
Joseph Malafronte Resume
No ratings yet
Joseph Malafronte Resume
1 page
Urban Engineering Course Outline - Final Year (First Semester)
No ratings yet
Urban Engineering Course Outline - Final Year (First Semester)
6 pages
ICT Final Syllabus
No ratings yet
ICT Final Syllabus
12 pages
G 10
No ratings yet
G 10
9 pages
00.02.710 MSDS UOP868 Calibration Standard
No ratings yet
00.02.710 MSDS UOP868 Calibration Standard
8 pages
MEACS-Layout of Engine Room
No ratings yet
MEACS-Layout of Engine Room
21 pages
Diesel Forklift 1.5-1.8-2-2.5-3-3.5 Ton XF-series-3000-7000lbs-Diesel&LPG
No ratings yet
Diesel Forklift 1.5-1.8-2-2.5-3-3.5 Ton XF-series-3000-7000lbs-Diesel&LPG
9 pages
XFX 750a SLI User Guide
No ratings yet
XFX 750a SLI User Guide
31 pages
CV Dan Robert Ciuraru - CV - Eng PDF
No ratings yet
CV Dan Robert Ciuraru - CV - Eng PDF
4 pages
Aadhaar Enabled Biometric Attendance Solution
No ratings yet
Aadhaar Enabled Biometric Attendance Solution
50 pages
Photochromic Lenses Handout
100% (2)
Photochromic Lenses Handout
2 pages
VW00149 en 09 19
No ratings yet
VW00149 en 09 19
24 pages
Protocol of Calculator Qualification: Next Wave (India)
100% (2)
Protocol of Calculator Qualification: Next Wave (India)
5 pages
5 - General Principles of Prestressed Concrete
No ratings yet
5 - General Principles of Prestressed Concrete
20 pages

High Speed Java

Uploaded by

High Speed Java

Uploaded by

Research & Whitepaper Volume 1.

High Speed Java

2 Research & White paper Volume 1.0

The JVM is divided in five basic components as shown:

3 Research & White paper Volume 1.0

Alternative execution techniques

4 Research & White paper Volume 1.0

The alternatives for executing java code as shown above are:

5 Research & White paper Volume 1.0

After-Compilation (AC) & Before-Compilation (BC) Techniques

 Object In lining (AC)

6 Research & White paper Volume 1.0

7 Research & White paper Volume 1.0

Common-sub expression elimination

for(…){ int sub = a[j]*3*CONST_VAL;

8 Research & White paper Volume 1.0

Loop-invariant code motion

for(…){ iVal = a[j]*3*CONST_VAL;

Minimal synchronization, RMI and serialization

9 Research & White paper Volume 1.0

10 Research & White paper Volume 1.0

You might also like

for(…){ int sub = a[j]3CONST_VAL;

for(…){ iVal = a[j]3CONST_VAL;