Hibernate Reference3.2
Hibernate Reference3.2
Hibernate Reference3.2
Version: 3.2.7
Table of Contents
Preface ............................................................................................................................................ ix
1. Introduction to Hibernate ........................................................................................................... 1
1.1. Preface .............................................................................................................................. 1
1.2. Part 1 - The first Hibernate Application ............................................................................... 1
1.2.1. The first class .......................................................................................................... 1
1.2.2. The mapping file ..................................................................................................... 3
1.2.3. Hibernate configuration ........................................................................................... 4
1.2.4. Building with Ant .................................................................................................... 6
1.2.5. Startup and helpers .................................................................................................. 6
1.2.6. Loading and storing objects ...................................................................................... 8
1.3. Part 2 - Mapping associations ............................................................................................ 10
1.3.1. Mapping the Person class ....................................................................................... 10
1.3.2. A unidirectional Set-based association .................................................................... 11
1.3.3. Working the association ......................................................................................... 12
1.3.4. Collection of values ............................................................................................... 13
1.3.5. Bi-directional associations ..................................................................................... 14
1.3.6. Working bi-directional links ................................................................................... 15
1.4. Part 3 - The EventManager web application ....................................................................... 16
1.4.1. Writing the basic servlet ........................................................................................ 16
1.4.2. Processing and rendering ....................................................................................... 17
1.4.3. Deploying and testing ............................................................................................ 18
1.5. Summary ......................................................................................................................... 19
2. Architecture .............................................................................................................................. 20
2.1. Overview ......................................................................................................................... 20
2.2. Instance states .................................................................................................................. 22
2.3. JMX Integration ............................................................................................................... 22
2.4. JCA Support .................................................................................................................... 23
2.5. Contextual Sessions .......................................................................................................... 23
3. Configuration ............................................................................................................................ 25
3.1. Programmatic configuration .............................................................................................. 25
3.2. Obtaining a SessionFactory ............................................................................................... 25
3.3. JDBC connections ............................................................................................................ 26
3.4. Optional configuration properties ...................................................................................... 27
3.4.1. SQL Dialects ......................................................................................................... 33
3.4.2. Outer Join Fetching ............................................................................................... 34
3.4.3. Binary Streams ...................................................................................................... 34
3.4.4. Second-level and query cache ................................................................................. 34
3.4.5. Query Language Substitution ................................................................................. 34
3.4.6. Hibernate statistics ................................................................................................ 34
3.5. Logging ........................................................................................................................... 34
3.6. Implementing a NamingStrategy ....................................................................................... 35
3.7. XML configuration file ..................................................................................................... 36
3.8. J2EE Application Server integration .................................................................................. 37
3.8.1. Transaction strategy configuration .......................................................................... 37
3.8.2. JNDI-bound SessionFactory ................................................................................... 38
3.8.3. Current Session context management with JTA ....................................................... 39
3.8.4. JMX deployment ................................................................................................... 39
4. Persistent Classes ...................................................................................................................... 41
Hibernate 3.2.7 ii
HIBERNATE - Relational Persistence for Idiomatic Java
Hibernate 3.2.7 iv
HIBERNATE - Relational Persistence for Idiomatic Java
Hibernate 3.2.7 v
HIBERNATE - Relational Persistence for Idiomatic Java
Hibernate 3.2.7 vi
HIBERNATE - Relational Persistence for Idiomatic Java
Hibernate not only takes care of the mapping from Java classes to database tables (and from Java data types to
SQL data types), but also provides data query and retrieval facilities and can significantly reduce development
time otherwise spent with manual data handling in SQL and JDBC.
Hibernates goal is to relieve the developer from 95 percent of common data persistence related programming
tasks. Hibernate may not be the best solution for data-centric applications that only use stored-procedures to
implement the business logic in the database, it is most useful with object-oriented domain models and business
logic in the Java-based middle-tier. However, Hibernate can certainly help you to remove or encapsulate
vendor-specific SQL code and will help with the common task of result set translation from a tabular represent-
ation to a graph of objects.
If you are new to Hibernate and Object/Relational Mapping or even Java, please follow these steps:
1. Read Chapter 1, Introduction to Hibernate for a tutorial with step-by-step instructions. The source code
for the tutorial is included in the distribution in the doc/reference/tutorial/ directory.
2. Read Chapter 2, Architecture to understand the environments where Hibernate can be used.
3. Have a look at the eg/ directory in the Hibernate distribution, it contains a simple standalone application.
Copy your JDBC driver to the lib/ directory and edit etc/hibernate.properties, specifying correct val-
ues for your database. From a command prompt in the distribution directory, type ant eg (using Ant), or
under Windows, type build eg.
4. Use this reference documentation as your primary source of information. Consider reading Hibernate in
Action (http://www.manning.com/bauer) if you need more help with application design or if you prefer a
step-by-step tutorial. Also visit http://caveatemptor.hibernate.org and download the example application
for Hibernate in Action.
6. Third party demos, examples, and tutorials are linked on the Hibernate website.
7. The Community Area on the Hibernate website is a good resource for design patterns and various integra-
tion solutions (Tomcat, JBoss AS, Struts, EJB, etc.).
If you have questions, use the user forum linked on the Hibernate website. We also provide a JIRA issue track-
ings system for bug reports and feature requests. If you are interested in the development of Hibernate, join the
developer mailing list. If you are interested in translating this documentation into your language, contact us on
the developer mailing list.
Commercial development support, production support, and training for Hibernate is available through JBoss
Inc. (see http://www.hibernate.org/SupportTraining/). Hibernate is a Professional Open Source project and a
critical component of the JBoss Enterprise Middleware System (JEMS) suite of products.
Hibernate 3.2.7 ix
Chapter 1. Introduction to Hibernate
1.1. Preface
This chapter is an introductory tutorial for new users of Hibernate. We start with a simple command line applic-
ation using an in-memory database and develop it in easy to understand steps.
This tutorial is intended for new users of Hibernate but requires Java and SQL knowledge. It is based on a tu-
torial by Michael Gloegl, the third-party libraries we name are for JDK 1.4 and 5.0. You might need others for
JDK 1.3.
The source code for the tutorial is included in the distribution in the doc/reference/tutorial/ directory.
Let's assume we need a small database application that can store events we want to attend, and information
about the hosts of these events.
The first thing we do, is set up our development directory and put all the Java libraries we need into it. Down-
load the Hibernate distribution from the Hibernate website. Extract the package and place all required libraries
found in /lib into into the /lib directory of your new development working directory. It should look like this:
.
+lib
antlr.jar
cglib.jar
asm.jar
asm-attrs.jars
commons-collections.jar
commons-logging.jar
hibernate3.jar
jta.jar
dom4j.jar
log4j.jar
This is the minimum set of required libraries (note that we also copied hibernate3.jar, the main archive) for Hi-
bernate at the time of writing. The Hibernate release you are using might require more or less libraries. See the
README.txt file in the lib/ directory of the Hibernate distribution for more information about required and op-
tional third-party libraries. (Actually, Log4j is not required but preferred by many developers.)
Next we create a class that represents the event we want to store in database.
Our first persistent class is a simple JavaBean class with some properties:
package events;
import java.util.Date;
Hibernate 3.2.7 1
Introduction to Hibernate
public Event() {}
You can see that this class uses standard JavaBean naming conventions for property getter and setter methods,
as well as private visibility for the fields. This is a recommended design - but not required. Hibernate can also
access fields directly, the benefit of accessor methods is robustness for refactoring. The no-argument construct-
or is required to instantiate an object of this class through reflection.
The id property holds a unique identifier value for a particular event. All persistent entity classes (there are less
important dependent classes as well) will need such an identifier property if we want to use the full feature set
of Hibernate. In fact, most applications (esp. web applications) need to distinguish objects by identifier, so you
should consider this a feature rather than a limitation. However, we usually don't manipulate the identity of an
object, hence the setter method should be private. Only Hibernate will assign identifiers when an object is
saved. You can see that Hibernate can access public, private, and protected accessor methods, as well as
(public, private, protected) fields directly. The choice is up to you and you can match it to fit your application
design.
The no-argument constructor is a requirement for all persistent classes; Hibernate has to create objects for you,
using Java Reflection. The constructor can be private, however, package visibility is required for runtime proxy
generation and efficient data retrieval without bytecode instrumentation.
Place this Java source file in a directory called src in the development folder, and in its correct package. The
directory should now look like this:
.
+lib
<Hibernate and third-party libraries>
+src
+events
Event.java
Hibernate 3.2.7 2
Introduction to Hibernate
Hibernate needs to know how to load and store objects of the persistent class. This is where the Hibernate map-
ping file comes into play. The mapping file tells Hibernate what table in the database it has to access, and what
columns in that table it should use.
<?xml version="1.0"?>
<!DOCTYPE hibernate-mapping PUBLIC
"-//Hibernate/Hibernate Mapping DTD 3.0//EN"
"http://hibernate.sourceforge.net/hibernate-mapping-3.0.dtd">
<hibernate-mapping>
[...]
</hibernate-mapping>
Note that the Hibernate DTD is very sophisticated. You can use it for auto-completion of XML mapping ele-
ments and attributes in your editor or IDE. You also should open up the DTD file in your text editor - it's the
easiest way to get an overview of all elements and attributes and to see the defaults, as well as some comments.
Note that Hibernate will not load the DTD file from the web, but first look it up from the classpath of the ap-
plication. The DTD file is included in hibernate3.jar as well as in the src/ directory of the Hibernate distri-
bution.
We will omit the DTD declaration in future examples to shorten the code. It is of course not optional.
Between the two hibernate-mapping tags, include a class element. All persistent entity classes (again, there
might be dependent classes later on, which are not first-class entities) need such a mapping, to a table in the
SQL database:
<hibernate-mapping>
</class>
</hibernate-mapping>
So far we told Hibernate how to persist and load object of class Event to the table EVENTS, each instance repres-
ented by a row in that table. Now we continue with a mapping of the unique identifier property to the tables
primary key. In addition, as we don't want to care about handling this identifier, we configure Hibernate's iden-
tifier generation strategy for a surrogate primary key column:
<hibernate-mapping>
</hibernate-mapping>
The id element is the declaration of the identifer property, name="id" declares the name of the Java property -
Hibernate will use the getter and setter methods to access the property. The column attribute tells Hibernate
which column of the EVENTS table we use for this primary key. The nested generator element specifies the
identifier generation strategy, in this case we used native, which picks the best strategy depending on the con-
figured database (dialect). Hibernate supports database generated, globally unique, as well as application as-
signed identifiers (or any strategy you have written an extension for).
Hibernate 3.2.7 3
Introduction to Hibernate
Finally we include declarations for the persistent properties of the class in the mapping file. By default, no
properties of the class are considered persistent:
<hibernate-mapping>
</hibernate-mapping>
Just as with the id element, the name attribute of the property element tells Hibernate which getter and setter
methods to use. So, in this case, Hibernate will look for getDate()/setDate(), as well as get-
Title()/setTitle().
Why does the date property mapping include the column attribute, but the title doesn't? Without the column
attribute Hibernate by default uses the property name as the column name. This works fine for title. However,
date is a reserved keyword in most database, so we better map it to a different name.
The next interesting thing is that the title mapping also lacks a type attribute. The types we declare and use in
the mapping files are not, as you might expect, Java data types. They are also not SQL database types. These
types are so called Hibernate mapping types, converters which can translate from Java to SQL data types and
vice versa. Again, Hibernate will try to determine the correct conversion and mapping type itself if the type at-
tribute is not present in the mapping. In some cases this automatic detection (using Reflection on the Java class)
might not have the default you expect or need. This is the case with the date property. Hibernate can't know if
the property (which is of java.util.Date) should map to a SQL date, timestamp, or time column. We pre-
serve full date and time information by mapping the property with a timestamp converter.
This mapping file should be saved as Event.hbm.xml, right in the directory next to the Event Java class source
file. The naming of mapping files can be arbitrary, however the hbm.xml suffix is a convention in the Hibernate
developer community. The directory structure should now look like this:
.
+lib
<Hibernate and third-party libraries>
+src
+events
Event.java
Event.hbm.xml
We now have a persistent class and its mapping file in place. It is time to configure Hibernate. Before we do
this, we will need a database. HSQL DB, a java-based SQL DBMS, can be downloaded from the HSQL DB
website. Actually, you only need the hsqldb.jar from this download. Place this file in the lib/ directory of the
development folder.
Create a directory called data in the root of the development directory - this is where HSQL DB will store its
data files. Now start the database by running java -classpath ../lib/hsqldb.jar org.hsqldb.Server in
this data directory. You can see it start up and bind to a TCP/IP socket, this is where our application will con-
Hibernate 3.2.7 4
Introduction to Hibernate
nect later. If you want to start with a fresh database during this tutorial, shutdown HSQL DB (press CTRL + C in
the window), delete all files in the data/ directory, and start HSQL DB again.
Hibernate is the layer in your application which connects to this database, so it needs connection information.
The connections are made through a JDBC connection pool, which we also have to configure. The Hibernate
distribution contains several open source JDBC connection pooling tools, but will use the Hibernate built-in
connection pool for this tutorial. Note that you have to copy the required library into your classpath and use dif-
ferent connection pooling settings if you want to use a production-quality third party JDBC pooling software.
For Hibernate's configuration, we can use a simple hibernate.properties file, a slightly more sophisticated
hibernate.cfg.xml file, or even complete programmatic setup. Most users prefer the XML configuration file:
<hibernate-configuration>
<session-factory>
<mapping resource="events/Event.hbm.xml"/>
</session-factory>
</hibernate-configuration>
Note that this XML configuration uses a different DTD. We configure Hibernate's SessionFactory - a global
factory responsible for a particular database. If you have several databases, use several <session-factory>
configurations, usually in several configuration files (for easier startup).
The first four property elements contain the necessary configuration for the JDBC connection. The dialect
property element specifies the particular SQL variant Hibernate generates. Hibernate's automatic session man-
agement for persistence contexts will come in handy as you will soon see. The hbm2ddl.auto option turns on
automatic generation of database schemas - directly into the database. This can of course also be turned off (by
removing the config option) or redirected to a file with the help of the SchemaExport Ant task. Finally, we add
the mapping file(s) for persistent classes to the configuration.
Copy this file into the source directory, so it will end up in the root of the classpath. Hibernate automatically
Hibernate 3.2.7 5
Introduction to Hibernate
looks for a file called hibernate.cfg.xml in the root of the classpath, on startup.
We'll now build the tutorial with Ant. You will need to have Ant installed - get it from the Ant download page
[http://ant.apache.org/bindownload.cgi]. How to install Ant will not be covered here. Please refer to the Ant
manual [http://ant.apache.org/manual/index.html]. After you have installed Ant, we can start to create the build-
file. It will be called build.xml and placed directly in the development directory.
<path id="libraries">
<fileset dir="${librarydir}">
<include name="*.jar"/>
</fileset>
</path>
<target name="clean">
<delete dir="${targetdir}"/>
<mkdir dir="${targetdir}"/>
</target>
<target name="copy-resources">
<copy todir="${targetdir}">
<fileset dir="${sourcedir}">
<exclude name="**/*.java"/>
</fileset>
</copy>
</target>
</project>
This will tell Ant to add all files in the lib directory ending with .jar to the classpath used for compilation. It
will also copy all non-Java source files to the target directory, e.g. configuration and Hibernate mapping files. If
you now run Ant, you should get this output:
C:\hibernateTutorial\>ant
Buildfile: build.xml
copy-resources:
[copy] Copying 2 files to C:\hibernateTutorial\bin
compile:
[javac] Compiling 1 source file to C:\hibernateTutorial\bin
BUILD SUCCESSFUL
Total time: 1 second
Hibernate 3.2.7 6
Introduction to Hibernate
It's time to load and store some Event objects, but first we have to complete the setup with some infrastructure
code. We have to startup Hibernate. This startup includes building a global SessionFactory object and to store
it somewhere for easy access in application code. A SessionFactory can open up new Session's. A Session
represents a single-threaded unit of work, the SessionFactory is a thread-safe global object, instantiated once.
We'll create a HibernateUtil helper class which takes care of startup and makes accessing a SessionFactory
convenient. Let's have a look at the implementation:
package util;
import org.hibernate.*;
import org.hibernate.cfg.*;
static {
try {
// Create the SessionFactory from hibernate.cfg.xml
sessionFactory = new Configuration().configure().buildSessionFactory();
} catch (Throwable ex) {
// Make sure you log the exception, as it might be swallowed
System.err.println("Initial SessionFactory creation failed." + ex);
throw new ExceptionInInitializerError(ex);
}
}
This class does not only produce the global SessionFactory in its static initializer (called once by the JVM
when the class is loaded), but also hides the fact that it uses a static singleton. It might as well lookup the Ses-
sionFactory from JNDI in an application server.
If you give the SessionFactory a name in your configuration file, Hibernate will in fact try to bind it to JNDI
after it has been built. To avoid this code completely you could also use JMX deployment and let the JMX-
capable container instantiate and bind a HibernateService to JNDI. These advanced options are discussed in
the Hibernate reference documentation.
.
+lib
<Hibernate and third-party libraries>
+src
+events
Event.java
Event.hbm.xml
+util
HibernateUtil.java
hibernate.cfg.xml
+data
build.xml
This should again compile without problems. We finally need to configure a logging system - Hibernate uses
commons logging and leaves you the choice between Log4j and JDK 1.4 logging. Most developers prefer
Log4j: copy log4j.properties from the Hibernate distribution (it's in the etc/ directory) to your src direct-
ory, next to hibernate.cfg.xml. Have a look at the example configuration and change the settings if you like
Hibernate 3.2.7 7
Introduction to Hibernate
to have more verbose output. By default, only Hibernate startup message are shown on stdout.
The tutorial infrastructure is complete - and we are ready to do some real work with Hibernate.
Finally, we can use Hibernate to load and store objects. We write an EventManager class with a main() meth-
od:
package events;
import org.hibernate.Session;
import java.util.Date;
import util.HibernateUtil;
if (args[0].equals("store")) {
mgr.createAndStoreEvent("My Event", new Date());
}
HibernateUtil.getSessionFactory().close();
}
session.beginTransaction();
session.save(theEvent);
session.getTransaction().commit();
}
We create a new Event object, and hand it over to Hibernate. Hibernate now takes care of the SQL and ex-
ecutes INSERTs on the database. Let's have a look at the Session and Transaction-handling code before we run
this.
A Session is a single unit of work. For now we'll keep things simple and assume a one-to-one granularity
between a Hibernate Session and a database transaction. To shield our code from the actual underlying transac-
tion system (in this case plain JDBC, but it could also run with JTA) we use the Transaction API that is avail-
able on the Hibernate Session.
What does sessionFactory.getCurrentSession() do? First, you can call it as many times and anywhere you
like, once you get hold of your SessionFactory (easy thanks to HibernateUtil). The getCurrentSession()
method always returns the "current" unit of work. Remember that we switched the configuration option for this
mechanism to "thread" in hibernate.cfg.xml? Hence, the current unit of work is bound to the current Java
thread that executes our application. However, this is not the full picture, you also have to consider scope, when
a unit of work begins and when it ends.
A Session begins when it is first needed, when the first call to getCurrentSession() is made. It is then bound
Hibernate 3.2.7 8
Introduction to Hibernate
by Hibernate to the current thread. When the transaction ends, either through commit or rollback, Hibernate
automatically unbinds the Session from the thread and closes it for you. If you call getCurrentSession()
again, you get a new Session and can start a new unit of work. This thread-bound programming model is the
most popular way of using Hibernate, as it allows flexible layering of your code (transaction demarcation code
can be separated from data access code, we'll do this later in this tutorial).
Related to the unit of work scope, should the Hibernate Session be used to execute one or several database op-
erations? The above example uses one Session for one operation. This is pure coincidence, the example is just
not complex enough to show any other approach. The scope of a Hibernate Session is flexible but you should
never design your application to use a new Hibernate Session for every database operation. So even if you see
it a few more times in the following (very trivial) examples, consider session-per-operation an anti-pattern. A
real (web) application is shown later in this tutorial.
Have a look at Chapter 11, Transactions And Concurrency for more information about transaction handling and
demarcation. We also skipped any error handling and rollback in the previous example.
To run this first routine we have to add a callable target to the Ant build file:
The value of the action argument is set on the command line when calling the target:
You should see, after compilation, Hibernate starting up and, depending on your configuration, lots of log out-
put. At the end you will find the following line:
[java] Hibernate: insert into EVENTS (EVENT_DATE, title, EVENT_ID) values (?, ?, ?)
This is the INSERT executed by Hibernate, the question marks represent JDBC bind parameters. To see the val-
ues bound as arguments, or to reduce the verbosity of the log, check your log4j.properties.
Now we'd like to list stored events as well, so we add an option to the main method:
if (args[0].equals("store")) {
mgr.createAndStoreEvent("My Event", new Date());
}
else if (args[0].equals("list")) {
List events = mgr.listEvents();
for (int i = 0; i < events.size(); i++) {
Event theEvent = (Event) events.get(i);
System.out.println("Event: " + theEvent.getTitle() +
" Time: " + theEvent.getDate());
}
}
session.beginTransaction();
Hibernate 3.2.7 9
Introduction to Hibernate
session.getTransaction().commit();
return result;
}
What we do here is use an HQL (Hibernate Query Language) query to load all existing Event objects from the
database. Hibernate will generate the appropriate SQL, send it to the database and populate Event objects with
the data. You can create more complex queries with HQL, of course.
• Run ant run -Daction=store to store something into the database and, of course, to generate the database
schema before through hbm2ddl.
• Now disable hbm2ddl by commenting out the property in your hibernate.cfg.xml file. Usually you only
leave it turned on in continous unit testing, but another run of hbm2ddl would drop everything you have
stored - the create configuration setting actually translates into "drop all tables from the schema, then re-
create all tables, when the SessionFactory is build".
If you now call Ant with -Daction=list, you should see the events you have stored so far. You can of course
also call the store action a few times more.
Note: Most new Hibernate users fail at this point and we see questions about Table not found error messages
regularly. However, if you follow the steps outlined above you will not have this problem, as hbm2ddl creates
the database schema on the first run, and subsequent application restarts will use this schema. If you change the
mapping and/or database schema, you have to re-enable hbm2ddl once again.
package events;
public Person() {}
Create a new mapping file called Person.hbm.xml (don't forget the DTD reference at the top):
<hibernate-mapping>
Hibernate 3.2.7 10
Introduction to Hibernate
</hibernate-mapping>
<mapping resource="events/Event.hbm.xml"/>
<mapping resource="events/Person.hbm.xml"/>
We'll now create an association between these two entities. Obviously, persons can participate in events, and
events have participants. The design questions we have to deal with are: directionality, multiplicity, and collec-
tion behavior.
We'll add a collection of events to the Person class. That way we can easily navigate to the events for a particu-
lar person, without executing an explicit query - by calling aPerson.getEvents(). We use a Java collection, a
Set, because the collection will not contain duplicate elements and the ordering is not relevant for us.
We need a unidirectional, many-valued associations, implemented with a Set. Let's write the code for this in
the Java classes and then map it:
Before we map this association, think about the other side. Clearly, we could just keep this unidirectional. Or,
we could create another collection on the Event, if we want to be able to navigate it bi-directional, i.e. an-
Event.getParticipants(). This is not necessary, from a functional perspective. You could always execute an
explicit query to retrieve the participants for a particular event. This is a design choice left to you, but what is
clear from this discussion is the multiplicity of the association: "many" valued on both sides, we call this a
many-to-many association. Hence, we use Hibernate's many-to-many mapping:
Hibernate 3.2.7 11
Introduction to Hibernate
</class>
Hibernate supports all kinds of collection mappings, a <set> being most common. For a many-to-many associ-
ation (or n:m entity relationship), an association table is needed. Each row in this table represents a link
between a person and an event. The table name is configured with the table attribute of the set element. The
identifier column name in the association, for the person's side, is defined with the <key> element, the column
name for the event's side with the column attribute of the <many-to-many>. You also have to tell Hibernate the
class of the objects in your collection (correct: the class on the other side of the collection of references).
_____________ __________________
| | | | _____________
| EVENTS | | PERSON_EVENT | | |
|_____________| |__________________| | PERSON |
| | | | |_____________|
| *EVENT_ID | <--> | *EVENT_ID | | |
| EVENT_DATE | | *PERSON_ID | <--> | *PERSON_ID |
| TITLE | |__________________| | AGE |
|_____________| | FIRSTNAME |
| LASTNAME |
|_____________|
Let's bring some people and events together in a new method in EventManager:
aPerson.getEvents().add(anEvent);
session.getTransaction().commit();
}
After loading a Person and an Event, simply modify the collection using the normal collection methods. As
you can see, there is no explicit call to update() or save(), Hibernate automatically detects that the collection
has been modified and needs to be updated. This is called automatic dirty checking, and you can also try it by
modifying the name or the date property of any of your objects. As long as they are in persistent state, that is,
bound to a particular Hibernate Session (i.e. they have been just loaded or saved in a unit of work), Hibernate
monitors any changes and executes SQL in a write-behind fashion. The process of synchronizing the memory
state with the database, usually only at the end of a unit of work, is called flushing. In our code, the unit of work
ends with a commit (or rollback) of the database transaction - as defined by the thread configuration option for
the CurrentSessionContext class.
You might of course load person and event in different units of work. Or you modify an object outside of a
Session, when it is not in persistent state (if it was persistent before, we call this state detached). You can even
modify a collection when it is detached:
Hibernate 3.2.7 12
Introduction to Hibernate
session.beginTransaction();
session.getTransaction().commit();
session2.getTransaction().commit();
}
The call to update makes a detached object persistent again, you could say it binds it to a new unit of work, so
any modifications you made to it while detached can be saved to the database. This includes any modifications
(additions/deletions) you made to a collection of that entity object.
Well, this is not much use in our current situation, but it's an important concept you can design into your own
application. For now, complete this exercise by adding a new action to the EventManager's main method and
call it from the command line. If you need the identifiers of a person and an event - the save() method returns
it (you might have to modify some of the previous methods to return that identifier):
else if (args[0].equals("addpersontoevent")) {
Long eventId = mgr.createAndStoreEvent("My Event", new Date());
Long personId = mgr.createAndStorePerson("Foo", "Bar");
mgr.addPersonToEvent(personId, eventId);
System.out.println("Added person " + personId + " to event " + eventId);
This was an example of an association between two equally important classes, two entities. As mentioned earli-
er, there are other classes and types in a typical model, usually "less important". Some you have already seen,
like an int or a String. We call these classes value types, and their instances depend on a particular entity. In-
stances of these types don't have their own identity, nor are they shared between entities (two persons don't ref-
erence the same firstname object, even if they have the same first name). Of course, value types can not only
be found in the JDK (in fact, in a Hibernate application all JDK classes are considered value types), but you can
also write dependent classes yourself, Address or MonetaryAmount, for example.
You can also design a collection of value types. This is conceptually very different from a collection of refer-
ences to other entities, but looks almost the same in Java.
We add a collection of value typed objects to the Person entity. We want to store email addresses, so the type
we use is String, and the collection is again a Set:
Hibernate 3.2.7 13
Introduction to Hibernate
The difference compared with the earlier mapping is the element part, which tells Hibernate that the collection
does not contain references to another entity, but a collection of elements of type String (the lowercase name
tells you it's a Hibernate mapping type/converter). Once again, the table attribute of the set element determ-
ines the table name for the collection. The key element defines the foreign-key column name in the collection
table. The column attribute in the element element defines the column name where the String values will actu-
ally be stored.
_____________ __________________
| | | | _____________
| EVENTS | | PERSON_EVENT | | | ___________________
|_____________| |__________________| | PERSON | | |
| | | | |_____________| | PERSON_EMAIL_ADDR |
| *EVENT_ID | <--> | *EVENT_ID | | | |___________________|
| EVENT_DATE | | *PERSON_ID | <--> | *PERSON_ID | <--> | *PERSON_ID |
| TITLE | |__________________| | AGE | | *EMAIL_ADDR |
|_____________| | FIRSTNAME | |___________________|
| LASTNAME |
|_____________|
You can see that the primary key of the collection table is in fact a composite key, using both columns. This
also implies that there can't be duplicate email addresses per person, which is exactly the semantics we need for
a set in Java.
You can now try and add elements to this collection, just like we did before by linking persons and events. It's
the same code in Java:
session.getTransaction().commit();
}
This time we didnt' use a fetch query to initialize the collection. Hence, the call to its getter method will trigger
an additional select to initialize it, so we can add an element to it. Monitor the SQL log and try to optimize this
with an eager fetch.
Hibernate 3.2.7 14
Introduction to Hibernate
Next we are going to map a bi-directional association - making the association between person and event work
from both sides in Java. Of course, the database schema doesn't change, we still have many-to-many multipli-
city. A relational database is more flexible than a network programming language, so it doesn't need anything
like a navigation direction - data can be viewed and retrieved in any possible way.
As you see, these are normal set mappings in both mapping documents. Notice that the column names in key
and many-to-many are swapped in both mapping documents. The most important addition here is the in-
verse="true" attribute in the set element of the Event's collection mapping.
What this means is that Hibernate should take the other side - the Person class - when it needs to find out in-
formation about the link between the two. This will be a lot easier to understand once you see how the bi-
directional link between our two entities is created .
First, keep in mind that Hibernate does not affect normal Java semantics. How did we create a link between a
Person and an Event in the unidirectional example? We added an instance of Event to the collection of event
references, of an instance of Person. So, obviously, if we want to make this link working bi-directional, we
have to do the same on the other side - adding a Person reference to the collection in an Event. This "setting the
link on both sides" is absolutely necessary and you should never forget doing it.
Many developers program defensive and create a link management methods to correctly set both sides, e.g. in
Person:
Hibernate 3.2.7 15
Introduction to Hibernate
Notice that the get and set methods for the collection are now protected - this allows classes in the same pack-
age and subclasses to still access the methods, but prevents everybody else from messing with the collections
directly (well, almost). You should probably do the same with the collection on the other side.
What about the inverse mapping attribute? For you, and for Java, a bi-directional link is simply a matter of set-
ting the references on both sides correctly. Hibernate however doesn't have enough information to correctly ar-
range SQL INSERT and UPDATE statements (to avoid constraint violations), and needs some help to handle bi-
directional associations properly. Making one side of the association inverse tells Hibernate to basically ignore
it, to consider it a mirror of the other side. That's all that is necessary for Hibernate to work out all of the issues
when transformation a directional navigation model to a SQL database schema. The rules you have to remem-
ber are straightforward: All bi-directional associations need one side as inverse. In a one-to-many association
it has to be the many-side, in many-to-many association you can pick either side, there is no difference.
package events;
// Imports
// Servlet code
}
The servlet handles HTTP GET requests only, hence, the method we implement is doGet():
try {
// Begin unit of work
HibernateUtil.getSessionFactory()
.getCurrentSession().beginTransaction();
Hibernate 3.2.7 16
Introduction to Hibernate
The pattern we are applying here is called session-per-request. When a request hits the servlet, a new Hibernate
Session is opened through the first call to getCurrentSession() on the SessionFactory. Then a database
transaction is started—all data access as to occur inside a transaction, no matter if data is read or written (we
don't use the auto-commit mode in applications).
Do not use a new Hibernate Session for every database operation. Use one Hibernate Session that is scoped to
the whole request. Use getCurrentSession(), so that it is automatically bound to the current Java thread.
Next, the possible actions of the request are processed and the response HTML is rendered. We'll get to that
part soon.
Finally, the unit of work ends when processing and rendering is complete. If any problem occured during pro-
cessing or rendering, an exception will be thrown and the database transaction rolled back. This completes the
session-per-request pattern. Instead of the transaction demarcation code in every servlet you could also
write a servlet filter. See the Hibernate website and Wiki for more information about this pattern, called Open
Session in View—you'll need it as soon as you consider rendering your view in JSP, not in a servlet.
Let's implement the processing of the request and rendering of the page.
// Handle actions
if ( "store".equals(request.getParameter("action")) ) {
if ( "".equals(eventTitle) || "".equals(eventDate) ) {
out.println("<b><i>Please enter event title and date.</i></b>");
} else {
createAndStoreEvent(eventTitle, dateFormatter.parse(eventDate));
out.println("<b><i>Added event.</i></b>");
}
}
// Print page
printEventForm(out);
listEvents(out, dateFormatter);
Granted, this coding style with a mix of Java and HTML would not scale in a more complex application—keep
in mind that we are only illustrating basic Hibernate concepts in this tutorial. The code prints an HTML header
and a footer. Inside this page, an HTML form for event entry and a list of all events in the database are printed.
The first method is trivial and only outputs HTML:
Hibernate 3.2.7 17
Introduction to Hibernate
out.println("</form>");
}
The listEvents() method uses the Hibernate Session bound to the current thread to execute a query:
Finally, the store action is dispatched to the createAndStoreEvent() method, which also uses the Session of
the current thread:
HibernateUtil.getSessionFactory()
.getCurrentSession().save(theEvent);
}
That's it, the servlet is complete. A request to the servlet will be processed in a single Session and Transac-
tion. As earlier in the standalone application, Hibernate can automatically bind these ojects to the current
thread of execution. This gives you the freedom to layer your code and access the SessionFactory in any way
you like. Usually you'd use a more sophisticated design and move the data access code into data access objects
(the DAO pattern). See the Hibernate Wiki for more examples.
To deploy this application you have to create a web archive, a WAR. Add the following Ant target to your
build.xml:
<classes dir="${targetdir}"/>
</war>
</target>
This target creates a file called hibernate-tutorial.war in your project directory. It packages all libraries and
Hibernate 3.2.7 18
Introduction to Hibernate
the web.xml descriptor, which is expected in the base directory of your project:
<servlet>
<servlet-name>Event Manager</servlet-name>
<servlet-class>events.EventManagerServlet</servlet-class>
</servlet>
<servlet-mapping>
<servlet-name>Event Manager</servlet-name>
<url-pattern>/eventmanager</url-pattern>
</servlet-mapping>
</web-app>
Before you compile and deploy the web application, note that an additional library is required: jsdk.jar. This
is the Java servlet development kit, if you don't have this library already, get it from the Sun website and copy it
to your library directory. However, it will be only used for compliation and excluded from the WAR package.
To build and deploy call ant war in your project directory and copy the hibernate-tutorial.war file into
your Tomcat webapp directory. If you don't have Tomcat installed, download it and follow the installation in-
structions. You don't have to change any Tomcat configuration to deploy this application though.
1.5. Summary
This tutorial covered the basics of writing a simple standalone Hibernate application and a small web applica-
tion.
If you already feel confident with Hibernate, continue browsing through the reference documentation table of
contents for topics you find interesting - most asked are transactional processing (Chapter 11, Transactions And
Concurrency), fetch performance (Chapter 19, Improving performance), or the usage of the API (Chapter 10,
Working with objects) and the query features (Section 10.4, “Querying”).
Don't forget to check the Hibernate website for more (specialized) tutorials.
Hibernate 3.2.7 19
Chapter 2. Architecture
2.1. Overview
A (very) high-level view of the Hibernate architecture:
This diagram shows Hibernate using the database and configuration data to provide persistence services (and
persistent objects) to the application.
We would like to show a more detailed view of the runtime architecture. Unfortunately, Hibernate is flexible
and supports several approaches. We will show the two extremes. The "lite" architecture has the application
provide its own JDBC connections and manage its own transactions. This approach uses a minimal subset of
Hibernate's APIs:
The "full cream" architecture abstracts the application away from the underlying JDBC/JTA APIs and lets Hi-
Hibernate 3.2.7 20
Architecture
SessionFactory (org.hibernate.SessionFactory)
A threadsafe (immutable) cache of compiled mappings for a single database. A factory for Session and a
client of ConnectionProvider. Might hold an optional (second-level) cache of data that is reusable
between transactions, at a process- or cluster-level.
Session (org.hibernate.Session)
A single-threaded, short-lived object representing a conversation between the application and the persistent
store. Wraps a JDBC connection. Factory for Transaction. Holds a mandatory (first-level) cache of per-
sistent objects, used when navigating the object graph or looking up objects by identifier.
Transaction (org.hibernate.Transaction)
(Optional) A single-threaded, short-lived object used by the application to specify atomic units of work.
Abstracts application from underlying JDBC, JTA or CORBA transaction. A Session might span several
Transactions in some cases. However, transaction demarcation, either using the underlying API or Trans-
action, is never optional!
Hibernate 3.2.7 21
Architecture
ConnectionProvider (org.hibernate.connection.ConnectionProvider)
(Optional) A factory for (and pool of) JDBC connections. Abstracts application from underlying Data-
source or DriverManager. Not exposed to application, but can be extended/implemented by the developer.
TransactionFactory (org.hibernate.TransactionFactory)
(Optional) A factory for Transaction instances. Not exposed to the application, but can be extended/
implemented by the developer.
Extension Interfaces
Hibernate offers many optional extension interfaces you can implement to customize the behavior of your
persistence layer. See the API documentation for details.
Given a "lite" architecture, the application bypasses the Transaction/TransactionFactory and/or Connec-
tionProvider APIs to talk to JTA or JDBC directly.
transient
The instance is not, and has never been associated with any persistence context. It has no persistent identity
(primary key value).
persistent
The instance is currently associated with a persistence context. It has a persistent identity (primary key
value) and, perhaps, a corresponding row in the database. For a particular persistence context, Hibernate
guarantees that persistent identity is equivalent to Java identity (in-memory location of the object).
detached
The instance was once associated with a persistence context, but that context was closed, or the instance
was serialized to another process. It has a persistent identity and, perhaps, a corrsponding row in the data-
base. For detached instances, Hibernate makes no guarantees about the relationship between persistent
identity and Java identity.
For an example how to deploy Hibernate as a JMX service on the JBoss Application Server, please see the
JBoss User Guide. On JBoss AS, you also get these benefits if you deploy using JMX:
• Session Management: The Hibernate Session's lifecycle can be automatically bound to the scope of a JTA
transaction. This means you no longer have to manually open and close the Session, this becomes the job
of a JBoss EJB interceptor. You also don't have to worry about transaction demarcation in your code any-
more (unless you'd like to write a portable persistence layer of course, use the optional Hibernate Transac-
tion API for this). You call the HibernateContext to access a Session.
Hibernate 3.2.7 22
Architecture
• HAR deployment: Usually you deploy the Hibernate JMX service using a JBoss service deployment
descriptor (in an EAR and/or SAR file), it supports all the usual configuration options of a Hibernate Ses-
sionFactory. However, you still have to name all your mapping files in the deployment descriptor. If you
decide to use the optional HAR deployment, JBoss will automatically detect all mapping files in your HAR
file.
Consult the JBoss AS user guide for more information about these options.
Another feature available as a JMX service are runtime Hibernate statistics. See Section 3.4.6, “Hibernate stat-
istics”.
Starting with version 3.0.1, Hibernate added the SessionFactory.getCurrentSession() method. Initially, this
assumed usage of JTA transactions, where the JTA transaction defined both the scope and context of a current
session. The Hibernate team maintains that, given the maturity of the numerous stand-alone JTA Transaction-
Manager implementations out there, most (if not all) applications should be using JTA transaction management
whether or not they are deployed into a J2EE container. Based on that, the JTA-based contextual sessions is all
you should ever need to use.
See the Javadocs for the org.hibernate.context.CurrentSessionContext interface for a detailed discussion
of its contract. It defines a single method, currentSession(), by which the implementation is responsible for
tracking the current contextual session. Out-of-the-box, Hibernate comes with three implementations of this in-
terface.
Hibernate 3.2.7 23
Architecture
The first two implementations provide a "one session - one database transaction" programming model, also
known and used as session-per-request. The beginning and end of a Hibernate session is defined by the dura-
tion of a database transaction. If you use programatic transaction demarcation in plain JSE without JTA, you
are adviced to use the Hibernate Transaction API to hide the underlying transaction system from your code. If
you use JTA, use the JTA interfaces to demarcate transactions. If you execute in an EJB container that supports
CMT, transaction boundaries are defined declaratively and you don't need any transaction or session demarca-
tion operations in your code. Refer to Chapter 11, Transactions And Concurrency for more information and
code examples.
Hibernate 3.2.7 24
Chapter 3. Configuration
Because Hibernate is designed to operate in many different environments, there are a large number of configur-
ation parameters. Fortunately, most have sensible default values and Hibernate is distributed with an example
hibernate.properties file in etc/ that shows the various options. Just put the example file in your classpath
and customize it.
You may obtain a Configuration instance by instantiating it directly and specifying XML mapping docu-
ments. If the mapping files are in the classpath, use addResource():
An alternative (sometimes better) way is to specify the mapped class, and let Hibernate find the mapping docu-
ment for you:
Then Hibernate will look for mapping files named /org/hibernate/auction/Item.hbm.xml and /
org/hibernate/auction/Bid.hbm.xml in the classpath. This approach eliminates any hardcoded filenames.
This is not the only way to pass configuration properties to Hibernate. The various options include:
Hibernate 3.2.7 25
Configuration
Hibernate does allow your application to instantiate more than one SessionFactory. This is useful if you are
using more than one database.
As soon as you do something that requires access to the database, a JDBC connection will be obtained from the
pool.
For this to work, we need to pass some JDBC connection properties to Hibernate. All Hibernate property names
and semantics are defined on the class org.hibernate.cfg.Environment. We will now describe the most im-
portant settings for JDBC connection configuration.
Hibernate will obtain (and pool) connections using java.sql.DriverManager if you set the following proper-
ties:
Hibernate's own connection pooling algorithm is however quite rudimentary. It is intended to help you get star-
ted and is not intended for use in a production system or even for performance testing. You should use a third
party pool for best performance and stability. Just replace the hibernate.connection.pool_size property with
connection pool specific settings. This will turn off Hibernate's internal pool. For example, you might like to
use C3P0.
C3P0 is an open source JDBC connection pool distributed along with Hibernate in the lib directory. Hibernate
will use its C3P0ConnectionProvider for connection pooling if you set hibernate.c3p0.* properties. If you'd
like to use Proxool refer to the packaged hibernate.properties and the Hibernate web site for more informa-
tion.
hibernate.connection.driver_class = org.postgresql.Driver
hibernate.connection.url = jdbc:postgresql://localhost/mydatabase
hibernate.connection.username = myuser
hibernate.connection.password = secret
Hibernate 3.2.7 26
Configuration
hibernate.c3p0.min_size=5
hibernate.c3p0.max_size=20
hibernate.c3p0.timeout=1800
hibernate.c3p0.max_statements=50
hibernate.dialect = org.hibernate.dialect.PostgreSQLDialect
For use inside an application server, you should almost always configure Hibernate to obtain connections from
an application server Datasource registered in JNDI. You'll need to set at least one of the following properties:
Here's an example hibernate.properties file for an application server provided JNDI datasource:
hibernate.connection.datasource = java:/comp/env/jdbc/test
hibernate.transaction.factory_class = \
org.hibernate.transaction.JTATransactionFactory
hibernate.transaction.manager_lookup_class = \
org.hibernate.transaction.JBossTransactionManagerLookup
hibernate.dialect = org.hibernate.dialect.PostgreSQLDialect
JDBC connections obtained from a JNDI datasource will automatically participate in the container-managed
transactions of the application server.
Arbitrary connection properties may be given by prepending "hibernate.connnection" to the property name.
For example, you may specify a charSet using hibernate.connection.charSet.
You may define your own plugin strategy for obtaining JDBC connections by implementing the interface
org.hibernate.connection.ConnectionProvider. You may select a custom implementation by setting hi-
bernate.connection.provider_class.
Warning: some of these properties are "system-level" only. System-level properties can be set only via java -
Dproperty=value or hibernate.properties. They may not be set by the other techniques described above.
Hibernate 3.2.7 27
Configuration
eg. full.classname.of.Dialect
eg. SCHEMA_NAME
eg. CATALOG_NAME
eg. jndi/composite/name
hibernate.max_fetch_depth Set a maximum "depth" for the outer join fetch tree
for single-ended associations (one-to-one, many-
to-one). A 0 disables default outer join fetching.
Hibernate 3.2.7 28
Configuration
eg. classname.of.Batcher
Hibernate 3.2.7 29
Configuration
eg. true|false
eg. classname.of.ConnectionProvider
eg. 1, 2, 4, 8
Hibernate 3.2.7 30
Configuration
eg. classname.of.CacheProvider
eg. true|false
eg. true|false
eg. true|false
eg. classname.of.QueryCache
eg. prefix
eg. true|false
eg. classname.of.TransactionFactory
eg. jndi/composite/name
Hibernate 3.2.7 31
Configuration
eg. classname.of.TransactionManagerLookup
eg.
org.hibernate.hql.ast.ASTQueryTranslatorFacto
ry or
org.hibernate.hql.classic.ClassicQueryTransla
torFactory
Hibernate 3.2.7 32
Configuration
You should always set the hibernate.dialect property to the correct org.hibernate.dialect.Dialect sub-
class for your database. If you specify a dialect, Hibernate will use sensible defaults for some of the other prop-
erties listed above, saving you the effort of specifying them manually.
RDBMS Dialect
DB2 org.hibernate.dialect.DB2Dialect
PostgreSQL org.hibernate.dialect.PostgreSQLDialect
MySQL org.hibernate.dialect.MySQLDialect
Oracle 8i org.hibernate.dialect.Oracle8iDialect
Oracle 9i org.hibernate.dialect.Oracle9iDialect
Sybase org.hibernate.dialect.SybaseDialect
SAP DB org.hibernate.dialect.SAPDBDialect
Informix org.hibernate.dialect.InformixDialect
HypersonicSQL org.hibernate.dialect.HSQLDialect
Ingres org.hibernate.dialect.IngresDialect
Progress org.hibernate.dialect.ProgressDialect
Interbase org.hibernate.dialect.InterbaseDialect
Pointbase org.hibernate.dialect.PointbaseDialect
FrontBase org.hibernate.dialect.FrontbaseDialect
Hibernate 3.2.7 33
Configuration
RDBMS Dialect
Firebird org.hibernate.dialect.FirebirdDialect
If your database supports ANSI, Oracle or Sybase style outer joins, outer join fetching will often increase per-
formance by limiting the number of round trips to and from the database (at the cost of possibly more work per-
formed by the database itself). Outer join fetching allows a whole graph of objects connected by many-to-one,
one-to-many, many-to-many and one-to-one associations to be retrieved in a single SQL SELECT.
Outer join fetching may be disabled globally by setting the property hibernate.max_fetch_depth to 0. A set-
ting of 1 or higher enables outer join fetching for one-to-one and many-to-one associations which have been
mapped with fetch="join".
Oracle limits the size of byte arrays that may be passed to/from its JDBC driver. If you wish to use large in-
stances of binary or serializable type, you should enable hibernate.jdbc.use_streams_for_binary. This
is a system-level setting only.
The properties prefixed by hibernate.cache allow you to use a process or cluster scoped second-level cache
system with Hibernate. See the Section 19.2, “The Second Level Cache” for more details.
You may define new Hibernate query tokens using hibernate.query.substitutions. For example:
would cause the tokens true and false to be translated to integer literals in the generated SQL.
hibernate.query.substitutions toLowercase=LOWER
If you enable hibernate.generate_statistics, Hibernate will expose a number of metrics that are useful
when tuning a running system via SessionFactory.getStatistics(). Hibernate can even be configured to ex-
pose these statistics via JMX. Read the Javadoc of the interfaces in org.hibernate.stats for more informa-
tion.
3.5. Logging
Hibernate 3.2.7 34
Configuration
The commons-logging service will direct output to either Apache Log4j (if you include log4j.jar in your
classpath) or JDK1.4 logging (if running under JDK1.4 or above). You may download Log4j from ht-
tp://jakarta.apache.org. To use Log4j you will need to place a log4j.properties file in your classpath, an
example properties file is distributed with Hibernate in the src/ directory.
We strongly recommend that you familiarize yourself with Hibernate's log messages. A lot of work has been
put into making the Hibernate log as detailed as possible, without making it unreadable. It is an essential
troubleshooting device. The most interesting log categories are the following:
Category Function
org.hibernate.pretty Log the state of all entities (max 20 entities) associated with the session at
flush time
org.hibernate Log everything (a lot of information, but very useful for troubleshooting)
When developing applications with Hibernate, you should almost always work with debug enabled for the cat-
egory org.hibernate.SQL, or, alternatively, the property hibernate.show_sql enabled.
You may provide rules for automatically generating database identifiers from Java identifiers or for processing
"logical" column and table names given in the mapping file into "physical" table and column names. This fea-
ture helps reduce the verbosity of the mapping document, eliminating repetitive noise (TBL_ prefixes, for ex-
ample). The default strategy used by Hibernate is quite minimal.
You may specify a different strategy by calling Configuration.setNamingStrategy() before adding map-
pings:
Hibernate 3.2.7 35
Configuration
.addFile("Bid.hbm.xml")
.buildSessionFactory();
The XML configuration file is by default expected to be in the root o your CLASSPATH. Here is an example:
<hibernate-configuration>
</session-factory>
</hibernate-configuration>
As you can see, the advantage of this approach is the externalization of the mapping file names to configura-
tion. The hibernate.cfg.xml is also more convenient once you have to tune the Hibernate cache. Note that is
your choice to use either hibernate.properties or hibernate.cfg.xml, both are equivalent, except for the
above mentioned benefits of using the XML syntax.
Hibernate 3.2.7 36
Configuration
• Container-managed datasources: Hibernate can use JDBC connections managed by the container and
provided through JNDI. Usually, a JTA compatible TransactionManager and a ResourceManager take care
of transaction management (CMT), esp. distributed transaction handling across several datasources. You
may of course also demarcate transaction boundaries programatically (BMT) or you might want to use the
optional Hibernate Transaction API for this to keep your code portable.
• Automatic JNDI binding: Hibernate can bind its SessionFactory to JNDI after startup.
• JTA Session binding: The Hibernate Session may be automatically bound to the scope of JTA transactions.
Simply lookup the SessionFactory from JNDI and get the current Session. Let Hibernate take care of
flushing and closing the Session when your JTA transaction completes. Transaction demarcation is either
declarative (CMT) or programmatic (BMT/UserTransaction).
• JMX deployment: If you have a JMX capable application server (e.g. JBoss AS), you can chose to deploy
Hibernate as a managed MBean. This saves you the one line startup code to build your SessionFactory
from a Configuration. The container will startup your HibernateService, and ideally also take care of
service dependencies (Datasource has to be available before Hibernate starts, etc).
Depending on your environment, you might have to set the configuration option hibern-
ate.connection.aggressive_release to true if your application server shows "connection containment" ex-
ceptions.
The Hibernate Session API is independent of any transaction demarcation system in your architecture. If you
let Hibernate use JDBC directly, through a connection pool, you may begin and end your transactions by call-
ing the JDBC API. If you run in a J2EE application server, you might want to use bean-managed transactions
and call the JTA API and UserTransaction when needed.
To keep your code portable between these two (and other) environments we recommend the optional Hibernate
Transaction API, which wraps and hides the underlying system. You have to specify a factory class for
Transaction instances by setting the Hibernate configuration property hibern-
ate.transaction.factory_class.
org.hibernate.transaction.JDBCTransactionFactory
delegates to database (JDBC) transactions (default)
org.hibernate.transaction.JTATransactionFactory
delegates to container-managed transaction if an existing transaction is underway in this context (e.g. EJB
session bean method), otherwise a new transaction is started and bean-managed transaction are used.
org.hibernate.transaction.CMTTransactionFactory
delegates to container-managed JTA transactions
Hibernate 3.2.7 37
Configuration
You may also define your own transaction strategies (for a CORBA transaction service, for example).
Some features in Hibernate (i.e. the second level cache, Contextual Sessions with JTA, etc.) require access to
the JTA TransactionManager in a managed environment. In an application server you have to specify how Hi-
bernate should obtain a reference to the TransactionManager, since J2EE does not standardize a single mech-
anism:
org.hibernate.transaction.JBossTransactionManagerLookup JBoss
org.hibernate.transaction.WeblogicTransactionManagerLookup Weblogic
org.hibernate.transaction.WebSphereTransactionManagerLookup WebSphere
org.hibernate.transaction.WebSphereExtendedJTATransactionLookup WebSphere 6
org.hibernate.transaction.OrionTransactionManagerLookup Orion
org.hibernate.transaction.ResinTransactionManagerLookup Resin
org.hibernate.transaction.JOTMTransactionManagerLookup JOTM
org.hibernate.transaction.JOnASTransactionManagerLookup JOnAS
org.hibernate.transaction.JRun4TransactionManagerLookup JRun4
org.hibernate.transaction.BESTransactionManagerLookup Borland ES
A JNDI bound Hibernate SessionFactory can simplify the lookup of the factory and the creation of new Ses-
sions. Note that this is not related to a JNDI bound Datasource, both simply use the same registry!
If you wish to have the SessionFactory bound to a JNDI namespace, specify a name (eg.
java:hibernate/SessionFactory) using the property hibernate.session_factory_name. If this property is
omitted, the SessionFactory will not be bound to JNDI. (This is especially useful in environments with a read-
only JNDI default implementation, e.g. Tomcat.)
When binding the SessionFactory to JNDI, Hibernate will use the values of hibernate.jndi.url, hibern-
ate.jndi.class to instantiate an initial context. If they are not specified, the default InitialContext will be
used.
Hibernate will automatically place the SessionFactory in JNDI after you call cfg.buildSessionFactory().
This means you will at least have this call in some startup code (or utility class) in your application, unless you
use JMX deployment with the HibernateService (discussed later).
If you use a JNDI SessionFactory, an EJB or any other class may obtain the SessionFactory using a JNDI
lookup.
We recommend that you bind the SessionFactory to JNDI in a managend environment and use a static
singleton otherwise. To shield your application code from these details, we also recommend to hide the actual
lookup code for a SessionFactory in a helper class, such as HibernateUtil.getSessionFactory(). Note that
such a class is also a convenient way to startup Hibernate—see chapter 1.
Hibernate 3.2.7 38
Configuration
The easiest way to handle Sessions and transactions is Hibernates automatic "current" Session management.
See the discussion of Section 2.5, “Contextual Sessions”. Using the "jta" session context, if there is no Hi-
bernate Session associated with the current JTA transaction, one will be started and associated with that JTA
transaction the first time you call sessionFactory.getCurrentSession(). The Sessions retrieved via getCur-
rentSession() in "jta" context will be set to automatically flush before the transaction completes, close after
the transaction completes, and aggressively release JDBC connections after each statement. This allows the
Sessions to be managed by the lifecycle of the JTA transaction to which it is associated, keeping user code
clean of such management concerns. Your code can either use JTA programmatically through UserTransac-
tion, or (recommended for portable code) use the Hibernate Transaction API to set transaction boundaries. If
you run in an EJB container, declarative transaction demarcation with CMT is preferred.
The line cfg.buildSessionFactory() still has to be executed somewhere to get a SessionFactory into JNDI.
You can do this either in a static initializer block (like the one in HibernateUtil) or you deploy Hibernate as
a managed service.
<?xml version="1.0"?>
<server>
<mbean code="org.hibernate.jmx.HibernateService"
name="jboss.jca:service=HibernateFactory,name=HibernateFactory">
</mbean>
Hibernate 3.2.7 39
Configuration
</server>
This file is deployed in a directory called META-INF and packaged in a JAR file with the extension .sar (service
archive). You also need to package Hibernate, its required third-party libraries, your compiled persistent
classes, as well as your mapping files in the same archive. Your enterprise beans (usually session beans) may
be kept in their own JAR file, but you may include this EJB JAR file in the main service archive to get a single
(hot-)deployable unit. Consult the JBoss AS documentation for more information about JMX service and EJB
deployment.
Hibernate 3.2.7 40
Chapter 4. Persistent Classes
Persistent classes are classes in an application that implement the entities of the business problem (e.g. Custom-
er and Order in an E-commerce application). Not all instances of a persistent class are considered to be in the
persistent state - an instance may instead be transient or detached.
Hibernate works best if these classes follow some simple rules, also known as the Plain Old Java Object
(POJO) programming model. However, none of these rules are hard requirements. Indeed, Hibernate3 assumes
very little about the nature of your persistent objects. You may express a domain model in other ways: using
trees of Map instances, for example.
package eg;
import java.util.Set;
import java.util.Date;
Hibernate 3.2.7 41
Persistent Classes
Cat has a no-argument constructor. All persistent classes must have a default constructor (which may be non-
public) so that Hibernate can instantiate them using Constructor.newInstance(). We strongly recommend
having a default constructor with at least package visibility for runtime proxy generation in Hibernate.
Cat has a property called id. This property maps to the primary key column of a database table. The property
might have been called anything, and its type might have been any primitive type, any primitive "wrapper"
type, java.lang.String or java.util.Date. (If your legacy database table has composite keys, you can even
use a user-defined class with properties of these types - see the section on composite identifiers later.)
The identifier property is strictly optional. You can leave them off and let Hibernate keep track of object identi-
fiers internally. We do not recommend this, however.
In fact, some functionality is available only to classes which declare an identifier property:
• Transitive reattachment for detached objects (cascade update or cascade merge) - see Section 10.11,
“Transitive persistence”
• Session.saveOrUpdate()
• Session.merge()
We recommend you declare consistently-named identifier properties on persistent classes. We further recom-
mend that you use a nullable (ie. non-primitive) type.
Hibernate 3.2.7 42
Persistent Classes
A central feature of Hibernate, proxies, depends upon the persistent class being either non-final, or the imple-
mentation of an interface that declares all public methods.
You can persist final classes that do not implement an interface with Hibernate, but you won't be able to use
proxies for lazy association fetching - which will limit your options for performance tuning.
You should also avoid declaring public final methods on the non-final classes. If you want to use a class
with a public final method, you must explicitly disable proxying by setting lazy="false".
Cat declares accessor methods for all its persistent fields. Many other ORM tools directly persist instance vari-
ables. We believe it is better to provide an indirection between the relational schema and internal data structures
of the class. By default, Hibernate persists JavaBeans style properties, and recognizes method names of the
form getFoo, isFoo and setFoo. You may switch to direct field access for particular properties, if needed.
Properties need not be declared public - Hibernate can persist a property with a default, protected or private
get / set pair.
package eg;
• intend to put instances of persistent classes in a Set (the recommended way to represent many-valued asso-
ciations) and
• intend to use reattachment of detached instances
Hibernate guarantees equivalence of persistent identity (database row) and Java identity only inside a particular
session scope. So as soon as we mix instances retrieved in different sessions, we must implement equals() and
hashCode() if we wish to have meaningful semantics for Sets.
The most obvious way is to implement equals()/hashCode() by comparing the identifier value of both objects.
If the value is the same, both must be the same database row, they are therefore equal (if both are added to a
Set, we will only have one element in the Set). Unfortunately, we can't use that approach with generated iden-
tifiers! Hibernate will only assign identifier values to objects that are persistent, a newly created instance will
Hibernate 3.2.7 43
Persistent Classes
not have any identifier value! Furthermore, if an instance is unsaved and currently in a Set, saving it will assign
an identifier value to the object. If equals() and hashCode() are based on the identifier value, the hash code
would change, breaking the contract of the Set. See the Hibernate website for a full discussion of this problem.
Note that this is not a Hibernate issue, but normal Java semantics of object identity and equality.
We recommend implementing equals() and hashCode() using Business key equality. Business key equality
means that the equals() method compares only the properties that form the business key, a key that would
identify our instance in the real world (a natural candidate key):
...
public boolean equals(Object other) {
if (this == other) return true;
if ( !(other instanceof Cat) ) return false;
return true;
}
Note that a business key does not have to be as solid as a database primary key candidate (see Section 11.1.3,
“Considering object identity”). Immutable or unique properties are usually good candidates for a business key.
Persistent entities don't necessarily have to be represented as POJO classes or as JavaBean objects at runtime.
Hibernate also supports dynamic models (using Maps of Maps at runtime) and the representation of entities as
DOM4J trees. With this approach, you don't write persistent classes, only mapping files.
By default, Hibernate works in normal POJO mode. You may set a default entity representation mode for a par-
ticular SessionFactory using the default_entity_mode configuration option (see Table 3.3, “Hibernate Con-
figuration Properties”.
The following examples demonstrates the representation using Maps. First, in the mapping file, an entity-name
has to be declared instead of (or in addition to) a class name:
<hibernate-mapping>
<class entity-name="Customer">
<id name="id"
type="long"
column="ID">
<generator class="sequence"/>
</id>
<property name="name"
Hibernate 3.2.7 44
Persistent Classes
column="NAME"
type="string"/>
<property name="address"
column="ADDRESS"
type="string"/>
<many-to-one name="organization"
column="ORGANIZATION_ID"
class="Organization"/>
<bag name="orders"
inverse="true"
lazy="false"
cascade="all">
<key column="CUSTOMER_ID"/>
<one-to-many class="Order"/>
</bag>
</class>
</hibernate-mapping>
Note that even though associations are declared using target class names, the target type of an associations may
also be a dynamic entity instead of a POJO.
After setting the default entity mode to dynamic-map for the SessionFactory, we can at runtime work with
Maps of Maps:
Session s = openSession();
Transaction tx = s.beginTransaction();
Session s = openSession();
// Create a customer
Map david = new HashMap();
david.put("name", "David");
// Create an organization
Map foobar = new HashMap();
foobar.put("name", "Foobar Inc.");
// Link both
david.put("organization", foobar);
// Save both
s.save("Customer", david);
s.save("Organization", foobar);
tx.commit();
s.close();
The advantages of a dynamic mapping are quick turnaround time for prototyping without the need for entity
class implementation. However, you lose compile-time type checking and will very likely deal with many ex-
ceptions at runtime. Thanks to the Hibernate mapping, the database schema can easily be normalized and
sound, allowing to add a proper domain model implementation on top later on.
// Create a customer
Map david = new HashMap();
david.put("name", "David");
dynamicSession.save("Customer", david);
...
dynamicSession.flush();
Hibernate 3.2.7 45
Persistent Classes
dynamicSession.close()
...
// Continue on pojoSession
Please note that the call to getSession() using an EntityMode is on the Session API, not the SessionFactory.
That way, the new Session shares the underlying JDBC connection, transaction, and other context information.
This means you don't have tocall flush() and close() on the secondary Session, and also leave the transac-
tion and connection handling to the primary unit of work.
More information about the XML representation capabilities can be found in Chapter 18, XML Mapping.
4.5. Tuplizers
org.hibernate.tuple.Tuplizer, and its sub-interfaces, are responsible for managing a particular representa-
tion of a piece of data, given that representation's org.hibernate.EntityMode. If a given piece of data is
thought of as a data structure, then a tuplizer is the thing which knows how to create such a data structure and
how to extract values from and inject values into such a data structure. For example, for the POJO entity mode,
the correpsonding tuplizer knows how create the POJO through its constructor and how to access the POJO
properties using the defined property accessors. There are two high-level types of Tuplizers, represented by the
org.hibernate.tuple.EntityTuplizer and org.hibernate.tuple.ComponentTuplizer interfaces. En-
tityTuplizers are responsible for managing the above mentioned contracts in regards to entities, while Com-
ponentTuplizers do the same for components.
Users may also plug in their own tuplizers. Perhaps you require that a java.util.Map implementation other
than java.util.HashMap be used while in the dynamic-map entity-mode; or perhaps you need to define a dif-
ferent proxy generation strategy than the one used by default. Both would be achieved by defining a custom
tuplizer implementation. Tuplizers definitions are attached to the entity or component mapping they are meant
to manage. Going back to the example of our customer entity:
<hibernate-mapping>
<class entity-name="Customer">
<!--
Override the dynamic-map entity-mode
tuplizer for the customer entity
-->
<tuplizer entity-mode="dynamic-map"
class="CustomMapTuplizerImpl"/>
Hibernate 3.2.7 46
Persistent Classes
4.6. EntityNameResolvers
The org.hibernate.EntityNameResolver interface is a contract for resolving the entity name of a given entity
instance. The interface defines a single method resolveEntityName which is passed the entity instance and is
expected to return the appropriate entity name (null is allowed and would indicate that the resolver does not
know how to resolve the entity name of the given entity instance). Generally speaking, an
org.hibernate.EntityNameResolver is going to be most useful in the case of dynamic models. One example
might be using proxied interfaces as your domain model. The hibernate test suite has an example of this exact
style of usage under the
<package>org.hibernate.test.dynamicentity.tuplizer2</package>
. Here is some of the code from that package for illustration.
/**
* A very trivial JDK Proxy InvocationHandler implementation where we proxy an interface as
* the domain model and simply store persistent state in an internal Map. This is an extremely
* trivial example meant only for illustration.
*/
public final class DataProxyHandler implements InvocationHandler {
private String entityName;
private HashMap data = new HashMap();
public Object invoke(Object proxy, Method method, Object[] args) throws Throwable {
String methodName = method.getName();
if ( methodName.startsWith( "set" ) ) {
String propertyName = methodName.substring( 3 );
data.put( propertyName, args[0] );
}
else if ( methodName.startsWith( "get" ) ) {
String propertyName = methodName.substring( 3 );
return data.get( propertyName );
}
else if ( "toString".equals( methodName ) ) {
return entityName + "#" + data.get( "Id" );
}
else if ( "hashCode".equals( methodName ) ) {
return new Integer( this.hashCode() );
}
return null;
}
/**
*
*/
public class ProxyHelper {
Hibernate 3.2.7 47
Persistent Classes
/**
* The EntityNameResolver implementation.
* IMPL NOTE : An EntityNameResolver really defines a strategy for how entity names should be
* resolved. Since this particular impl can handle resolution for all of our entities we want to
* take advantage of the fact that SessionFactoryImpl keeps these in a Set so that we only ever
* have one instance registered. Why? Well, when it comes time to resolve an entity name,
* Hibernate must iterate over all the registered resolvers. So keeping that number down
* helps that process be as speedy as possible. Hence the equals and hashCode impls
*/
public class MyEntityNameResolver implements EntityNameResolver {
public static final MyEntityNameResolver INSTANCE = new MyEntityNameResolver();
...
}
Hibernate 3.2.7 48
Persistent Classes
Hibernate 3.2.7 49
Chapter 5. Basic O/R Mapping
Note that, even though many Hibernate users choose to write the XML by hand, a number of tools exist to gen-
erate the mapping document, including XDoclet, Middlegen and AndroMDA.
<?xml version="1.0"?>
<!DOCTYPE hibernate-mapping PUBLIC
"-//Hibernate/Hibernate Mapping DTD 3.0//EN"
"http://hibernate.sourceforge.net/hibernate-mapping-3.0.dtd">
<hibernate-mapping package="eg">
<class name="Cat"
table="cats"
discriminator-value="C">
<id name="id">
<generator class="native"/>
</id>
<discriminator column="subclass"
type="character"/>
<property name="weight"/>
<property name="birthdate"
type="date"
not-null="true"
update="false"/>
<property name="color"
type="eg.types.ColorUserType"
not-null="true"
update="false"/>
<property name="sex"
not-null="true"
update="false"/>
<property name="litterId"
column="litterId"
update="false"/>
<many-to-one name="mother"
column="mother_id"
update="false"/>
<set name="kittens"
inverse="true"
order-by="litter_id">
<key column="mother_id"/>
<one-to-many class="Cat"/>
</set>
<subclass name="DomesticCat"
discriminator-value="D">
Hibernate 3.2.7 50
Basic O/R Mapping
<property name="name"
type="string"/>
</subclass>
</class>
<class name="Dog">
<!-- mapping for Dog could go here -->
</class>
</hibernate-mapping>
We will now discuss the content of the mapping document. We will only describe the document elements and
attributes that are used by Hibernate at runtime. The mapping document also contains some extra optional at-
tributes and elements that affect the database schemas exported by the schema export tool. (For example the
not-null attribute.)
5.1.1. Doctype
All XML mappings should declare the doctype shown. The actual DTD may be found at the URL above, in the
directory hibernate-x.x.x/src/org/hibernate or in hibernate3.jar. Hibernate will always look for the
DTD in its classpath first. If you experience lookups of the DTD using an Internet connection, check your DTD
declaration against the contents of your claspath.
EntityResolver
As mentioned previously, Hibernate will first attempt to resolve DTDs in its classpath. The manner in which it
does this is by registering a custom org.xml.sax.EntityResolver implementation with the SAXReader it uses
to read in the xml files. This custom EntityResolver recognizes two different systemId namespaces.
• a hibernate namespace is recognized whenever the resolver encounteres a systemId starting with ht-
tp://hibernate.sourceforge.net/; the resolver attempts to resolve these entities via the classlaoder
which loaded the Hibernate classes.
• a user namespace is recognized whenever the resolver encounteres a systemId using a classpath:// URL
protocol; the resolver will attempt to resolve these entities via (1) the current thread context classloader and
(2) the classloader which loaded the Hibernate classes.
<?xml version="1.0"?>
<!DOCTYPE hibernate-mapping PUBLIC
"-//Hibernate/Hibernate Mapping DTD 3.0//EN"
"http://hibernate.sourceforge.net/hibernate-mapping-3.0.dtd" [
<!ENTITY types SYSTEM "classpath://your/domain/types.xml">
]>
<hibernate-mapping package="your.domain">
<class name="MyEntity">
<id name="id" type="my-custom-id-type">
...
</id>
<class>
&types;
</hibernate-mapping>
Hibernate 3.2.7 51
Basic O/R Mapping
Where types.xml is a resource in the your.domain package and contains a custom Section 5.2.3, “Custom
value types”.
5.1.2. hibernate-mapping
This element has several optional attributes. The schema and catalog attributes specify that tables referred to in
this mapping belong to the named schema and/or catalog. If specified, tablenames will be qualified by the given
schema and catalog names. If missing, tablenames will be unqualified. The default-cascade attribute specifies
what cascade style should be assumed for properties and collections which do not specify a cascade attribute.
The auto-import attribute lets us use unqualified class names in the query language, by default.
<hibernate-mapping
schema="schemaName" (1)
catalog="catalogName" (2)
default-cascade="cascade_style" (3)
default-access="field|property|ClassName" (4)
default-lazy="true|false" (5)
auto-import="true|false" (6)
package="package.name" (7)
/>
If you have two persistent classes with the same (unqualified) name, you should set auto-import="false". Hi-
bernate will throw an exception if you attempt to assign two classes to the same "imported" name.
Note that the hibernate-mapping element allows you to nest several persistent <class> mappings, as shown
above. It is however good practice (and expected by some tools) to map only a single persistent class (or a
single class hierarchy) in one mapping file and name it after the persistent superclass, e.g. Cat.hbm.xml,
Dog.hbm.xml, or if using inheritance, Animal.hbm.xml.
5.1.3. class
<class
name="ClassName" (1)
table="tableName" (2)
discriminator-value="discriminator_value" (3)
mutable="true|false" (4)
schema="owner" (5)
catalog="catalog" (6)
proxy="ProxyInterface" (7)
dynamic-update="true|false" (8)
dynamic-insert="true|false" (9)
select-before-update="true|false" (10)
polymorphism="implicit|explicit" (11)
Hibernate 3.2.7 52
Basic O/R Mapping
(1) name (optional): The fully qualified Java class name of the persistent class (or interface). If this attribute is
missing, it is assumed that the mapping is for a non-POJO entity.
(2) table (optional - defaults to the unqualified class name): The name of its database table.
(3) discriminator-value (optional - defaults to the class name): A value that distiguishes individual sub-
classes, used for polymorphic behaviour. Acceptable values include null and not null.
(4) mutable (optional, defaults to true): Specifies that instances of the class are (not) mutable.
(5) schema (optional): Override the schema name specified by the root <hibernate-mapping> element.
(6) catalog (optional): Override the catalog name specified by the root <hibernate-mapping> element.
(7) proxy (optional): Specifies an interface to use for lazy initializing proxies. You may specify the name of
the class itself.
(8) dynamic-update (optional, defaults to false): Specifies that UPDATE SQL should be generated at runtime
and contain only those columns whose values have changed.
(9) dynamic-insert (optional, defaults to false): Specifies that INSERT SQL should be generated at runtime
and contain only the columns whose values are not null.
(10) select-before-update (optional, defaults to false): Specifies that Hibernate should never perform an
SQL UPDATE unless it is certain that an object is actually modified. In certain cases (actually, only when a
transient object has been associated with a new session using update()), this means that Hibernate will
perform an extra SQL SELECT to determine if an UPDATE is actually required.
(11) polymorphism (optional, defaults to implicit): Determines whether implicit or explicit query polymorph-
ism is used.
(12) where (optional) specify an arbitrary SQL WHERE condition to be used when retrieving objects of this class
(13) persister (optional): Specifies a custom ClassPersister.
(14) batch-size (optional, defaults to 1) specify a "batch size" for fetching instances of this class by identifier.
(15) optimistic-lock (optional, defaults to version): Determines the optimistic locking strategy.
(16) lazy (optional): Lazy fetching may be completely disabled by setting lazy="false".
(17) entity-name (optional, defaults to the class name): Hibernate3 allows a class to be mapped multiple times
(to different tables, potentially), and allows entity mappings that are represented by Maps or XML at the
Java level. In these cases, you should provide an explicit arbitrary name for the entity. See Section 4.4,
“Dynamic models” and Chapter 18, XML Mapping for more information.
(18) check (optional): A SQL expression used to generate a multi-row check constraint for automatic schema
generation.
(19) rowid (optional): Hibernate can use so called ROWIDs on databases which support. E.g. on Oracle, Hi-
bernate can use the rowid extra column for fast updates if you set this option to rowid. A ROWID is an
implementation detail and represents the physical location of a stored tuple.
(20) subselect (optional): Maps an immutable and read-only entity to a database subselect. Useful if you want
to have a view instead of a base table, but don't. See below for more information.
(21) abstract (optional): Used to mark abstract superclasses in <union-subclass> hierarchies.
It is perfectly acceptable for the named persistent class to be an interface. You would then declare implement-
ing classes of that interface using the <subclass> element. You may persist any static inner class. You should
specify the class name using the standard form ie. eg.Foo$Bar.
Hibernate 3.2.7 53
Basic O/R Mapping
Immutable classes, mutable="false", may not be updated or deleted by the application. This allows Hibernate
to make some minor performance optimizations.
The optional proxy attribute enables lazy initialization of persistent instances of the class. Hibernate will ini-
tially return CGLIB proxies which implement the named interface. The actual persistent object will be loaded
when a method of the proxy is invoked. See "Proxies for Lazy Initialization" below.
Implicit polymorphism means that instances of the class will be returned by a query that names any superclass
or implemented interface or the class and that instances of any subclass of the class will be returned by a query
that names the class itself. Explicit polymorphism means that class instances will be returned only by queries
that explicitly name that class and that queries that name the class will return only instances of subclasses
mapped inside this <class> declaration as a <subclass> or <joined-subclass>. For most purposes the default,
polymorphism="implicit", is appropriate. Explicit polymorphism is useful when two different classes are
mapped to the same table (this allows a "lightweight" class that contains a subset of the table columns).
The persister attribute lets you customize the persistence strategy used for the class. You may, for example,
specify your own subclass of org.hibernate.persister.EntityPersister or you might even provide a com-
pletely new implementation of the interface org.hibernate.persister.ClassPersister that implements per-
sistence via, for example, stored procedure calls, serialization to flat files or LDAP. See
org.hibernate.test.CustomPersister for a simple example (of "persistence" to a Hashtable).
Note that the dynamic-update and dynamic-insert settings are not inherited by subclasses and so may also be
specified on the <subclass> or <joined-subclass> elements. These settings may increase performance in
some cases, but might actually decrease performance in others. Use judiciously.
Use of select-before-update will usually decrease performance. It is very useful to prevent a database update
trigger being called unnecessarily if you reattach a graph of detached instances to a Session.
If you enable dynamic-update, you will have a choice of optimistic locking strategies:
We very strongly recommend that you use version/timestamp columns for optimistic locking with Hibernate.
This is the optimal strategy with respect to performance and is the only strategy that correctly handles modific-
ations made to detached instances (ie. when Session.merge() is used).
There is no difference between a view and a base table for a Hibernate mapping, as expected this is transparent
at the database level (note that some DBMS don't support views properly, especially with updates). Sometimes
you want to use a view, but can't create one in the database (ie. with a legacy schema). In this case, you can
map an immutable and read-only entity to a given SQL subselect expression:
<class name="Summary">
<subselect>
select item.name, max(bid.amount), count(*)
from item
join bid on bid.item_id = item.id
group by item.name
</subselect>
<synchronize table="item"/>
<synchronize table="bid"/>
<id name="name"/>
Hibernate 3.2.7 54
Basic O/R Mapping
...
</class>
Declare the tables to synchronize this entity with, ensuring that auto-flush happens correctly, and that queries
against the derived entity do not return stale data. The <subselect> is available as both as an attribute and a
nested mapping element.
5.1.4. id
Mapped classes must declare the primary key column of the database table. Most classes will also have a Java-
Beans-style property holding the unique identifier of an instance. The <id> element defines the mapping from
that property to the primary key column.
<id
name="propertyName" (1)
type="typename" (2)
column="column_name" (3)
unsaved-value="null|any|none|undefined|id_value" (4)
access="field|property|ClassName"> (5)
node="element-name|@attribute-name|element/@attribute|."
<generator class="generatorClass"/>
</id>
If the name attribute is missing, it is assumed that the class has no identifier property.
There is an alternative <composite-id> declaration to allow access to legacy data with composite keys. We
strongly discourage its use for anything else.
Generator
The optional <generator> child element names a Java class used to generate unique identifiers for instances of
the persistent class. If any parameters are required to configure or initialize the generator instance, they are
passed using the <param> element.
All generators implement the interface org.hibernate.id.IdentifierGenerator. This is a very simple inter-
face; some applications may choose to provide their own specialized implementations. However, Hibernate
provides a range of built-in implementations. There are shortcut names for the built-in generators:
Hibernate 3.2.7 55
Basic O/R Mapping
increment
generates identifiers of type long, short or int that are unique only when no other process is inserting data
into the same table. Do not use in a cluster.
identity
supports identity columns in DB2, MySQL, MS SQL Server, Sybase and HypersonicSQL. The returned
identifier is of type long, short or int.
sequence
uses a sequence in DB2, PostgreSQL, Oracle, SAP DB, McKoi or a generator in Interbase. The returned
identifier is of type long, short or int
hilo
uses a hi/lo algorithm to efficiently generate identifiers of type long, short or int, given a table and
column (by default hibernate_unique_key and next_hi respectively) as a source of hi values. The hi/lo al-
gorithm generates identifiers that are unique only for a particular database.
seqhilo
uses a hi/lo algorithm to efficiently generate identifiers of type long, short or int, given a named database
sequence.
uuid
uses a 128-bit UUID algorithm to generate identifiers of type string, unique within a network (the IP ad-
dress is used). The UUID is encoded as a string of hexadecimal digits of length 32.
guid
uses a database-generated GUID string on MS SQL Server and MySQL.
native
picks identity, sequence or hilo depending upon the capabilities of the underlying database.
assigned
lets the application to assign an identifier to the object before save() is called. This is the default strategy if
no <generator> element is specified.
select
retrieves a primary key assigned by a database trigger by selecting the row by some unique key and retriev-
ing the primary key value.
foreign
uses the identifier of another associated object. Usually used in conjunction with a <one-to-one> primary
key association.
sequence-identity
a specialized sequence generation strategy which utilizes a database sequence for the actual value genera-
tion, but combines this with JDBC3 getGeneratedKeys to actually return the generated identifier value as
part of the insert statement execution. This strategy is only known to be supported on Oracle 10g drivers
targetted for JDK 1.4. Note comments on these insert statements are disabled due to a bug in the Oracle
drivers.
Hi/lo algorithm
The hilo and seqhilo generators provide two alternate implementations of the hi/lo algorithm, a favorite ap-
Hibernate 3.2.7 56
Basic O/R Mapping
proach to identifier generation. The first implementation requires a "special" database table to hold the next
available "hi" value. The second uses an Oracle-style sequence (where supported).
Unfortunately, you can't use hilo when supplying your own Connection to Hibernate. When Hibernate is using
an application server datasource to obtain connections enlisted with JTA, you must properly configure the hi-
bernate.transaction.manager_lookup_class.
UUID algorithm
The UUID contains: IP address, startup time of the JVM (accurate to a quarter second), system time and a
counter value (unique within the JVM). It's not possible to obtain a MAC address or memory address from Java
code, so this is the best we can do without using JNI.
For databases which support identity columns (DB2, MySQL, Sybase, MS SQL), you may use identity key
generation. For databases that support sequences (DB2, Oracle, PostgreSQL, Interbase, McKoi, SAP DB) you
may use sequence style key generation. Both these strategies require two SQL queries to insert a new object.
For cross-platform development, the native strategy will choose from the identity, sequence and hilo
strategies, dependant upon the capabilities of the underlying database.
Assigned identifiers
If you want the application to assign identifiers (as opposed to having Hibernate generate them), you may use
the assigned generator. This special generator will use the identifier value already assigned to the object's iden-
tifier property. This generator is used when the primary key is a natural key instead of a surrogate key. This is
the default behavior if you do no specify a <generator> element.
Choosing the assigned generator makes Hibernate use unsaved-value="undefined", forcing Hibernate to go
to the database to determine if an instance is transient or detached, unless there is a version or timestamp prop-
erty, or you define Interceptor.isUnsaved().
Hibernate 3.2.7 57
Basic O/R Mapping
For legacy schemas only (Hibernate does not generate DDL with triggers).
In the above example, there is a unique valued property named socialSecurityNumber defined by the class, as
a natural key, and a surrogate key named person_id whose value is generated by a trigger.
Starting with release 3.2.3, there are 2 new generators which represent a re-thinking of 2 different aspects of
identifier generation. The first aspect is database portability; the second is optimization (not having to query the
database for every request for a new identifier value). These two new generators are intended to take the place
of some of the named generators described above (starting in 3.3.x); however, they are included in the current
releases and can be referenced by FQN.
• sequence_name (optional, defaults to hibernate_sequence): The name of the sequence (or table) to be
used.
• initial_value (optional, defaults to 1): The initial value to be retrieved from the sequence/table. In se-
quence creation terms, this is analogous to the clause typical named "STARTS WITH".
• increment_size (optional, defaults to 1): The value by which subsequent calls to the sequence/table should
differ. In sequence creation terms, this is analogous to the clause typical named "INCREMENT BY".
• force_table_use (optional, defaults to false): Should we force the use of a table as the backing structure
even though the dialect might support sequence?
• value_column (optional, defaults to next_val): Only relevant for table structures! The name of the column
on the table which is used to hold the value.
• optimizer (optional, defaults to none): See Section 5.1.6, “Identifier generator optimization”
Hibernate 3.2.7 58
Basic O/R Mapping
• segment_column_name (optional, defaults to sequence_name): The name of the column on the table which is
used to hold the "segement key". This is the value which distinctly identifies which increment value to use.
• segment_value (optional, defaults to default): The "segment key" value for the segment from which we
want to pull increment values for this generator.
• segment_value_length (optional, defaults to 255): Used for schema generation; the column size to create
this segment key column.
• initial_value (optional, defaults to 1): The initial value to be retrieved from the table.
• increment_size (optional, defaults to 1): The value by which subsequent calls to the table should differ.
• optimizer (optional, defaults to ): See Section 5.1.6, “Identifier generator optimization”
For identifier generators which store values in the database, it is inefficient for them to hit the database on each
and every call to generate a new identifier value. Instead, you'd ideally want to group a bunch of them in
memory and only hit the database when you have exhausted your in-memory value group. This is the role of
the pluggable optimizers. Currently only the two enhanced generators (Section 5.1.5, “Enhanced identifier gen-
erators” support this notion.
• none (generally this is the default if no optimizer was specified): This says to not perform any optimiza-
tions, and hit the database each and every request.
• hilo: applies a hi/lo algorithm around the database retrieved values. The values from the database for this
optimizer are expected to be sequential. The values retrieved from the database structure for this optimizer
indicates the "group number"; the increment_size is multiplied by that value in memory to define a group
"hi value".
• pooled: like was discussed for hilo, this optimizers attempts to minimize the number of hits to the data-
base. Here, however, we simply store the starting value for the "next group" into the database structure
rather than a sequential value in combination with an in-memory grouping algorithm. increment_size here
refers to the values coming from the database.
5.1.7. composite-id
<composite-id
name="propertyName"
class="ClassName"
mapped="true|false"
access="field|property|ClassName">
node="element-name|."
For a table with a composite key, you may map multiple properties of the class as identifier properties. The
<composite-id> element accepts <key-property> property mappings and <key-many-to-one> mappings as
child elements.
<composite-id>
<key-property name="medicareNumber"/>
<key-property name="dependent"/>
</composite-id>
Your persistent class must override equals() and hashCode() to implement composite identifier equality. It
must also implements Serializable.
Unfortunately, this approach to composite identifiers means that a persistent object is its own identifier. There
Hibernate 3.2.7 59
Basic O/R Mapping
is no convenient "handle" other than the object itself. You must instantiate an instance of the persistent class it-
self and populate its identifier properties before you can load() the persistent state associated with a composite
key. We call this approach an embedded composite identifier, and discourage it for serious applications.
A second approach is what we call a mapped composite identifier, where the identifier properties named inside
the <composite-id> element are duplicated on both the persistent class and a separate identifier class.
In this example, both the composite identifier class, MedicareId, and the entity class itself have properties
named medicareNumber and dependent. The identifier class must override equals() and hashCode() and im-
plement. Serializable. The disadvantage of this approach is quite obvious—code duplication.
• mapped (optional, defaults to false): indicates that a mapped composite identifier is used, and that the con-
tained property mappings refer to both the entity class and the composite identifier class.
• class (optional, but required for a mapped composite identifier): The class used as a composite identifier.
We will describe a third, even more convenient approach where the composite identifier is implemented as a
component class in Section 8.4, “Components as composite identifiers”. The attributes described below apply
only to this alternative approach:
• name (optional, required for this approach): A property of component type that holds the composite identifi-
er (see chapter 9).
• access (optional - defaults to property): The strategy Hibernate should use for accessing the property
value.
• class (optional - defaults to the property type determined by reflection): The component class used as a
composite identifier (see next section).
This third approach, an identifier component is the one we recommend for almost all applications.
5.1.8. discriminator
The <discriminator> element is required for polymorphic persistence using the table-per-class-hierarchy map-
ping strategy and declares a discriminator column of the table. The discriminator column contains marker val-
ues that tell the persistence layer what subclass to instantiate for a particular row. A restricted set of types may
be used: string, character, integer, byte, short, boolean, yes_no, true_false.
<discriminator
column="discriminator_column" (1)
type="discriminator_type" (2)
force="true|false" (3)
insert="true|false" (4)
formula="arbitrary sql expression" (5)
/>
(1) column (optional - defaults to class) the name of the discriminator column.
(2) type (optional - defaults to string) a name that indicates the Hibernate type
(3) force (optional - defaults to false) "force" Hibernate to specify allowed discriminator values even when
retrieving all instances of the root class.
(4) insert (optional - defaults to true) set this to false if your discriminator column is also part of a mapped
Hibernate 3.2.7 60
Basic O/R Mapping
composite identifier. (Tells Hibernate to not include the column in SQL INSERTs.)
(5) formula (optional) an arbitrary SQL expression that is executed when a type has to be evaluated. Allows
content-based discrimination.
Actual values of the discriminator column are specified by the discriminator-value attribute of the <class>
and <subclass> elements.
The force attribute is (only) useful if the table contains rows with "extra" discriminator values that are not
mapped to a persistent class. This will not usually be the case.
Using the formula attribute you can declare an arbitrary SQL expression that will be used to evaluate the type
of a row:
<discriminator
formula="case when CLASS_TYPE in ('a', 'b', 'c') then 0 else 1 end"
type="integer"/>
The <version> element is optional and indicates that the table contains versioned data. This is particularly use-
ful if you plan to use long transactions (see below).
<version
column="version_column" (1)
name="propertyName" (2)
type="typename" (3)
access="field|property|ClassName" (4)
unsaved-value="null|negative|undefined" (5)
generated="never|always" (6)
insert="true|false" (7)
node="element-name|@attribute-name|element/@attribute|."
/>
(1) column (optional - defaults to the property name): The name of the column holding the version number.
(2) name: The name of a property of the persistent class.
(3) type (optional - defaults to integer): The type of the version number.
(4) access (optional - defaults to property): The strategy Hibernate should use for accessing the property
value.
(5) unsaved-value (optional - defaults to undefined): A version property value that indicates that an instance
is newly instantiated (unsaved), distinguishing it from detached instances that were saved or loaded in a
previous session. (undefined specifies that the identifier property value should be used.)
(6) generated (optional - defaults to never): Specifies that this version property value is actually generated
by the database. See the discussion of Section 5.6, “Generated Properties”.
(7) insert (optional - defaults to true): Specifies whether the version column should be included in SQL in-
sert statements. May be set to false if and only if the database column is defined with a default value of
0.
Version numbers may be of Hibernate type long, integer, short, timestamp or calendar.
A version or timestamp property should never be null for a detached instance, so Hibernate will detact any in-
stance with a null version or timestamp as transient, no matter what other unsaved-value strategies are spe-
cified. Declaring a nullable version or timestamp property is an easy way to avoid any problems with transitive
reattachment in Hibernate, especially useful for people using assigned identifiers or composite keys!
Hibernate 3.2.7 61
Basic O/R Mapping
The optional <timestamp> element indicates that the table contains timestamped data. This is intended as an al-
ternative to versioning. Timestamps are by nature a less safe implementation of optimistic locking. However,
sometimes the application might use the timestamps in other ways.
<timestamp
column="timestamp_column" (1)
name="propertyName" (2)
access="field|property|ClassName" (3)
unsaved-value="null|undefined" (4)
source="vm|db" (5)
generated="never|always" (6)
node="element-name|@attribute-name|element/@attribute|."
/>
(1) column (optional - defaults to the property name): The name of a column holding the timestamp.
(2) name: The name of a JavaBeans style property of Java type Date or Timestamp of the persistent class.
(3) access (optional - defaults to property): The strategy Hibernate should use for accessing the property
value.
(4) unsaved-value (optional - defaults to null): A version property value that indicates that an instance is
newly instantiated (unsaved), distinguishing it from detached instances that were saved or loaded in a pre-
vious session. (undefined specifies that the identifier property value should be used.)
(5) source (optional - defaults to vm): From where should Hibernate retrieve the timestamp value? From the
database, or from the current JVM? Database-based timestamps incur an overhead because Hibernate
must hit the database in order to determine the "next value", but will be safer for use in clustered environ-
ments. Note also, that not all Dialects are known to support retrieving of the database's current
timestamp, while others might be unsafe for usage in locking due to lack of precision (Oracle 8 for ex-
ample).
(6) generated (optional - defaults to never): Specifies that this timestamp property value is actually gener-
ated by the database. See the discussion of Section 5.6, “Generated Properties”.
5.1.11. property
The <property> element declares a persistent, JavaBean style property of the class.
<property
name="propertyName" (1)
column="column_name" (2)
type="typename" (3)
update="true|false" (4)
insert="true|false" (4)
formula="arbitrary SQL expression" (5)
access="field|property|ClassName" (6)
lazy="true|false" (7)
unique="true|false" (8)
not-null="true|false" (9)
optimistic-lock="true|false" (10)
generated="never|insert|always" (11)
node="element-name|@attribute-name|element/@attribute|."
index="index_name"
unique_key="unique_key_id"
length="L"
precision="P"
scale="S"
/>
(1) name: the name of the property, with an initial lowercase letter.
Hibernate 3.2.7 62
Basic O/R Mapping
(2) column (optional - defaults to the property name): the name of the mapped database table column. This
may also be specified by nested <column> element(s).
(3) type (optional): a name that indicates the Hibernate type.
(4) update, insert (optional - defaults to true) : specifies that the mapped columns should be included in
SQL UPDATE and/or INSERT statements. Setting both to false allows a pure "derived" property whose
value is initialized from some other property that maps to the same colum(s) or by a trigger or other ap-
plication.
(5) formula (optional): an SQL expression that defines the value for a computed property. Computed proper-
ties do not have a column mapping of their own.
(6) access (optional - defaults to property): The strategy Hibernate should use for accessing the property
value.
(7) lazy (optional - defaults to false): Specifies that this property should be fetched lazily when the instance
variable is first accessed (requires build-time bytecode instrumentation).
(8) unique (optional): Enable the DDL generation of a unique constraint for the columns. Also, allow this to
be the target of a property-ref.
(9) not-null (optional): Enable the DDL generation of a nullability constraint for the columns.
(10) optimistic-lock (optional - defaults to true): Specifies that updates to this property do or do not require
acquisition of the optimistic lock. In other words, determines if a version increment should occur when
this property is dirty.
(11) generated (optional - defaults to never): Specifies that this property value is actually generated by the
database. See the discussion of Section 5.6, “Generated Properties”.
1. The name of a Hibernate basic type (eg. integer, string, character, date, timestamp, float,
binary, serializable, object, blob).
2. The name of a Java class with a default basic type (eg. int, float, char, java.lang.String,
java.util.Date, java.lang.Integer, java.sql.Clob).
3. The name of a serializable Java class.
4. The class name of a custom type (eg. com.illflow.type.MyCustomType).
If you do not specify a type, Hibernate will use reflection upon the named property to take a guess at the correct
Hibernate type. Hibernate will try to interpret the name of the return class of the property getter using rules 2, 3,
4 in that order. However, this is not always enough. In certain cases you will still need the type attribute. (For
example, to distinguish between Hibernate.DATE and Hibernate.TIMESTAMP, or to specify a custom type.)
The access attribute lets you control how Hibernate will access the property at runtime. By default, Hibernate
will call the property get/set pair. If you specify access="field", Hibernate will bypass the get/set pair and ac-
cess the field directly, using reflection. You may specify your own strategy for property access by naming a
class that implements the interface org.hibernate.property.PropertyAccessor.
An especially powerful feature are derived properties. These properties are by definition read-only, the property
value is computed at load time. You declare the computation as a SQL expression, this translates to a SELECT
clause subquery in the SQL query that loads an instance:
<property name="totalPrice"
formula="( SELECT SUM (li.quantity*p.price) FROM LineItem li, Product p
WHERE li.productId = p.productId
AND li.customerId = customerId
AND li.orderNumber = orderNumber )"/>
Note that you can reference the entities own table by not declaring an alias on a particular column (customerId
in the given example). Also note that you can use the nested <formula> mapping element if you don't like to
use the attribute.
Hibernate 3.2.7 63
Basic O/R Mapping
5.1.12. many-to-one
An ordinary association to another persistent class is declared using a many-to-one element. The relational
model is a many-to-one association: a foreign key in one table is referencing the primary key column(s) of the
target table.
<many-to-one
name="propertyName" (1)
column="column_name" (2)
class="ClassName" (3)
cascade="cascade_style" (4)
fetch="join|select" (5)
update="true|false" (6)
insert="true|false" (6)
property-ref="propertyNameFromAssociatedClass" (7)
access="field|property|ClassName" (8)
unique="true|false" (9)
not-null="true|false" (10)
optimistic-lock="true|false" (11)
lazy="proxy|no-proxy|false" (12)
not-found="ignore|exception" (13)
entity-name="EntityName" (14)
formula="arbitrary SQL expression" (15)
node="element-name|@attribute-name|element/@attribute|."
embed-xml="true|false"
index="index_name"
unique_key="unique_key_id"
foreign-key="foreign_key_name"
/>
Hibernate 3.2.7 64
Basic O/R Mapping
Setting a value of the cascade attribute to any meaningful value other than none will propagate certain opera-
tions to the associated object. The meaningful values are the names of Hibernate's basic operations, persist,
merge, delete, save-update, evict, replicate, lock, refresh, as well as the special values delete-
orphan and all and comma-separated combinations of operation names, for example, cas-
cade="persist,merge,evict" or cascade="all,delete-orphan". See Section 10.11, “Transitive persistence”
for a full explanation. Note that single valued associations (many-to-one and one-to-one associations) do not
support orphan delete.
The property-ref attribute should only be used for mapping legacy data where a foreign key refers to a unique
key of the associated table other than the primary key. This is an ugly relational model. For example, suppose
the Product class had a unique serial number, that is not the primary key. (The unique attribute controls Hi-
bernate's DDL generation with the SchemaExport tool.)
If the referenced unique key comprises multiple properties of the associated entity, you should map the refer-
enced properties inside a named <properties> element.
If the referenced unique key is the property of a component, you may specify a property path:
5.1.13. one-to-one
<one-to-one
name="propertyName" (1)
class="ClassName" (2)
cascade="cascade_style" (3)
constrained="true|false" (4)
fetch="join|select" (5)
property-ref="propertyNameFromAssociatedClass" (6)
access="field|property|ClassName" (7)
formula="any SQL expression" (8)
lazy="proxy|no-proxy|false" (9)
entity-name="EntityName" (10)
node="element-name|@attribute-name|element/@attribute|."
embed-xml="true|false"
foreign-key="foreign_key_name"
/>
Hibernate 3.2.7 65
Basic O/R Mapping
(3) cascade (optional) specifies which operations should be cascaded from the parent object to the associated
object.
(4) constrained (optional) specifies that a foreign key constraint on the primary key of the mapped table ref-
erences the table of the associated class. This option affects the order in which save() and delete() are
cascaded, and determines whether the association may be proxied (it is also used by the schema export
tool).
(5) fetch (optional - defaults to select): Chooses between outer-join fetching or sequential select fetching.
(6) property-ref: (optional) The name of a property of the associated class that is joined to the primary key
of this class. If not specified, the primary key of the associated class is used.
(7) access (optional - defaults to property): The strategy Hibernate should use for accessing the property
value.
(8) formula (optional): Almost all one to one associations map to the primary key of the owning entity. In the
rare case that this is not the case, you may specify a some other column, columns or expression to join on
using an SQL formula. (See org.hibernate.test.onetooneformula for an example.)
(9) lazy (optional - defaults to proxy): By default, single point associations are proxied. lazy="no-proxy"
specifies that the property should be fetched lazily when the instance variable is first accessed (requires
build-time bytecode instrumentation). lazy="false" specifies that the association will always be eagerly
fetched. Note that if constrained="false", proxying is impossible and Hibernate will eager fetch the as-
sociation!
(10) entity-name (optional): The entity name of the associated class.
Primary key associations don't need an extra table column; if two rows are related by the association then the
two table rows share the same primary key value. So if you want two objects to be related by a primary key as-
sociation, you must make sure that they are assigned the same identifier value!
For a primary key association, add the following mappings to Employee and Person, respectively.
Now we must ensure that the primary keys of related rows in the PERSON and EMPLOYEE tables are equal.
We use a special Hibernate identifier generation strategy called foreign:
A newly saved instance of Person is then assigned the same primary key value as the Employee instance refered
with the employee property of that Person.
Alternatively, a foreign key with a unique constraint, from Employee to Person, may be expressed as:
Hibernate 3.2.7 66
Basic O/R Mapping
And this association may be made bidirectional by adding the following to the Person mapping:
5.1.14. natural-id
<natural-id mutable="true|false"/>
<property ... />
<many-to-one ... />
......
</natural-id>
Even though we recommend the use of surrogate keys as primary keys, you should still try to identify natural
keys for all entities. A natural key is a property or combination of properties that is unique and non-null. If it is
also immutable, even better. Map the properties of the natural key inside the <natural-id> element. Hibernate
will generate the necessary unique key and nullability constraints, and your mapping will be more self-
documenting.
We strongly recommend that you implement equals() and hashCode() to compare the natural key properties
of the entity.
This mapping is not intended for use with entities with natural primary keys.
• mutable (optional, defaults to false): By default, natural identifier properties as assumed to be immutable
(constant).
The <component> element maps properties of a child object to columns of the table of a parent class. Compon-
ents may, in turn, declare their own properties, components or collections. See "Components" below.
<component
name="propertyName" (1)
class="className" (2)
insert="true|false" (3)
update="true|false" (4)
access="field|property|ClassName" (5)
lazy="true|false" (6)
optimistic-lock="true|false" (7)
unique="true|false" (8)
node="element-name|."
>
<property ...../>
<many-to-one .... />
........
</component>
Hibernate 3.2.7 67
Basic O/R Mapping
(6) lazy (optional - defaults to false): Specifies that this component should be fetched lazily when the in-
stance variable is first accessed (requires build-time bytecode instrumentation).
(7) optimistic-lock (optional - defaults to true): Specifies that updates to this component do or do not re-
quire acquisition of the optimistic lock. In other words, determines if a version increment should occur
when this property is dirty.
(8) unique (optional - defaults to false): Specifies that a unique constraint exists upon all mapped columns
of the component.
The child <property> tags map properties of the child class to table columns.
The <component> element allows a <parent> subelement that maps a property of the component class as a ref-
erence back to the containing entity.
The <dynamic-component> element allows a Map to be mapped as a component, where the property names refer
to keys of the map, see Section 8.5, “Dynamic components”.
5.1.16. properties
The <properties> element allows the definition of a named, logical grouping of properties of a class. The most
important use of the construct is that it allows a combination of properties to be the target of a property-ref. It
is also a convenient way to define a multi-column unique constraint.
<properties
name="logicalName" (1)
insert="true|false" (2)
update="true|false" (3)
optimistic-lock="true|false" (4)
unique="true|false" (5)
>
<property ...../>
<many-to-one .... />
........
</properties>
(1) name: The logical name of the grouping - not an actual property name.
(2) insert: Do the mapped columns appear in SQL INSERTs?
(3) update: Do the mapped columns appear in SQL UPDATEs?
(4) optimistic-lock (optional - defaults to true): Specifies that updates to these properties do or do not re-
quire acquisition of the optimistic lock. In other words, determines if a version increment should occur
when these properties are dirty.
(5) unique (optional - defaults to false): Specifies that a unique constraint exists upon all mapped columns
of the component.
<class name="Person">
<id name="personNumber"/>
...
<properties name="name"
unique="true" update="false">
<property name="firstName"/>
<property name="initial"/>
<property name="lastName"/>
</properties>
</class>
Then we might have some legacy data association which refers to this unique key of the Person table, instead
of to the primary key:
Hibernate 3.2.7 68
Basic O/R Mapping
<many-to-one name="person"
class="Person" property-ref="name">
<column name="firstName"/>
<column name="initial"/>
<column name="lastName"/>
</many-to-one>
We don't recommend the use of this kind of thing outside the context of mapping legacy data.
5.1.17. subclass
Finally, polymorphic persistence requires the declaration of each subclass of the root persistent class. For the
table-per-class-hierarchy mapping strategy, the <subclass> declaration is used.
<subclass
name="ClassName" (1)
discriminator-value="discriminator_value" (2)
proxy="ProxyInterface" (3)
lazy="true|false" (4)
dynamic-update="true|false"
dynamic-insert="true|false"
entity-name="EntityName"
node="element-name"
extends="SuperclassName">
Each subclass should declare its own persistent properties and subclasses. <version> and <id> properties are
assumed to be inherited from the root class. Each subclass in a heirarchy must define a unique discriminator-
value. If none is specified, the fully qualified Java class name is used.
5.1.18. joined-subclass
Alternatively, each subclass may be mapped to its own table (table-per-subclass mapping strategy). Inherited
state is retrieved by joining with the table of the superclass. We use the <joined-subclass> element.
<joined-subclass
name="ClassName" (1)
table="tablename" (2)
proxy="ProxyInterface" (3)
lazy="true|false" (4)
dynamic-update="true|false"
dynamic-insert="true|false"
schema="schema"
catalog="catalog"
extends="SuperclassName"
persister="ClassName"
subselect="SQL expression"
entity-name="EntityName"
node="element-name">
Hibernate 3.2.7 69
Basic O/R Mapping
No discriminator column is required for this mapping strategy. Each subclass must, however, declare a table
column holding the object identifier using the <key> element. The mapping at the start of the chapter would be
re-written as:
<?xml version="1.0"?>
<!DOCTYPE hibernate-mapping PUBLIC
"-//Hibernate/Hibernate Mapping DTD//EN"
"http://hibernate.sourceforge.net/hibernate-mapping-3.0.dtd">
<hibernate-mapping package="eg">
<class name="eg.Dog">
<!-- mapping for Dog could go here -->
</class>
</hibernate-mapping>
5.1.19. union-subclass
A third option is to map only the concrete classes of an inheritance hierarchy to tables, (the table-
per-concrete-class strategy) where each table defines all persistent state of the class, including inherited state. In
Hibernate, it is not absolutely necessary to explicitly map such inheritance hierarchies. You can simply map
each class with a separate <class> declaration. However, if you wish use polymorphic associations (e.g. an as-
sociation to the superclass of your hierarchy), you need to use the <union-subclass> mapping.
<union-subclass
name="ClassName" (1)
table="tablename" (2)
proxy="ProxyInterface" (3)
lazy="true|false" (4)
Hibernate 3.2.7 70
Basic O/R Mapping
dynamic-update="true|false"
dynamic-insert="true|false"
schema="schema"
catalog="catalog"
extends="SuperclassName"
abstract="true|false"
persister="ClassName"
subselect="SQL expression"
entity-name="EntityName"
node="element-name">
5.1.20. join
Using the <join> element, it is possible to map properties of one class to several tables.
<join
table="tablename" (1)
schema="owner" (2)
catalog="catalog" (3)
fetch="join|select" (4)
inverse="true|false" (5)
optional="true|false"> (6)
For example, the address information for a person can be mapped to a separate table (while preserving value
type semantics for all properties):
<class name="Person"
table="PERSON">
Hibernate 3.2.7 71
Basic O/R Mapping
<join table="ADDRESS">
<key column="ADDRESS_ID"/>
<property name="address"/>
<property name="zip"/>
<property name="country"/>
</join>
...
This feature is often only useful for legacy data models, we recommend fewer tables than classes and a fine-
grained domain model. However, it is useful for switching between inheritance mapping strategies in a single
hierarchy, as explained later.
5.1.21. key
We've seen the <key> element crop up a few times now. It appears anywhere the parent mapping element
defines a join to a new table, and defines the foreign key in the joined table, that references the primary key of
the original table.
<key
column="columnname" (1)
on-delete="noaction|cascade" (2)
property-ref="propertyName" (3)
not-null="true|false" (4)
update="true|false" (5)
unique="true|false" (6)
/>
(1) column (optional): The name of the foreign key column. This may also be specified by nested <column>
element(s).
(2) on-delete (optional, defaults to noaction): Specifies whether the foreign key constraint has database-
level cascade delete enabled.
(3) property-ref (optional): Specifies that the foreign key refers to columns that are not the primary key of
the orginal table. (Provided for legacy data.)
(4) not-null (optional): Specifies that the foreign key columns are not nullable (this is implied whenever the
foreign key is also part of the primary key).
(5) update (optional): Specifies that the foreign key should never be updated (this is implied whenever the
foreign key is also part of the primary key).
(6) unique (optional): Specifies that the foreign key should have a unique constraint (this is implied whenever
the foreign key is also the primary key).
We recommend that for systems where delete performance is important, all keys should be defined on-de-
lete="cascade", and Hibernate will use a database-level ON CASCADE DELETE constraint, instead of many indi-
vidual DELETE statements. Be aware that this feature bypasses Hibernate's usual optimistic locking strategy for
versioned data.
The not-null and update attributes are useful when mapping a unidirectional one to many association. If you
map a unidirectional one to many to a non-nullable foreign key, you must declare the key column using <key
not-null="true">.
Any mapping element which accepts a column attribute will alternatively accept a <column> subelement. Like-
wise, <formula> is an alternative to the formula attribute.
Hibernate 3.2.7 72
Basic O/R Mapping
<column
name="column_name"
length="N"
precision="N"
scale="N"
not-null="true|false"
unique="true|false"
unique-key="multicolumn_unique_key_name"
index="index_name"
sql-type="sql_type_name"
check="SQL expression"
default="SQL expression"/>
<formula>SQL expression</formula>
column and formula attributes may even be combined within the same property or association mapping to ex-
press, for example, exotic join conditions.
5.1.23. import
Suppose your application has two persistent classes with the same name, and you don't want to specify the fully
qualified (package) name in Hibernate queries. Classes may be "imported" explicitly, rather than relying upon
auto-import="true". You may even import classes and interfaces that are not explicitly mapped.
<import
class="ClassName" (1)
rename="ShortName" (2)
/>
(1) class: The fully qualified class name of of any Java class.
(2) rename (optional - defaults to the unqualified class name): A name that may be used in the query lan-
guage.
5.1.24. any
There is one further type of property mapping. The <any> mapping element defines a polymorphic association
to classes from multiple tables. This type of mapping always requires more than one column. The first column
holds the type of the associated entity. The remaining columns hold the identifier. It is impossible to specify a
foreign key constraint for this kind of association, so this is most certainly not meant as the usual way of map-
ping (polymorphic) associations. You should use this only in very special cases (eg. audit logs, user session
data, etc).
The meta-type attribute lets the application specify a custom type that maps database column values to persist-
ent classes which have identifier properties of the type specified by id-type. You must specify the mapping
from values of the meta-type to class names.
Hibernate 3.2.7 73
Basic O/R Mapping
<any
name="propertyName" (1)
id-type="idtypename" (2)
meta-type="metatypename" (3)
cascade="cascade_style" (4)
access="field|property|ClassName" (5)
optimistic-lock="true|false" (6)
>
<meta-value ... />
<meta-value ... />
.....
<column .... />
<column .... />
.....
</any>
To understand the behaviour of various Java language-level objects with respect to the persistence service, we
need to classify them into two groups:
An entity exists independently of any other objects holding references to the entity. Contrast this with the usual
Java model where an unreferenced object is garbage collected. Entities must be explicitly saved and deleted
(except that saves and deletions may be cascaded from a parent entity to its children). This is different from the
ODMG model of object persistence by reachablity - and corresponds more closely to how application objects
are usually used in large systems. Entities support circular and shared references. They may also be versioned.
An entity's persistent state consists of references to other entities and instances of value types. Values are prim-
itives, collections (not what's inside a collection), components and certain immutable objects. Unlike entities,
values (in particular collections and components) are persisted and deleted by reachability. Since value objects
(and primitives) are persisted and deleted along with their containing entity they may not be independently ver-
sioned. Values have no independent identity, so they cannot be shared by two entities or collections.
Up until now, we've been using the term "persistent class" to refer to entities. We will continue to do that.
Strictly speaking, however, not all user-defined classes with persistent state are entities. A component is a user
defined class with value semantics. A Java property of type java.lang.String also has value semantics. Given
this definition, we can say that all types (classes) provided by the JDK have value type semantics in Java, while
user-defined types may be mapped with entity or value type semantics. This decision is up to the application
Hibernate 3.2.7 74
Basic O/R Mapping
developer. A good hint for an entity class in a domain model are shared references to a single instance of that
class, while composition or aggregation usually translates to a value type.
The challenge is to map the Java type system (and the developers' definition of entities and value types) to the
SQL/database type system. The bridge between both systems is provided by Hibernate: for entities we use
<class>, <subclass> and so on. For value types we use <property>, <component>, etc, usually with a type at-
tribute. The value of this attribute is the name of a Hibernate mapping type. Hibernate provides many mappings
(for standard JDK value types) out of the box. You can write your own mapping types and implement your cus-
tom conversion strategies as well, as you'll see later.
integer, long, short, float, double, character, byte, boolean, yes_no, true_false
Type mappings from Java primitives or wrapper classes to appropriate (vendor-specific) SQL column
types. boolean, yes_no and true_false are all alternative encodings for a Java boolean or
java.lang.Boolean.
string
A type mapping from java.lang.String to VARCHAR (or Oracle VARCHAR2).
calendar, calendar_date
Type mappings from java.util.Calendar to SQL types TIMESTAMP and DATE (or equivalent).
big_decimal, big_integer
Type mappings from java.math.BigDecimal and java.math.BigInteger to NUMERIC (or Oracle NUMBER).
class
A type mapping from java.lang.Class to VARCHAR (or Oracle VARCHAR2). A Class is mapped to its fully
qualified name.
binary
Maps byte arrays to an appropriate SQL binary type.
text
Maps long Java strings to a SQL CLOB or TEXT type.
serializable
Maps serializable Java types to an appropriate SQL binary type. You may also indicate the Hibernate type
serializable with the name of a serializable Java class or interface that does not default to a basic type.
Hibernate 3.2.7 75
Basic O/R Mapping
clob, blob
Type mappings for the JDBC classes java.sql.Clob and java.sql.Blob. These types may be inconveni-
ent for some applications, since the blob or clob object may not be reused outside of a transaction.
(Furthermore, driver support is patchy and inconsistent.)
Unique identifiers of entities and collections may be of any basic type except binary, blob and clob.
(Composite identifiers are also allowed, see below.)
The basic value types have corresponding Type constants defined on org.hibernate.Hibernate. For example,
Hibernate.STRING represents the string type.
It is relatively easy for developers to create their own value types. For example, you might want to persist prop-
erties of type java.lang.BigInteger to VARCHAR columns. Hibernate does not provide a built-in type for this.
But custom types are not limited to mapping a property (or collection element) to a single table column. So, for
example, you might have a Java property getName()/setName() of type java.lang.String that is persisted to
the columns FIRST_NAME, INITIAL, SURNAME.
You may even supply parameters to a UserType in the mapping file. To do this, your UserType must implement
the org.hibernate.usertype.ParameterizedType interface. To supply parameters to your custom type, you
can use the <type> element in your mapping files.
<property name="priority">
<type name="com.mycompany.usertypes.DefaultValueIntegerType">
<param name="default">0</param>
</type>
</property>
The UserType can now retrieve the value for the parameter named default from the Properties object passed
to it.
If you use a certain UserType very often, it may be useful to define a shorter name for it. You can do this using
Hibernate 3.2.7 76
Basic O/R Mapping
the <typedef> element. Typedefs assign a name to a custom type, and may also contain a list of default para-
meter values if the type is parameterized.
It is also possible to override the parameters supplied in a typedef on a case-by-case basis by using type para-
meters on the property mapping.
Even though Hibernate's rich range of built-in types and support for components means you will very rarely
need to use a custom type, it is nevertheless considered good form to use custom types for (non-entity) classes
that occur frequently in your application. For example, a MonetaryAmount class is a good candidate for a Com-
positeUserType, even though it could easily be mapped as a component. One motivation for this is abstraction.
With a custom type, your mapping documents would be future-proofed against possible changes in your way of
representing monetary values.
Notice how associations are now specified using entity-name instead of class.
Hibernate 3.2.7 77
Basic O/R Mapping
Many Hibernate users prefer to embed mapping information directly in sourcecode using XDoclet
@hibernate.tags. We will not cover this approach in this document, since strictly it is considered part of
XDoclet. However, we include the following example of the Cat class with XDoclet mappings.
package eg;
import java.util.Set;
import java.util.Date;
/**
* @hibernate.class
* table="CATS"
*/
public class Cat {
private Long id; // identifier
private Date birthdate;
private Cat mother;
private Set kittens
private Color color;
private char sex;
private float weight;
/*
* @hibernate.id
* generator-class="native"
* column="CAT_ID"
*/
public Long getId() {
return id;
}
private void setId(Long id) {
this.id=id;
}
/**
* @hibernate.many-to-one
* column="PARENT_ID"
*/
public Cat getMother() {
return mother;
}
void setMother(Cat mother) {
this.mother = mother;
}
/**
* @hibernate.property
* column="BIRTH_DATE"
*/
public Date getBirthdate() {
return birthdate;
}
void setBirthdate(Date date) {
birthdate = date;
}
/**
* @hibernate.property
* column="WEIGHT"
*/
public float getWeight() {
return weight;
Hibernate 3.2.7 78
Basic O/R Mapping
}
void setWeight(float weight) {
this.weight = weight;
}
/**
* @hibernate.property
* column="COLOR"
* not-null="true"
*/
public Color getColor() {
return color;
}
void setColor(Color color) {
this.color = color;
}
/**
* @hibernate.set
* inverse="true"
* order-by="BIRTH_DATE"
* @hibernate.collection-key
* column="PARENT_ID"
* @hibernate.collection-one-to-many
*/
public Set getKittens() {
return kittens;
}
void setKittens(Set kittens) {
this.kittens = kittens;
}
// addKitten not needed by Hibernate
public void addKitten(Cat kitten) {
kittens.add(kitten);
}
/**
* @hibernate.property
* column="SEX"
* not-null="true"
* update="false"
*/
public char getSex() {
return sex;
}
void setSex(char sex) {
this.sex=sex;
}
}
See the Hibernate web site for more examples of XDoclet and Hibernate.
JDK 5.0 introduced XDoclet-style annotations at the language level, type-safe and checked at compile time.
This mechnism is more powerful than XDoclet annotations and better supported by tools and IDEs. IntelliJ
IDEA, for example, supports auto-completion and syntax highlighting of JDK 5.0 annotations. The new revi-
sion of the EJB specification (JSR-220) uses JDK 5.0 annotations as the primary metadata mechanism for en-
tity beans. Hibernate3 implements the EntityManager of JSR-220 (the persistence API), support for mapping
metadata is available via the Hibernate Annotations package, as a separate download. Both EJB3 (JSR-220)
and Hibernate3 metadata is supported.
@Entity(access = AccessType.FIELD)
public class Customer implements Serializable {
Hibernate 3.2.7 79
Basic O/R Mapping
@Id;
Long id;
String firstName;
String lastName;
Date birthday;
@Transient
Integer age;
@Embedded
private Address homeAddress;
@OneToMany(cascade=CascadeType.ALL)
@JoinColumn(name="CUSTOMER_ID")
Set<Order> orders;
Note that support for JDK 5.0 Annotations (and JSR-220) is still work in progress and not completed. Please
refer to the Hibernate Annotations module for more details.
Properties marked as generated must additionally be non-insertable and non-updateable. Only Section 5.1.9,
“version (optional)”, Section 5.1.10, “timestamp (optional)”, and Section 5.1.11, “property” can be marked as
generated.
never (the default) - means that the given property value is not generated within the database.
insert - states that the given property value is generated on insert, but is not regenerated on subsequent up-
dates. Things like created-date would fall into this category. Note that even thought Section 5.1.9, “version
(optional)” and Section 5.1.10, “timestamp (optional)” properties can be marked as generated, this option is not
available there...
always - states that the property value is generated both on insert and on update.
The first mode is to explicitly list the CREATE and DROP commands out in the mapping file:
<hibernate-mapping>
Hibernate 3.2.7 80
Basic O/R Mapping
...
<database-object>
<create>CREATE TRIGGER my_trigger ...</create>
<drop>DROP TRIGGER my_trigger</drop>
</database-object>
</hibernate-mapping>
The second mode is to supply a custom class which knows how to construct the CREATE and DROP com-
mands. This custom class must implement the org.hibernate.mapping.AuxiliaryDatabaseObject interface.
<hibernate-mapping>
...
<database-object>
<definition class="MyTriggerDefinition"/>
</database-object>
</hibernate-mapping>
Additionally, these database objects can be optionally scoped such that they only apply when certain dialects
are used.
<hibernate-mapping>
...
<database-object>
<definition class="MyTriggerDefinition"/>
<dialect-scope name="org.hibernate.dialect.Oracle9iDialect"/>
<dialect-scope name="org.hibernate.dialect.Oracle10gDialect"/>
</database-object>
</hibernate-mapping>
Hibernate 3.2.7 81
Chapter 6. Collection Mapping
Notice how we initialized the instance variable with an instance of HashSet. This is the best way to initialize
collection valued properties of newly instantiated (non-persistent) instances. When you make the instance per-
sistent - by calling persist(), for example - Hibernate will actually replace the HashSet with an instance of
Hibernate's own implementation of Set. Watch out for errors like this:
The persistent collections injected by Hibernate behave like HashMap, HashSet, TreeMap, TreeSet or
ArrayList, depending upon the interface type.
Collections instances have the usual behavior of value types. They are automatically persisted when referenced
by a persistent object and automatically deleted when unreferenced. If a collection is passed from one persistent
object to another, its elements might be moved from one table to another. Two entities may not share a refer-
ence to the same collection instance. Due to the underlying relational model, collection-valued properties do
not support null value semantics; Hibernate does not distinguish between a null collection reference and an
empty collection.
You shouldn't have to worry much about any of this. Use persistent collections the same way you use ordinary
Java collections. Just make sure you understand the semantics of bidirectional associations (discussed later).
<class name="Product">
<id name="serialNumber" column="productSerialNumber"/>
Hibernate 3.2.7 82
Collection Mapping
<set name="parts">
<key column="productSerialNumber" not-null="true"/>
<one-to-many class="Part"/>
</set>
</class>
Apart from <set>, there is also <list>, <map>, <bag>, <array> and <primitive-array> mapping elements.
The <map> element is representative:
<map
name="propertyName" (1)
table="table_name" (2)
schema="schema_name" (3)
lazy="true|extra|false" (4)
inverse="true|false" (5)
cascade="all|none|save-update|delete|all-delete-orphan|delet(6)e-orphan"
sort="unsorted|natural|comparatorClass" (7)
order-by="column_name asc|desc" (8)
where="arbitrary sql where condition" (9)
fetch="join|select|subselect" (10)
batch-size="N" (11)
access="field|property|ClassName" (12)
optimistic-lock="true|false" (13)
mutable="true|false" (14)
node="element-name|."
embed-xml="true|false"
>
Hibernate 3.2.7 83
Collection Mapping
Collection instances are distinguished in the database by the foreign key of the entity that owns the collection.
This foreign key is referred to as the collection key column (or columns) of the collection table. The collection
key column is mapped by the <key> element.
There may be a nullability constraint on the foreign key column. For most collections, this is implied. For uni-
directional one to many associations, the foreign key column is nullable by default, so you might need to spe-
cify not-null="true".
See the previous chapter for a full definition of the <key> element.
Collections may contain almost any other Hibernate type, including all basic types, custom types, components,
and of course, references to other entities. This is an important distinction: an object in a collection might be
handled with "value" semantics (its lifecycle fully depends on the collection owner) or it might be a reference
to another entity, with its own lifecycle. In the latter case, only the "link" between the two objects is considered
to be state held by the collection.
The contained type is referred to as the collection element type. Collection elements are mapped by <element>
or <composite-element>, or in the case of entity references, with <one-to-many> or <many-to-many>. The first
two map elements with value semantics, the next two are used to map entity associations.
All collection mappings, except those with set and bag semantics, need an index column in the collection table -
a column that maps to an array index, or List index, or Map key. The index of a Map may be of any basic type,
mapped with <map-key>, it may be an entity reference mapped with <map-key-many-to-many>, or it may be a
composite type, mapped with <composite-map-key>. The index of an array or list is always of type integer
and is mapped using the <list-index> element. The mapped column contains sequential integers (numbered
from zero, by default).
<list-index
column="column_name" (1)
base="0|1|..."/>
(1) column_name (required): The name of the column holding the collection index values.
(1) base (optional, defaults to 0): The value of the index column that corresponds to the first element of the
list or array.
<map-key
column="column_name" (1)
formula="any SQL expression" (2)
type="type_name" (3)
node="@attribute-name"
length="N"/>
Hibernate 3.2.7 84
Collection Mapping
(1) column (optional): The name of the column holding the collection index values.
(2) formula (optional): A SQL formula used to evaluate the key of the map.
(3) type (reguired): The type of the map keys.
<map-key-many-to-many
column="column_name" (1)
formula="any SQL expression" (2)(3)
class="ClassName"
/>
(1) column (optional): The name of the foreign key column for the collection index values.
(2) formula (optional): A SQL formula used to evaluate the foreign key of the map key.
(3) class (required): The entity class used as the map key.
If your table doesn't have an index column, and you still wish to use List as the property type, you should map
the property as a Hibernate <bag>. A bag does not retain its order when it is retrieved from the database, but it
may be optionally sorted or ordered.
There are quite a range of mappings that can be generated for collections, covering many common relational
models. We suggest you experiment with the schema generation tool to get a feeling for how various mapping
declarations translate to database tables.
Any collection of values or many-to-many association requires a dedicated collection table with a foreign key
column or columns, collection element column or columns and possibly an index column or columns.
<element
column="column_name" (1)
formula="any SQL expression" (2)
type="typename" (3)
length="L"
precision="P"
scale="S"
not-null="true|false"
unique="true|false"
node="element-name"
/>
(1) column (optional): The name of the column holding the collection element values.
(2) formula (optional): An SQL formula used to evaluate the element.
(3) type (required): The type of the collection element.
<many-to-many
column="column_name" (1)
formula="any SQL expression" (2)
class="ClassName" (3)
fetch="select|join" (4)
unique="true|false" (5)
not-found="ignore|exception" (6)
entity-name="EntityName" (7)
property-ref="propertyNameFromAssociatedClass" (8)
node="element-name"
embed-xml="true|false"
/>
Hibernate 3.2.7 85
Collection Mapping
(1) column (optional): The name of the element foreign key column.
(2) formula (optional): An SQL formula used to evaluate the element foreign key value.
(3) class (required): The name of the associated class.
(4) fetch (optional - defaults to join): enables outer-join or sequential select fetching for this association.
This is a special case; for full eager fetching (in a single SELECT) of an entity and its many-to-many rela-
tionships to other entities, you would enable join fetching not only of the collection itself, but also with
this attribute on the <many-to-many> nested element.
(5) unique (optional): Enable the DDL generation of a unique constraint for the foreign-key column. This
makes the association multiplicity effectively one to many.
(6) not-found (optional - defaults to exception): Specifies how foreign keys that reference missing rows will
be handled: ignore will treat a missing row as a null association.
(7) entity-name (optional): The entity name of the associated class, as an alternative to class.
(8) property-ref: (optional) The name of a property of the associated class that is joined to this foreign key.
If not specified, the primary key of the associated class is used.
A bag containing integers (with an iteration order determined by the order-by attribute):
<bag name="sizes"
table="item_sizes"
order-by="size asc">
<key column="item_id"/>
<element column="size" type="integer"/>
</bag>
<array name="addresses"
table="PersonAddress"
cascade="persist">
<key column="personId"/>
<list-index column="sortOrder"/>
<many-to-many column="addressId" class="Address"/>
</array>
<map name="holidays"
table="holidays"
schema="dbo"
order-by="hol_name asc">
<key column="id"/>
<map-key column="hol_name" type="string"/>
<element column="hol_date" type="date"/>
</map>
<list name="carComponents"
table="CarComponents">
<key column="carId"/>
<list-index column="sortOrder"/>
<composite-element class="CarComponent">
<property name="price"/>
<property name="type"/>
<property name="serialNumber" column="serialNum"/>
Hibernate 3.2.7 86
Collection Mapping
</composite-element>
</list>
A one to many association links the tables of two classes via a foreign key, with no intervening collection table.
This mapping loses certain semantics of normal Java collections:
• An instance of the contained entity class may not belong to more than one instance of the collection
• An instance of the contained entity class may not appear at more than one value of the collection index
An association from Product to Part requires existence of a foreign key column and possibly an index column
to the Part table. A <one-to-many> tag indicates that this is a one to many association.
<one-to-many
class="ClassName" (1)
not-found="ignore|exception" (2)
entity-name="EntityName" (3)
node="element-name"
embed-xml="true|false"
/>
Notice that the <one-to-many> element does not need to declare any columns. Nor is it necessary to specify the
table name anywhere.
Very important note: If the foreign key column of a <one-to-many> association is declared NOT NULL, you must
declare the <key> mapping not-null="true" or use a bidirectional association with the collection mapping
marked inverse="true". See the discussion of bidirectional associations later in this chapter.
This example shows a map of Part entities by name (where partName is a persistent property of Part). Notice
the use of a formula-based index.
<map name="parts"
cascade="all">
<key column="productId" not-null="true"/>
<map-key formula="partName"/>
<one-to-many class="Part"/>
</map>
Hibernate supports collections implementing java.util.SortedMap and java.util.SortedSet. You must spe-
cify a comparator in the mapping file:
<set name="aliases"
table="person_aliases"
sort="natural">
<key column="person"/>
Hibernate 3.2.7 87
Collection Mapping
Allowed values of the sort attribute are unsorted, natural and the name of a class implementing
java.util.Comparator.
If you want the database itself to order the collection elements use the order-by attribute of set, bag or map
mappings. This solution is only available under JDK 1.4 or higher (it is implemented using LinkedHashSet or
LinkedHashMap). This performs the ordering in the SQL query, not in memory.
Note that the value of the order-by attribute is an SQL ordering, not a HQL ordering!
Associations may even be sorted by some arbitrary criteria at runtime using a collection filter().
A bidirectional association allows navigation from both "ends" of the association. Two kinds of bidirectional
association are supported:
one-to-many
set or bag valued at one end, single-valued at the other
many-to-many
set or bag valued at both ends
You may specify a bidirectional many-to-many association simply by mapping two many-to-many associations
to the same database table and declaring one end as inverse (which one is your choice, but it can not be an in-
dexed collection).
Here's an example of a bidirectional many-to-many association; each category can have many items and each
item can be in many categories:
<class name="Category">
<id name="id" column="CATEGORY_ID"/>
...
<bag name="items" table="CATEGORY_ITEM">
<key column="CATEGORY_ID"/>
Hibernate 3.2.7 88
Collection Mapping
<class name="Item">
<id name="id" column="CATEGORY_ID"/>
...
Changes made only to the inverse end of the association are not persisted. This means that Hibernate has two
representations in memory for every bidirectional association, one link from A to B and another link from B to
A. This is easier to understand if you think about the Java object model and how we create a many-to-many re-
lationship in Java:
The non-inverse side is used to save the in-memory representation to the database.
You may define a bidirectional one-to-many association by mapping a one-to-many association to the same ta-
ble column(s) as a many-to-one association and declaring the many-valued end inverse="true".
<class name="Parent">
<id name="id" column="parent_id"/>
....
<set name="children" inverse="true">
<key column="parent_id"/>
<one-to-many class="Child"/>
</set>
</class>
<class name="Child">
<id name="id" column="child_id"/>
....
<many-to-one name="parent"
class="Parent"
column="parent_id"
not-null="true"/>
</class>
Mapping one end of an association with inverse="true" doesn't affect the operation of cascades, these are or-
thogonal concepts!
A bidirectional association where one end is represented as a <list> or <map> requires special consideration. If
there is a property of the child class which maps to the index column, no problem, we can continue using in-
verse="true" on the collection mapping:
<class name="Parent">
<id name="id" column="parent_id"/>
....
Hibernate 3.2.7 89
Collection Mapping
<class name="Child">
<id name="id" column="child_id"/>
....
<property name="name"
not-null="true"/>
<many-to-one name="parent"
class="Parent"
column="parent_id"
not-null="true"/>
</class>
But, if there is no such property on the child class, we can't think of the association as truly bidirectional (there
is information available at one end of the association that is not available at the other end). In this case, we can't
map the collection inverse="true". Instead, we could use the following mapping:
<class name="Parent">
<id name="id" column="parent_id"/>
....
<map name="children">
<key column="parent_id"
not-null="true"/>
<map-key column="name"
type="string"/>
<one-to-many class="Child"/>
</map>
</class>
<class name="Child">
<id name="id" column="child_id"/>
....
<many-to-one name="parent"
class="Parent"
column="parent_id"
insert="false"
update="false"
not-null="true"/>
</class>
Note that in this mapping, the collection-valued end of the association is responsible for updates to the foreign
key. TODO: Does this really result in some unnecessary update statements?
There are three possible approaches to mapping a ternary association. One is to use a Map with an association as
its index:
<map name="contracts">
<key column="employer_id" not-null="true"/>
<map-key-many-to-many column="employee_id" class="Employee"/>
<one-to-many class="Contract"/>
</map>
<map name="connections">
<key column="incoming_node_id"/>
<map-key-many-to-many column="outgoing_node_id" class="Node"/>
<many-to-many column="connection_id" class="Connection"/>
Hibernate 3.2.7 90
Collection Mapping
</map>
A second approach is to simply remodel the association as an entity class. This is the approach we use most
commonly.
If you've fully embraced our view that composite keys are a bad thing and that entities should have synthetic
identifiers (surrogate keys), then you might find it a bit odd that the many to many associations and collections
of values that we've shown so far all map to tables with composite keys! Now, this point is quite arguable; a
pure association table doesn't seem to benefit much from a surrogate key (though a collection of composite val-
ues might). Nevertheless, Hibernate provides a feature that allows you to map many to many associations and
collections of values to a table with a surrogate key.
The <idbag> element lets you map a List (or Collection) with bag semantics.
As you can see, an <idbag> has a synthetic id generator, just like an entity class! A different surrogate key is
assigned to each collection row. Hibernate does not provide any mechanism to discover the surrogate key value
of a particular row, however.
Note that the update performance of an <idbag> is much better than a regular <bag>! Hibernate can locate indi-
vidual rows efficiently and update or delete them individually, just like a list, map or set.
In the current implementation, the native identifier generation strategy is not supported for <idbag> collection
identifiers.
package eg;
import java.util.Set;
....
....
}
Hibernate 3.2.7 91
Collection Mapping
has a collection of Child instances. If each child has at most one parent, the most natural mapping is a one-
to-many association:
<hibernate-mapping>
<class name="Parent">
<id name="id">
<generator class="sequence"/>
</id>
<set name="children">
<key column="parent_id"/>
<one-to-many class="Child"/>
</set>
</class>
<class name="Child">
<id name="id">
<generator class="sequence"/>
</id>
<property name="name"/>
</class>
</hibernate-mapping>
<hibernate-mapping>
<class name="Parent">
<id name="id">
<generator class="sequence"/>
</id>
<set name="children" inverse="true">
<key column="parent_id"/>
<one-to-many class="Child"/>
</set>
</class>
<class name="Child">
<id name="id">
<generator class="sequence"/>
</id>
<property name="name"/>
<many-to-one name="parent" class="Parent" column="parent_id" not-null="true"/>
</class>
</hibernate-mapping>
Alternatively, if you absolutely insist that this association should be unidirectional, you can declare the NOT
NULL constraint on the <key> mapping:
Hibernate 3.2.7 92
Collection Mapping
<hibernate-mapping>
<class name="Parent">
<id name="id">
<generator class="sequence"/>
</id>
<set name="children">
<key column="parent_id" not-null="true"/>
<one-to-many class="Child"/>
</set>
</class>
<class name="Child">
<id name="id">
<generator class="sequence"/>
</id>
<property name="name"/>
</class>
</hibernate-mapping>
On the other hand, if a child might have multiple parents, a many-to-many association is appropriate:
<hibernate-mapping>
<class name="Parent">
<id name="id">
<generator class="sequence"/>
</id>
<set name="children" table="childset">
<key column="parent_id"/>
<many-to-many class="Child" column="child_id"/>
</set>
</class>
<class name="Child">
<id name="id">
<generator class="sequence"/>
</id>
<property name="name"/>
</class>
</hibernate-mapping>
Table definitions:
For more examples and a complete walk-through a parent/child relationship mapping, see Chapter 21, Ex-
ample: Parent/Child.
Even more exotic association mappings are possible, we will catalog all possibilities in the next chapter.
Hibernate 3.2.7 93
Chapter 7. Association Mappings
7.1. Introduction
Association mappings are the often most difficult thing to get right. In this section we'll go through the canonic-
al cases one by one, starting with unidirectional mappings, and then considering the bidirectional cases. We'll
use Person and Address in all the examples.
We'll classify associations by whether or not they map to an intervening join table, and by multiplicity.
Nullable foreign keys are not considered good practice in traditional data modelling, so all our examples use
not null foreign keys. This is not a requirement of Hibernate, and the mappings will all work if you drop the
nullability constraints.
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<many-to-one name="address"
column="addressId"
not-null="true"/>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
</class>
create table Person ( personId bigint not null primary key, addressId bigint not null )
create table Address ( addressId bigint not null primary key )
A unidirectional one-to-one association on a foreign key is almost identical. The only difference is the column
unique constraint.
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<many-to-one name="address"
column="addressId"
unique="true"
not-null="true"/>
</class>
Hibernate 3.2.7 94
Association Mappings
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
</class>
create table Person ( personId bigint not null primary key, addressId bigint not null unique )
create table Address ( addressId bigint not null primary key )
A unidirectional one-to-one association on a primary key usually uses a special id generator. (Notice that we've
reversed the direction of the association in this example.)
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
</class>
<class name="Address">
<id name="id" column="personId">
<generator class="foreign">
<param name="property">person</param>
</generator>
</id>
<one-to-one name="person" constrained="true"/>
</class>
A unidirectional one-to-many association on a foreign key is a very unusual case, and is not really recommen-
ded.
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<set name="addresses">
<key column="personId"
not-null="true"/>
<one-to-many class="Address"/>
</set>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
</class>
We think it's better to use a join table for this kind of association.
Hibernate 3.2.7 95
Association Mappings
A unidirectional one-to-many association on a join table is much preferred. Notice that by specifying
unique="true", we have changed the multiplicity from many-to-many to one-to-many.
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<set name="addresses" table="PersonAddress">
<key column="personId"/>
<many-to-many column="addressId"
unique="true"
class="Address"/>
</set>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
</class>
A unidirectional many-to-one association on a join table is quite common when the association is optional.
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<join table="PersonAddress"
optional="true">
<key column="personId" unique="true"/>
<many-to-one name="address"
column="addressId"
not-null="true"/>
</join>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
</class>
Hibernate 3.2.7 96
Association Mappings
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<join table="PersonAddress"
optional="true">
<key column="personId"
unique="true"/>
<many-to-one name="address"
column="addressId"
not-null="true"
unique="true"/>
</join>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
</class>
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<set name="addresses" table="PersonAddress">
<key column="personId"/>
<many-to-many column="addressId"
class="Address"/>
</set>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
</class>
Hibernate 3.2.7 97
Association Mappings
A bidirectional many-to-one association is the most common kind of association. (This is the standard parent/
child relationship.)
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<many-to-one name="address"
column="addressId"
not-null="true"/>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
<set name="people" inverse="true">
<key column="addressId"/>
<one-to-many class="Person"/>
</set>
</class>
create table Person ( personId bigint not null primary key, addressId bigint not null )
create table Address ( addressId bigint not null primary key )
If you use a List (or other indexed collection) you need to set the key column of the foreign key to not null,
and let Hibernate manage the association from the collections side to maintain the index of each element
(making the other side virtually inverse by setting update="false" and insert="false"):
<class name="Person">
<id name="id"/>
...
<many-to-one name="address"
column="addressId"
not-null="true"
insert="false"
update="false"/>
</class>
<class name="Address">
<id name="id"/>
...
<list name="people">
<key column="addressId" not-null="true"/>
<list-index column="peopleIdx"/>
<one-to-many class="Person"/>
</list>
</class>
It is important that you define not-null="true" on the <key> element of the collection mapping if the underly-
ing foreign key column is NOT NULL. Don't only declare not-null="true" on a possible nested <column> ele-
ment, but on the <key> element.
<class name="Person">
<id name="id" column="personId">
Hibernate 3.2.7 98
Association Mappings
<generator class="native"/>
</id>
<many-to-one name="address"
column="addressId"
unique="true"
not-null="true"/>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
<one-to-one name="person"
property-ref="address"/>
</class>
create table Person ( personId bigint not null primary key, addressId bigint not null unique )
create table Address ( addressId bigint not null primary key )
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<one-to-one name="address"/>
</class>
<class name="Address">
<id name="id" column="personId">
<generator class="foreign">
<param name="property">person</param>
</generator>
</id>
<one-to-one name="person"
constrained="true"/>
</class>
A bidirectional one-to-many association on a join table. Note that the inverse="true" can go on either end of
the association, on the collection, or on the join.
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<set name="addresses"
table="PersonAddress">
<key column="personId"/>
<many-to-many column="addressId"
unique="true"
class="Address"/>
Hibernate 3.2.7 99
Association Mappings
</set>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
<join table="PersonAddress"
inverse="true"
optional="true">
<key column="addressId"/>
<many-to-one name="person"
column="personId"
not-null="true"/>
</join>
</class>
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<join table="PersonAddress"
optional="true">
<key column="personId"
unique="true"/>
<many-to-one name="address"
column="addressId"
not-null="true"
unique="true"/>
</join>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
<join table="PersonAddress"
optional="true"
inverse="true">
<key column="addressId"
unique="true"/>
<many-to-one name="person"
column="personId"
not-null="true"
unique="true"/>
</join>
</class>
<class name="Person">
<id name="id" column="personId">
<generator class="native"/>
</id>
<set name="addresses" table="PersonAddress">
<key column="personId"/>
<many-to-many column="addressId"
class="Address"/>
</set>
</class>
<class name="Address">
<id name="id" column="addressId">
<generator class="native"/>
</id>
<set name="people" inverse="true" table="PersonAddress">
<key column="addressId"/>
<many-to-many column="personId"
class="Person"/>
</set>
</class>
<properties name="currentAccountKey">
<property name="accountNumber" type="string" not-null="true"/>
<property name="currentAccount" type="boolean">
<formula>case when effectiveEndDate is null then 1 else 0 end</formula>
</property>
</properties>
<property name="effectiveEndDate" type="date"/>
<property name="effectiveStateDate" type="date" not-null="true"/>
Then we can map an association to the current instance (the one with null effectiveEndDate) using:
<many-to-one name="currentAccountInfo"
property-ref="currentAccountKey"
class="AccountInfo">
<column name="accountNumber"/>
<formula>'1'</formula>
</many-to-one>
In a more complex example, imagine that the association between Employee and Organization is maintained in
an Employment table full of historical employment data. Then an association to the employee's most recent em-
ployer (the one with the most recent startDate) might be mapped this way:
<join>
<key column="employeeId"/>
<subselect>
select employeeId, orgId
from Employments
group by orgId
having startDate = max(startDate)
</subselect>
<many-to-one name="mostRecentEmployer"
class="Organization"
column="orgId"/>
</join>
You can get quite creative with this functionality, but it is usually more practical to handle these kinds of cases
using HQL or a criteria query.
Now Name may be persisted as a component of Person. Notice that Name defines getter and setter methods for
its persistent properties, but doesn't need to declare any interfaces or identifier properties.
The person table would have the columns pid, birthday, initial, first and last.
Like all value types, components do not support shared references. In other words, two persons could have the
same name, but the two person objects would contain two independent name ojects, only "the same" by value.
The null value semantics of a component are ad hoc. When reloading the containing object, Hibernate will as-
sume that if all component columns are null, then the entire component is null. This should be okay for most
purposes.
The properties of a component may be of any Hibernate type (collections, many-to-one associations, other
components, etc). Nested components should not be considered an exotic usage. Hibernate is intended to sup-
port a very fine-grained object model.
The <component> element allows a <parent> subelement that maps a property of the component class as a ref-
erence back to the containing entity.
Note: if you define a Set of composite elements, it is very important to implement equals() and hashCode()
correctly.
Composite elements may contain components but not collections. If your composite element itself contains
components, use the <nested-composite-element> tag. This is a pretty exotic case - a collection of compon-
ents which themselves have components. By this stage you should be asking yourself if a one-to-many associ-
ation is more appropriate. Try remodelling the composite element as an entity - but note that even though the
Java model is the same, the relational model and persistence semantics are still slightly different.
Please note that a composite element mapping doesn't support null-able properties if you're using a <set>. Hi-
bernate has to use each columns value to identify a record when deleting objects (there is no separate primary
key column in the composite element table), which is not possible with null values. You have to either use only
not-null properties in a composite-element or choose a <list>, <map>, <bag> or <idbag>.
A special case of a composite element is a composite element with a nested <many-to-one> element. A map-
ping like this allows you to map extra columns of a many-to-many association table to the composite element
class. The following is a many-to-many association from Order to Item where purchaseDate, price and
quantity are properties of the association:
Of course, there can't be a reference to the purchae on the other side, for bidirectional association navigation.
Remember that components are value types and don't allow shared references. A single Purchase can be in the
set of an Order, but it can't be referenced by the Item at the same time.
Composite elements may appear in queries using the same syntax as associations to other entities.
ments:
Note: in Hibernate3, the second requirement is not an absolutely hard requirement of Hibernate. But do it any-
way.
You can't use an IdentifierGenerator to generate composite keys. Instead the application must assign its own
identifiers.
Use the <composite-id> tag (with nested <key-property> elements) in place of the usual <id> declaration. For
example, the OrderLine class has a primary key that depends upon the (composite) primary key of Order.
<class name="OrderLine">
<property name="name"/>
</class>
Now, any foreign keys referencing the OrderLine table are also composite. You must declare this in your map-
pings for other classes. An association to OrderLine would be mapped like this:
(Note that the <column> tag is an alternative to the column attribute everywhere.)
<set name="undeliveredOrderLines">
<key column name="warehouseId"/>
<many-to-many class="OrderLine">
<column name="lineId"/>
<column name="orderId"/>
<column name="customerId"/>
</many-to-many>
</set>
</key>
<one-to-many class="OrderLine"/>
</set>
<class name="OrderLine">
....
....
<list name="deliveryAttempts">
<key> <!-- a collection inherits the composite key type -->
<column name="lineId"/>
<column name="orderId"/>
<column name="customerId"/>
</key>
<list-index column="attemptId" base="1"/>
<composite-element class="DeliveryAttempt">
...
</composite-element>
</set>
</class>
<dynamic-component name="userAttributes">
<property name="foo" column="FOO" type="string"/>
<property name="bar" column="BAR" type="integer"/>
<many-to-one name="baz" class="Baz" column="BAZ_ID"/>
</dynamic-component>
The semantics of a <dynamic-component> mapping are identical to <component>. The advantage of this kind of
mapping is the ability to determine the actual properties of the bean at deployment time, just by editing the
mapping document. Runtime manipulation of the mapping document is also possible, using a DOM parser.
Even better, you can access (and change) Hibernate's configuration-time metamodel via the Configuration ob-
ject.
• implicit polymorphism
It is possible to use different mapping strategies for different branches of the same inheritance hierarchy, and
then make use of implicit polymorphism to achieve polymorphism across the whole hierarchy. However, Hi-
bernate does not support mixing <subclass>, and <joined-subclass> and <union-subclass> mappings under
the same root <class> element. It is possible to mix together the table per hierarchy and table per subclass
strategies, under the the same <class> element, by combining the <subclass> and <join> elements (see be-
low).
It is possible to define subclass, union-subclass, and joined-subclass mappings in separate mapping docu-
ments, directly beneath hibernate-mapping. This allows you to extend a class hierachy just by adding a new
mapping file. You must specify an extends attribute in the subclass mapping, naming a previously mapped su-
perclass. Note: Previously this feature made the ordering of the mapping documents important. Since Hibern-
ate3, the ordering of mapping files does not matter when using the extends keyword. The ordering inside a
single mapping file still needs to be defined as superclasses before subclasses.
<hibernate-mapping>
<subclass name="DomesticCat" extends="Cat" discriminator-value="D">
<property name="name" type="string"/>
</subclass>
</hibernate-mapping>
</subclass>
<subclass name="ChequePayment" discriminator-value="CHEQUE">
...
</subclass>
</class>
Exactly one table is required. There is one big limitation of this mapping strategy: columns declared by the sub-
classes, such as CCTYPE, may not have NOT NULL constraints.
Four tables are required. The three subclass tables have primary key associations to the superclass table (so the
relational model is actually a one-to-one association).
Note that Hibernate's implementation of table per subclass requires no discriminator column. Other object/
relational mappers use a different implementation of table per subclass which requires a type discriminator
column in the superclass table. The approach taken by Hibernate is much more difficult to implement but argu-
ably more correct from a relational point of view. If you would like to use a discriminator column with the table
per subclass strategy, you may combine the use of <subclass> and <join>, as follow:
...
</join>
</subclass>
<subclass name="ChequePayment" discriminator-value="CHEQUE">
<join table="CHEQUE_PAYMENT" fetch="select">
<key column="PAYMENT_ID"/>
...
</join>
</subclass>
</class>
The optional fetch="select" declaration tells Hibernate not to fetch the ChequePayment subclass data using an
outer join when querying the superclass.
9.1.4. Mixing table per class hierarchy with table per subclass
You may even mix the table per hierarchy and table per subclass strategies using this approach:
For any of these mapping strategies, a polymorphic association to the root Payment class is mapped using
<many-to-one>.
There are two ways we could go about mapping the table per concrete class strategy. The first is to use
<union-subclass>.
<class name="Payment">
<id name="id" type="long" column="PAYMENT_ID">
<generator class="sequence"/>
</id>
<property name="amount" column="AMOUNT"/>
...
<union-subclass name="CreditCardPayment" table="CREDIT_PAYMENT">
<property name="creditCardType" column="CCTYPE"/>
...
</union-subclass>
<union-subclass name="CashPayment" table="CASH_PAYMENT">
...
</union-subclass>
Three tables are involved for the subclasses. Each table defines columns for all properties of the class, includ-
ing inherited properties.
The limitation of this approach is that if a property is mapped on the superclass, the column name must be the
same on all subclass tables. (We might relax this in a future release of Hibernate.) The identity generator
strategy is not allowed in union subclass inheritance, indeed the primary key seed has to be shared accross all
unioned subclasses of a hierarchy.
If your superclass is abstract, map it with abstract="true". Of course, if it is not abstract, an additional table
(defaults to PAYMENT in the example above) is needed to hold instances of the superclass.
Notice that nowhere do we mention the Payment interface explicitly. Also notice that properties of Payment are
mapped in each of the subclasses. If you want to avoid duplication, consider using XML entities (e.g. [
<!ENTITY allproperties SYSTEM "allproperties.xml"> ] in the DOCTYPE declartion and &allproperties;
in the mapping).
The disadvantage of this approach is that Hibernate does not generate SQL UNIONs when performing poly-
morphic queries.
For this mapping strategy, a polymorphic association to Payment is usually mapped using <any>.
There is one further thing to notice about this mapping. Since the subclasses are each mapped in their own
<class> element (and since Payment is just an interface), each of the subclasses could easily be part of another
inheritance hierarchy! (And you can still use polymorphic queries against the Payment interface.)
Once again, we don't mention Payment explicitly. If we execute a query against the Payment interface - for ex-
ample, from Payment - Hibernate automatically returns instances of CreditCardPayment (and its subclasses,
since they also implement Payment), CashPayment and ChequePayment but not instances of Nonelectronic-
Transaction.
9.2. Limitations
There are certain limitations to the "implicit polymorphism" approach to the table per concrete-class mapping
strategy. There are somewhat less restrictive limitations to <union-subclass> mappings.
The following table shows the limitations of table per concrete-class mappings, and of implicit polymorphism,
in Hibernate.
table per <many-to-o <one-to-on <one-to-ma <many-to-m s.get(Paym from Pay- from Order
class- ne> e> ny> any> ent.class, ment p o join
hierarchy id) o.payment
p
table per <many-to-o <one-to-on <one-to-ma <many-to-m s.get(Paym from Pay- from Order
subclass ne> e> ny> any> ent.class, ment p o join
id) o.payment
p
table per <many-to-o <one-to-on <one-to-ma <many-to-m s.get(Paym from Pay- from Order
concrete- ne> e> ny> (for any> ent.class, ment p o join
class in- id) o.payment
(union-subc verse="tru p
lass) e" only)
table per <any> not suppor- not suppor- <many-to-a s.createCr from Pay- not suppor-
concrete ted ted ny> iter- ment p ted
class ia(Payment
(implicit .class).ad
polymorph- d( Re-
ism) stric-
tions.idEq
(id)
).uniqueRe
sult()
In other words, Hibernate application developers should always think about the state of their objects, and not
necessarily about the execution of SQL statements. This part is taken care of by Hibernate and is only relevant
for the application developer when tuning the performance of the system.
• Transient - an object is transient if it has just been instantiated using the new operator, and it is not associ-
ated with a Hibernate Session. It has no persistent representation in the database and no identifier value has
been assigned. Transient instances will be destroyed by the garbage collector if the application doesn't hold
a reference anymore. Use the Hibernate Session to make an object persistent (and let Hibernate take care of
the SQL statements that need to be executed for this transition).
• Persistent - a persistent instance has a representation in the database and an identifier value. It might just
have been saved or loaded, however, it is by definition in the scope of a Session. Hibernate will detect any
changes made to an object in persistent state and synchronize the state with the database when the unit of
work completes. Developers don't execute manual UPDATE statements, or DELETE statements when an object
should be made transient.
• Detached - a detached instance is an object that has been persistent, but its Session has been closed. The
reference to the object is still valid, of course, and the detached instance might even be modified in this
state. A detached instance can be reattached to a new Session at a later point in time, making it (and all the
modifications) persistent again. This feature enables a programming model for long running units of work
that require user think-time. We call them application transactions, i.e. a unit of work from the point of
view of the user.
We'll now discuss the states and state transitions (and the Hibernate methods that trigger a transition) in more
detail.
If Cat has a generated identifier, the identifier is generated and assigned to the cat when save() is called. If
Cat has an assigned identifier, or a composite key, the identifier should be assigned to the cat instance before
calling save(). You may also use persist() instead of save(), with the semantics defined in the EJB3 early
draft.
Alternatively, you may assign the identifier using an overloaded version of save().
If the object you make persistent has associated objects (e.g. the kittens collection in the previous example),
these objects may be made persistent in any order you like unless you have a NOT NULL constraint upon a for-
eign key column. There is never a risk of violating foreign key constraints. However, you might violate a NOT
NULL constraint if you save() the objects in the wrong order.
Usually you don't bother with this detail, as you'll very likely use Hibernate's transitive persistence feature to
save the associated objects automatically. Then, even NOT NULL constraint violations don't occur - Hibernate
will take care of everything. Transitive persistence is discussed later in this chapter.
Note that load() will throw an unrecoverable exception if there is no matching database row. If the class is
mapped with a proxy, load() just returns an uninitialized proxy and does not actually hit the database until you
invoke a method of the proxy. This behaviour is very useful if you wish to create an association to an object
without actually loading it from the database. It also allows multiple instances to be loaded as a batch if batch-
size is defined for the class mapping.
If you are not certain that a matching row exists, you should use the get() method, which hits the database im-
mediately and returns null if there is no matching row.
You may even load an object using an SQL SELECT ... FOR UPDATE, using a LockMode. See the API docu-
Note that any associated instances or contained collections are not selected FOR UPDATE, unless you decide to
specify lock or all as a cascade style for the association.
It is possible to re-load an object and all its collections at any time, using the refresh() method. This is useful
when database triggers are used to initialize some of the properties of the object.
sess.save(cat);
sess.flush(); //force the SQL INSERT
sess.refresh(cat); //re-read the state (after the trigger executes)
An important question usually appears at this point: How much does Hibernate load from the database and how
many SQL SELECTs will it use? This depends on the fetching strategy and is explained in Section 19.1,
“Fetching strategies”.
10.4. Querying
If you don't know the identifiers of the objects you are looking for, you need a query. Hibernate supports an
easy-to-use but powerful object oriented query language (HQL). For programmatic query creation, Hibernate
supports a sophisticated Criteria and Example query feature (QBC and QBE). You may also express your query
in the native SQL of your database, with optional support from Hibernate for result set conversion into objects.
HQL and native SQL queries are represented with an instance of org.hibernate.Query. This interface offers
methods for parameter binding, result set handling, and for the execution of the actual query. You always ob-
tain a Query using the current Session:
A query is usually executed by invoking list(), the result of the query will be loaded completely into a collec-
tion in memory. Entity instances retrieved by a query are in persistent state. The uniqueResult() method offers
a shortcut if you know your query will only return a single object. Note that queries that make use of eager
fetching of collections usually return duplicates of the root objects (but with their collections initialized). You
can filter these duplicates simply through a Set.
Iterating results
Occasionally, you might be able to achieve better performance by executing the query using the iterate()
method. This will only usually be the case if you expect that the actual entity instances returned by the query
will already be in the session or second-level cache. If they are not already cached, iterate() will be slower
than list() and might require many database hits for a simple query, usually 1 for the initial select which only
returns identifiers, and n additional selects to initialize the actual instances.
// fetch ids
Iterator iter = sess.createQuery("from eg.Qux q order by q.likeliness").iterate();
while ( iter.hasNext() ) {
Qux qux = (Qux) iter.next(); // fetch the object
// something we couldnt express in the query
if ( qux.calculateComplicatedAlgorithm() ) {
// delete the current instance
iter.remove();
// dont need to process the rest
break;
}
}
Hibernate queries sometimes return tuples of objects, in which case each tuple is returned as an array:
while ( kittensAndMothers.hasNext() ) {
Object[] tuple = (Object[]) kittensAndMothers.next();
Cat kitten = tuple[0];
Cat mother = tuple[1];
....
}
Scalar results
Queries may specify a property of a class in the select clause. They may even call SQL aggregate functions.
Properties or aggregates are considered "scalar" results (and not entities in persistent state).
while ( results.hasNext() ) {
Object[] row = (Object[]) results.next();
Color type = (Color) row[0];
Date oldest = (Date) row[1];
Integer count = (Integer) row[2];
.....
}
Bind parameters
Methods on Query are provided for binding values to named parameters or JDBC-style ? parameters. Contrary
to JDBC, Hibernate numbers parameters from zero. Named parameters are identifiers of the form :name in the
query string. The advantages of named parameters are:
• named parameters are insensitive to the order they occur in the query string
• they may occur multiple times in the same query
• they are self-documenting
//positional parameter
Query q = sess.createQuery("from DomesticCat cat where cat.name = ?");
q.setString(0, "Izi");
Iterator cats = q.iterate();
Pagination
If you need to specify bounds upon your result set (the maximum number of rows you want to retrieve and / or
the first row you want to retrieve) you should use methods of the Query interface:
Hibernate knows how to translate this limit query into the native SQL of your DBMS.
Scrollable iteration
If your JDBC driver supports scrollable ResultSets, the Query interface may be used to obtain a Scrolla-
bleResults object, which allows flexible navigation of the query results.
// find the first name on each page of an alphabetical list of cats by name
firstNamesOfPages = new ArrayList();
do {
String name = cats.getString(0);
firstNamesOfPages.add(name);
}
while ( cats.scroll(PAGE_SIZE) );
}
cats.close()
Note that an open database connection (and cursor) is required for this functionality, use setMaxResult()/set-
FirstResult() if you need offline pagination functionality.
You may also define named queries in the mapping document. (Remember to use a CDATA section if your query
contains characters that could be interpreted as markup.)
<query name="ByNameAndMaximumWeight"><![CDATA[
from eg.DomesticCat as cat
where cat.name = ?
and cat.weight > ?
] ]></query>
Query q = sess.getNamedQuery("ByNameAndMaximumWeight");
q.setString(0, name);
q.setInt(1, minWeight);
List cats = q.list();
Note that the actual program code is independent of the query language that is used, you may also define native
SQL queries in metadata, or migrate existing queries to Hibernate by placing them in mapping files.
Also note that a query declaration inside a <hibernate-mapping> element requires a global unique name for the
query, while a query declaration inside a <class> element is made unique automatically by prepending the
fully qualified name of the class, for example eg.Cat.ByNameAndMaximumWeight.
A collection filter is a special type of query that may be applied to a persistent collection or array. The query
string may refer to this, meaning the current collection element.
The returned collection is considered a bag, and it's a copy of the given collection. The original collection is not
modified (this is contrary to the implication of the name "filter", but consistent with expected behavior).
Observe that filters do not require a from clause (though they may have one if required). Filters are not limited
to returning the collection elements themselves.
Even an empty filter query is useful, e.g. to load a subset of elements in a huge collection:
.setFirstResult(0).setMaxResults(10)
.list();
HQL is extremely powerful but some developers prefer to build queries dynamically, using an object-oriented
API, rather than building query strings. Hibernate provides an intuitive Criteria query API for these cases:
The Criteria and the associated Example API are discussed in more detail in Chapter 15, Criteria Queries.
You may express a query in SQL, using createSQLQuery() and let Hibernate take care of the mapping from
result sets to objects. Note that you may at any time call session.connection() and use the JDBC Connection
directly. If you chose to use the Hibernate API, you must enclose SQL aliases in braces:
SQL queries may contain named and positional parameters, just like Hibernate queries. More information about
native SQL queries in Hibernate can be found in Chapter 16, Native SQL.
Sometimes this programming model is inefficient since it would require both an SQL SELECT (to load an ob-
ject) and an SQL UPDATE (to persist its updated state) in the same session. Therefore Hibernate offers an altern-
ate approach, using detached instances.
Note that Hibernate does not offer its own API for direct execution of UPDATE or DELETE statements. Hibernate
is a state management service, you don't have to think in statements to use it. JDBC is a perfect API for execut-
ing SQL statements, you can get a JDBC Connection at any time by calling session.connection(). Further-
more, the notion of mass operations conflicts with object/relational mapping for online transaction processing-
oriented applications. Future versions of Hibernate may however provide special mass operation functions. See
Chapter 13, Batch processing for some possible batch operation tricks.
Hibernate supports this model by providing for reattachment of detached instances using the Ses-
sion.update() or Session.merge() methods:
If the Cat with identifier catId had already been loaded by secondSession when the application tried to reat-
tach it, an exception would have been thrown.
Use update() if you are sure that the session does not contain an already persistent instance with the same
identifier, and merge() if you want to merge your modifications at any time without consideration of the state
of the session. In other words, update() is usually the first method you would call in a fresh session, ensuring
that reattachment of your detached instances is the first operation that is executed.
The application should individually update() detached instances reachable from the given detached instance if
and only if it wants their state also updated. This can be automated of course, using transitive persistence, see
Section 10.11, “Transitive persistence”.
The lock() method also allows an application to reassociate an object with a new session. However, the de-
tached instance has to be unmodified!
//just reassociate:
sess.lock(fritz, LockMode.NONE);
//do a version check, then reassociate:
sess.lock(izi, LockMode.READ);
//do a version check, using SELECT ... FOR UPDATE, then reassociate:
sess.lock(pk, LockMode.UPGRADE);
Note that lock() can be used with various LockModes, see the API documentation and the chapter on transac-
tion handling for more information. Reattachment is not the only usecase for lock().
Other models for long units of work are discussed in Section 11.3, “Optimistic concurrency control”.
The usage and semantics of saveOrUpdate() seems to be confusing for new users. Firstly, so long as you are
not trying to use instances from one session in another new session, you should not need to use update(), sa-
veOrUpdate(), or merge(). Some whole applications will never use either of these methods.
• if there is a persistent instance with the same identifier currently associated with the session, copy the state
of the given object onto the persistent instance
• if there is no persistent instance currently associated with the session, try to load it from the database, or
create a new persistent instance
• the persistent instance is returned
• the given instance does not become associated with the session, it remains detached
sess.delete(cat);
You may delete objects in any order you like, without risk of foreign key constraint violations. It is still pos-
sible to violate a NOT NULL constraint on a foreign key column by deleting objects in the wrong order, e.g. if
you delete the parent, but forget to delete the children.
The ReplicationMode determines how replicate() will deal with conflicts with existing rows in the database.
• ReplicationMode.IGNORE - ignore the object when there is an existing database row with the same identifi-
er
• ReplicationMode.OVERWRITE - overwrite any existing database row with the same identifier
• ReplicationMode.EXCEPTION - throw an exception if there is an existing database row with the same identi-
fier
• ReplicationMode.LATEST_VERSION - overwrite the row if its version number is earlier than the version
number of the object, or ignore the object otherwise
Usecases for this feature include reconciling data entered into different database instances, upgrading system
configuration information during product upgrades, rolling back changes made during non-ACID transactions
and more.
1. all entity insertions, in the same order the corresponding objects were saved using Session.save()
2. all entity updates
3. all collection deletions
4. all collection element deletions, updates and insertions
5. all collection insertions
6. all entity deletions, in the same order the corresponding objects were deleted using Session.delete()
(An exception is that objects using native ID generation are inserted when they are saved.)
Except when you explicity flush(), there are absolutely no guarantees about when the Session executes the
JDBC calls, only the order in which they are executed. However, Hibernate does guarantee that the
Query.list(..) will never return stale data; nor will they return the wrong data.
It is possible to change the default behavior so that flush occurs less frequently. The FlushMode class defines
three different modes: only flush at commit time (and only when the Hibernate Transaction API is used), flush
automatically using the explained routine, or never flush unless flush() is called explicitly. The last mode is
useful for long running units of work, where a Session is kept open and disconnected for a long time (see Sec-
tion 11.3.2, “Extended session and automatic versioning”).
sess = sf.openSession();
Transaction tx = sess.beginTransaction();
sess.setFlushMode(FlushMode.COMMIT); // allow queries to return stale state
During flush, an exception might occur (e.g. if a DML operation violates a constraint). Since handling excep-
tions involves some understanding of Hibernate's transactional behavior, we discuss it in Chapter 11, Transac-
tions And Concurrency.
If the children in a parent/child relationship would be value typed (e.g. a collection of addresses or strings),
their lifecycle would depend on the parent and no further action would be required for convenient "cascading"
of state changes. When the parent is saved, the value-typed child objects are saved as well, when the parent is
deleted, the children will be deleted, etc. This even works for operations such as the removal of a child from the
collection; Hibernate will detect this and, since value-typed objects can't have shared references, delete the
child from the database.
Now consider the same scenario with parent and child objects being entities, not value-types (e.g. categories
and items, or parent and child cats). Entities have their own lifecycle, support shared references (so removing
an entity from the collection does not mean it can be deleted), and there is by default no cascading of state from
one entity to any other associated entities. Hibernate does not implement persistence by reachability by default.
For each basic operation of the Hibernate session - including persist(), merge(), saveOrUpdate(), de-
lete(), lock(), refresh(), evict(), replicate() - there is a corresponding cascade style. Respectively,
the cascade styles are named create, merge, save-update, delete, lock, refresh, evict, replicate.
If you want an operation to be cascaded along an association, you must indicate that in the mapping document.
For example:
You may even use cascade="all" to specify that all operations should be cascaded along the association. The
default cascade="none" specifies that no operations are to be cascaded.
A special cascade style, delete-orphan, applies only to one-to-many associations, and indicates that the de-
lete() operation should be applied to any child object that is removed from the association.
Recommendations:
• It doesn't usually make sense to enable cascade on a <many-to-one> or <many-to-many> association. Cas-
cade is often useful for <one-to-one> and <one-to-many> associations.
• If the child object's lifespan is bounded by the lifespan of the of the parent object make it a lifecycle object
by specifying cascade="all,delete-orphan".
• Otherwise, you might not need cascade at all. But if you think that you will often be working with the par-
ent and children together in the same transaction, and you want to save yourself some typing, consider us-
ing cascade="persist,merge,save-update".
Mapping an association (either a single valued association, or a collection) with cascade="all" marks the as-
sociation as a parent/child style relationship where save/update/delete of the parent results in save/update/delete
of the child or children.
Futhermore, a mere reference to a child from a persistent parent will result in save/update of the child. This
metaphor is incomplete, however. A child which becomes unreferenced by its parent is not automatically de-
leted, except in the case of a <one-to-many> association mapped with cascade="delete-orphan". The precise
semantics of cascading operations for a parent/child relationship are as follows:
Finally, note that cascading of operations can be applied to an object graph at call time or at flush time. All op-
erations, if enabled, are cascaded to associated entities reachable when the operation is executed. However,
save-upate and delete-orphan are transitive for all associated entities reachable during flush of the Session.
Hibernate exposes metadata via the ClassMetadata and CollectionMetadata interfaces and the Type hier-
archy. Instances of the metadata interfaces may be obtained from the SessionFactory.
Hibernate does not lock objects in memory. Your application can expect the behavior as defined by the isola-
tion level of your database transactions. Note that thanks to the Session, which is also a transaction-scoped
cache, Hibernate provides repeatable reads for lookup by identifier and entity queries (not reporting queries that
return scalar values).
In addition to versioning for automatic optimistic concurrency control, Hibernate also offers a (minor) API for
pessimistic locking of rows, using the SELECT FOR UPDATE syntax. Optimistic concurrency control and this API
are discussed later in this chapter.
We start the discussion of concurrency control in Hibernate with the granularity of Configuration, Session-
Factory, and Session, as well as database transactions and long conversations.
A Session is an inexpensive, non-threadsafe object that should be used once, for a single request, a conversa-
tion, single unit of work, and then discarded. A Session will not obtain a JDBC Connection (or a Datasource)
unless it is needed, hence consume no resources until used.
To complete this picture you also have to think about database transactions. A database transaction has to be as
short as possible, to reduce lock contention in the database. Long database transactions will prevent your ap-
plication from scaling to highly concurrent load. Hence, it is almost never good design to hold a database trans-
action open during user think time, until the unit of work is complete.
What is the scope of a unit of work? Can a single Hibernate Session span several database transactions or is
this a one-to-one relationship of scopes? When should you open and close a Session and how do you demarc-
ate the database transaction boundaries?
First, don't use the session-per-operation antipattern, that is, don't open and close a Session for every simple
database call in a single thread! Of course, the same is true for database transactions. Database calls in an ap-
plication are made using a planned sequence, they are grouped into atomic units of work. (Note that this also
means that auto-commit after every single SQL statement is useless in an application, this mode is intended for
ad-hoc SQL console work. Hibernate disables, or expects the application server to do so, auto-commit mode
immediately.) Database transactions are never optional, all communication with a database has to occur inside a
transaction, no matter if you read or write data. As explained, auto-commit behavior for reading data should be
avoided, as many small transactions are unlikely to perform better than one clearly defined unit of work. The
latter is also much more maintainable and extensible.
The most common pattern in a multi-user client/server application is session-per-request. In this model, a re-
quest from the client is send to the server (where the Hibernate persistence layer runs), a new Hibernate Ses-
sion is opened, and all database operations are executed in this unit of work. Once the work has been com-
pleted (and the response for the client has been prepared), the session is flushed and closed. You would also use
a single database transaction to serve the clients request, starting and committing it when you open and close
the Session. The relationship between the two is one-to-one and this model is a perfect fit for many applica-
tions.
The challenge lies in the implementation. Hibernate provides built-in management of the "current session" to
simplify this pattern. All you have to do is start a transaction when a server request has to be processed, and end
the transaction before the response is send to the client. You can do this in any way you like, common solutions
are ServletFilter, AOP interceptor with a pointcut on the service methods, or a proxy/interception container.
An EJB container is a standardized way to implement cross-cutting aspects such as transaction demarcation on
EJB session beans, declaratively with CMT. If you decide to use programmatic transaction demarcation, prefer
the Hibernate Transaction API shown later in this chapter, for ease of use and code portability.
Your application code can access a "current session" to process the request by simply calling sessionFact-
ory.getCurrentSession() anywhere and as often as needed. You will always get a Session scoped to the cur-
rent database transaction. This has to be configured for either resource-local or JTA environments, see Sec-
tion 2.5, “Contextual Sessions”.
Sometimes it is convenient to extend the scope of a Session and database transaction until the "view has been
rendered". This is especially useful in servlet applications that utilize a separate rendering phase after the re-
quest has been processed. Extending the database transaction until view rendering is complete is easy to do if
you implement your own interceptor. However, it is not easily doable if you rely on EJBs with container-man-
aged transactions, as a transaction will be completed when an EJB method returns, before rendering of any
view can start. See the Hibernate website and forum for tips and examples around this Open Session in View
pattern.
The session-per-request pattern is not the only useful concept you can use to design units of work. Many busi-
ness processes require a whole series of interactions with the user interleaved with database accesses. In web
and enterprise applications it is not acceptable for a database transaction to span a user interaction. Consider the
following example:
• The first screen of a dialog opens, the data seen by the user has been loaded in a particular Session and
database transaction. The user is free to modify the objects.
• The user clicks "Save" after 5 minutes and expects his modifications to be made persistent; he also expects
that he was the only person editing this information and that no conflicting modification can occur.
We call this unit of work, from the point of view of the user, a long running conversation (or application trans-
action). There are many ways how you can implement this in your application.
A first naive implementation might keep the Session and database transaction open during user think time,
with locks held in the database to prevent concurrent modification, and to guarantee isolation and atomicity.
This is of course an anti-pattern, since lock contention would not allow the application to scale with the number
of concurrent users.
Clearly, we have to use several database transactions to implement the converastion. In this case, maintaining
isolation of business processes becomes the partial responsibility of the application tier. A single conversation
usually spans several database transactions. It will be atomic if only one of these database transactions (the last
one) stores the updated data, all others simply read data (e.g. in a wizard-style dialog spanning several request/
response cycles). This is easier to implement than it might sound, especially if you use Hibernate's features:
• Automatic Versioning - Hibernate can do automatic optimistic concurrency control for you, it can automat-
ically detect if a concurrent modification occured during user think time. Usually we only check at the end
of the conversation.
• Detached Objects - If you decide to use the already discussed session-per-request pattern, all loaded in-
stances will be in detached state during user think time. Hibernate allows you to reattach the objects and
persist the modifications, the pattern is called session-per-request-with-detached-objects. Automatic ver-
sioning is used to isolate concurrent modifications.
• Extended (or Long) Session - The Hibernate Session may be disconnected from the underlying JDBC con-
nection after the database transaction has been committed, and reconnected when a new client request oc-
curs. This pattern is known as session-per-conversation and makes even reattachment unnecessary. Auto-
matic versioning is used to isolate concurrent modifications and the Session is usually not allowed to be
flushed automatically, but explicitely.
An application may concurrently access the same persistent state in two different Sessions. However, an in-
stance of a persistent class is never shared between two Session instances. Hence there are two different no-
tions of identity:
Database Identity
foo.getId().equals( bar.getId() )
JVM Identity
foo==bar
Then for objects attached to a particular Session (i.e. in the scope of a Session) the two notions are equival-
ent, and JVM identity for database identity is guaranteed by Hibernate. However, while the application might
concurrently access the "same" (persistent identity) business object in two different sessions, the two instances
will actually be "different" (JVM identity). Conflicts are resolved using (automatic versioning) at flush/commit
time, using an optimistic approach.
This approach leaves Hibernate and the database to worry about concurrency; it also provides the best scalabil-
ity, since guaranteeing identity in single-threaded units of work only doesn't need expensive locking or other
means of synchronization. The application never needs to synchronize on any business object, as long as it
sticks to a single thread per Session. Within a Session the application may safely use == to compare objects.
However, an application that uses == outside of a Session, might see unexpected results. This might occur even
in some unexpected places, for example, if you put two detached instances into the same Set. Both might have
the same database identity (i.e. they represent the same row), but JVM identity is by definition not guaranteed
for instances in detached state. The developer has to override the equals() and hashCode() methods in persist-
ent classes and implement his own notion of object equality. There is one caveat: Never use the database identi-
fier to implement equality, use a business key, a combination of unique, usually immutable, attributes. The
database identifier will change if a transient object is made persistent. If the transient instance (usually together
with detached instances) is held in a Set, changing the hashcode breaks the contract of the Set. Attributes for
business keys don't have to be as stable as database primary keys, you only have to guarantee stability as long
as the objects are in the same Set. See the Hibernate website for a more thorough discussion of this issue. Also
note that this is not a Hibernate issue, but simply how Java object identity and equality has to be implemented.
Never use the anti-patterns session-per-user-session or session-per-application (of course, there are rare excep-
tions to this rule). Note that some of the following issues might also appear with the recommended patterns,
make sure you understand the implications before making a design decision:
• A Session is not thread-safe. Things which are supposed to work concurrently, like HTTP requests, session
beans, or Swing workers, will cause race conditions if a Session instance would be shared. If you keep
your Hibernate Session in your HttpSession (discussed later), you should consider synchronizing access
to your Http session. Otherwise, a user that clicks reload fast enough may use the same Session in two con-
currently running threads.
• An exception thrown by Hibernate means you have to rollback your database transaction and close the Ses-
sion immediately (discussed later in more detail). If your Session is bound to the application, you have to
stop the application. Rolling back the database transaction doesn't put your business objects back into the
state they were at the start of the transaction. This means the database state and the business objects do get
out of sync. Usually this is not a problem, because exceptions are not recoverable and you have to start over
after rollback anyway.
• The Session caches every object that is in persistent state (watched and checked for dirty state by Hibern-
ate). This means it grows endlessly until you get an OutOfMemoryException, if you keep it open for a long
time or simply load too much data. One solution for this is to call clear() and evict() to manage the Ses-
sion cache, but you most likely should consider a Stored Procedure if you need mass data operations. Some
solutions are shown in Chapter 13, Batch processing. Keeping a Session open for the duration of a user
session also means a high probability of stale data.
A Hibernate application can run in non-managed (i.e. standalone, simple Web- or Swing applications) and man-
aged J2EE environments. In a non-managed environment, Hibernate is usually responsible for its own database
connection pool. The application developer has to manually set transaction boundaries, in other words, begin,
commit, or rollback database transactions himself. A managed environment usually provides container-man-
aged transactions (CMT), with the transaction assembly defined declaratively in deployment descriptors of EJB
session beans, for example. Programmatic transaction demarcation is then no longer necessary.
However, it is often desirable to keep your persistence layer portable between non-managed resource-local en-
vironments, and systems that can rely on JTA but use BMT instead of CMT. In both cases you'd use program-
matic transaction demaracation. Hibernate offers a wrapper API called Transaction that translates into the nat-
ive transaction system of your deployment environment. This API is actually optional, but we strongly encour-
age its use unless you are in a CMT session bean.
Flushing the session has been discussed earlier, we'll now have a closer look at transaction demarcation and ex-
ception handling in both managed- and non-managed environments.
If a Hibernate persistence layer runs in a non-managed environment, database connections are usually handled
by simple (i.e. non-DataSource) connection pools from which Hibernate obtains connections as needed. The
session/transaction handling idiom looks like this:
// do some work
...
tx.commit();
}
catch (RuntimeException e) {
if (tx != null) tx.rollback();
throw e; // or display error message
}
finally {
sess.close();
}
You don't have to flush() the Session explicitly - the call to commit() automatically triggers the synchroniza-
tion (depending upon the Section 10.10, “Flushing the Session” for the session. A call to close() marks the
end of a session. The main implication of close() is that the JDBC connection will be relinquished by the ses-
sion. This Java code is portable and runs in both non-managed and JTA environments.
A much more flexible solution is Hibernate's built-in "current session" context management, as described earli-
er:
// do some work
...
factory.getCurrentSession().getTransaction().commit();
}
catch (RuntimeException e) {
factory.getCurrentSession().getTransaction().rollback();
throw e; // or display error message
}
You will very likely never see these code snippets in a regular application; fatal (system) exceptions should al-
ways be caught at the "top". In other words, the code that executes Hibernate calls (in the persistence layer) and
the code that handles RuntimeException (and usually can only clean up and exit) are in different layers. The
current context management by Hibernate can significantly simplify this design, as all you need is access to a
SessionFactory. Exception handling is discussed later in this chapter.
If your persistence layer runs in an application server (e.g. behind EJB session beans), every datasource con-
nection obtained by Hibernate will automatically be part of the global JTA transaction. You can also install a
standalone JTA implementation and use it without EJB. Hibernate offers two strategies for JTA integration.
If you use bean-managed transactions (BMT) Hibernate will tell the application server to start and end a BMT
transaction if you use the Transaction API. So, the transaction management code is identical to the non-
managed environment.
// BMT idiom
Session sess = factory.openSession();
Transaction tx = null;
try {
tx = sess.beginTransaction();
// do some work
...
tx.commit();
}
catch (RuntimeException e) {
if (tx != null) tx.rollback();
throw e; // or display error message
}
finally {
sess.close();
}
If you want to use a transaction-bound Session, that is, the getCurrentSession() functionality for easy con-
text propagation, you will have to use the JTA UserTransaction API directly:
tx.begin();
tx.commit();
}
catch (RuntimeException e) {
tx.rollback();
throw e; // or display error message
}
With CMT, transaction demarcation is done in session bean deployment descriptors, not programatically,
hence, the code is reduced to:
// CMT idiom
Session sess = factory.getCurrentSession();
// do some work
...
In a CMT/EJB even rollback happens automatically, since an unhandled RuntimeException thrown by a ses-
sion bean method tells the container to set the global transaction to rollback. This means you do not need to use
the Hibernate Transaction API at all with BMT or CMT, and you get automatic propagation of the "current"
Note that you should choose org.hibernate.transaction.JTATransactionFactory if you use JTA directly
(BMT), and org.hibernate.transaction.CMTTransactionFactory in a CMT session bean, when you config-
ure Hibernate's transaction factory. Remember to also set hibernate.transaction.manager_lookup_class.
Furthermore, make sure that your hibernate.current_session_context_class is either unset (backwards
compatiblity), or set to "jta".
The getCurrentSession() operation has one downside in a JTA environment. There is one caveat to the use of
after_statement connection release mode, which is then used by default. Due to a silly limitation of the JTA
spec, it is not possible for Hibernate to automatically clean up any unclosed ScrollableResults or Iterator
instances returned by scroll() or iterate(). You must release the underlying database cursor by calling
ScrollableResults.close() or Hibernate.close(Iterator) explicity from a finally block. (Of course,
most applications can easily avoid using scroll() or iterate() at all from the JTA or CMT code.)
If the Session throws an exception (including any SQLException), you should immediately rollback the data-
base transaction, call Session.close() and discard the Session instance. Certain methods of Session will not
leave the session in a consistent state. No exception thrown by Hibernate can be treated as recoverable. Ensure
that the Session will be closed by calling close() in a finally block.
The HibernateException, which wraps most of the errors that can occur in a Hibernate persistence layer, is an
unchecked exception (it wasn't in older versions of Hibernate). In our opinion, we shouldn't force the applica-
tion developer to catch an unrecoverable exception at a low layer. In most systems, unchecked and fatal excep-
tions are handled in one of the first frames of the method call stack (i.e. in higher layers) and an error message
is presented to the application user (or some other appropriate action is taken). Note that Hibernate might also
throw other unchecked exceptions which are not a HibernateException. These are, again, not recoverable and
appropriate action should be taken.
Hibernate wraps SQLExceptions thrown while interacting with the database in a JDBCException. In fact, Hi-
bernate will attempt to convert the eexception into a more meningful subclass of JDBCException. The underly-
ing SQLException is always available via JDBCException.getCause(). Hibernate converts the SQLException
into an appropriate JDBCException subclass using the SQLExceptionConverter attached to the SessionFact-
ory. By default, the SQLExceptionConverter is defined by the configured dialect; however, it is also possible
to plug in a custom implementation (see the javadocs for the SQLExceptionConverterFactory class for de-
tails). The standard JDBCException subtypes are:
One extremely important feature provided by a managed environment like EJB that is never provided for non-
managed code is transaction timeout. Transaction timeouts ensure that no misbehaving transaction can indefin-
itely tie up resources while returning no response to the user. Outside a managed (JTA) environment, Hibernate
cannot fully provide this functionality. However, Hibernate can at least control data access operations, ensuring
that database level deadlocks and queries with huge result sets are limited by a defined timeout. In a managed
environment, Hibernate can delegate transaction timeout to JTA. This functioanlity is abstracted by the Hibern-
// do some work
...
sess.getTransaction().commit()
}
catch (RuntimeException e) {
sess.getTransaction().rollback();
throw e; // or display error message
}
finally {
sess.close();
}
Note that setTimeout() may not be called in a CMT bean, where transaction timeouts must be defined declar-
atively.
In an implementation without much help from Hibernate, each interaction with the database occurs in a new
Session and the developer is responsible for reloading all persistent instances from the database before manip-
ulating them. This approach forces the application to carry out its own version checking to ensure conversation
transaction isolation. This approach is the least efficient in terms of database access. It is the approach most
similar to entity EJBs.
t.commit();
session.close();
The version property is mapped using <version>, and Hibernate will automatically increment it during flush if
the entity is dirty.
Of course, if you are operating in a low-data-concurrency environment and don't require version checking, you
may use this approach and just skip the version check. In that case, last commit wins will be the default strategy
for your long conversations. Keep in mind that this might confuse the users of the application, as they might ex-
perience lost updates without error messages or a chance to merge conflicting changes.
Clearly, manual version checking is only feasible in very trivial circumstances and not practical for most ap-
plications. Often not only single instances, but complete graphs of modified ojects have to be checked. Hibern-
ate offers automatic version checking with either an extended Session or detached instances as the design
paradigm.
A single Session instance and its persistent instances are used for the whole conversation, known as session-
per-conversation. Hibernate checks instance versions at flush time, throwing an exception if concurrent modi-
fication is detected. It's up to the developer to catch and handle this exception (common options are the oppor-
tunity for the user to merge changes or to restart the business conversation with non-stale data).
The Session is disconnected from any underlying JDBC connection when waiting for user interaction. This ap-
proach is the most efficient in terms of database access. The application need not concern itself with version
checking or with reattaching detached instances, nor does it have to reload instances in every database transac-
tion.
foo.setProperty("bar");
The foo object still knows which Session it was loaded in. Beginning a new database transaction on an old ses-
sion obtains a new connection and resumes the session. Committing a database transaction disconnects a ses-
sion from the JDBC connection and returns the connection to the pool. After reconnection, to force a version
check on data you aren't updating, you may call Session.lock() with LockMode.READ on any objects that
might have been updated by another transaction. You don't need to lock any data that you are updating. Usually
you would set FlushMode.NEVER on an extended Session, so that only the last database transaction cycle is al-
lowed to actually persist all modifications made in this conversation. Hence, only this last database transaction
would include the flush() operation, and then also close() the session to end the conversation.
This pattern is problematic if the Session is too big to be stored during user think time, e.g. an HttpSession
should be kept as small as possible. As the Session is also the (mandatory) first-level cache and contains all
loaded objects, we can probably use this strategy only for a few request/response cycles. You should use a Ses-
sion only for a single conversation, as it will soon also have stale data.
(Note that earlier Hibernate versions required explicit disconnection and reconnection of a Session. These
methods are deprecated, as beginning and ending a transaction has the same effect.)
Also note that you should keep the disconnected Session close to the persistence layer. In other words, use an
EJB stateful session bean to hold the Session in a three-tier environment, and don't transfer it to the web layer
(or even serialize it to a separate tier) to store it in the HttpSession.
The extended session pattern, or session-per-conversation, is more difficult to implement with automatic cur-
rent session context management. You need to supply your own implementation of the CurrentSessionCon-
text for this, see the Hibernate Wiki for examples.
Each interaction with the persistent store occurs in a new Session. However, the same persistent instances are
reused for each interaction with the database. The application manipulates the state of detached instances ori-
ginally loaded in another Session and then reattaches them using Session.update(), Ses-
sion.saveOrUpdate(), or Session.merge().
Again, Hibernate will check instance versions during flush, throwing an exception if conflicting updates oc-
cured.
You may also call lock() instead of update() and use LockMode.READ (performing a version check, bypassing
all caches) if you are sure that the object has not been modified.
You may disable Hibernate's automatic version increment for particular properties and collections by setting the
optimistic-lock mapping attribute to false. Hibernate will then no longer increment versions if the property
is dirty.
Legacy database schemas are often static and can't be modified. Or, other applications might also access the
same database and don't know how to handle version numbers or even timestamps. In both cases, versioning
can't rely on a particular column in a table. To force a version check without a version or timestamp property
mapping, with a comparison of the state of all fields in a row, turn on optimistic-lock="all" in the <class>
mapping. Note that this concepetually only works if Hibernate can compare the old and new state, i.e. if you
use a single long Session and not session-per-request-with-detached-objects.
Sometimes concurrent modification can be permitted as long as the changes that have been made don't overlap.
If you set optimistic-lock="dirty" when mapping the <class>, Hibernate will only compare dirty fields dur-
ing flush.
In both cases, with dedicated version/timestamp columns or with full/dirty field comparison, Hibernate uses a
single UPDATE statement (with an appropriate WHERE clause) per entity to execute the version check and update
the information. If you use transitive persistence to cascade reattachment to associated entities, Hibernate might
execute uneccessary updates. This is usually not a problem, but on update triggers in the database might be ex-
ecuted even when no changes have been made to detached instances. You can customize this behavior by set-
ting select-before-update="true" in the <class> mapping, forcing Hibernate to SELECT the instance to en-
sure that changes did actually occur, before updating the row.
Hibernate will always use the locking mechanism of the database, never lock objects in memory!
The LockMode class defines the different lock levels that may be acquired by Hibernate. A lock is obtained by
the following mechanisms:
If Session.load() is called with UPGRADE or UPGRADE_NOWAIT, and the requested object was not yet loaded by
the session, the object is loaded using SELECT ... FOR UPDATE. If load() is called for an object that is already
loaded with a less restrictive lock than the one requested, Hibernate calls lock() for that object.
Session.lock() performs a version number check if the specified lock mode is READ, UPGRADE or UP-
GRADE_NOWAIT. (In the case of UPGRADE or UPGRADE_NOWAIT, SELECT ... FOR UPDATE is used.)
If the database does not support the requested lock mode, Hibernate will use an appropriate alternate mode
(instead of throwing an exception). This ensures that applications will be portable.
• ON_CLOSE - is essentially the legacy behavior described above. The Hibernate session obatins a connection
when it first needs to perform some JDBC access and holds unto that connection until the session is closed.
• AFTER_TRANSACTION - says to release connections after a org.hibernate.Transaction has completed.
• AFTER_STATEMENT (also referred to as aggressive release) - says to release connections after each and every
statement execution. This aggressive releasing is skipped if that statement leaves open resources associated
with the given session; currently the only situation where this occurs is through the use of
org.hibernate.ScrollableResults.
12.1. Interceptors
The Interceptor interface provides callbacks from the session to the application allowing the application to in-
spect and/or manipulate properties of a persistent object before it is saved, updated, deleted or loaded. One pos-
sible use for this is to track auditing information. For example, the following Interceptor automatically sets
the createTimestamp when an Auditable is created and updates the lastUpdateTimestamp property when an
Auditable is updated.
package org.hibernate.test;
import java.io.Serializable;
import java.util.Date;
import java.util.Iterator;
import org.hibernate.EmptyInterceptor;
import org.hibernate.Transaction;
import org.hibernate.type.Type;
loads++;
}
return false;
}
A Session-scoped interceptor is specified when a session is opened using one of the overloaded SessionFact-
ory.openSession() methods accepting an Interceptor.
A SessionFactory-scoped interceptor is registered with the Configuration object prior to building the Ses-
sionFactory. In this case, the supplied interceptor will be applied to all sessions opened from that Session-
Factory; this is true unless a session is opened explicitly specifying the interceptor to use. SessionFactory-
scoped interceptors must be thread safe, taking care to not store session-specific state since multiple sessions
will use this interceptor (potentially) concurrently.
Essentially all of the methods of the Session interface correlate to an event. You have a LoadEvent, a
FlushEvent, etc (consult the XML configuration-file DTD or the org.hibernate.event package for the full
list of defined event types). When a request is made of one of these methods, the Hibernate Session generates
an appropriate event and passes it to the configured event listeners for that type. Out-of-the-box, these listeners
implement the same processing in which those methods always resulted. However, you are free to implement a
customization of one of the listener interfaces (i.e., the LoadEvent is processed by the registered implemenation
of the LoadEventListener interface), in which case their implementation would be responsible for processing
any load() requests made of the Session.
The listeners should be considered effectively singletons; meaning, they are shared between requests, and thus
should not save any state as instance variables.
A custom listener should implement the appropriate interface for the event it wants to process and/or extend
one of the convenience base classes (or even the default event listeners used by Hibernate out-of-the-box as
these are declared non-final for this purpose). Custom listeners can either be registered programmatically
through the Configuration object, or specified in the Hibernate configuration XML (declarative configuration
through the properties file is not supported). Here's an example of a custom load event listener:
You also need a configuration entry telling Hibernate to use the listener in addition to the default listener:
<hibernate-configuration>
<session-factory>
...
<event type="load">
<listener class="com.eg.MyLoadListener"/>
<listener class="org.hibernate.event.def.DefaultLoadEventListener"/>
</event>
</session-factory>
</hibernate-configuration>
Listeners registered declaratively cannot share instances. If the same class name is used in multiple
<listener/> elements, each reference will result in a separate instance of that class. If you need the capability
to share listener instances between listener types you must use the programmatic registration approach.
Why implement an interface and define the specific type during configuration? Well, a listener implementation
could implement multiple event listener interfaces. Having the type additionally defined during registration
makes it easier to turn custom listeners on or off during configuration.
First, you must configure the appropriate event listeners, to enable the use of JAAS authorization.
Note that <listener type="..." class="..."/> is just a shorthand for <event type="..."><listener
class="..."/></event> when there is exactly one listener for a particular event type.
The role names are the roles understood by your JACC provider.
This would fall over with an OutOfMemoryException somewhere around the 50 000th row. That's because Hi-
bernate caches all the newly inserted Customer instances in the session-level cache.
In this chapter we'll show you how to avoid this problem. First, however, if you are doing batch processing, it is
absolutely critical that you enable the use of JDBC batching, if you intend to achieve reasonable performance.
Set the JDBC batch size to a reasonable number (say, 10-50):
hibernate.jdbc.batch_size 20
Note that Hibernate disables insert batching at the JDBC level transparently if you use an identiy identifier
generator.
You also might like to do this kind of work in a process where interaction with the second-level cache is com-
pletely disabled:
hibernate.cache.use_second_level_cache false
However, this is not absolutely necessary, since we can explicitly set the CacheMode to disable interaction with
the second-level cache.
tx.commit();
session.close();
For retrieving and updating data the same ideas apply. In addition, you need to use scroll() to take advantage
of server-side cursors for queries that return many rows of data.
tx.commit();
session.close();
tx.commit();
session.close();
Note that in this code example, the Customer instances returned by the query are immediately detached. They
are never associated with any persistence context.
The insert(), update() and delete() operations defined by the StatelessSession interface are considered
to be direct database row-level operations, which result in immediate execution of a SQL INSERT, UPDATE or
DELETE respectively. Thus, they have very different semantics to the save(), saveOrUpdate() and delete()
operations defined by the Session interface.
As already discussed, automatic and transparent object/relational mapping is concerned with the management
of object state. This implies that the object state is available in memory, hence manipulating (using the SQL
Data Manipulation Language (DML) statements: INSERT, UPDATE, DELETE) data directly in the database will
not affect in-memory state. However, Hibernate provides methods for bulk SQL-style DML statement execu-
tion which are performed through the Hibernate Query Language (Chapter 14, HQL: The Hibernate Query
Language).
The pseudo-syntax for UPDATE and DELETE statements is: ( UPDATE | DELETE ) FROM? EntityName (WHERE
where_conditions)?. Some points to note:
As an example, to execute an HQL UPDATE, use the Query.executeUpdate() method (the method is named for
those familiar with JDBC's PreparedStatement.executeUpdate()):
String hqlUpdate = "update Customer c set c.name = :newName where c.name = :oldName";
// or String hqlUpdate = "update Customer set name = :newName where name = :oldName";
int updatedEntities = s.createQuery( hqlUpdate )
.setString( "newName", newName )
.setString( "oldName", oldName )
.executeUpdate();
tx.commit();
session.close();
HQL UPDATE statements, by default do not effect the Section 5.1.9, “version (optional)” or the Section 5.1.10,
“timestamp (optional)” property values for the affected entities; this is in keeping with the EJB3 specification.
However, you can force Hibernate to properly reset the version or timestamp property values through the use
of a versioned update. This is achieved by adding the VERSIONED keyword after the UPDATE keyword.
Note that custom version types (org.hibernate.usertype.UserVersionType) are not allowed in conjunction
with a update versioned statement.
.executeUpdate();
tx.commit();
session.close();
The int value returned by the Query.executeUpdate() method indicate the number of entities effected by the
operation. Consider this may or may not correlate to the number of rows effected in the database. An HQL bulk
operation might result in multiple actual SQL statements being executed, for joined-subclass, for example. The
returned number indicates the number of actual entities affected by the statement. Going back to the example of
joined-subclass, a delete against one of the subclasses may actually result in deletes against not just the table to
which that subclass is mapped, but also the "root" table and potentially joined-subclass tables further down the
inheritence hierarchy.
The pseudo-syntax for INSERT statements is: INSERT INTO EntityName properties_list select_statement.
Some points to note:
• Only the INSERT INTO ... SELECT ... form is supported; not the INSERT INTO ... VALUES ... form.
The properties_list is analogous to the column speficiation in the SQL INSERT statement. For entities in-
volved in mapped inheritence, only properties directly defined on that given class-level can be used in the
properties_list. Superclass properties are not allowed; and subclass properties do not make sense. In other
words, INSERT statements are inherently non-polymorphic.
• select_statement can be any valid HQL select query, with the caveat that the return types must match the
types expected by the insert. Currently, this is checked during query compilation rather than allowing the
check to relegate to the database. Note however that this might cause problems between Hibernate Types
which are equivalent as opposed to equal. This might cause issues with mismatches between a property
defined as a org.hibernate.type.DateType and a property defined as a
org.hibernate.type.TimestampType, even though the database might not make a distinction or might be
able to handle the conversion.
• For the id property, the insert statement gives you two options. You can either explicitly specify the id prop-
erty in the properties_list (in which case its value is taken from the corresponding select expression) or omit
it from the properties_list (in which case a generated value is used). This later option is only available when
using id generators that operate in the database; attempting to use this option with any "in memory" type
generators will cause an exception during parsing. Note that for the purposes of this discussion, in-database
generators are considered to be org.hibernate.id.SequenceGenerator (and its subclasses) and any imple-
mentors of org.hibernate.id.PostInsertIdentifierGenerator. The most notable exception here is
org.hibernate.id.TableHiLoGenerator, which cannot be used because it does not expose a selectable
way to get its values.
• For properties mapped as either version or timestamp, the insert statement gives you two options. You can
either specify the property in the properties_list (in which case its value is taken from the corresponding se-
lect expressions) or omit it from the properties_list (in which case the seed value defined by the
org.hibernate.type.VersionType is used).
String hqlInsert = "insert into DelinquentAccount (id, name) select c.id, c.name from Customer c where
int createdEntities = s.createQuery( hqlInsert )
.executeUpdate();
tx.commit();
session.close();
This manual uses lowercase HQL keywords. Some users find queries with uppercase keywords more readable,
but we find this convention ugly when embedded in Java code.
from eg.Cat
which simply returns all instances of the class eg.Cat. We don't usually need to qualify the class name, since
auto-import is the default. So we almost always just write:
from Cat
Most of the time, you will need to assign an alias, since you will want to refer to the Cat in other parts of the
query.
This query assigns the alias cat to Cat instances, so we could use that alias later in the query. The as keyword
is optional; we could also write:
It is considered good practice to name query aliases using an initial lowercase, consistent with Java naming
standards for local variables (eg. domesticCat).
• inner join
• left outer join
• right outer join
• full join (not usually useful)
The inner join, left outer join and right outer join constructs may be abbreviated.
You may supply extra join conditions using the HQL with keyword.
In addition, a "fetch" join allows associations or collections of values to be initialized along with their parent
objects, using a single select. This is particularly useful in the case of a collection. It effectively overrides the
outer join and lazy declarations of the mapping file for associations and collections. See Section 19.1,
“Fetching strategies” for more information.
A fetch join does not usually need to assign an alias, because the associated objects should not be used in the
where clause (or any other clause). Also, the associated objects are not returned directly in the query results. In-
stead, they may be accessed via the parent object. The only reason we might need an alias is if we are recurs-
ively join fetching a further collection:
Note that the fetch construct may not be used in queries called using iterate() (though scroll() can be
used). Nor should fetch be used together with setMaxResults() or setFirstResult() as these operations are
based on the result rows, which usually contain duplicates for eager collection fetching, hence, the number of
rows is not what you'd expect. Nor may fetch be used together with an ad hoc with condition. It is possible to
create a cartesian product by join fetching more than one collection in a query, so take care in this case. Join
fetching multiple collection roles also sometimes gives unexpected results for bag mappings, so be careful
about how you formulate your queries in this case. Finally, note that full join fetch and right join fetch
are not meaningful.
If you are using property-level lazy fetching (with bytecode instrumentation), it is possible to force Hibernate to
fetch the lazy properties immediately (in the first query) using fetch all properties.
from Document doc fetch all properties where lower(doc.name) like '%cats%'
The queries shown in the previous section all use the explicit form where the join keyword is explicitly used
in the from clause. This is the recommended form.
The implicit form does not use the join keyword. Instead, the associations are "dereferenced" using dot-
notation. implicit joins can appear in any of the HQL clauses. implicit join result in inner joins in the result-
ing SQL statement.
• The special property (lowercase) id may be used to reference the identifier property of an entity provided
that entity does not define a non-identifier property named id.
• If the entity defines a named identifier property, you may use that property name.
References to composite identifier properties follow the same naming rules. If the entity has a non-identifier
property named id, the composite identifier property can only be referenced by its defined named; otherwise,
the special id property can be used to rerference the identifier property.
Note: this has changed significantly starting in version 3.2.2. In previous versions, id always referred to the
identifier property no matter what its actual name. A ramification of that decision was that non-identifier prop-
erties named id could never be referenced in Hibernate queries.
select mate
from Cat as cat
inner join cat.mate as mate
The query will select mates of other Cats. Actually, you may express this query more compactly as:
Queries may return properties of any value type including properties of component type:
Queries may return multiple objects and/or properties as an array of type Object[],
or as a List,
This is most useful when used together with select new map:
You may use arithmetic operators, concatenation, and recognized SQL functions in the select clause:
The distinct and all keywords may be used and have the same semantics as in SQL.
returns instances not only of Cat, but also of subclasses like DomesticCat. Hibernate queries may name any
Java class or interface in the from clause. The query will return instances of all persistent classes that extend
that class or implement the interface. The following query would return all persistent objects:
from java.lang.Object o
Note that these last two queries will require more than one SQL SELECT. This means that the order by clause
does not correctly order the whole result set. (It also means you can't call these queries using Query.scroll().)
select foo
from Foo foo, Bar bar
where foo.startDate = bar.date
will return all instances of Foo for which there exists an instance of bar with a date property equal to the
startDate property of the Foo. Compound path expressions make the where clause extremely powerful. Con-
sider:
This query translates to an SQL query with a table (inner) join. If you were to write something like
you would end up with a query that would require four table joins in SQL.
The = operator may be used to compare not only properties, but also instances:
The special property (lowercase) id may be used to reference the unique identifier of an object. See Sec-
tion 14.5, “Refering to identifier property” for more information.
Properties of composite identifiers may also be used. Suppose Person has a composite identifier consisting of
country and medicareNumber. Again, see Section 14.5, “Refering to identifier property” for more information
regarding referencing identifier properties.
Likewise, the special property class accesses the discriminator value of an instance in the case of polymorphic
persistence. A Java class name embedded in the where clause will be translated to its discriminator value.
You may also use components or composite user types, or properties of said component types. See ??? for more
details.
An "any" type has the special properties id and class, allowing us to express a join in the following way
(where AuditLog.item is a property mapped with <any>).
Notice that log.item.class and payment.class would refer to the values of completely different database
columns in the above query.
14.10. Expressions
Expressions allowed in the where clause include most of the kind of things you could write in SQL:
• mathematical operators +, -, *, /
• binary comparison operators =, >=, <=, <>, !=, like
• logical operations and, or, not
• Parentheses ( ), indicating grouping
• in, not in, between, is null, is not null, is empty, is not empty, member of and not member of
• "Simple" case, case ... when ... then ... else ... end, and "searched" case, case when ... then
... else ... end
• string concatenation ...||... or concat(...,...)
• current_date(), current_time(), current_timestamp()
• second(...), minute(...), hour(...), day(...), month(...), year(...),
• Any function or operator defined by EJB-QL 3.0: substring(), trim(), lower(), upper(), length(),
locate(), abs(), sqrt(), bit_length(), mod()
• coalesce() and nullif()
• str() for converting numeric or temporal values to a readable string
• cast(... as ...), where the second argument is the name of a Hibernate type, and extract(... from
...) if ANSI cast() and extract() is supported by the underlying database
• the HQL index() function, that applies to aliases of a joined indexed collection
• HQL functions that take collection-valued path expressions: size(), minelement(), maxelement(),
minindex(), maxindex(), along with the special elements() and indices functions which may be quanti-
fied using some, all, exists, any, in.
• Any database-supported SQL scalar function like sign(), trunc(), rtrim(), sin()
• JDBC-style positional parameters ?
• named parameters :name, :start_date, :x1
• SQL literals 'foo', 69, 6.66E+2, '1970-01-01 10:00:01.0'
• Java public static final constants eg.Color.TABBY
from DomesticCat cat where cat.name not between 'A' and 'B'
Likewise, is null and is not null may be used to test for null values.
Booleans may be easily used in expressions by declaring HQL query substitutions in Hibernate configuration:
This will replace the keywords true and false with the literals 1 and 0 in the translated SQL from this HQL:
You may test the size of a collection with the special property size, or the special size() function.
For indexed collections, you may refer to the minimum and maximum indices using minindex and maxindex
functions. Similarly, you may refer to the minimum and maximum elements of a collection of basic type using
the minelement and maxelement functions.
The SQL functions any, some, all, exists, in are supported when passed the element or index set of a col-
lection (elements and indices functions) or the result of a subquery (see below).
Note that these constructs - size, elements, indices, minindex, maxindex, minelement, maxelement - may
only be used in the where clause in Hibernate3.
Elements of indexed collections (arrays, lists, maps) may be referred to by index (in a where clause only):
HQL also provides the built-in index() function, for elements of a one-to-many association or collection of
values.
If you are not yet convinced by all this, think how much longer and less readable the following query would be
in SQL:
select cust
from Product prod,
Store store
inner join store.customers cust
where prod.name = 'widget'
and store.location.name in ( 'Melbourne', 'Sydney' )
and prod = all elements(cust.currentOrder.lineItems)
SQL functions and aggregate functions are allowed in the having and order by clauses, if supported by the un-
derlying database (eg. not in MySQL).
select cat
from Cat cat
join cat.kittens kitten
group by cat.id, cat.name, cat.other, cat.properties
having avg(kitten.weight) > 100
order by count(kitten) asc, sum(kitten.weight) desc
Note that neither the group by clause nor the order by clause may contain arithmetic expressions. Also note
that Hibernate currently does not expand a grouped entity, so you can't write group by cat if all properties of
cat are non-aggregated. You have to list all non-aggregated properties explicitly.
14.13. Subqueries
For databases that support subselects, Hibernate supports subqueries within queries. A subquery must be sur-
rounded by parentheses (often by an SQL aggregate function call). Even correlated subqueries (subqueries that
refer to an alias in the outer query) are allowed.
Note that HQL subqueries may occur only in the select or where clauses.
Note that subqueries can also utilize row value constructor syntax. See Section 14.18, “Row value con-
structor syntax” for more details.
Hibernate queries can be quite powerful and complex. In fact, the power of the query language is one of Hi-
bernate's main selling points. Here are some example queries very similar to queries that I used on a recent
project. Note that most queries you will write are much simpler than these!
The following query returns the order id, number of items and total value of the order for all unpaid orders for a
particular customer and given minimum total value, ordering the results by total value. In determining the
prices, it uses the current catalog. The resulting SQL query, against the ORDER, ORDER_LINE, PRODUCT, CATALOG
and PRICE tables has four inner joins and an (uncorrelated) subselect.
What a monster! Actually, in real life, I'm not very keen on subqueries, so my query was really more like this:
The next query counts the number of payments in each status, excluding all payments in the AWAIT-
ING_APPROVAL status where the most recent status change was made by the current user. It translates to an SQL
query with two inner joins and a correlated subselect against the PAYMENT, PAYMENT_STATUS and PAY-
MENT_STATUS_CHANGE tables.
If I would have mapped the statusChanges collection as a list, instead of a set, the query would have been
much simpler to write.
The next query uses the MS SQL Server isNull() function to return all the accounts and unpaid payments for
the organization to which the current user belongs. It translates to an SQL query with three inner joins, an outer
join and a subselect against the ACCOUNT, PAYMENT, PAYMENT_STATUS, ACCOUNT_TYPE, ORGANIZATION and
ORG_USER tables.
For some databases, we would need to do away with the (correlated) subselect.
If your database supports subselects, you can place a condition upon selection size in the where clause of your
query:
As this solution can't return a User with zero messages because of the inner join, the following form is also use-
ful:
14.17. Components
Components might be used in just about every way that simple value types can be used in HQL queries. They
can appear in the select clause:
where the Person's name property is a component. Components can also be used in the where clause:
Another common use of components is in Section 14.18, “Row value constructor syntax”.
That's valid syntax, although a little verbose. It be nice to make this a bit more concise and use row value con-
structor syntax:
Another time using row value constructor syntax can be beneficial is when using subqueries needing to
compare against multiple values:
One thing to consider when deciding if you want to use this syntax is that the query will be dependent upon the
ordering of the component sub-properties in the metadata.
There are quite a range of built-in criterion types (Restrictions subclasses), but one that is especially useful
lets you specify SQL directly.
The {alias} placeholder with be replaced by the row alias of the queried entity.
An alternative approach to obtaining a criterion is to get it from a Property instance. You can create a Prop-
erty by calling Property.forName().
15.4. Associations
You may easily specify constraints upon related entities by navigating associations using createCriteria().
note that the second createCriteria() returns a new instance of Criteria, which refers to the elements of the
kittens collection.
Note that the kittens collections held by the Cat instances returned by the previous two queries are not pre-
filtered by the criteria! If you wish to retrieve just the kittens that match the criteria, you must use a Result-
Transformer.
.createCriteria("kittens", "kt")
.add( Restrictions.eq("name", "F%") )
.setResultTransformer(Criteria.ALIAS_TO_ENTITY_MAP)
.list();
Iterator iter = cats.iterator();
while ( iter.hasNext() ) {
Map map = (Map) iter.next();
Cat cat = (Cat) map.get(Criteria.ROOT_ALIAS);
Cat kitten = (Cat) map.get("kt");
}
This query will fetch both mate and kittens by outer join. See Section 19.1, “Fetching strategies” for more in-
formation.
Version properties, identifiers and associations are ignored. By default, null valued properties are excluded.
You can even use examples to place criteria upon associated objects.
There is no explicit "group by" necessary in a criteria query. Certain projection types are defined to be grouping
projections, which also appear in the SQL group by clause.
An alias may optionally be assigned to a projection, so that the projected value may be referred to in restrictions
or orderings. Here are two different ways to do this:
The alias() and as() methods simply wrap a projection instance in another, aliased, instance of Projection.
As a shortcut, you can assign an alias when you add the projection to a projection list:
A DetachedCriteria may also be used to express a subquery. Criterion instances involving subqueries may be
obtained via Subqueries or Property.
idation algorithm: lookups by a constant natural key. In some applications, this kind of query occurs frequently.
The criteria API provides special provision for this use case.
First, you should map the natural key of your entity using <natural-id>, and enable use of the second-level
cache.
<class name="User">
<cache usage="read-write"/>
<id name="id">
<generator class="increment"/>
</id>
<natural-id>
<property name="name"/>
<property name="org"/>
</natural-id>
<property name="password"/>
</class>
Note that this functionality is not intended for use with entities with mutable natural keys.
Now, Restrictions.naturalId() allows us to make use of the more efficient cache algorithm.
session.createCriteria(User.class)
.add( Restrictions.naturalId()
.set("name", "gavin")
.set("org", "hb")
).setCacheable(true)
.uniqueResult();
Hibernate3 allows you to specify handwritten SQL (including stored procedures) for all create, update, delete,
and load operations.
These will both return a List of Object arrays (Object[]) with scalar values for each column in the CATS table.
Hibernate will use ResultSetMetadata to deduce the actual order and types of the returned scalar values.
To avoid the overhead of using ResultSetMetadata or simply to be more explicit in what is returned one can
use addScalar().
This will still return Object arrays, but now it will not use ResultSetMetdata but will instead explicitly get the
ID, NAME and BIRTHDATE column as respectively a Long, String and a Short from the underlying resultset.
This also means that only these three columns will be returned, even though the query is using * and could re-
turn more than the three listed columns.
It is possible to leave out the type information for all or some of the scalars.
This is essentially the same query as before, but now ResultSetMetaData is used to decide the type of NAME
and BIRTHDATE where as the type of ID is explicitly specified.
How the java.sql.Types returned from ResultSetMetaData is mapped to Hibernate types is controlled by the
Dialect. If a specific type is not mapped or does not result in the expected type it is possible to customize it via
calls to registerHibernateType in the Dialect.
The above queries were all about returning scalar values, basically returning the "raw" values from the result-
set. The following shows how to get entity objects from a native sql query via addEntity().
Assuming that Cat is mapped as a class with the columns ID, NAME and BIRTHDATE the above queries will
both return a List where each element is a Cat entity.
If the entity is mapped with a many-to-one to another entity it is required to also return this when performing
the native query, otherwise a database specific "column not found" error will occur. The additional columns
will automatically be returned when using the * notation, but we prefer to be explicit as in the following ex-
ample for a many-to-one to a Dog:
It is possible to eagerly join in the Dog to avoid the possible extra roundtrip for initializing the proxy. This is
done via the addJoin() method, which allows you to join in an association or collection.
sess.createSQLQuery("SELECT c.ID, NAME, BIRTHDATE, DOG_ID, D_ID, D_NAME FROM CATS c, DOGS d WHERE c.DO
.addEntity("cat", Cat.class)
.addJoin("cat.dog");
In this example the returned Cat's will have their dog property fully initialized without any extra roundtrip to
the database. Notice that we added a alias name ("cat") to be able to specify the target property path of the join.
It is possible to do the same eager joining for collections, e.g. if the Cat had a one-to-many to Dog instead.
sess.createSQLQuery("SELECT ID, NAME, BIRTHDATE, D_ID, D_NAME, CAT_ID FROM CATS c, DOGS d WHERE c.ID =
.addEntity("cat", Cat.class)
.addJoin("cat.dogs");
<p>At this stage we are reaching the limits of what is possible with native queries without starting to enhance
the sql queries to make them usable in Hibernate; the problems starts to arise when returning multiple entities
of the same type or when the default alias/column names are not enough.</p>
Until now the result set column names are assumed to be the same as the column names specified in the map-
ping document. This can be problematic for SQL queries which join multiple tables, since the same column
names may appear in more than one table.
Column alias injection is needed in the following query (which most likely will fail):
The intention for this query is to return two Cat instances per row, a cat and its mother. This will fail since there
is a conflict of names since they are mapped to the same column names and on some databases the returned
column aliases will most likely be on the form "c.ID", "c.NAME", etc. which are not equal to the columns spe-
cificed in the mappings ("ID" and "NAME").
• the SQL query string, with placeholders for Hibernate to inject column aliases
The {cat.*} and {mother.*} notation used above is a shorthand for "all properties". Alternatively, you may list
the columns explicity, but even in this case we let Hibernate inject the SQL column aliases for each property.
The placeholder for a column alias is just the property name qualified by the table alias. In the following ex-
ample, we retrieve Cats and their mothers from a different table (cat_log) to the one declared in the mapping
metadata. Notice that we may even use the property aliases in the where clause if we like.
For most cases the above alias injection is needed, but for queries relating to more complex mappings like com-
posite properties, inheritance discriminators, collections etc. there are some specific aliases to use to allow Hi-
bernate to inject the proper aliases.
The following table shows the different possibilities of using the alias injection. Note: the alias names in the
result are examples, each alias will have a unique and probably different name when used.
It is possible to apply a ResultTransformer to native sql queries. Allowing it to e.g. return non-managed entit-
ies.
• a result transformer
The above query will return a list of CatDTO which has been instantiated and injected the values of NAME and
BIRTHNAME into its corresponding properties or fields.
Native sql queries which query for entities that is mapped as part of an inheritance must include all properties
for the baseclass and all it subclasses.
16.1.7. Parameters
<sql-query name="persons">
<return alias="person" class="eg.Person"/>
SELECT person.NAME AS {person.name},
person.AGE AS {person.age},
person.SEX AS {person.sex}
FROM PERSON person
WHERE person.NAME LIKE :namePattern
</sql-query>
The <return-join> and <load-collection> elements are used to join associations and define queries which
initialize collections, respectively.
<sql-query name="personsWith">
<return alias="person" class="eg.Person"/>
<return-join alias="address" property="person.mailingAddress"/>
SELECT person.NAME AS {person.name},
person.AGE AS {person.age},
person.SEX AS {person.sex},
adddress.STREET AS {address.street},
adddress.CITY AS {address.city},
adddress.STATE AS {address.state},
adddress.ZIP AS {address.zip}
FROM PERSON person
JOIN ADDRESS adddress
ON person.ID = address.PERSON_ID AND address.TYPE='MAILING'
WHERE person.NAME LIKE :namePattern
</sql-query>
A named SQL query may return a scalar value. You must declare the column alias and Hibernate type using the
<return-scalar> element:
<sql-query name="mySqlQuery">
<return-scalar column="name" type="string"/>
<return-scalar column="age" type="long"/>
SELECT p.NAME AS name,
p.AGE AS age,
FROM PERSON p WHERE p.NAME LIKE 'Hiber%'
</sql-query>
You can externalize the resultset mapping informations in a <resultset> element to either reuse them accross
several named queries or through the setResultSetMapping() API.
<resultset name="personAddress">
You can alternatively use the resultset mapping information in your hbm files directly in java code.
With <return-property> you can explicitly tell Hibernate what column aliases to use, instead of using the {}-
syntax to let Hibernate inject its own aliases.
<sql-query name="mySqlQuery">
<return alias="person" class="eg.Person">
<return-property name="name" column="myName"/>
<return-property name="age" column="myAge"/>
<return-property name="sex" column="mySex"/>
</return>
SELECT person.NAME AS myName,
person.AGE AS myAge,
person.SEX AS mySex,
FROM PERSON person WHERE person.NAME LIKE :name
</sql-query>
<return-property> also works with multiple columns. This solves a limitation with the {}-syntax which can
not allow fine grained control of multi-column properties.
<sql-query name="organizationCurrentEmployments">
<return alias="emp" class="Employment">
<return-property name="salary">
<return-column name="VALUE"/>
<return-column name="CURRENCY"/>
</return-property>
<return-property name="endDate" column="myEndDate"/>
</return>
SELECT EMPLOYEE AS {emp.employee}, EMPLOYER AS {emp.employer},
STARTDATE AS {emp.startDate}, ENDDATE AS {emp.endDate},
REGIONCODE as {emp.regionCode}, EID AS {emp.id}, VALUE, CURRENCY
FROM EMPLOYMENT
WHERE EMPLOYER = :id AND ENDDATE IS NULL
ORDER BY STARTDATE ASC
</sql-query>
Notice that in this example we used <return-property> in combination with the {}-syntax for injection. Al-
lowing users to choose how they want to refer column and properties.
If your mapping has a discriminator you must use <return-discriminator> to specify the discriminator
column.
Hibernate 3 introduces support for queries via stored procedures and functions. Most of the following docu-
mentation is equivalent for both. The stored procedure/function must return a resultset as the first out-parameter
to be able to work with Hibernate. An example of such a stored function in Oracle 9 and higher is as follows:
To use this query in Hibernate you need to map it via a named query.
Notice stored procedures currently only return scalars and entities. <return-join> and <load-collection> are
not supported.
To use stored procedures with Hibernate the procedures/functions have to follow some rules. If they do not fol-
low those rules they are not usable with Hibernate. If you still want to use these procedures you have to execute
them via session.connection(). The rules are different for each database, since database vendors have differ-
ent stored procedure semantics/syntax.
• A function must return a result set. The first parameter of a procedure must be an OUT that returns a result
set. This is done by using a SYS_REFCURSOR type in Oracle 9 or 10. In Oracle you need to define a REF
CURSOR type, see Oracle literature.
• The procedure must return a result set. Note that since these servers can/will return multiple result sets and
update counts, Hibernate will iterate the results and take the first result that is a result set as its return value.
Everything else will be discarded.
• If you can enable SET NOCOUNT ON in your procedure it will probably be more efficient, but this is not a re-
quirement.
<class name="Person">
<id name="id">
<generator class="increment"/>
</id>
<property name="name" not-null="true"/>
<sql-insert>INSERT INTO PERSON (NAME, ID) VALUES ( UPPER(?), ? )</sql-insert>
<sql-update>UPDATE PERSON SET NAME=UPPER(?) WHERE ID=?</sql-update>
<sql-delete>DELETE FROM PERSON WHERE ID=?</sql-delete>
</class>
The SQL is directly executed in your database, so you are free to use any dialect you like. This will of course
reduce the portability of your mapping if you use database specific SQL.
<class name="Person">
<id name="id">
<generator class="increment"/>
</id>
<property name="name" not-null="true"/>
<sql-insert callable="true">{call createPerson (?, ?)}</sql-insert>
<sql-delete callable="true">{? = call deletePerson (?)}</sql-delete>
<sql-update callable="true">{? = call updatePerson (?, ?)}</sql-update>
</class>
The order of the positional parameters are currently vital, as they must be in the same sequence as Hibernate
expects them.
You can see the expected order by enabling debug logging for the org.hibernate.persister.entity level.
With this level enabled Hibernate will print out the static SQL that is used to create, update, delete etc. entities.
(To see the expected sequence, remember to not include your custom SQL in the mapping files as that will
override the Hibernate generated static sql.)
The stored procedures are in most cases (read: better do it than not) required to return the number of rows inser-
ted/updated/deleted, as Hibernate has some runtime checks for the success of the statement. Hibernate always
registers the first statement parameter as a numeric output parameter for the CUD operations:
update PERSON
set
NAME = uname,
where
ID = uid;
return SQL%ROWCOUNT;
END updatePerson;
<sql-query name="person">
<return alias="pers" class="Person" lock-mode="upgrade"/>
SELECT NAME AS {pers.name}, ID AS {pers.id}
FROM PERSON
WHERE ID=?
FOR UPDATE
</sql-query>
This is just a named query declaration, as discussed earlier. You may reference this named query in a class
mapping:
<class name="Person">
<id name="id">
<generator class="increment"/>
</id>
<property name="name" not-null="true"/>
<loader query-ref="person"/>
</class>
<sql-query name="employments">
<load-collection alias="emp" role="Person.employments"/>
SELECT {emp.*}
FROM EMPLOYMENT emp
WHERE EMPLOYER = :id
ORDER BY STARTDATE ASC, EMPLOYEE ASC
</sql-query>
You could even define an entity loader that loads a collection by join fetching:
<sql-query name="person">
<return alias="pers" class="Person"/>
<return-join alias="emp" property="pers.employments"/>
SELECT NAME AS {pers.*}, {emp.*}
FROM PERSON pers
LEFT OUTER JOIN EMPLOYMENT emp
ON pers.ID = emp.PERSON_ID
WHERE ID=?
</sql-query>
In order to use filters, they must first be defined and then attached to the appropriate mapping elements. To
define a filter, use the <filter-def/> element within a <hibernate-mapping/> element:
<filter-def name="myFilter">
<filter-param name="myFilterParam" type="string"/>
</filter-def>
or, to a collection:
<set ...>
<filter name="myFilter" condition=":myFilterParam = MY_FILTERED_COLUMN"/>
</set>
session.enableFilter("myFilter").setParameter("myFilterParam", "some-value");
Note that methods on the org.hibernate.Filter interface do allow the method-chaining common to much of Hi-
bernate.
A full example, using temporal data with an effective record date pattern:
<filter-def name="effectiveDate">
<filter-param name="asOfDate" type="date"/>
</filter-def>
...
<!--
Note that this assumes non-terminal records have an eff_end_dt set to
a max db date for simplicity-sake
-->
<filter name="effectiveDate"
condition=":asOfDate BETWEEN eff_start_dt and eff_end_dt"/>
</class>
Then, in order to ensure that you always get back currently effective records, simply enable the filter on the ses-
sion prior to retrieving employee data:
In the HQL above, even though we only explicitly mentioned a salary constraint on the results, because of the
enabled filter the query will return only currently active employees who have a salary greater than a million
dollars.
Note: if you plan on using filters with outer joining (either through HQL or load fetching) be careful of the dir-
ection of the condition expression. Its safest to set this up for left outer joining; in general, place the parameter
first followed by the column name(s) after the operator.
After being defined a filter might be attached to multiple entities and/or collections each with its own condition.
That can be tedious when the conditions are the same each time. Thus <filter-def/> allows defining a default
condition, either as an attribute or CDATA:
This default condition will then be used whenever the filter is attached to something without specifying a con-
dition. Note that this means you can give a specific condition as part of the attachment of the filter which over-
rides the default condition in that particular case.
Hibernate supports dom4j as API for manipulating XML trees. You can write queries that retrieve dom4j trees
from the database and have any modification you make to the tree automatically synchronized to the database.
You can even take an XML document, parse it using dom4j, and write it to the database with any of Hibernate's
basic operations: persist(), saveOrUpdate(), merge(), delete(), replicate() (merging is not yet sup-
ported).
This feature has many applications including data import/export, externalization of entity data via JMS or
SOAP and XSLT-based reporting.
A single mapping may be used to simultaneously map properties of a class and nodes of an XML document to
the database, or, if there is no class to map, it may be used to map just the XML.
<class name="Account"
table="ACCOUNTS"
node="account">
<id name="accountId"
column="ACCOUNT_ID"
node="@id"/>
<many-to-one name="customer"
column="CUSTOMER_ID"
node="customer/@id"
embed-xml="false"/>
<property name="balance"
column="BALANCE"
node="balance"/>
...
</class>
<class entity-name="Account"
table="ACCOUNTS"
node="account">
<id name="id"
column="ACCOUNT_ID"
node="@id"
type="string"/>
<many-to-one name="customerId"
column="CUSTOMER_ID"
node="customer/@id"
embed-xml="false"
entity-name="Customer"/>
<property name="balance"
column="BALANCE"
node="balance"
type="big_decimal"/>
...
</class>
This mapping allows you to access the data as a dom4j tree, or as a graph of property name/value pairs (java
Maps). The property names are purely logical constructs that may be referred to in HQL queries.
For collections and single valued associations, there is an additional embed-xml attribute. If embed-xml="true",
the default, the XML tree for the associated entity (or collection of value type) will be embedded directly in the
XML tree for the entity that owns the association. Otherwise, if embed-xml="false", then only the referenced
identifier value will appear in the XML for single point associations and collections will simply not appear at
all.
You should be careful not to leave embed-xml="true" for too many associations, since XML does not deal well
with circularity!
<class name="Customer"
table="CUSTOMER"
node="customer">
<id name="id"
column="CUST_ID"
node="@id"/>
<map name="accounts"
node="."
embed-xml="true">
<key column="CUSTOMER_ID"
not-null="true"/>
<map-key column="SHORT_DESC"
node="@short-desc"
type="string"/>
<one-to-many entity-name="Account"
embed-xml="false"
node="account"/>
</map>
<component name="name"
node="name">
<property name="firstName"
node="first-name"/>
<property name="initial"
node="initial"/>
<property name="lastName"
node="last-name"/>
</component>
...
</class>
in this case, we have decided to embed the collection of account ids, but not the actual account data. The fol-
lowing HQL query:
from Customer c left join fetch c.accounts where c.lastName like :lastName
<customer id="123456789">
<account short-desc="Savings">987632567</account>
<account short-desc="Credit Card">985612323</account>
<name>
<first-name>Gavin</first-name>
<initial>A</initial>
<last-name>King</last-name>
</name>
...
</customer>
If you set embed-xml="true" on the <one-to-many> mapping, the data might look more like this:
<customer id="123456789">
<account id="987632567" short-desc="Savings">
<customer id="123456789"/>
<balance>100.29</balance>
</account>
<account id="985612323" short-desc="Credit Card">
<customer id="123456789"/>
<balance>-2370.34</balance>
</account>
<name>
<first-name>Gavin</first-name>
<initial>A</initial>
<last-name>King</last-name>
</name>
...
</customer>
.list();
for ( int i=0; i<results.size(); i++ ) {
//add the customer data to the XML document
Element customer = (Element) results.get(i);
doc.add(customer);
}
tx.commit();
session.close();
tx.commit();
session.close();
It is extremely useful to combine this feature with Hibernate's replicate() operation to implement XML-
based data import/export.
• Join fetching - Hibernate retrieves the associated instance or collection in the same SELECT, using an OUTER
JOIN.
• Select fetching - a second SELECT is used to retrieve the associated entity or collection. Unless you explicitly
disable lazy fetching by specifying lazy="false", this second select will only be executed when you actu-
ally access the association.
• Subselect fetching - a second SELECT is used to retrieve the associated collections for all entities retrieved in
a previous query or fetch. Unless you explicitly disable lazy fetching by specifying lazy="false", this
second select will only be executed when you actually access the association.
• Batch fetching - an optimization strategy for select fetching - Hibernate retrieves a batch of entity instances
or collections in a single SELECT, by specifying a list of primary keys or foreign keys.
• Immediate fetching - an association, collection or attribute is fetched immediately, when the owner is
loaded.
• Lazy collection fetching - a collection is fetched when the application invokes an operation upon that collec-
tion. (This is the default for collections.)
• "Extra-lazy" collection fetching - individual elements of the collection are accessed from the database as
needed. Hibernate tries not to fetch the whole collection into memory unless absolutely needed (suitable for
very large collections)
• Proxy fetching - a single-valued association is fetched when a method other than the identifier getter is in-
voked upon the associated object.
• "No-proxy" fetching - a single-valued association is fetched when the instance variable is accessed. Com-
pared to proxy fetching, this approach is less lazy (the association is fetched even when only the identifier is
accessed) but more transparent, since no proxy is visible to the application. This approach requires build-
time bytecode instrumentation and is rarely necessary.
• Lazy attribute fetching - an attribute or single valued association is fetched when the instance variable is ac-
cessed. This approach requires buildtime bytecode instrumentation and is rarely necessary.
We have two orthogonal notions here: when is the association fetched, and how is it fetched (what SQL is
used). Don't confuse them! We use fetch to tune performance. We may use lazy to define a contract for what
data is always available in any detached instance of a particular class.
By default, Hibernate3 uses lazy select fetching for collections and lazy proxy fetching for single-valued asso-
ciations. These defaults make sense for almost all associations in almost all applications.
Note: if you set hibernate.default_batch_fetch_size, Hibernate will use the batch fetch optimization for
lazy fetching (this optimization may also be enabled at a more granular level).
However, lazy fetching poses one problem that you must be aware of. Access to a lazy association outside of
the context of an open Hibernate session will result in an exception. For example:
s = sessions.openSession();
Transaction tx = s.beginTransaction();
tx.commit();
s.close();
Since the permissions collection was not initialized when the Session was closed, the collection will not be
able to load its state. Hibernate does not support lazy initialization for detached objects. The fix is to move the
code that reads from the collection to just before the transaction is committed.
Alternatively, we could use a non-lazy collection or association, by specifying lazy="false" for the associ-
ation mapping. However, it is intended that lazy initialization be used for almost all collections and associ-
ations. If you define too many non-lazy associations in your object model, Hibernate will end up needing to
fetch the entire database into memory in every transaction!
On the other hand, we often want to choose join fetching (which is non-lazy by nature) instead of select fetch-
ing in a particular transaction. We'll now see how to customize the fetching strategy. In Hibernate3, the mech-
anisms for choosing a fetch strategy are identical for single-valued associations and collections.
Select fetching (the default) is extremely vulnerable to N+1 selects problems, so we might want to enable join
fetching in the mapping document:
<set name="permissions"
fetch="join">
<key column="userId"/>
<one-to-many class="Permission"/>
</set
• Criteria queries
No matter what fetching strategy you use, the defined non-lazy graph is guaranteed to be loaded into memory.
Note that this might result in several immediate selects being used to execute a particular HQL query.
Usually, we don't use the mapping document to customize fetching. Instead, we keep the default behavior, and
override it for a particular transaction, using left join fetch in HQL. This tells Hibernate to fetch the associ-
ation eagerly in the first select, using an outer join. In the Criteria query API, you would use setFetch-
Mode(FetchMode.JOIN).
If you ever feel like you wish you could change the fetching strategy used by get() or load(), simply use a
Criteria query, for example:
(This is Hibernate's equivalent of what some ORM solutions call a "fetch plan".)
A completely different way to avoid problems with N+1 selects is to use the second-level cache.
Lazy fetching for collections is implemented using Hibernate's own implementation of persistent collections.
However, a different mechanism is needed for lazy behavior in single-ended associations. The target entity of
the association must be proxied. Hibernate implements lazy initializing proxies for persistent objects using
runtime bytecode enhancement (via the excellent CGLIB library).
By default, Hibernate3 generates proxies (at startup) for all persistent classes and uses them to enable lazy
fetching of many-to-one and one-to-one associations.
The mapping file may declare an interface to use as the proxy interface for that class, with the proxy attribute.
By default, Hibernate uses a subclass of the class. Note that the proxied class must implement a default con-
structor with at least package visibility. We recommend this constructor for all persistent classes!
There are some gotchas to be aware of when extending this approach to polymorphic classes, eg.
Firstly, instances of Cat will never be castable to DomesticCat, even if the underlying instance is an instance of
DomesticCat:
Cat cat = (Cat) session.load(Cat.class, id); // instantiate a proxy (does not hit the db)
if ( cat.isDomesticCat() ) { // hit the db to initialize the proxy
DomesticCat dc = (DomesticCat) cat; // Error!
....
}
DomesticCat dc =
(DomesticCat) session.load(DomesticCat.class, id); // acquire new DomesticCat proxy!
System.out.println(cat==dc); // false
However, the situation is not quite as bad as it looks. Even though we now have two references to different
proxy objects, the underlying instance will still be the same object:
Third, you may not use a CGLIB proxy for a final class or a class with any final methods.
Finally, if your persistent object acquires any resources upon instantiation (eg. in initializers or default con-
structor), then those resources will also be acquired by the proxy. The proxy class is an actual subclass of the
persistent class.
These problems are all due to fundamental limitations in Java's single inheritance model. If you wish to avoid
these problems your persistent classes must each implement an interface that declares its business methods.
You should specify these interfaces in the mapping file. eg.
where CatImpl implements the interface Cat and DomesticCatImpl implements the interface DomesticCat.
Then proxies for instances of Cat and DomesticCat may be returned by load() or iterate(). (Note that
list() does not usually return proxies.)
Relationships are also lazily initialized. This means you must declare any properties to be of type Cat, not
CatImpl.
By choosing lazy="no-proxy" instead of the default lazy="proxy", we can avoid the problems associated with
typecasting. However, we will require buildtime bytecode instrumentation, and all operations will result in im-
mediate proxy initialization.
Sometimes we need to ensure that a proxy or collection is initialized before closing the Session. Of course, we
can alway force initialization by calling cat.getSex() or cat.getKittens().size(), for example. But that is
confusing to readers of the code and is not convenient for generic code.
The static methods Hibernate.initialize() and Hibernate.isInitialized() provide the application with a
convenient way of working with lazily initialized collections or proxies. Hibernate.initialize(cat) will
force the initialization of a proxy, cat, as long as its Session is still open. Hibernate.initialize(
cat.getKittens() ) has a similar effect for the collection of kittens.
Another option is to keep the Session open until all needed collections and proxies have been loaded. In some
application architectures, particularly where the code that accesses data using Hibernate, and the code that uses
it are in different application layers or different physical processes, it can be a problem to ensure that the Ses-
sion is open when a collection is initialized. There are two basic ways to deal with this issue:
• In a web-based application, a servlet filter can be used to close the Session only at the very end of a user
request, once the rendering of the view is complete (the Open Session in View pattern). Of course, this
places heavy demands on the correctness of the exception handling of your application infrastructure. It is
vitally important that the Session is closed and the transaction ended before returning to the user, even
when an exception occurs during rendering of the view. See the Hibernate Wiki for examples of this "Open
Session in View" pattern.
• In an application with a separate business tier, the business logic must "prepare" all collections that will be
needed by the web tier before returning. This means that the business tier should load all the data and return
all the data already initialized to the presentation/web tier that is required for a particular use case. Usually,
the application calls Hibernate.initialize() for each collection that will be needed in the web tier (this
call must occur before the session is closed) or retrieves the collection eagerly using a Hibernate query with
a FETCH clause or a FetchMode.JOIN in Criteria. This is usually easier if you adopt the Command pattern
instead of a Session Facade.
• You may also attach a previously loaded object to a new Session with merge() or lock() before accessing
uninitialized collections (or other proxies). No, Hibernate does not, and certainly should not do this auto-
matically, since it would introduce ad hoc transaction semantics!
Sometimes you don't want to initialize a large collection, but still need some information about it (like its size)
or a subset of the data.
You can use a collection filter to get the size of a collection without initializing it:
The createFilter() method is also used to efficiently retrieve subsets of a collection without needing to ini-
tialize the whole collection:
Hibernate can make efficient use of batch fetching, that is, Hibernate can load several uninitialized proxies if
one proxy is accessed (or collections. Batch fetching is an optimization of the lazy select fetching strategy.
There are two ways you can tune batch fetching: on the class and the collection level.
Batch fetching for classes/entities is easier to understand. Imagine you have the following situation at runtime:
You have 25 Cat instances loaded in a Session, each Cat has a reference to its owner, a Person. The Person
class is mapped with a proxy, lazy="true". If you now iterate through all cats and call getOwner() on each,
Hibernate will by default execute 25 SELECT statements, to retrieve the proxied owners. You can tune this beha-
vior by specifying a batch-size in the mapping of Person:
Hibernate will now execute only three queries, the pattern is 10, 10, 5.
You may also enable batch fetching of collections. For example, if each Person has a lazy collection of Cats,
and 10 persons are currently loaded in the Sesssion, iterating through all persons will generate 10 SELECTs, one
for every call to getCats(). If you enable batch fetching for the cats collection in the mapping of Person, Hi-
bernate can pre-fetch collections:
<class name="Person">
<set name="cats" batch-size="3">
...
</set>
</class>
With a batch-size of 8, Hibernate will load 3, 3, 3, 1 collections in four SELECTs. Again, the value of the at-
tribute depends on the expected number of uninitialized collections in a particular Session.
Batch fetching of collections is particularly useful if you have a nested tree of items, ie. the typical bill-
of-materials pattern. (Although a nested set or a materialized path might be a better option for read-mostly
trees.)
If one lazy collection or single-valued proxy has to be fetched, Hibernate loads all of them, re-running the ori-
ginal query in a subselect. This works in the same way as batch-fetching, without the piecemeal loading.
Hibernate3 supports the lazy fetching of individual properties. This optimization technique is also known as
fetch groups. Please note that this is mostly a marketing feature, as in practice, optimizing row reads is much
more important than optimization of column reads. However, only loading some properties of a class might be
useful in extreme cases, when legacy tables have hundreds of columns and the data model can not be improved.
To enable lazy property loading, set the lazy attribute on your particular property mappings:
<class name="Document">
<id name="id">
<generator class="native"/>
</id>
<property name="name" not-null="true" length="50"/>
<property name="summary" not-null="true" length="200" lazy="true"/>
<property name="text" not-null="true" length="2000" lazy="true"/>
</class>
Lazy property loading requires buildtime bytecode instrumentation! If your persistent classes are not enhanced,
Hibernate will silently ignore lazy property settings and fall back to immediate fetching.
<instrument verbose="true">
<fileset dir="${testclasses.dir}/org/hibernate/auction/model">
<include name="*.class"/>
</fileset>
</instrument>
</target>
A different (better?) way to avoid unnecessary column reads, at least for read-only transactions is to use the
projection features of HQL or Criteria queries. This avoids the need for buildtime bytecode processing and is
certainly a prefered solution.
You may force the usual eager fetching of properties using fetch all properties in HQL.
You have the option to tell Hibernate which caching implementation to use by specifying the name of a class
that implements org.hibernate.cache.CacheProvider using the property
hibernate.cache.provider_class. Hibernate comes bundled with a number of built-in integrations with
open-source cache providers (listed below); additionally, you could implement your own and plug it in as out-
lined above. Note that versions prior to 3.2 defaulted to use EhCache as the default cache provider; that is no
longer the case as of 3.2.
The <cache> element of a class or collection mapping has the following form:
<cache
usage="transactional|read-write|nonstrict-read-write|read-only" (1)
region="RegionName" (2)
include="all|non-lazy" (3)
/>
(1) usage (required) specifies the caching strategy: transactional, read-write, nonstrict-read-write or
read-only
(2) region (optional, defaults to the class or collection role name) specifies the name of the second level
cache region
(3) include (optional, defaults to all) non-lazy specifies that properties of the entity mapped with
lazy="true" may not be cached when attribute-level lazy fetching is enabled
Alternatively (preferrably?), you may specify <class-cache> and <collection-cache> elements in hibern-
ate.cfg.xml.
If your application needs to read but never modify instances of a persistent class, a read-only cache may be
used. This is the simplest and best performing strategy. It's even perfectly safe for use in a cluster.
If the application needs to update data, a read-write cache might be appropriate. This cache strategy should
never be used if serializable transaction isolation level is required. If the cache is used in a JTA environment,
you must specify the property hibernate.transaction.manager_lookup_class, naming a strategy for obtain-
ing the JTA TransactionManager. In other environments, you should ensure that the transaction is completed
when Session.close() or Session.disconnect() is called. If you wish to use this strategy in a cluster, you
should ensure that the underlying cache implementation supports locking. The built-in cache providers do not.
If the application only occasionally needs to update data (ie. if it is extremely unlikely that two transactions
would try to update the same item simultaneously) and strict transaction isolation is not required, a nonstrict-
read-write cache might be appropriate. If the cache is used in a JTA environment, you must specify hibern-
The transactional cache strategy provides support for fully transactional cache providers such as JBoss
TreeCache. Such a cache may only be used in a JTA environment and you must specify hibern-
ate.transaction.manager_lookup_class.
None of the cache providers support all of the cache concurrency strategies. The following table shows which
providers are compatible with which concurrency strategies.
When flush() is subsequently called, the state of that object will be synchronized with the database. If you do
not want this synchronization to occur or if you are processing a huge number of objects and need to manage
memory efficiently, the evict() method may be used to remove the object and its collections from the first-
level cache.
The Session also provides a contains() method to determine if an instance belongs to the session cache.
To completely evict all objects from the session cache, call Session.clear()
For the second-level cache, there are methods defined on SessionFactory for evicting the cached state of an
instance, entire class, collection instance or entire collection role.
The CacheMode controls how a particular session interacts with the second-level cache.
• CacheMode.NORMAL - read items from and write items to the second-level cache
• CacheMode.GET - read items from the second-level cache, but don't write to the second-level cache except
when updating data
• CacheMode.PUT - write items to the second-level cache, but don't read from the second-level cache
• CacheMode.REFRESH - write items to the second-level cache, but don't read from the second-level cache, by-
pass the effect of hibernate.cache.use_minimal_puts, forcing a refresh of the second-level cache for all
items read from the database
To browse the contents of a second-level or query cache region, use the Statistics API:
You'll need to enable statistics, and, optionally, force Hibernate to keep the cache entries in a more human-
understandable format:
hibernate.generate_statistics true
hibernate.cache.use_structured_entries true
hibernate.cache.use_query_cache true
This setting causes the creation of two new cache regions - one holding cached query result sets
(org.hibernate.cache.StandardQueryCache), the other holding timestamps of the most recent updates to
queryable tables (org.hibernate.cache.UpdateTimestampsCache). Note that the query cache does not cache
the state of the actual entities in the result set; it caches only identifier values and results of value type. So the
query cache should always be used in conjunction with the second-level cache.
Most queries do not benefit from caching, so by default queries are not cached. To enable caching, call
Query.setCacheable(true). This call allows the query to look for existing cache results or add its results to
the cache when it is executed.
If you require fine-grained control over query cache expiration policies, you may specify a named cache region
for a particular query by calling Query.setCacheRegion().
If the query should force a refresh of its query cache region, you should call
Query.setCacheMode(CacheMode.REFRESH). This is particularly useful in cases where underlying data may
have been updated via a separate process (i.e., not modified through Hibernate) and allows the application to
selectively refresh particular query result sets. This is a more efficient alternative to eviction of a query cache
region via SessionFactory.evictQueries().
19.5.1. Taxonomy
• collections of values
This classification distinguishes the various table and foreign key relationships but does not tell us quite
everything we need to know about the relational model. To fully understand the relational structure and per-
formance characteristics, we must also consider the structure of the primary key that is used by Hibernate to up-
date or delete collection rows. This suggests the following classification:
• indexed collections
• sets
• bags
All indexed collections (maps, lists, arrays) have a primary key consisting of the <key> and <index> columns.
In this case collection updates are usually extremely efficient - the primary key may be efficiently indexed and
a particular row may be efficiently located when Hibernate tries to update or delete it.
Sets have a primary key consisting of <key> and element columns. This may be less efficient for some types of
collection element, particularly composite elements or large text or binary fields; the database may not be able
to index a complex primary key as efficently. On the other hand, for one to many or many to many associ-
ations, particularly in the case of synthetic identifiers, it is likely to be just as efficient. (Side-note: if you want
SchemaExport to actually create the primary key of a <set> for you, you must declare all columns as not-
null="true".)
<idbag> mappings define a surrogate key, so they are always very efficient to update. In fact, they are the best
case.
Bags are the worst case. Since a bag permits duplicate element values and has no index column, no primary key
may be defined. Hibernate has no way of distinguishing between duplicate rows. Hibernate resolves this prob-
lem by completely removing (in a single DELETE) and recreating the collection whenever it changes. This might
be very inefficient.
Note that for a one-to-many association, the "primary key" may not be the physical primary key of the database
table - but even in this case, the above classification is still useful. (It still reflects how Hibernate "locates" indi-
vidual rows of the collection.)
19.5.2. Lists, maps, idbags and sets are the most efficient collections to up-
date
From the discussion above, it should be clear that indexed collections and (usually) sets allow the most efficient
operation in terms of adding, removing and updating elements.
There is, arguably, one more advantage that indexed collections have over sets for many to many associations
or collections of values. Because of the structure of a Set, Hibernate doesn't ever UPDATE a row when an ele-
ment is "changed". Changes to a Set always work via INSERT and DELETE (of individual rows). Once again, this
consideration does not apply to one to many associations.
After observing that arrays cannot be lazy, we would conclude that lists, maps and idbags are the most perform-
ant (non-inverse) collection types, with sets not far behind. Sets are expected to be the most common kind of
collection in Hibernate applications. This is because the "set" semantics are most natural in the relational mod-
el.
However, in well-designed Hibernate domain models, we usually see that most collections are in fact one-
to-many associations with inverse="true". For these associations, the update is handled by the many-to-one
end of the association, and so considerations of collection update performance simply do not apply.
19.5.3. Bags and lists are the most efficient inverse collections
Just before you ditch bags forever, there is a particular case in which bags (and also lists) are much more per-
formant than sets. For a collection with inverse="true" (the standard bidirectional one-to-many relationship
idiom, for example) we can add elements to a bag or list without needing to initialize (fetch) the bag elements!
This is because Collection.add() or Collection.addAll() must always return true for a bag or List (unlike
a Set). This can make the following common code much faster.
Occasionally, deleting collection elements one by one can be extremely inefficient. Hibernate isn't completely
stupid, so it knows not to do that in the case of an newly-empty collection (if you called list.clear(), for ex-
ample). In this case, Hibernate will issue a single DELETE and we are done!
Suppose we add a single element to a collection of size twenty and then remove two elements. Hibernate will
issue one INSERT statement and two DELETE statements (unless the collection is a bag). This is certainly desir-
able.
However, suppose that we remove eighteen elements, leaving two and then add thee new elements. There are
two possible ways to proceed
• delete eighteen rows one by one and then insert three rows
• remove the whole collection (in one SQL DELETE) and insert all five current elements (one by one)
Hibernate isn't smart enough to know that the second option is probably quicker in this case. (And it would
probably be undesirable for Hibernate to be that smart; such behaviour might confuse database triggers, etc.)
Fortunately, you can force this behaviour (ie. the second strategy) at any time by discarding (ie. dereferencing)
the original collection and returning a newly instantiated collection with all the current elements. This can be
very useful and powerful from time to time.
You can access SessionFactory metrics in two ways. Your first option is to call sessionFact-
ory.getStatistics() and read or display the Statistics yourself.
Hibernate can also use JMX to publish metrics if you enable the StatisticsService MBean. You may enable
a single MBean for all your SessionFactory or one per factory. See the following code for minimalistic con-
figuration examples:
TODO: This doesn't make sense: In the first case, we retrieve and use the MBean directly. In the second one,
we must give the JNDI name in which the session factory is held before using it. Use hibernateStats-
Bean.setSessionFactoryJNDIName("my/JNDI/Name")
Statistics can be reset programatically using the clear() method. A summary can be sent to a logger (info
level) using the logSummary() method.
19.6.2. Metrics
Hibernate provides a number of metrics, from very basic to the specialized information only relevant in certain
scenarios. All available counters are described in the Statistics interface API, in three categories:
• Metrics related to the general Session usage, such as number of open sessions, retrieved JDBC connec-
tions, etc.
• Metrics related to he entities, collections, queries, and caches as a whole (aka global metrics),
For exampl,e you can check the cache hit, miss, and put ratio of entities, collections and queries, and the aver-
age time a query needs. Beware that the number of milliseconds is subject to approximation in Java. Hibernate
is tied to the JVM precision, on some platforms this might even only be accurate to 10 seconds.
Simple getters are used to access the global metrics (i.e. not tied to a particular entity, collection, cache region,
etc.). You can access the metrics of a particular entity, collection or cache region through its name, and through
its HQL or SQL representation for queries. Please refer to the Statistics, EntityStatistics, CollectionS-
tatistics, SecondLevelCacheStatistics, and QueryStatistics API Javadoc for more information. The fol-
lowing code shows a simple example:
EntityStatistics entityStats =
stats.getEntityStatistics( Cat.class.getName() );
long changes =
entityStats.getInsertCount()
+ entityStats.getUpdateCount()
+ entityStats.getDeleteCount();
log.info(Cat.class.getName() + " changed " + changes + "times" );
To work on all entities, collections, queries and region caches, you can retrieve the list of names of entities, col-
lections, queries and region caches with the following methods: getQueries(), getEntityNames(), getCol-
lectionRoleNames(), and getSecondLevelCacheRegionNames().
The Hibernate Tools currently include plugins for the Eclipse IDE as well as Ant tasks for reverse engineering
of existing databases:
• Mapping Editor: An editor for Hibernate XML mapping files, supporting auto-completion and syntax high-
lighting. It also supports semantic auto-completion for class names and property/field names, making it
much more versatile than a normal XML editor.
• Console: The console is a new view in Eclipse. In addition to a tree overview of your console configura-
tions, you also get an interactive view of your persistent classes and their relationships. The console allows
you to execute HQL queries against your database and browse the result directly in Eclipse.
• Development Wizards: Several wizards are provided with the Hibernate Eclipse tools; you can use a wizard
to quickly generate Hibernate configuration (cfg.xml) files, or you may even completely reverse engineer
an existing database schema into POJO source files and Hibernate mapping files. The reverse engineering
wizard supports customizable templates.
• Ant Tasks:
Please refer to the Hibernate Tools package and it's documentation for more information.
However, the Hibernate main package comes bundled with an integrated tool (it can even be used from "inside"
Hibernate on-the-fly): SchemaExport aka hbm2ddl.
You must specify a SQL Dialect via the hibernate.dialect property when using this tool, as DDL is highly
vendor specific.
Many Hibernate mapping elements define optional attributes named length, precision and scale. You may
set the length, precision and scale of a column with this attribute.
Some tags also accept a not-null attribute (for generating a NOT NULL constraint on table columns) and a
unique attribute (for generating UNIQUE constraint on table columns).
A unique-key attribute may be used to group columns in a single unique key constraint. Currently, the spe-
cified value of the unique-key attribute is not used to name the constraint in the generated DDL, only to group
the columns in the mapping file.
An index attribute specifies the name of an index that will be created using the mapped column or columns.
Multiple columns may be grouped into the same index, simply by specifying the same index name.
A foreign-key attribute may be used to override the name of any generated foreign key constraint.
Many mapping elements also accept a child <column> element. This is particularly useful for mapping multi-
column types:
The default attribute lets you specify a default value for a column (you should assign the same value to the
mapped property before saving a new instance of the mapped class).
The sql-type attribute allows the user to override the default mapping of a Hibernate type to SQL datatype.
unique true|false specifies that the column should have a unique constraint
foreign-key foreign_key_name specifies the name of the foreign key constraint generated
for an association, for a <one-to-one>, <many-to-one>,
<key>, or <many-to-many> mapping element. Note that in-
verse="true" sides will not be considered by SchemaEx-
port.
sql-type SQL column type overrides the default column type (attribute of <column>
element only)
check SQL expression create an SQL check constraint on either column or table
The <comment> element allows you to specify comments for the generated schema.
<property name="balance">
<column name="bal">
<comment>Balance in USD</comment>
</column>
</property>
This results in a comment on table or comment on column statement in the generated DDL (where supported).
The SchemaExport tool writes a DDL script to standard out and/or executes the DDL statements.
Option Description
Option Description
20.1.3. Properties
hibernate.dialect dialect
<target name="schemaexport">
<taskdef name="schemaexport"
classname="org.hibernate.tool.hbm2ddl.SchemaExportTask"
classpathref="class.path"/>
<schemaexport
properties="hibernate.properties"
quiet="no"
text="no"
drop="no"
delimiter=";"
output="schema-export.sql">
<fileset dir="src">
<include name="**/*.hbm.xml"/>
</fileset>
</schemaexport>
</target>
The SchemaUpdate tool will update an existing schema with "incremental" changes. Note that SchemaUpdate
depends heavily upon the JDBC metadata API, so it will not work with all JDBC drivers.
Option Description
<target name="schemaupdate">
<taskdef name="schemaupdate"
classname="org.hibernate.tool.hbm2ddl.SchemaUpdateTask"
classpathref="class.path"/>
<schemaupdate
properties="hibernate.properties"
quiet="no">
<fileset dir="src">
<include name="**/*.hbm.xml"/>
</fileset>
</schemaupdate>
</target>
The SchemaValidator tool will validate that the existing database schema "matches" your mapping documents.
Note that SchemaValidator depends heavily upon the JDBC metadata API, so it will not work with all JDBC
Option Description
<target name="schemavalidate">
<taskdef name="schemavalidator"
classname="org.hibernate.tool.hbm2ddl.SchemaValidatorTask"
classpathref="class.path"/>
<schemavalidator
properties="hibernate.properties">
<fileset dir="src">
<include name="**/*.hbm.xml"/>
</fileset>
</schemaupdate>
</target>
• When we remove / add an object from / to a collection, the version number of the collection owner is incre-
mented.
• If an object that was removed from a collection is an instance of a value type (eg, a composite element), that
object will cease to be persistent and its state will be completely removed from the database. Likewise,
adding a value type instance to the collection will cause its state to be immediately persistent.
• On the other hand, if an entity is removed from a collection (a one-to-many or many-to-many association),
it will not be deleted, by default. This behaviour is completely consistent - a change to the internal state of
another entity should not cause the associated entity to vanish! Likewise, adding an entity to a collection
does not cause that entity to become persistent, by default.
Instead, the default behaviour is that adding an entity to a collection merely creates a link between the two en-
tities, while removing it removes the link. This is very appropriate for all sorts of cases. Where it is not appro-
priate at all is the case of a parent / child relationship, where the life of the child is bound to the lifecycle of the
parent.
<set name="children">
<key column="parent_id"/>
<one-to-many class="Child"/>
</set>
Parent p = .....;
Child c = new Child();
p.getChildren().add(c);
session.save(c);
session.flush();
This is not only inefficient, but also violates any NOT NULL constraint on the parent_id column. We can fix the
nullability constraint violation by specifying not-null="true" in the collection mapping:
<set name="children">
<key column="parent_id" not-null="true"/>
<one-to-many class="Child"/>
</set>
The underlying cause of this behaviour is that the link (the foreign key parent_id) from p to c is not considered
part of the state of the Child object and is therefore not created in the INSERT. So the solution is to make the
link part of the Child mapping.
(We also need to add the parent property to the Child class.)
Now that the Child entity is managing the state of the link, we tell the collection not to update the link. We use
the inverse attribute.
The explicit call to save() is still annoying. We will address this by using cascades.
Similarly, we don't need to iterate over the children when saving or deleting a Parent. The following removes p
and all its children from the database.
will not remove c from the database; it will ony remove the link to p (and cause a NOT NULL constraint viola-
tion, in this case). You need to explicitly delete() the Child.
Now, in our case, a Child can't really exist without its parent. So if we remove a Child from the collection, we
really do want it to be deleted. For this, we must use cascade="all-delete-orphan".
Note: even though the collection mapping specifies inverse="true", cascades are still processed by iterating
the collection elements. So if you require that an object be saved, deleted or updated by cascade, you must add
it to the collection. It is not enough to simply call setParent().
children are new. (See Section 10.7, “Automatic state detection”.) In Hibernate3, it is no longer necessary to
specify an unsaved-value explicitly.
The following code will update parent and child and insert newChild.
Well, that's all very well for the case of a generated identifier, but what about assigned identifiers and compos-
ite identifiers? This is more difficult, since Hibernate can't use the identifier property to distinguish between a
newly instantiated object (with an identifier assigned by the user) and an object loaded in a previous session. In
this case, Hibernate will either use the timestamp or version property, or will actually query the second-level
cache or, worst case, the database, to see if the row exists.
21.5. Conclusion
There is quite a bit to digest here and it might look confusing first time around. However, in practice, it all
works out very nicely. Most Hibernate applications use the parent / child pattern in many places.
We mentioned an alternative in the first paragraph. None of the above issues exist in the case of
<composite-element> mappings, which have exactly the semantics of a parent / child relationship. Unfortu-
nately, there are two big limitations to composite element classes: composite elements may not own collections,
and they should not be the child of any entity other than the unique parent.
package eg;
import java.util.List;
package eg;
import java.text.DateFormat;
import java.util.Calendar;
}
public void setDatetime(Calendar calendar) {
_datetime = calendar;
}
public void setId(Long long1) {
_id = long1;
}
public void setText(String string) {
_text = string;
}
public void setTitle(String string) {
_title = string;
}
}
<?xml version="1.0"?>
<!DOCTYPE hibernate-mapping PUBLIC
"-//Hibernate/Hibernate Mapping DTD 3.0//EN"
"http://hibernate.sourceforge.net/hibernate-mapping-3.0.dtd">
<hibernate-mapping package="eg">
<class
name="Blog"
table="BLOGS">
<id
name="id"
column="BLOG_ID">
<generator class="native"/>
</id>
<property
name="name"
column="NAME"
not-null="true"
unique="true"/>
<bag
name="items"
inverse="true"
order-by="DATE_TIME"
cascade="all">
<key column="BLOG_ID"/>
<one-to-many class="BlogItem"/>
</bag>
</class>
</hibernate-mapping>
<?xml version="1.0"?>
<!DOCTYPE hibernate-mapping PUBLIC
"-//Hibernate/Hibernate Mapping DTD 3.0//EN"
"http://hibernate.sourceforge.net/hibernate-mapping-3.0.dtd">
<hibernate-mapping package="eg">
<class
name="BlogItem"
table="BLOG_ITEMS"
dynamic-update="true">
<id
name="id"
column="BLOG_ITEM_ID">
<generator class="native"/>
</id>
<property
name="title"
column="TITLE"
not-null="true"/>
<property
name="text"
column="TEXT"
not-null="true"/>
<property
name="datetime"
column="DATE_TIME"
not-null="true"/>
<many-to-one
name="blog"
column="BLOG_ID"
not-null="true"/>
</class>
</hibernate-mapping>
package eg;
import java.util.ArrayList;
import java.util.Calendar;
import java.util.Iterator;
import java.util.List;
import org.hibernate.HibernateException;
import org.hibernate.Query;
import org.hibernate.Session;
import org.hibernate.SessionFactory;
import org.hibernate.Transaction;
import org.hibernate.cfg.Configuration;
import org.hibernate.tool.hbm2ddl.SchemaExport;
blog.getItems().add(item);
tx.commit();
}
catch (HibernateException he) {
if (tx!=null) tx.rollback();
throw he;
}
finally {
session.close();
}
return item;
}
item.setText(text);
tx.commit();
}
catch (HibernateException he) {
if (tx!=null) tx.rollback();
throw he;
}
finally {
session.close();
}
return result;
}
result = q.list();
tx.commit();
}
catch (HibernateException he) {
if (tx!=null) tx.rollback();
throw he;
}
finally {
session.close();
}
return result;
}
}
23.1. Employer/Employee
The following model of the relationship between Employer and Employee uses an actual entity class (Employ-
ment) to represent the association. This is done because there might be more than one period of employment for
the same two parties. Components are used to model monetary values and employee names.
<hibernate-mapping>
<id name="id">
<generator class="sequence">
<param name="sequence">employment_id_seq</param>
</generator>
</id>
<property name="startDate" column="start_date"/>
<property name="endDate" column="end_date"/>
</class>
<generator class="sequence">
<param name="sequence">employee_id_seq</param>
</generator>
</id>
<property name="taxfileNumber"/>
<component name="name" class="Name">
<property name="firstName"/>
<property name="initial"/>
<property name="lastName"/>
</component>
</class>
</hibernate-mapping>
23.2. Author/Work
Consider the following model of the relationships between Work, Author and Person. We represent the relation-
ship between Work and Author as a many-to-many association. We choose to represent the relationship between
Author and Person as one-to-one association. Another possibility would be to have Author extend Person.
<hibernate-mapping>
<property name="title"/>
<set name="authors" table="author_work">
<key column name="work_id"/>
<many-to-many class="Author" column name="author_id"/>
</set>
</class>
<property name="alias"/>
<one-to-one name="person" constrained="true"/>
</class>
</hibernate-mapping>
There are four tables in this mapping. works, authors and persons hold work, author and person data respect-
ively. author_work is an association table linking authors to works. Heres the table schema, as generated by
SchemaExport.
23.3. Customer/Order/Product
Now consider a model of the relationships between Customer, Order and LineItem and Product. There is a
one-to-many association between Customer and Order, but how should we represent Order / LineItem /
Product? I've chosen to map LineItem as an association class representing the many-to-many association
between Order and Product. In Hibernate, this is called a composite element.
<hibernate-mapping>
</hibernate-mapping>
customers, orders, line_items and products hold customer, order, order line item and product data respect-
ively. line_items also acts as an association table linking orders with products.
<class name="Person">
<id name="name"/>
<one-to-one name="address"
cascade="all">
<formula>name</formula>
<formula>'HOME'</formula>
</one-to-one>
<one-to-one name="mailingAddress"
cascade="all">
<formula>name</formula>
<formula>'MAILING'</formula>
</one-to-one>
</class>
<class name="Customer">
<id name="customerId"
length="10">
<generator class="assigned"/>
</id>
<list name="orders"
inverse="true"
cascade="save-update">
<key column="customerId"/>
<index column="orderNumber"/>
<one-to-many class="Order"/>
</list>
</class>
<composite-id name="id"
class="Order$Id">
<key-property name="customerId" length="10"/>
<key-property name="orderNumber"/>
</composite-id>
<property name="orderDate"
type="calendar_date"
not-null="true"/>
<property name="total">
<formula>
( select sum(li.quantity*p.price)
from LineItem li, Product p
where li.productId = p.productId
and li.customerId = customerId
and li.orderNumber = orderNumber )
</formula>
</property>
<many-to-one name="customer"
column="customerId"
insert="false"
update="false"
not-null="true"/>
<bag name="lineItems"
fetch="join"
inverse="true"
cascade="save-update">
<key>
<column name="customerId"/>
<column name="orderNumber"/>
</key>
<one-to-many class="LineItem"/>
</bag>
</class>
<class name="LineItem">
<composite-id name="id"
class="LineItem$Id">
<key-property name="customerId" length="10"/>
<key-property name="orderNumber"/>
<key-property name="productId" length="10"/>
</composite-id>
<property name="quantity"/>
<many-to-one name="order"
insert="false"
update="false"
not-null="true">
<column name="customerId"/>
<column name="orderNumber"/>
</many-to-one>
<many-to-one name="product"
insert="false"
update="false"
not-null="true"
column="productId"/>
</class>
<class name="Product">
<synchronize table="LineItem"/>
<id name="productId"
length="10">
<generator class="assigned"/>
</id>
<property name="description"
not-null="true"
length="200"/>
<property name="price" length="3"/>
<property name="numberAvailable"/>
<property name="numberOrdered">
<formula>
( select sum(li.quantity)
from LineItem li
where li.productId = productId )
</formula>
</property>
</class>
<many-to-many class="User">
<column name="userName"/>
<formula>org</formula>
</many-to-many>
</set>
</class>
<class name="Person"
discriminator-value="P">
<id name="id"
column="person_id"
unsaved-value="0">
<generator class="native"/>
</id>
<discriminator
type="character">
<formula>
case
when title is not null then 'E'
when salesperson is not null then 'C'
else 'P'
end
</formula>
</discriminator>
<property name="name"
not-null="true"
length="80"/>
<property name="sex"
not-null="true"
update="false"/>
<component name="address">
<property name="address"/>
<property name="zip"/>
<property name="country"/>
</component>
<subclass name="Employee"
discriminator-value="E">
<property name="title"
length="20"/>
<property name="salary"/>
<many-to-one name="manager"/>
</subclass>
<subclass name="Customer"
discriminator-value="C">
<property name="comments"/>
<many-to-one name="salesperson"/>
</subclass>
</class>
<class name="Person">
<id name="id">
<generator class="hilo"/>
</id>
<one-to-one name="address"
property-ref="person"
cascade="all"
fetch="join"/>
<set name="accounts"
inverse="true">
<key column="userId"
property-ref="userId"/>
<one-to-many class="Account"/>
</set>
</class>
<class name="Address">
<id name="id">
<generator class="hilo"/>
</id>
</class>
<class name="Account">
<id name="accountId" length="32">
<generator class="uuid"/>
</id>
<many-to-one name="user"
column="userId"
property-ref="userId"/>
</class>
bean to and from the servlet / JSP layer. Use a new session to service each request. Use Session.merge()
or Session.saveOrUpdate() to synchronize objects with the database.
Use the open session in view pattern, or a disciplined assembly phase to avoid problems with unfetched data.
Hibernate frees the developer from writing tedious Data Transfer Objects (DTO). In a traditional EJB ar-
chitecture, DTOs serve dual purposes: first, they work around the problem that entity beans are not serializ-
able; second, they implicitly define an assembly phase where all data to be used by the view is fetched and
marshalled into the DTOs before returning control to the presentation tier. Hibernate eliminates the first
purpose. However, you will still need an assembly phase (think of your business methods as having a strict
contract with the presentation tier about what data is available in the detached objects) unless you are pre-
pared to hold the persistence context (the session) open across the view rendering process. This is not a lim-
itation of Hibernate! It is a fundamental requirement of safe transactional data access.