Learning Cython Programming - Second Edition - Sample Chapter
Learning Cython Programming - Second Edition - Sample Chapter
ee
Second Edition
pl
C o m m u n i t y
E x p e r i e n c e
D i s t i l l e d
Second Edition
Philip Herron
P U B L I S H I N G
Sa
m
$ 34.99 US
22.99 UK
Philip Herron
Preface
Cython is a tool that makes writing native extensions to Python as easy as writing
them in Python. For those who are unaware, you can implement Python modules
as pure the C code, which will, for all intents and purposes, look and act like any
Python code. This is required when implementing modules in Python, such as the
built-in zip module which use native zlib under the hood. Doing this makes sense
for the standard library modules part of Python, though for most users of Python,
writing native modules should be the last course of action if possible.
Writing native modules is hard and requires prerequisite knowledge of how to use
the garbage collector calls correctly in order to avoid memory leaks. It also requires
an understanding of how the GIL is used, which changes if you are using CPython
or PyPy. It also requires knowledge of the module structures and argument passing
internally to the Python runtime. Therefore, it isn't a trivial process when the need
arises. Cython lets us write and work with the native code without having to know
anything about the Python runtime. We can write almost pure Python code that just
so happens to let us manipulate C/C++ types and classes. We can call back and forth
from the native code and into the Python code.
More importantly, Cython removes the complexity and intrinsicity and lets the
programmer focus on solving problems.
Preface
Chapter 3, Extending Applications, uses everything from the previous chapters to write
native Tmux commands using Python instead of C/C++.
Chapter 4, Debugging Cython, uses the cygdb wrapper over gdb to debug Cython code.
Chapter 5, Advanced Cython, introduces how well Cython can work with C++ classes
and templates. In general, it also covers caveats in Cython.
Chapter 6, Further Reading, briefly looks at the related projects and interesting sources
of new learning.
Installing Cython
Type conversion
Installing Cython
Since Cython is a programming language, we must install its respective compiler,
which just so happens to be the aptly named Cython.
There are many different ways to install Cython. The preferred one would be to
use pip:
$ pip install Cython
This should work on both Linux and Mac. Alternatively, you can use your Linux
distribution's package manager to install Cython:
$ yum install cython
# will work on Fedora and Centos
$ apt-get install cython # will work on Debian based systems.
Emacs mode
There is an emacs mode available for Cython. Although the syntax is nearly the same
as Python, there are differences that conflict in simply using Python-mode. You can
grab cython-mode.el from the Cython source code (inside the Tools directory.) The
preferred way of installing packages to emacs would be to use a package repository
like MELPA:
To add the package repository to emacs, open your ~/.emacs configuration file
and add:
(when (>= emacs-major-version 24)
(require 'package)
(add-to-list
'package-archives
'("melpa" . "http://melpa.org/packages/")
t)
(package-initialize))
[2]
Chapter 1
Once you add this and reload your configuration to install the Cython mode, you can
simply run:
'M-x package-install RET cython-mode'
Once this is installed, you can activate the mode by adding this into your emacs
config file:
(require 'cython-mode)
You should now have created helloworld.so! This is a Cython module of the same
name as the Cython source code file. While in the same directory of the shared object
module, you can invoke this code by running a respective Python import:
$ python
Python 2.7.3 (default, Aug 1 2012, 05:16:07)
[GCC 4.6.3] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import helloworld
Hello World from cython!
[3]
As you can see by opening helloworld.pyx, it looks just like a normal Python
Hello World application, but as previously stated, Cython generates modules.
These modules need a name so that they can be correctly imported by the Python
runtime. The Cython compiler simply uses the name of the source code file. It then
requires us to compile this to the same shared object name.
Overall, Cython source code files have the .pyx,.pxd, and .pxi extensions.
For now, all we care about are the .pyx files; the others are for cimports and
includes respectively within a .pyx module file.
The following screenshot depicts the compilation flow required to have a callable
native Python module:
I wrote a basic makefile so that you can simply run make to compile these examples.
Here's the code to do this manually:
$ cython helloworld.pyx
$ gcc/clang -g -O2 -fpic `python-config --cflags` -c helloworld.c -o
helloworld.o
$ gcc/clang -shared -o helloworld.so helloworld.o `python-config
libs`
[4]
Chapter 1
Using the cythonize function as part of the ext_modules section will build
any specified Cython source into an installable Python module. This will compile
helloworld.pyx into the same shared library. This provides the Python practice
to distribute native modules as part of distutils.
This is the C code that we will calljust a simple function to add two integers.
Now, let's get Python to call it. Open a file called AddFunction.h, wherein we
will declare our prototype:
#ifndef __ADDFUNCTION_H__
#define __ADDFUNCTION_H__
extern int AddFunction (int, int);
#endif //__ADDFUNCTION_H__
[5]
We need this so that Cython can see the prototype for the function we want to call.
In practice, you will already have your headers in your own project with your
prototypes and declarations already available.
Open a file called AddFunction.pyx, and insert the following code in it:
cdef extern from "AddFunction.h":
cdef int AddFunction(int, int)
Here, we have to declare which code we want to call. The cdef is a keyword
signifying that this is from the C code that will be linked in. Now, we need a
Python entry point:
def Add(a, b):
return AddFunction(a, b)
This Add function is a Python callable inside a PyAddFunction module this acts as
a wrapper for Python code to be able to call directly into the C code. Again, I have
provided a handy makefile to produce the module:
$ cd cython-book/chapter1/ownmodule
$ make
cython -2 PyAddFunction.pyx
gcc -g -O2 -fpic -c PyAddFunction.c -o PyAddFunction.o `python-config
--includes`
gcc -g -O2 -fpic -c AddFunction.c -o AddFunction.o
gcc -g -O2 -shared -o PyAddFunction.so AddFunction.o PyAddFunction.o
`python-config --libs`
Notice that the print statement inside the AddFunction and the final result are
printed correctly. Therefore, we know that the control hit the C code and did the
calculation in C, and not inside the Python runtime. This is a revelation of what is
possible. Python can be cited to be slow in some circumstances. Using this technique
makes it possible for Python code to bypass its own runtime and to run in an unsafe
context, which is unrestricted by the Python runtime which is much faster.
[6]
Chapter 1
It lets the compiler know that there is a function called AddFunction and it takes two
ints and returns an int. This is all the information the compiler needs to know beside
the host and target operating system's calling convention to call this function safely.
Then, we created the Python entry point, which is a Python callable that takes two
parameters:
def Add(a, b):
return AddFunction(a, b)
Inside this entry point, it simply returned the native AddFunction and passed the
two Python objects as parameters. This is what makes Cython so powerful. Here,
the Cython compiler must inspect the function call and generate code to safely try
and convert these Python objects to native C integers. This becomes difficult when
precision is taken into account as well as potential overflow, which just so happens
to be a major use case since it handles everything so well. Also, remember that this
function returns an integer, and Cython also generates code to convert the integer
return into a valid Python object.
Downloading the example code
You can download the example code files for all Packt books you have
purchased from your account at http://www.PacktPub.com. If you
purchased this book elsewhere, you can visit http://www.PacktPub.
com/support and register to have the files e-mailed directly to you.
Summary
Overall, we installed the Cython compiler, ran the Hello World example, and
took into consideration that we need to compile all code into native shared objects.
We also saw how to wrap native C code to make it callable from Python. We have
also seen the implicit type conversion which Cython does for us to make calling
C work. In the next chapter, we will delve deeper into Cython programming with
discussion on how to make Python code callable from C and manipulate native C
data structures from Cython.
[7]
www.PacktPub.com
Stay Connected: