Separation of Manga Line Drawings and Screentones: Conference Paper
Separation of Manga Line Drawings and Screentones: Conference Paper
net/publication/277653033
CITATIONS READS
7 2,299
4 authors, including:
Some of the authors of this publication are also working on these related projects:
Approximate Nearest Neighbor Search with Selective K-means Tree View project
All content following this page was uploaded by Kiyoharu Aizawa on 03 June 2015.
Abstract
Screentones are unique expressions of Japanese comics (manga), which enrich their visual expression. However,
such screentones have a very different visual nature from that of line drawing areas; this prevents us from applying
various kinds of image processing techniques to manga. We propose a method for extracting line drawings and
removing screentones. We employ Laplacians of Gaussian filters and flow-based differences of Gaussian filters,
one for removing screentones and the other for preserving lines, and make a binary mask for separating line
drawings from manga by merging the results of the two filters. We show that the proposed method successfully
separates line drawings and is better than existing methods in comparative studies.
Categories and Subject Descriptors (according to ACM CCS): I.4.6 [Image Processing and Computer Vision]:
Segmentation—Edge and feature detection
1. Introduction
Manga (Japanese comics) are popular all over the world. A
unique trait of manga is their use of screentones, which are
preprinted patterns that express visual effects such as tex-
tures and shadows (Fig. 1(a)). Manga authors draw manga by
drawing lines and filling areas with screentones (Fig. 1(b)).
Currently, comic books are increasingly being pub-
lished in electronic form as well as in print, and research
on new applications of manga image processing has ap-
peared, e.g., manga retrieval [MAJ14], interactive segmen-
tation [AMYA14], and element composition for drawing as-
(a) Examples of various (b) A manga image consists of a line
sistance [CLC14]. kinds of screentones. drawing layer and a screentone layer.
However, it is not clear how to handle screentones in dig-
ital images. The nature of screentones is very different from Figure 1: Manga components. Note that a black-filled area
that of line drawings, which might cause undesired effects is considered to be a screentone in this paper.
Junichiro
c
if we do not take the screentones into consideration when Akabi
manga images are processed. Therefore, we want to separate
screentones and line drawings in advance before the main
processing is applied, to make full use of the visual structure
(Fig. 1(a)), it is hard to extract all kinds of screentones with
of the manga image. Further, the separation enables us to ap-
a single filter. To handle them, we create two masks: (1) a
ply image processing methods developed for line drawings.
screentone removal mask, which deletes all screentone ar-
We propose a method for extracting line drawings and re- eas, and (2) a line preserving mask, which preserves lines
moving screentones. The method consists of applying sev- as much as possible. By appropriate merging of the results
eral filters and merging the results. Because screentones have from these two masks, we compute the final mask, which
a wide variety of statistical natures, including large/small, specifies line drawings. Our method does not require any
isotropic/anisotropic, and low/high frequencies of textures parameter tuning and is very robust to the kinds of screen-
(a)
2. Method
We show the flow of our method in Fig. 2. First, we create Bin(IˆLoGi ) is a binary of IˆLoGi by Otsu’s method [Ots75]. We
a screentone removal mask Mrm , which removes screentone omit a description of input x for readability.
areas as much as possible, and a line preserving mask Ml p ,
which retains as many lines as possible. Then, we combine The output of the LoG filter changes with i as shown in
these masks, taking account of connected components. Fig. 3(a). We must select an appropriate value for i to remove
screentones; i = 19 is the best in this case. To select i, we
introduce two criteria, the Connected Component Criteria
2.1. Screentone removal mask (CCC) and the Stop Criteria (STC), which are defined as
The purpose of a screentone removal mask is to delete all follows:
screentone areas while allowing the removal of some por- NCC(MLoGi )
CCCi = 1 − , (3)
tion of line drawings. To create a screentone removal mask, NCC(MLoGi−2 )
we employ a Laplacian of Gaussian (LoG) filter, which acts
as a bandpass filter in the Fourier domain, and works as a NCC(MLoGi )
line detector. Because screentones tend to have periodicity or STCi = × |(NCC(MLoGi−2 ) − NCC(MLoGi ))|,
NCC(MLoG1 )
high-frequency components, they can be removed by a LoG (4)
filter while preserving line drawings. We found two traits of where NCC(M) is the number of connected components of
outputs of LoG filtering: (1) the LoG value is relatively high black pixels in a mask M. CCCi is a ratio showing the de-
at line drawings, and low in other areas; (2) a pixel that has crease in the number of connected components when i is in-
a negative LoG value hardly ever belongs to a line drawing. cremented. If CCCi has its highest value at i∗ , it means the
By considering these observations, we create an LoG mask most screentones are removed at i∗ . We empirically found
MLoGi as shown below: that i values a little larger than i∗ are best for our purpose.
0 if ILoGi (x) < 0 If we increment i much more, the lines become too blurred.
IˆLoGi (x) = (1) Examples are shown in Fig. 3(a) and Fig. 3(b).
ILoGi (x) otherwise,
STCi is used for the stopping condition of the process.
MLoGi = Bin(IˆLoGi ), (2) If STCi is smaller than a predefined threshold, the process
stops. STCi denotes the amount of change of NCC(MLoGi )
where ILoGi is the output of a LoG filtering of an input image
(compared with the ratio between NCC(MLoG1 ) and
I, and i is the window size of a Gaussian filter. i must be odd.
NCC(MLoGi ), i.e., NCC(MLoGi )/NCC(MLoG1 )), as shown in
Note that i automatically determines a standard deviation pa-
Fig. 3(b).
rameter of the filter. IˆLoGi is an image in which each pixel
IˆLoGi (x) is ILoGi (x) if ILoGi (x) is positive and 0 otherwise. We show an algorithm for creating a removal mask Mrm in