Journal Pre-proof
LABiocin database: a new database designed specifically for Lactic
Acid Bacteria bacteriocins
Imad Al Kassaa , Rayane Rafei , Mirna Moukhtar , Mazen Zaylaa ,
Adem Gharsallaoui , Abdeslam Asehraou , Khaled El Omari ,
Ahmad Shahin , Monzer Hamze , Nour-Eddine Chihib
PII:
DOI:
Reference:
S0924-8579(19)30192-X
https://doi.org/10.1016/j.ijantimicag.2019.07.012
ANTAGE 5761
To appear in:
International Journal of Antimicrobial Agents
Received date:
Accepted date:
5 May 2019
10 July 2019
Please cite this article as: Imad Al Kassaa , Rayane Rafei , Mirna Moukhtar , Mazen Zaylaa ,
Adem Gharsallaoui , Abdeslam Asehraou , Khaled El Omari , Ahmad Shahin , Monzer Hamze ,
Nour-Eddine Chihib , LABiocin database: a new database designed specifically for Lactic Acid Bacteria bacteriocins, International Journal of Antimicrobial Agents (2019), doi:
https://doi.org/10.1016/j.ijantimicag.2019.07.012
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service
to our customers we are providing this early version of the manuscript. The manuscript will undergo
copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please
note that during the production process errors may be discovered which could affect the content, and
all legal disclaimers that apply to the journal pertain.
© 2019 Published by Elsevier B.V.
Highlights
LABioicin is a new repository for Lactic Acid Bacteria bacteriocins and their data
Bacteriocin properties, sequences, spectrum and extraction methods are compiled
LABioicin provides an interactive interface that integrates many tools and services
LABiocin underpins for the development of food safety applications and new drugs
LABiocin database: a new database designed specifically for Lactic Acid
Bacteria bacteriocins
Imad AL KASSAAa†, Rayane RAFEIa†, Mirna MOUKHTARc, Mazen ZAYLAAa,
Adem GHARSALLAOUIf, Abdeslam ASEHRAOUe, Khaled EL OMARIa,d, Ahmad SHAHINa,c,
Monzer HAMZEa, Nour-Eddine CHIHIBb*
a
Health and Environment Microbiology Laboratory, Doctoral School of Sciences and
Technology, Faculty of Public Health, Lebanese University, Tripoli, Lebanon
b
UMET CNRS Laboratory, INRA, UMR 8207-UMET-PIHM, Lille University, Villeneuve
d’Ascq, France
c
LIA Laboratory, Doctoral School in Science & Technology, Lebanese University, Tripoli –
Lebanon
d
Quality Control Center Laboratories at the Chamber of Commerce, Industry Agriculture of
Tripoli & North Lebanon
e
Laboratory of Biochemistry and Biotechnology Mohammed Premier University, Faculty of
Sciences Oujda, Morocco
University Lyon 1, ISARA Lyon, Laboratoire BioDyMIA, Equipe Mixte d’Accueil, n°3733, IUT
f
Lyon 1, technopole Alimentec, rue Henri de Boissieu, F-01000 Bourg en Bresse, France
*Corresponding author:
Nour-Eddine CHIHIB
Tel: +33 (0)3 59 63 22 46
Fax: +33 (0)3 59 63 21 62
E-mail : Nour-Eddine.chihib@univ-lille.fr
†
: equally contributed authors: Imad AL KASSAA; Rayane RAFEI
Abstract
Although bacteriocins from lactic acid bacteria (LAB) are successfully applied as natural
alternatives to food preservation as well as to antibiotics, they are scattered through literature and
unexhaustive databases. For these reasons, we have developed the LABiocin database, a
specialized database on LAB bacteriocins. The database was stored and compiled using MySQL
with NetBeans IDE as database’s platform. Important data are compiled such as the bacteriocin
name, the class to which it belongs, the amino acids and the nucleic acid sequences if available.
In addition, the target microorganisms, the origin, and status of the producing strains as well as
their culture conditions and their extraction and purification methods are also mentioned in this
new database. A phylogenetic tree for the mature peptide bacteriocin sequences has also been
created.
LABiocin is an interactive database gathering up to 517 LAB bacteriocins using a suitable userfriendly interface and integrates several tools and services. Besides data searching tools, a
BLAST tool was integrated into the database allowing the user to perform a homology search
against mature peptide sequences. Users can be linked to other databases that contain additional
information especially about predicted bacteriocin structure and mechanisms of action.
LABiocin database would allow better and more comprehensive functional analysis of this
special group of antimicrobial peptides. This would certainly be useful in food preservation or
food safety applications, but also has substantial implications for the development of new drugs
for medical use. LABiocin database is available at labiocin.net.
Keywords: Bacteriocins; databases; LAB; antimicrobial peptides; LABiocin
1. Introduction
Bacteriocins are antimicrobial peptides synthesized by some bacteria via the ribosomal pathway
and are active generally against closely related bacteria. Although, some bacteriocins have shown
recently a wide range inhibitory spectrum [1]. The bacteriocin discovery preceded that of
antibiotics. Colicin was the first bacteriocin identified in 1925 from an Escherichia coli strain [2].
Since this discovery, numerous bacteriocins were identified, leading to a diversified family of
proteins in terms of size, target, action mode, delivery, and immunity mechanisms.
Lactic acid bacteria (LAB) are able to produce antimicrobial molecules such as organic acids,
diacetyl, acetoin, hydrogen peroxide, antifungal peptides, and bacteriocins [3]. Most of the LAB
are Generally Recognized as Safe (GRAS), granted by the American Food and Drug Agency
(FDA). The European Food Safety Authority (EFSA) also granted the Qualified Presumption of
Safety (QPS) status to most of the LAB genera [4]. Bacteriocins, produced by the LAB have been
proposed and even successfully applied as natural antimicrobial alternative in food preservation
as well in antibiotic therapy [5]. Hence, a huge number of research studies have been conducted
to find out novel LAB bacteriocins and deeply investigate their properties and activities.
Unfortunately, these information are scattered in various databases hampering thus their
availability and use for the interested users. Therefore, gathering all LAB bacteriocins and their
information in a specific database seems to be a paramount task nowadays. In fact, this tool
represents a fundamental element to scientists, to food authorities as well as to food industries.
Several databases were created to hold antimicrobial peptides (AMPs) as well as bacteriocins.
Such databases could deliver information in a well-organized manner and facilitate comparison
between stored bacteriocins (peptide sequence, inhibitory spectrum, resistance to extreme
conditions, characterization methods...etc). Several databases hold AMPs including bacteriocins,
of which only two are specialized in bacteriocins: BACTIBASE and BAGEL. BACTIBASE
holds 177 bacteriocins in its last release, of which 156 bacteriocins belong to Gram positive
bacteria [6]. In another hand, BAGEL4 (the latest release of BAGEL) [7], is a mining web server
that identifies and visualizes gene clusters in bacteria and archeal genomes involved in
biosynthesis of Ribosomally synthesized and Post translationally modified Peptides (RiPPs). The
database used in BAGEL4 for gene clusters prediction contains 254 from LAB [7]. Nowadays,
more special purpose databases are increasingly available for the scientific community that
enables the users to query more homogenous and focalized databases on a particular theme. For
example, RegulonDB and Bacillus-RegNet are comprehensive resources for transcriptional
regulatory networks for E. coli and Bacillus spp. respectively [8,9].
Thus, there is a clear need to gather, filter and critically evaluate this mass of information and
store into smaller, more specialized, resources so that it can then be used in a way that enhances
efficiency. A new database designed specifically for LAB bacteriocins is therefore needed. The
microbial physicochemical and structural properties provided in such database would allow better
and more comprehensive structural and functional analysis of this special group of antimicrobial
peptides. This would certainly be useful in food preservation and food safety applications, but
also would have implications in the development of new drugs for medical use. For these reasons
the goal for the present work is to develop the LABiocin database, a specialized database on LAB
bacteriocins. Using a suitable user-friendly interface, it offers the users a validated repository of
LAB bacteriocins and allows them to extract, analyze and compare data.
2. Materials and Methods
2.1 Data collection
The bacteriocins of LAB presented here in the LABiocin database were extracted and collected
from Scopus, Pubmed/Medline, Science direct and other databases until August 2017. The
keywords used were LAB bacteriocins, antibacterial peptide. In addition, LAB bacteriocins from
Bactibase (http://bactibase.hammamilab.org/main.php) and BAGEL3 (http://bagel.molgenrug.nl/)
databases were also collected. These latter bacteriocins were verified from their original
references to add all missing information. After careful readings of literature review, the collected
information (if available) for bacteriocins are divided in three information groups: (i) General
bacteriocin characteristics: class, charge, molecular weight, isoelectric point (IP), solubility,
producer organisms, and target bacteriocin organisms; (ii) bacteriocin sequences: full pre-peptide
sequence, mature peptide sequence, leader sequence, the corresponding accession numbers to
UniProt database as well as to Protein and Nucleotide NCBI databases, and gene sequence (if
available or predicted); (iii) the in-vitro characterization of bacteriocins: Extraction,
characterization and purification methods as well as conditions of bacteriocin production (culture
medium, pH and temperature). Concerning gene sequences, besides the literature review, the
UniProt and Protein databases, NCBI tblastn software was used to retrieve gene sequences
coding for peptidic bacteriocin sequences available in the Nucleotide collection (nr/nt) database.
Genes were only considered if they had 100% query coverage and 100% identity between
peptidic bacteriocin query and database sequences. The NCBI blastp program was used to search
homologies in the Non-redundant protein sequences (nr) and UniProtKB/Swiss-Prot(swissprot)
databases. If a peptidic database sequence had a 100% identity with our query only characterized
to a partial level by researchers, the full pre-peptide sequence of the database sequence was
extracted and represented in the database. Of note, both sequences (partial and full pre-peptide
sequences) are mentioned in the bacteriocin entry in our LABiocin database with a note
explaining this issue. Multiple sequence alignment was done using MUSCLE available at
https://www.ebi.ac.uk/Tools/msa/muscle/.
2.2 Database architecture
The LABiocin database was stored and compiled using MySQL (v 6.3). The database’s platform
used NetBeans IDE 8.0.2. HTML and JSP were used to improve the web interface. This database
is available on the following domain: labiocin.net. Furthermore, the database is hosted at Lille
University, France.
The LABiocin database contains 13 tables. The bacteriocin table is the main one. The
relationships between these tables are represented in Figure 1 and can be summarized as follows.
Indeed, each bacteriocin has one producer strain designed by full species name and code. This
producer strain is isolated from one (or more) sources that can be human, animal, food or others.
In addition, for each producer strain a status such as probiotic or starter is given depending on the
bibliographic available information. Each producer strain requires some culture conditions to
produce bacteriocin such as the culture medium type, medium pH, incubation time, and
temperature. The extraction of bacteriocins could be conducted via several methods. Each
bacteriocin was extracted by one or two methods such as salt precipitation, solvent extraction,
solid phase extraction, and others. After extraction, bacteriocins should be purified. Each
bacteriocin was purified via one or two methods such as Reverse-LC technique. Bacteriocin
characterization is divided in several parts: (i) Each bacteriocin should be classified in a class
and/or subclass such as Class I, Class II, subclass IIa…etc. In addition, (ii) each bacteriocin has a
molecular weight (MW) as well as (iii) an isoelectric point (IP), (iv) peptide sequence (full
sequence, mature sequence, partial sequence as well as leader peptide sequence). Therefore,
many techniques and methods were used to characterize bacteriocins. Each bacteriocin has one or
more characterization methods such as Mass Spectrometry, Edman degradation, SDSPAGE…etc. Bacteriocin is an antimicrobial peptide, which exhibits an antimicrobial spectrum.
Thus, each bacteriocin has its own antimicrobial spectrum designed by the list of the indicator
strains used. These indicator strains are mentioned with a full name of species and strain codes.
In order to make the classification of indicator strains easier, indicator strains were classified in
several categories such as Gram stain, fastidious, Molds, Yeast…etc.
Each bacteriocin has a reference designed by a published article or other. Sometimes, one
bacteriocin can be mentioned in several references, and thus, all these references are mentioned
in LABiocin database.
2.3 LABiocin data analysis
GraphPad Prism was employed for graph preparation and data analaysis.
2.4 Phylogenetic tree
Two phylogenetic trees have been created: one for the 16S rDNA gene sequences of bacteriocinproducing LAB and another one for the mature peptide bacteriocin sequences. For the first tree,
16S rDNA gene sequences for each bacteriocin-producing LAB species available in our database
were extracted from the representative genomes in the Genome database at NCBI. The Software
Mega7 was used to construct the phylogenetic tree by Maximum likelihood method (Figure S1)
[10]. For the second tree, the mature peptide bacteriocin sequences were aligned using MUSCLE
v3.8.31 [11]. The obtained multiple sequence alignment was trimmed using the TrimAl v1.4.
method [12]. After that, Protest3 determined the best-fit model of evolution for this datatest [13].
Then, FastTree v2.1.8 inferred the maximum-likelihood phylogenetic tree with WAG (WhelanAnd-Goldman) as a model [14]. FastTree support values were calculated from the the
Shimodaira-Hasegawa test to provide an estimate of reliability of observed clusters. FigTree
v1.4.3 was used to display graphically the trees (http://tree.bio.ed.ac.uk/software/figtree/).
3. Results and discussion
3.1 Brief Description
LABiocin is a worldwide free access database. However, the user must register and login to the
database with his proper username and password. The database homepage allows access to the
following interfaces: Blast, Phylogeny, data analysis, About us, GeoMap, Contact, and Submit
new bacteriocin (Figure 2). Additionally, the user can query directly the database from the
homepage (discussed below). The results of a search query are given first in a simple table
(Figure 2) that provides the user a quick view of some characteristics of the searched bacteriocin.
The table shows the LABiocin accession number, bacteriocin’s class, the bacteriocin producer
strain, the molecular weight, gene names as well as the links to reference article in Pubmed,
UniProt and NCBI databases (Protein and GenBank). The user can download the whole
information or/and the Fasta format of bacteriocin sequences from a selected bacteriocin.
By clicking the name of a specific bacteriocin in the table, a new entry will show a detailed
information about this bacteriocin. This entry is divided into seven sections (Figure 2). The first
section lists the bacteriocin details such as: the bacterial status, the isoelectric point, the
molecular weight, the electrostatic charge, the solubility, the full pre-peptide sequence, the leader
sequences, the mature peptide sequence and the gene sequence as well as their corresponding
accession numbers to UniProt, NCBI databases and Bactibase if available. The second section
details the producer organism information such as the full name species, the food matrix, and the
environmental conditions from where the bacterium where isolated (source), and the used
conditions to induce bacteriocin production (the culture medium, temperature of incubation, pH).
The third section shows the indicator strains targeted by bacteriocins. The fourth, the fifth and the
sixth sections list respectively the extraction, the purification, and the characterization methods.
The last section of this entry corresponds to the literature review of this bacteriocin.
3.2 Available tools
Various tools have been implemented to query the database. For data searching, two options are
available: “type in a keyword” and “advanced search” tools. “Type in a keyword” tool allows the
user to quickly query the database using a simple keyword. The keyword may be a bacteriocin
name, a LABiocin accession number, antimicrobial spectrum or indicator strains, a producer
status (starter culture, probiotic…), an extraction or a purification method, the origin of the
bacteriocin producer such as the food matrix. “Advanced search tool” enables the user to perform
complex queries and combine up to 9 fields in a single search (Figure 2). The chosen term, within
the inhibitory spectrum list, filters the appeared terms in the indicator strains list. For example,
when the user chooses the inhibitory spectrum as Gram-positive, systematically the given list in
the indicator strain field is corresponding to Gram-positive genera. Besides data searching tools, a
BLAST tool was integrated into the database allowing the user to perform a BLAST search
against mature peptide sequences. The homologous sequences in LABiocin database having a
high similarity with the query peptide will appear. To assess the similarity between the query and
the aligned database sequences, many criteria were used such as the identity, the similarity, the
query coverage, the percentage of gaps, and the E-value.
3.3 Services for users
Several services for users have been integrated into LABiocin database. A “contact us interface”
allows users to communicate with the database curators for any further requests or suggestions.
To ensure a continuous update of our data, the user can directly submit the newly identified
bacteriocin by sending an email with his validated data using the “submit new bacteriocin”
interface. The user can also download all available sequences in the database either mature
peptide sequence, full pre-peptide sequences, partial sequences, and (or) bacteriocin-encoding
gene sequences. However, a previous contact with the curators for the commitment of terms and
conditions of use of LABiocin database is needed for download. Finally, a GeoMap interface
shows the user popularity by country.
3.4 Phylogenetic trees
Of 517 bacteriocins stored in the LABiocin database, 267 with full mature peptide sequence were
were selected and analyzed by Maximum likelihood phylogenetic tree based on their sequence
similarities (Figure S2). Of note, if more than one bacteriocins have the same sequence, one
bacteriocin sequence is selected to represent these identical bacteriocins in the tree (appendix
1).Our findings showed interesting results, several groups with high FastTree support values
(more than 70%) are identified in the tree with the potential to identify bacteriocin classes. For
class I bacteriocins, lantibiotics form well-defined groups dispersed in the tree, which are the
following: lacticin 481, nisin, streptin, salivaricin 9, ClyLS, bovicin HJ50, pldA1, salivaricin A,
mutacin 1140, and plantaricin w β groups. Moreover, class IIa bacteriocins define groups that are
close to each other. These groups are that of sakacin X, bavaricin MN, mesentericin Y105,
pediocin PA-1, and carnobacteriocin BM1. Bacteriocins of class IIb consist of ThmB, BlpMTIGR4, salivaricin Abp118 β, sakacin T α, BlpN-TIGR4, lafX, lafA, BlpM-23F, enterocin X α,
and latococcin G β groups. Bacteriocins of class IIc, IId, III, and IV as well as some divergent
members of class I, IIa, and IIb are distributed across the tree. Furthermore, several unclassified
or unknown bacteriocins that didn’t belong to known bacteriocin class are scattered across the
tree. Interestingly, the clustering of these bacteriocins with known ones with high FastTree
support values may be explained by either insufficient characterization of these bacteriocins or
either a poor sequencing of other novel bacteriocins to constitute distinct groups. For examples,
lactacin rm is clustered with enterocin AS-48 belonging to class IIc. Also, bacteriocin J46, a
posttranslationally unmodified bacteriocin, is clustered with nisin group belonging to class I that
begs the questions around their identities. In addition, some of poorly characterized class II
members grouped with well-defined ones as Hiracin JM79, which is a Sec-dependent class II
bacteriocin, clustered with the bavaricin MN group, a class IIa group. Finally, this sequence
similarity-based tree fits well with structure-based sequence fingerprints that identifies 12 groups
in 107 analyzed bacteriocins [15]. Although, some differences between these 2 trees are observed
and are mainly due to the massive number of newly described bacteriocins analyzed here. As
discussed, phylogenetic trees may provide new horizons and solutions to classify bacteriocins
that are insufficiently characterized under the current contradictory classification schemas.
3.5 Relevant results and LABiocin data analysis
LABiocin database holds 517 LAB bacteriocins, collected from scientific research articles until
august 2017. Each bacteriocin in our database is assigned a specific accession number starting
with a prefix LBN. The availability of such relevant number of stored LAB bacteriocins allows
to perform a deep data analysis that can be accessed by LABiocin users in a specific interface. In
addition, the LABiocin data analysis Figure 3 showed that until 1990, LAB bacteriocins were
less studied. Between 1990 and 2000 (19.7%), researchers have started looking for new
Generally Recognized As Safe (GRAS) bacteriocins such as LAB bacteriocins for both medical
and food application. Furthermore, the majority of bacteriocins researches (38.7%) were done in
10 years between 2000 and 2010. After 2010, bacteriocin research never stopped (37.4%).
3.5.1 Bacteriocinogenic potential in lactic acid bacteria
Our findings showed that Lactobacillus genus seems to be the most bacteriocinogenic genus
among LAB family accounting for 30.95% of total bacteriocinogenic LAB genera, followed by
Enterococcus genus with 24.18% of the total. Whereas Weissella and Bifidobacterium species
exhibit the lowest percentages with 2.13% and 0.58% respectively (Figure 4A). Interestingly,
based on the published bacteriocinogenic strain’s status, the percentage of bacteriocinogenic
probiotic strains, is very low (7.72 %) (Figure 4B). However, 58.49% of bacteriocinogenic strains
have not any published status and therefore LAB status was given to these strains. Thus, this
finding can be taken into consideration to exclude this criterion when researchers would select
probiotic strains (for screening of their ability to produce bacteriocins). It is worth to mention,
that the bacterial probiotic status of the LAB may change in the next future depending on the
research investigation.
3.5.2 LAB bacteriocins classification and antimicrobial spectrum
Among these 517 bacteriocins, 19.54% were classified in class IIa, whereas 17.6% have been
considered as bacteriocins with unknown class. The “unclassified bacteriocins” (15.28%) are
bacteriocins with sequences and/or structures that didn’t match to any known class, whereas the
“unknown bacteriocins” are bacteriocins with uncompleted information especially peptide
sequence. In other hand, class IIc, IV, III and circular bacteriocins are rare classes found in LAB
bacteriocins with the following percentages 1.35, 1.35, 0.58 and 0.39% respectively (Figure 5A).
Concerning characteristics of LAB bacteriocins, most of them (38.88%) have a molecular weight
(MW) ranging between 3 and 5 kDa. However, 17.4% of LAB bacteriocins have unidentified
MW (Figure 5B). The most important characteristics of bacteriocins is the activity spectrum. In
general, bacteriocins are known to be active against closely related bacteria. However, LAB
bacteriocins seem to have a wide antimicrobial spectrum. For example, 114 bacteriocins among
517 show an antimicrobial activity against both Gram positive and Gram-negative bacteria.
Moreover, 24 bacteriocins exhibit anti-molds activity. Furthermore, the following bacteriocins
“Bacteriocin VJ13B, Enterocin AS-48, Mutacin C67-1” and “Salivaricin A2, A3, A5 and
Salivaricin A4” can inhibit Mycobacterium species such as atypical mycobacteria (4) and
Mycobacterium tuberculosis (2) respectively (Figure 6). The list of indicator strains was collected
for each bacteriocins from the published articles. Rare bacteriocins had more than one published
article and therefore, all lists of indicator strains were compared and gathered in our database.
3.5.3 In silico predicted bacteriocins
It is worthy to mention that 7 bacteriocins have been detected with in silico methods such as the
whole genome sequencing of producer strain followed by the identification of bacteriocin gene
clusters and characterization of bacteriocins. The methods used for detection and characterization
of bacteriocins are critical to re-identify them and to discover new ones. Therefore, these methods
can help industries to characterize and produce bacteriocins in large scale for medical or food
applications.
3.5.4 Culture conditions, extraction/purification and characterization methods
LABiocin database summarizes the most important methods to be used in future bacteriocin’s
research. Starting with the medium, De Man Rogosa Sharp (MRS) medium (62.4%) is the most
medium used to produce a large amount of bacteriocins from LAB strains (Figure 7).
Concerning extraction methods, ammonium sulfate precipitation is the most method used to
extract LAB bacteriocins (42.17%). Furthermore, other famous methods “adsorption-desorption”
methods called also Yang method is placed in fourth place (6.38%). Remarkably, a large number
of bacteriocins (32.11%) were directly purified without extraction step (Figure 8A). The C18HPLC method was the most purification method used to purify LAB bacteriocins (263/358),
followed by the cation exchange method (112/358) (Figure 8B). Bacteriocin’s characterization
means methods used to identify bacteriocins properties such as MW determination as well as
peptide sequencing. In this context, SDS-PAGE is the most method used to determine
bacteriocin’s MW (211/384), followed by MALDI-TOF and ESI-MS methods (123/384). Edman
degradation method was the most sequencing method used for LAB bacteriocins (Figure 9).
Sequenced bacteriocin means a bacteriocin having a partial, mature or full prepeptide sequence
with/without leader sequence. Thus, the total number of sequenced LAB bacteriocins is 335
bacteriocins, among them 292 have full prepeptide sequences (Leader and mature peptide
sequences), whereas 174 have not any identified sequence (blocked sequence and/or nonsequenced bacteriocins). In addition, only 58 LAB bacteriocins have been partially sequenced
(Figure 10). Peptide and DNA sequences of 230 LAB bacteriocins are present in both NCBI and
Uniprot databases, whereas 75 are not present in NCBI neither Uniprot databases (Figure 10).
3.6 Comparison with other databases
In the last decade, over 10 thousands of antimicrobial peptides were studied. Several databases
were created to gather these AMPs and their information. Each AMP database holds specific
information as well as specific tools and statistics. Table 1 showed the comparison between
LABiocin and other AMP databases. APD3 [16], CAMPR3 [17], DBAASP [18], LAMP [19],
YADAMP [20], BACTIBASE 2 [6] and BAGEL4 [7] are the famous available AMP databases.
Only BACTIBASE 2 and BAGEL4 databases are specific to bacteriocins. The others hold AMP
from all origins such as: animal, plant, frog, insect, human as well as from microbes.
Bacteriocins, which are presented in AMP databases, are lacking information and need a deep
validation. As shown in Table 1, bacteriocins present in BACTIBASE have less information than
LABiocin. Furthermore, LABiocin gather GRAS bacteriocins, which are no cytotoxic and ready
to use in any clinical or food applications. LABiocin holds 517 bacteriocins, whereas
BACTIBASE 2 holds only 177 bacteriocins included non-LAB bacteriocins. However,
BACTIBASE 2 contains more information on bacteriocin’s structure as well as physico-chemical
properties.
BAGEL4 is a genome mining web tool for bacteriocin prediction. In addition, BAGEL4 holds
814 bacteriocins originated from all bacterial phyla. Moreover, the majority of BAGEL
bacteriocins are predicted from genomic analysis. LABiocin contains the majority of LAB
bacteriocins in the literatures including bacteriocins found in BACTIBASE 2 and BAGEL4.
Furthermore, the BACTIBASE and BAGEL accession numbers are mentioned as hyperlinks in
LABiocin database, which can lead users to aforementioned databases.
4. Conclusion and future prospects
LABiocin is a comprehensible and easy to access database, which compiles all LAB bacteriocins
collected until august 2017. Furthermore, LABiocin contains codes presented as hyperlinks to
several databases as Pubmed, Uniprot, NCBI, BAGEL and BACTIBASE. In addition, in
LABiocin database, users can be directed to other database sites (APD3, CAMP and BAGEL4),
which contain several prediction tools. Besides information contents, LABiocin integrates several
tools and services. This interactive database was built to continue growing up with new data
submitted by researchers and our team. As future prospects, many releases will be published
online with new information, data and new tools. The next release of LABiocin will contain
antiviral bacteriocins as well as antiviral LAB.
Author contribution
Authors
Involvement in the Labiocin database
Imad AL KASSAA
Fill in the different Excel files and update
them. The update of the bibliography. The
query of the LABiocin database and statistics
analysis. The writing and proofreading
Fill in the different Excel files and update
them. The update of the bibliography. The
query of the LABiocin database and statistics
analysis. The writing and proofreading
Bioinformatician in charge of programming of
the LABiocin database
Person in charge of gathering information from
different partners and help Al Kassaa I and
RAFEI R
The person in charge of checking of the
physiochemical characteristics of the
bacteriocins in the different files of the
database and the bibliography.
The person in charge of checking of the
bacteriocins targets and the potential
application of the bacteriocins of the lactic acid
bacteria in the different files of the database
and the bibliography.
The writing and proofreading participation of
the update of the bibliography.
The bioinformatician in charge of
programming of the LABiocin database and the
management of the software used
The management and monitoring of the work.
The establishment of the working group. The
writing and proofreading. Coordination of
exchanges between the different authors. The
Rayane RAFEI
Mirna MOUKHTAR
Mazen ZAYLAA
Adem GHARSALLAOUI
Abdeslam ASEHRAOU
Khaled EL OMARI
Ahmad SHAHINE
Monzer HAMZE
Nour-Eddine CHIHIB
selection and the update of the bibliogrpahy.
The design of the LABiocin database. The
selection of the parameters related to the lactic
acid bacteria bacteriocins. Selection of the
Bibliography. The creation of the Excel files.
The establishment of the working group. The
management and monitoring of the work. The
writing and proofreading.
Acknowledgement
Authors would thank Ms Samah Hussein and Ms Mirna Hayek for their technical support
Declarations
Funding: No funding
Competing Interests: No conflict of interest
Ethical Approval: Not required
References
[1] Line JE, Svetoch EA, Eruslanov BV, Perelygin VV, Mitsevich EV, Mitsevich IP, et al.
Isolation and purification of enterocin E-760 with broad antimicrobial activity against grampositive and gram-negative bacteria. Antimicrob Agents Chemother 2008;52:1094–100.
doi:10.1128/AAC.01569-06.
[2] Cotter PD, Hill C, Ross RP. Bacteriocins: developing innate immunity for food. Nat Rev
Microbiol 2005;3:777–88. doi:10.1038/nrmicro1273.
[3] Egan K, Field D, Rea MC, Ross RP, Hill C, Cotter PD. Bacteriocins: Novel Solutions to
Age Old Spore-Related Problems? Front Microbiol 2016;7:461.
doi:10.3389/fmicb.2016.00461.
[4] EFSA. Introduction of a Qualified Presumption of Safety (QPS) approach for assessment of
selected microorganisms referred to EFSA - Opinion of the Scientific Committee. The
EFSA journal 2007;587:1–16.
[5] Ołdak A, Zielińska D. Bacteriocins from lactic acid bacteria as an alternative to antibiotics.
Postepy Hig Med Doswiadczalnej Online 2017;71:328–38.
[6] Hammami R, Zouhir A, Le Lay C, Ben Hamida J, Fliss I. BACTIBASE second release: a
database and tool platform for bacteriocin characterization. BMC Microbiol 2010;10:22.
doi:10.1186/1471-2180-10-22.
[7] van Heel AJ, de Jong A, Song C, Viel JH, Kok J, Kuipers OP. BAGEL4: a user-friendly
web server to thoroughly mine RiPPs and bacteriocins. Nucleic Acids Res 2018;46:W278–
81. doi:10.1093/nar/gky383.
[8] Gama-Castro S, Salgado H, Santos-Zavaleta A, Ledezma-Tejeida D, Muñiz-Rascado L,
García-Sotelo JS, et al. RegulonDB version 9.0: high-level integration of gene regulation,
coexpression, motif clustering and beyond. Nucleic Acids Res 2016;44:D133-143.
doi:10.1093/nar/gkv1156.
[9] Misirli G, Hallinan J, Röttger R, Baumbach J, Wipat A. BacillusRegNet: a transcriptional
regulation database and analysis platform for Bacillus species. J Integr Bioinforma
2014;11:244. doi:10.2390/biecoll-jib-2014-244.
[10] Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis
Version 7.0 for Bigger Datasets. Mol Biol Evol 2016;33:1870–4.
doi:10.1093/molbev/msw054.
[11] Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput.
Nucleic Acids Res 2004;32:1792–7. doi:10.1093/nar/gkh340.
[12] Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment
trimming in large-scale phylogenetic analyses. Bioinformatics 2009;25:1972–3.
doi:10.1093/bioinformatics/btp348.
[13] Darriba D, Taboada GL, Doallo R, Posada D. ProtTest 3: fast selection of best-fit models of
protein evolution. Bioinformatics 2011;27:1164–5. doi:10.1093/bioinformatics/btr088.
[14] Price MN, Dehal PS, Arkin AP. FastTree 2 – Approximately Maximum-Likelihood Trees
for Large Alignments. PLOS ONE 2010;5:e9490. doi:10.1371/journal.pone.0009490.
[15] Zouhir A, Hammami R, Fliss I, Hamida JB. A new structure-based classification of grampositive bacteriocins. Protein J 2010;29:432–9. doi:10.1007/s10930-010-9270-4.
[16] Wang G, Li X, Wang Z. APD3: the antimicrobial peptide database as a tool for research and
education. Nucleic Acids Res 2016;44:D1087-1093. doi:10.1093/nar/gkv1278.
[17] Waghu FH, Barai RS, Gurung P, Idicula-Thomas S. CAMPR3: a database on sequences,
structures and signatures of antimicrobial peptides. Nucleic Acids Res 2016;44:D10941097. doi:10.1093/nar/gkv1051.
[18] Pirtskhalava M, Gabrielian A, Cruz P, Griggs HL, Squires RB, Hurt DE, et al. DBAASP
v.2: an enhanced database of structure and antimicrobial/cytotoxic activity of natural and
synthetic peptides. Nucleic Acids Res 2016;44:6503. doi:10.1093/nar/gkw243.
[19] Zhao X, Wu H, Lu H, Li G, Huang Q. LAMP: A Database Linking Antimicrobial Peptides.
PloS One 2013;8:e66557. doi:10.1371/journal.pone.0066557.
[20] Piotto SP, Sessa L, Concilio S, Iannelli P. YADAMP: yet another database of antimicrobial
peptides. Int J Antimicrob Agents 2012;39:346–51. doi:10.1016/j.ijantimicag.2011.12.003.
Figure Legends
Figure 1. LABioicin architecture.
This figure shows the tables of the database, the stored data of each table as well as the
relationships between these tables. The bacteriocin table was the main table.
Figure 2. LABioicin interfaces
At the up-left, Figure 2 presents the database homepage that allows access to the following
interfaces: Blast, Phylogeny, Statistics, About us, GeoMap, Contact, and Submit new bacteriocin.
At the up-middle of Figure 2, a simple table provides the user a quick view of some
characteristics of the searched bacteriocin. At the right, an interface shows detailed information
about a specific bacteriocin. At the down-middle, an advanced tool search enables the user to
perform complex queries and combine up to 9 fields in a single search.
Figure 3: Published period of different bacteriocins
Figure 4. A: The percentage of genus variability in bacteriocinogenic strains; B: Percentage of
bacteriocinogenic strain’s status.
Figure 5. A: Percentage of bacteriocin classes; B: Percentage of bacteriocin’s molecular weight.
Figure 6. Antimicrobial spectrum of bacteriocins.
Figure 7. Growth medium for bacteriocin production.
Figure 8. A: Methods of bacteriocin’s extraction, B: Methods of bacteriocin’s purification.
Figure 9. Methods of bacteriocin’s characterization.
Figure 10. A: Availability of bacteriocin’s sequence, B: presence of DNA/peptide sequences in
NCBI/Uniprot database.