Article
Estimating Equivalent Alkane Carbon Number Using Abraham
Solute Parameters
William E. Acree, Jr. 1 , Wei-Khiong Chong 2 , Andrew S.I.D. Lang 3, *
1
2
3
*
and Hamed Mozafari 3
Department of Chemistry, University of North Texas, Denton, TX 76203, USA
Advent Polytech Co., Ltd., Taipao City 61249, Taiwan
Department of Computing & Mathematics, Oral Roberts University, Tulsa, OK 74136, USA
Correspondence: alang@oru.edu
Abstract: The use of equivalent alkane carbon numbers (EACN) to characterize oils is important
in surfactant-oil-water (SOW) systems. However, the measurement of EACN values is non-trivial
and thus it becomes desirable to predict EACN values from structure. In this work, we present a
simple linear model that can be used to estimate the EACN value of oils with known Abraham solute
parameters. We used linear regression with leave-one-out cross validation on a dataset of N = 80 oils
with known Abraham solute parameters to derive a general model that can reliably estimate EACN
values based upon the Abraham solute parameters: E (the measured liquid or gas molar refraction
at 20 ◦ C minus that of a hypothetical alkane of identical volume), S (dipolarity/polarizability), A
(hydrogen bond acidity), B (hydrogen bond basicity), and V (McGowan characteristic volume) with
good accuracy within the chemical space studied (N = 80, R2 = 0.92, RMSE = 1.16, MAE = 0.90,
p < 2.2 × 10−16 ). These parameters are consistent with those in other models found in the literature
and are available for a wide range of compounds.
Keywords: equivalent alkane carbon number (EACN); Abraham solute parameters; hydrophobicity;
oils
Citation: Acree, W.E., Jr.;
Chong, W.-K.; Lang, A.S.I.D.;
Mozafari, H. Estimating Equivalent
Alkane Carbon Number Using
Abraham Solute Parameters. Liquids
2022, 2, 318–326. https://doi.org/
10.3390/liquids2040019
Academic Editor: Slobodan Žumer
Received: 19 August 2022
Accepted: 26 September 2022
Published: 2 October 2022
Publisher’s Note: MDPI stays neutral
with regard to jurisdictional claims in
published maps and institutional affiliations.
Copyright: © 2022 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
1. Introduction
Considerable attention has been given in recent years to development of better
surfactant-based microemulsions and foam systems for enhanced oil recovery in petroleum
processes, for removal of oil from contaminated soil and industrial machinery surfaces, and
for the solubilization of fragrances in water-based formulations. Many factors including the
temperature, electrolyte concentration, and the hydrophobicities of both the surfactant and
oil contribute to the overall efficiency of the extraction system. Experimental determination
of the optimum set of conditions for a given surfactant-oil-water system is both expensive
and very time-consuming. Fortunately, empirical equations have been proposed to describe
how the various factors affect microemulsion formation. One such expression is based on
the hydrophilic-lipophilic difference (HLD) framework [1,2].
Ionic surfactant: HLD = ln(S) − k · EACN + Cc − α · (T − Tref )
(1)
Nonionic surfactant: HLD = b · S − K · EACN + Cc + CT · (T − Tref )
(2)
where S is the salinity (not to be confused with Abraham’s S parameter) and the terms ln(S)
and b · S take into account the electrolyte concentration (usually in grams per 100 mL) of
the system, b is electrolyte and surfactant specific, EACN is the equivalent alkane carbon
number of the oil phase, Cc represents the hydrophilicity of the surfactant, and the last
two terms, α · (T − Tref ) and CT · (T − Tref ) are related to the temperature effect. The
application of HLD in predicting the type and microemulsion phase behavior is described
in greater detail elsewhere [1–3].
4.0/).
Liquids 2022, 2, 318–326. https://doi.org/10.3390/liquids2040019
https://www.mdpi.com/journal/liquids
Liquids 2022, 2
319
Our interest in the current study is in developing a predictive method for the equivalent alkane carbon number (EACN), which for simple alkanes is numerically equal to
the number of carbons (ACN), and for other liquids it is equal to the number of carbons
of the n-alkane exhibiting a similar phase behavior in a reference surfactant-oil-water
(SOW) system. EACNs can be determined by comparing the oil’s fish-tail-temperature
(T*) in a reference SOW to standard calibration curves for n-alkanes [4]. The experimental
determination of EACN values for novel oils may be time consuming and the ability to
predict EACN values is advantageous. Bouton et al. [5] have developed a two-descriptor
model based upon experimental data for 43 oils using the proprietary Molecular Operating
Environment (MOE) software:
EACN = −19.84 + 2.88 · average negative softness + 0.88 · KierA3
(3)
where KierA3 is the third alpha modified shape index and “average negative softness”—which
is related to polarizability. Lukowicz et al. [6] have developed a three-descriptor model
based upon 70 oils using COSMO-RS σ-moments:
EACN = −4.85 − 0.23 · M2 − 0.33 · Macc + 0.06 · M0 ,
(4)
where M2 is molecular polarity, Macc is hydrogen bond basicity, and M0 is total molecular
surface area. This extends previous work in this area [7]. Most recently, Delforce et al. [8]
have developed both a graph machine model using SMILES codes and a neural network model using COSMO-RS-computed σ-moments based on reliable EACN values for
111 molecules.
This work develops a model for EACN based upon experimental EACN values for
80 liquids using the five Abraham solute parameters E, S, A, B, and V which encode
physicochemical properties related to those already found to be important-namely size and
shape, polarizability, and hydrogen bond basicity, i.e., we propose the following model:
EACN = c + e · E + s · S + a · A + b · B + v · V
(5)
where E is the solute excess molar refractivity-the measured liquid or gas molar refraction
at 20 ◦ C minus that of a hypothetical alkane of identical volume-in units of (cm3 /mol)/10, S
is the solute dipolarity/polarizability, A and B are the overall or summation hydrogen bond
acidity and basicity, and V is the McGowan characteristic volume in units of (cm3 /mol)/100.
2. Materials and Methods
Experimentally measured EACN values, collected by Aubry et al. [4,8], were combined with their experimentally determined Abraham solute parameters, primarily from
the UFZ-LSER database [9], with the values for decylcyclohexane taken from a paper by
Chung et al. [10], and the values for dodecylcyclohexane and bis(2-ethylhexyl) adipate
new to this work (from an unpublished database of measured Abraham parameters original to Professor Abraham dated December 2020 shared with one of the authors before
Professor Abraham’s passing on the 19 January 2021), see Table 1. A modeling dataset
was created from these data by: 1. Using median EACN values for compounds with
multiple experimental measurements. 2. Only keeping compounds with all 5 Abraham
parameters available (measured, not predicted) [9]. This dataset of N = 86 compounds with
EACN values and Abraham parameters is available under a CC0 license from figshare [11].
Modeling was performed using R v4.2.0 (R Core Team, Vienna, Austria) [12].
Liquids 2022, 2
320
Table 1. Measured EACN values with available Abraham Solute Parameters.
Compound
EACN
E
S
A
B
V
EACN Ref.
2.4
1.7
1.8
2.5
3.5
2.8
4.6
4.5
4.5
4.5
3.7
4.1
3.9
2.6
5.8
6.3
5.5
5.6
5.7
4.5
7.2
7.9
6.9
5.6
5.3
10.0
11.2
10.1
4.3
3.9
6.3
6.7
4.6
14.4
17.5
11.7
14.8
24.5
13.8
0.305
0.305
0.305
0.305
0.244
0.244
0.191
0.191
0.263
0.263
0.263
0.409
0.320
0.320
0.257
0.257
0.257
0.283
0.283
0.283
0.255
0.255
0.255
0.474
0.544
0.000
0.000
0.000
0.421
0.421
0.270
0.270
0.270
0.243
0.300
0.000
0.000
-
0.10
0.10
0.10
0.10
0.06
0.06
0.17
0.17
0.10
0.10
0.10
0.10
0.23
0.23
0.14
0.14
0.14
0.07
0.07
0.07
0.14
0.14
0.14
0.10
0.25
0.00
0.00
0.00
0.12
0.12
0.07
0.07
0.07
0.23
0.23
0.00
0.00
-
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
-
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.13
0.13
0.00
0.00
0.00
0.00
0.00
0.00
0.00
-
0.8454
0.8454
0.8454
0.8454
0.9863
0.9863
1.1272
1.1272
1.1272
1.1272
1.1272
1.1272
1.1272
1.1272
1.2681
1.2681
1.2681
1.2681
1.2681
1.2681
1.4090
1.4090
1.4090
1.4090
1.3004
1.5176
1.5176
1.5176
1.3004
1.3004
1.4090
1.4090
1.4090
2.2544
2.5362
1.7994
2.2221
-
[13]
[13]
[14]
[15]
[13]
[13]
[13]
[13]
[13]
[13]
[16]
[6]
[16]
[13]
[13]
[13]
[16]
[13]
[13]
[13]
[13]
[17]
[16]
[18]
[18]
[5,19]
[6,20]
[13]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[21]
[16]
[7]
[7]
[21]
[22]
−3.4
1.0
3.5
6.3
5.6
5.8
8.0
7.3
9.8
9.0
0.340
0.191
0.185
0.366
0.181
0.181
0.176
0.176
0.173
0.173
0.37
0.40
0.40
0.40
0.40
0.41
0.41
0.42
0.42
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.12
0.09
0.09
0.10
0.10
0.10
0.10
0.10
0.10
0.8472
1.3582
1.6400
1.7624
1.9218
1.9218
2.2036
2.2036
2.4854
2.4854
[18]
[23]
[18]
[24]
[18]
[23]
[18]
[23]
[18]
[23]
Branched and cyclic alkanes
Cyclohexane
Cyclohexane
Cyclohexane
Cyclohexane
Methylcyclohexane
Methylcyclohexane
1,4-Dimethylcyclohexane
1,4-Dimethylcyclohexane
Ethylcyclohexane
Ethylcyclohexane
Ethylcyclohexane
Cyclooctane
1,2-Dimethylcyclohexane
1,2-Dimethylcyclohexane
Propylcyclohexane
Propylcyclohexane
Propylcyclohexane
Isopropylcyclohexane
Isopropylcyclohexane
Isopropylcyclohexane
Butylcyclohexane
Butylcyclohexane
Butylcyclohexane
Cyclodecane
cis-Decalin
Myrcane
Myrcane
Myrcane
Pinane
Pinane
p-Menthane
p-Menthane
p-Menthane
Decylcyclohexane
Dodecylcyclohexane
Isododecane
Hemisqualene
Squalane
Squalene
Halogenated alkanes
1-Bromo-2-methylpropane
1-Chlorooctane
1-Chlorodecane
1,10-Dichlorodecane
1-Chlorododecane
1-Chlorododecane
1-Chlorotetradecane
1-Chlorotetradecane
1-Chlorohexadecane
1-Chlorohexadecane
Liquids 2022, 2
321
Table 1. Cont.
Compound
EACN
E
S
A
B
V
EACN Ref.
−1.2
−3.1
−4.1
0.8
−0.8
0.6
−0.5
0.4
−1.4
−3.2
3.9
1.5
−1.8
−2.4
5.5
0.1
0.4
−3.3
3.6
3.4
3.1
3.6
2.9
2.0
2.3
2.0
2.0
1.6
1.9
1.4
1.5
0.8
1.3
0.1
−0.3
−1.3
8.1
2.0
3.9
4.0
10.3
6.6
7.3
5.7
6.2
6.0
14.4
7.8
0.395
0.515
0.501
0.391
0.391
0.347
0.347
0.360
0.360
0.495
0.094
0.460
0.155
0.613
0.093
0.143
0.600
0.438
0.438
0.350
0.350
0.492
0.492
0.515
0.515
0.501
0.501
0.522
0.522
0.526
0.526
0.590
0.590
0.607
0.607
0.089
0.133
0.579
0.757
0.757
0.720
0.720
0.579
0.079
0.571
0.28
0.38
0.46
0.18
0.18
0.22
0.22
0.20
0.20
0.32
0.08
0.24
0.22
0.52
0.08
0.22
0.51
0.20
0.20
0.12
0.12
0.22
0.22
0.19
0.19
0.31
0.31
0.29
0.29
0.25
0.25
0.31
0.31
0.49
0.49
0.08
0.22
0.48
0.20
0.20
0.15
0.15
0.47
0.08
0.47
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.09
0.00
0.00
0.09
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.09
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.09
0.12
0.16
0.10
0.10
0.10
0.10
0.10
0.10
0.11
0.07
0.10
0.10
0.16
0.07
0.10
0.15
0.14
0.14
0.07
0.07
0.14
0.14
0.15
0.15
0.23
0.23
0.22
0.22
0.23
0.23
0.20
0.20
0.19
0.19
0.07
0.10
0.15
0.22
0.22
0.25
0.25
0.15
0.07
0.15
0.8024
0.7594
0.7594
0.9433
0.9433
0.9433
0.9433
0.9433
0.9433
0.7919
1.1928
1.0842
1.1498
0.9982
1.4746
1.4316
1.2800
1.2574
1.2574
1.3660
1.3660
1.2574
1.2574
1.2574
1.2574
1.3230
1.3230
1.3230
1.3230
1.3230
1.3230
1.3230
1.3230
1.2800
1.2800
1.7564
1.7134
1.8436
1.8533
1.8533
1.9189
1.9189
2.1254
2.6018
2.4072
[13]
[13]
[13]
[13]
[13]
[13]
[13]
[13]
[13]
[13]
[18]
[18]
[18]
[18]
[18]
[18]
[18]
[18]
[6,20]
[6,20]
[19]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[6,20]
[18]
[18]
[18]
[16]
[6,20]
[20]
[6,20]
[6,20]
[6,20]
[25]
[18]
[25]
Alkenes, terpenes, alkynes and aromatics
Cyclohexene
1,3-Cyclohexadiene
1,4-Cyclohexadiene
1-Methyl-1-cyclohexene
1-Methyl-1-cyclohexene
4-Methyl-1-cyclohexene
4-Methyl-1-cyclohexene
3-Methyl-1-cyclohexene
3-Methyl-1-cyclohexene
2,5-Norbornadiene
1-Octene
cis-Cycloctene
1-Octyne
p-Xylene
1-Decene
1-Decyne
Butylbenzene
Phenyl-1-butyne
alpha-Pinene
alpha-Pinene
p-Menth-2-ene
p-Menth-2-ene
Delta-3-carene
Delta-3-carene
beta-Pinene
beta-Pinene
Limonene
Limonene
gamma-Terpinene
gamma-Terpinene
alpha-Terpinene
alpha-Terpinene
Terpinolene
Terpinolene
p-Cymene
p-Cymene
1-Dodecene
1-Dodecyne
1-Tetradecyne
Octylbenzene
2,6,10-Trimethylundecane-2,6-diene
Longifolene
Longifolene
Caryophyllene
Caryophyllene
Decylbenzene
1-Octadecene
Dodecylbenzene
Liquids 2022, 2
322
Table 1. Cont.
Compound
EACN
E
S
A
B
V
EACN Ref.
2.2
2.4
3.3
3.2
–3.4
−1.7
4.2
1.9
1.7
−2.1
−0.6
−1.3
1.8
2.3
2.2
6.2
−0.6
0.3
3.8
5.0
6.2
8.0
5.2
7.2
8.1
10.3
6.8
7.2
7.3
6.8
9.3
7.3
7.1
9.7
12.2
13.8
13.0
15.7
18.5
21.2
23.9
21.2
−0.063
0.000
0.000
0.000
0.108
0.162
0.000
0.108
0.156
0.101
0.013
0.013
0.013
0.000
0.103
0.132
0.002
0.002
0.000
−0.010
0.000
−0.062
−0.062
0.000
−0.010
−0.040
-
0.17
0.25
0.25
0.25
0.68
0.90
0.25
0.51
0.68
0.90
0.68
0.58
0.58
0.58
0.25
0.68
0.90
0.58
0.56
0.58
0.06
0.25
0.53
0.53
0.58
1.10
1.25
-
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
-
0.57
0.45
0.45
0.45
0.51
0.36
0.45
0.51
0.36
0.51
0.45
0.45
0.45
0.45
0.51
0.36
0.45
0.45
0.45
0.45
0.45
0.45
0.45
0.45
1.13
1.28
-
1.0127
1.2945
1.2945
1.2945
1.2515
1.2500
1.5763
1.6350
1.5333
1.5320
1.6742
1.8738
1.8738
1.8738
1.8581
1.8151
1.8132
2.1556
2.1556
2.4374
2.4374
2.4217
2.5783
2.5783
2.7192
3.3572
8.3631
-
[26]
[22]
[25]
[27]
[22]
[22]
[22]
[27]
[27]
[22]
[22]
[22]
[17]
[20]
[18]
[22]
[22]
[22]
[6]
[6]
[6]
[22]
[6]
[6]
[6]
[22]
[6]
[6]
[25]
[6]
[20]
[6]
[28]
[6]
[6]
[6]
[25]
[5]
[5]
[5]
[5]
[5]
−1.5
−1.6
−1.7
−3.1
0.4
−0.2
−0.1
0.8
0.322
0.380
0.674
0.154
0.154
0.154
0.154
0.61
0.33
0.86
0.49
0.49
0.49
0.49
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.62
0.76
0.57
0.45
0.45
0.45
0.45
1.4247
1.3591
1.3390
1.5490
1.5490
1.5490
1.5490
[21]
[21]
[21]
[21]
[29]
[29]
[29]
[29]
Ethers, esters, nitriles and ketones
Diisopropyl ether
Dibutyl ether
Dibutyl ether
Dibutyl ether
2-Octanone
Octanenitrile
Dipentyl ether
C3-O-C4-O-C3
C4-O-C2-O-C4
2-Decanone
Decanenitrile
2-Undecanone
Ethyl decanoate
Ethyl decanoate
Ethyl decanoate
Dihexyl ether
2-Dodecanone
Dodecanenitrile
Ethyl dodecanoate
Decyl butyrate
Hexyl octanoate
Diheptyl ether
Ethyl myristate
Butyl dodecanoate
Octyl octanoate
Dioctyl ether
Myristyl propionate
Isopropyl myristate
Isopropyl myristate
Ethyl palmitate
Hexyl dodecanoate
Ethyl oleate
Ethyl oleate
Bis(2-ethylhexyl) adipate
Tricaprilin
Tricaprin
Tricaprin
Trilaurin
Trimyristin
Tripalmitin
Tristearin
Triolein
Fragrances, acrylates and miscellaneous
Menthone
Eucalyptol
Rose oxide
D-Carvone
Hexyl methacrylate
Hexyl methacrylate
Hexyl methacrylate
Hexyl methacrylate
Liquids 2022, 2
323
Table 1. Cont.
Compound
EACN
E
S
A
B
V
EACN Ref.
0.7
−0.2
1.5
0.2
−0.4
−0.1
−0.2
−0.6
−0.9
−1.3
−1.7
−1.9
−1.1
3.5
1.0
0.154
0.154
0.154
0.154
0.154
0.243
0.198
0.368
0.331
0.340
0.892
-
0.49
0.49
0.49
0.49
0.49
0.65
0.59
0.65
0.65
1.53
0.78
-
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
-
0.45
0.45
0.45
0.45
0.45
0.54
0.64
0.68
0.65
0.97
0.76
-
1.5490
1.5490
1.5490
1.5490
1.5490
1.7652
1.8308
1.7878
1.7878
1.9218
1.7614
-
[29]
[29]
[29]
[29]
[29]
[6,20]
[21,22]
[21,22]
[21,22]
[21]
[21]
[17]
[21]
[21]
[21]
Fragrances, acrylates and miscellaneous
Hexyl methacrylate
Hexyl methacrylate
Hexyl methacrylate
Hexyl methacrylate
Hexyl methacrylate
Menthyl acetate
Citronellyl acetate
Geranyl acetate
Linalyl acetate
alpha-Damascone
Methyl dihydrojasmonate
beta-Ionone
Ethylene brassylate
Methyl cedryl ether
Ambrettolide
3. Results
The standard Abraham solute descriptor-based model is represented by Equation (6),
where c is the intercept, E, S, A, B, and V are the Abraham solute descriptors, and e, s, a, b,
v are their coefficients obtained by linear regression:
EACN = c + e · E + s · S + a · A + b · B + v · V
(6)
When using all the data, we found a moderate correlation (R2 = 0.61) between S and
B. Removing S-parameter outliers bis(2-ethylhexyl) adipate, tristearin, and methyl dihydrojasmonate (S > 1) resulted in a dataset where all pairwise correlations had coefficients
of determination lower than 0.50 and the coefficients of determination of each parameter
against all others was lower than 0.80. Removing S-parameter outliers provides a greater
reliability to the model, but may limit its application to a smaller chemical space.
Performing linear regression with leave-out-out (LOO) cross-validation showed that
dodecylcyclohexane, decylcyclohexane, and octyl octanoate were clear outliers. Removing
the three outliers, and again using LOO linear regression, we found that EACN can be
estimated with similar accuracy (LOO measures) using Abraham solute parameters (N = 80,
R2 = 0.92, RMSE = 1.16, MAE = 0.90, p < 2.2 × 10−16 ) as compared to previous models [5–8],
at least within the chemical space represented in the study:
EACN = −2.16 − 2.08 · E − 9.51 · S − 50.91 · A − 5.41 · B + 6.83 · V
(7)
Analyzing the EACN estimates for each compound type, we see that the model performs the best for alkenes, terpenes, alkynes, and aromatics (N = 31, ME = 0.37, MAE = 0.57,
RMSE = 0.76). Good performance is seen for the four other types: branched and cyclic
alkanes (N = 16, ME = −0.81, MAE = 0.85, RMSE = 1.11); halogenated alkanes (N = 6,
ME = 0.99, MAE = 0.99, RMSE = 1.16); fragrances, acrylates, and miscellaneous (N = 9,
ME = −0.16, MAE = 0.94, RMSE = 1.18); and ethers, esters, nitriles, and ketones (N = 18,
ME = −0.17, MAE = 1.16, RMSE = 1.32)—with consistent over-prediction for halogenated
alkanes and consistent under-prediction for branched and cyclic alkanes, see Figure 1.
Liquids
Liquids 2022,
2022, 22, FOR PEER REVIEW
324
7
Figure 1.
Measured EACN
EACN values
values colored
colored by
Figure
1. Estimated
Estimated vs.
vs. Measured
by type:
type: alkenes,
alkenes, terpenes,
terpenes, alkynes,
alkynes, and
and
aromatics (dark
cyclic
alkanes
(orange);
ethers,
esters,
nitriles,
and ketones
(red);
aromatics
(darkblue);
blue);branched
branchedand
and
cyclic
alkanes
(orange);
ethers,
esters,
nitriles,
and ketones
(red);
fragrances,
acrylates,
and miscellaneous
(light blue);
and halogenated
alkanes
(green).
fragrances,
acrylates,
and miscellaneous
(light blue);
and halogenated
alkanes
(green).
4. Discussion
Discussion
4.
We have
We
have demonstrated
demonstrated that
that EACN
EACN can
can be
be estimated
estimated using
using the
the standard
standard Abraham
Abraham
solute
parameter
model,
see
Equation
(7).
The
first
four
parameters
all have
solute parameter model, see Equation (7). The first four parameters all
have negative
negative
coefficients where
EEisisthethe
solute
excess
molar
refractivity-the
measured
liquidliquid
or gasor
molar
coefficients
where
solute
excess
molar
refractivity-the
measured
gas
◦ C minus that of a hypothetical alkane of identical volume-in units of
refraction
at
20
molar refraction at 20 °C minus that of a hypothetical alkane of identical volume-in units
(cm3 /mol)/10,
solute dipolarity/polarizability, A and B are the overall or summaof (cm3/mol)/10,S is
S the
is the
solute dipolarity/polarizability, A and B are the overall or
tion hydrogen bond acidity and basicity, and V is the McGowan characteristic volume in
summation 3hydrogen bond acidity and basicity, and V is the McGowan characteristic
units of (cm /mol)/100. These results align with previous results [5–8], using different
volume in units of (cm3/mol)/100. These results align with previous results [5–8], using
parameter systems, but showing similar accuracy and that EACN has contributions from
different parameter systems, but showing similar accuracy and that EACN has
shape (size and branching) [5–8], polarity/polarizability [5–8], and hydrogen bond basiccontributions from shape (size and branching) [5–8], polarity/polarizability [5–8], and
ity [6–8]. Our addition of hydrogen bond acidity, represented by the A descriptor, leads to
hydrogen bond basicity [6–8]. Our addition of hydrogen bond acidity, represented by the
superior estimation of EACN values for alkynes something not seen in previous models.
A descriptor, leads to superior estimation of EACN values for alkynes something not seen
We began with a dataset of N = 86 oils with both measured EACN values and meain previous models.
sured Abraham solute descriptors. During modeling we removed six compounds: bis(2We began with a dataset of N = 86 oils with both measured EACN values and
ethylhexyl) adipate, tristearin, methyl dihydrojasmonate, decylcyclohexane, dodecylcyclomeasured Abraham solute descriptors. During modeling we removed six compounds:
hexane, and octyl octanoate.
bis(2-ethylhexyl)
tristearin,
methyl because
dihydrojasmonate,
decylcyclohexane,
The first threeadipate,
compounds
were removed
they had large
S-values which
dodecylcyclohexane,
and
octyl
octanoate.
resulted in an artificially high collinearity with the B parameter. The second set of three
The firstwere
three
compounds
werefrom
removed
they had largeanalysis.
S-valuesEven
which
compounds
removed
as outliers
a firstbecause
LOO cross-validation
so,
resulted
in
an
artificially
high
collinearity
with
the
B
parameter.
The
second
set
of
three
Equation (7)-predicted EACN values for these compounds are generally of the right order,
compounds
werevalues
removed
as outliers
from[11].
a first
LOO
cross-validation
analysis.
so,
see the predicted
in the
open dataset
The
utility
of Equation (7)
can alsoEven
be seen
Equation
(7)-predicted
EACN
values
for
these
compounds
are
generally
of
the
right
order,
by using it to predict EACN values of compounds that have measured EACN values but
see the predicted values in the open dataset [11]. The utility of Equation (7) can also be
Liquids 2022, 2
325
that do not have measured Abraham solute parameters. Using predicted Abraham solute
parameters [9], we predicted the EACN values of several of these compounds without
measured Abraham solute parameters from Table 1, see Table 2. For the compounds listed
in Table 2, Equation (7) performs relatively well, with statistics similar to those found for the
estimated EACN results above, specifically: N = 11, ME = 0.17, MAE = 1.13, RMSE = 1.61.
Table 2. Equation (7)-predicted EACN values using predicted Abraham solute parameters.
Compound
1-Tetradecyne
2,6,10-Trimethylundecane-2,6-diene
Decyl butyrate
Butyl dodecanoate
Myristyl propionate
Diheptyl ether
Hexyl dodecanoate
Ethyl oleate
Ethyl oleate
Methyl cedryl ether
Ambrettolide
alpha-Damascone
EACN
E
S
A
B
V
Predicted EACN
3.9
10.3
5.0
7.2
6.8
8.0
9.3
7.1
7.3
3.5
1.0
−1.3
0.150
0.350
0.000
0.010
0.050
0.000
0.010
0.130
0.130
0.650
0.540
0.680
0.24
0.23
0.56
0.56
0.56
0.18
0.56
0.72
0.72
0.23
0.68
0.71
0.05
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.12
0.34
0.55
0.52
0.53
0.46
0.52
0.68
0.68
0.22
0.76
0.54
1.9952
2.1361
2.1556
2.4374
2.5783
2.1399
2.7192
2.9580
2.9580
2.0959
2.2858
1.7614
5.7
7.7
4.3
6.3
7.2
8.3
8.3
7.2
7.2
7.4
1.8
−1.2
The most recent paper by Delforce et al. [8] notes that the measured EACN of 2.2 for
diisopropyl ether reported previously [26] is an outier for their model. Our model estimates
the EACN value of diisopropyl ether to be 0.2 which is in line with their newly measured
EACN value of 0.6.
Our approach provides a useful tool for estimating equivalent alkane carbon numbers
as Abraham solute parameters are available for a significant number of compounds [9,10].
While a general model is presented, models for specific families of compounds can be
easily created using our open dataset [11]. We also note that the estimated EACN values
of individual hydrocarbons from Equation (7) will allow estimation of EACN values of
heavy hydrocarbon mixtures, EACNmix , using the mathematical expression proposed by
Cayias et al. [30] and Cash et al. [31]:
EACNmix =
N
∑i=1 xi EACNi
(8)
where xi and EACNi denote the mole fraction and numerical EACN value of the individual
hydrocarbon component i, respectively.
Future research directions include measuring the EACN values of more diverse compounds, especially those with known non-zero A-parameter values.
Author Contributions: Conceptualization, W.-K.C.; methodology, W.E.A.J., A.S.I.D.L. and H.M.; data
collection and curation, A.S.I.D.L. and H.M.; writing—original draft preparation, W.E.A.J., W.-K.C.,
A.S.I.D.L. and H.M.; writing—review and editing, W.E.A.J., W.-K.C., A.S.I.D.L. and H.M. All authors
have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Data Availability Statement: Data used in this study is available from figshare under a CC0 license [8].
Conflicts of Interest: The authors declare no conflict of interest.
References
1.
2.
Salager, J.L.; Morgan, J.C.; Schechter, R.S.; Wade, W.H.; Spe-Aime, M.; Vasquez, E. Optimum formulation of surfactant/water/oil
systems for minimum interfacial tension or phase behavior. Soc. Pet. Eng. AIME J. 1979, 19, 107–115. [CrossRef]
Ghayour, A.; Acosta, E. Characterizing the oil-like and surfactant-like behavior of polar oils. Langmuir 2019, 35, 15038–15050.
[CrossRef] [PubMed]
Liquids 2022, 2
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
326
Virin Kittithammavong, V.; Charoensaeng, A.; Khaodhiar, S. A Normalized HLD (HLDN ) tool for optimal salt-concentration
prediction of microemulsions. Appl. Sci. 2021, 11, 9151. [CrossRef]
Aubry, J.-M.; Ontiveros, J.F.; Salager, J.L.; Nardello-Rataj, V. Use of the normalized hydrophilic-lipophilic-deviation (HLDN)
equation for determining the equivalent alkane carbon number (EACN) of oils and the preferred alkane carbon number (PACN)
of nonionic surfactants by the fish-tail method (FTM). Adv. Colloid Interface Sci. 2020, 276, 102099. [CrossRef]
Bouton, F.; Durand, M.; Nardello-Rataj, V.; Borosy, A.P.; Quellet, C.; Aubry, J.M. A QSPR model for the prediction of the “fish-tail”
temperature of C i E4/water/polar hydrocarbon oil systems. Langmuir 2010, 26, 7962–7970. [CrossRef]
Lukowicz, T.; Illous, E.; Nardello-Rataj, V.; Aubry, J.M. Prediction of the equivalent alkane carbon number (EACN) of aprotic
polar oils with COSMO-RS sigma-moments. Colloids Surf. A Physicochem. Eng. Asp. 2018, 536, 53–59. [CrossRef]
Lukowicz, T.; Benazzouz, A.; Nardello-Rataj, V.; Aubry, J.-M. Rationalization and Prediction of the Equivalent Alkane Carbon
Number (EACN) of Polar Hydrocarbon Oils with COSMO-RS σ-Moments. Langmuir 2015, 31, 11220–11226. [CrossRef]
Delforce, L.; Duprat, F.; Ploix, J.-L.; Ontiveros, J.F.; Goussard, V.; Nardello-Rataj, V.; Aubry, J.-M. Fast Prediction of the Equivalent
Alkane Carbon Number Using Graph Machines and Neural Networks. ACS Omega 2022, in press.
Ulrich, N.; Endo, S.; Brown, T.N.; Watanabe, N.; Bronner, G.; Abraham, M.H.; Goss, K.-U. UFZ-LSER Database v 3.2.1 [Internet],
Leipzig, Germany, Helmholtz Centre for Environmental Research-UFZ. 2017. Available online: http://www.ufz.de/lserd
(accessed on 31 December 2021).
Chung, Y.; Vermeire, F.H.; Wu, H.; Walker, P.J.; Abraham, M.H.; Green, W.H. Group Contribution and Machine Learning
Approaches to Predict Abraham Solute Parameters, Solvation Free Energy, and Solvation Enthalpy. J. Chem. Inf. Modeling 2022,
62, 433–446. [CrossRef]
Acree, W.E.; Chong, W.; Lang, A.S.I.D.; Mozafari, H. Dataset: EACN and Abraham Parameter Values. Figshare. Dataset 2022.
[CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna,
Austria. 2022. Available online: https://www.R-project.org/ (accessed on 31 December 2021).
Bouton, F. Influence of Terpenes and Terpenoids on the Phase Behavior of Micro-and Macro-Emulsions. Ph.D. Thesis, University
of Lille, Lille, France, 2010.
Kahlweit, M.; Strey, R.; Busse, G. Effect of alcohols on the phase behavior of microemulsions. J. Phys. Chem. 1991, 95, 5344–5352.
[CrossRef]
Burauer, S.; Sottmann, T.; Strey, R. Nonionic microemulsions with cyclic oils-Oil penetration, efficiency and monomeric solubility.
Tenside Surfactants Deterg. 2000, 37, 8–16.
Queste, S.; Salager, J.L.; Strey, R.; Aubry, J.M. The EACN scale for oil classification revisited thanks to fish diagrams. J. Colloid
Interface Sci. 2007, 312, 98–107. [CrossRef] [PubMed]
Lukowicz, T. Synergistic Solubilisation of Fragrances in Binary Surfactant Systems and Prediction of Their EACN Value with
COSMO-RS. Ph.D. Thesis, Université Lille, Lille, France, 2015.
Tchakalova, V.; Testard, F.; Wong, K.; Parker, A.; Benczédi, D.; Zemb, T. Solubilization and interfacial curvature in microemulsions:
I. Interfacial expansion and co-extraction of oil. Colloids Surf. A Physicochem. Eng. Asp. 2008, 331, 31–39. [CrossRef]
Bouton, F.; Durand, M.; Nardello-Rataj, V.; Serry, M.; Aubry, J.M. Classification of terpene oils using the fish diagrams and the
Equivalent Alkane Carbon (EACN) scale. Colloids Surf. A Physicochem. Eng. Asp. 2009, 338, 142–147. [CrossRef]
Ontiveros, J.F.; Pierlot, C.; Catté, M.; Molinier, V.; Pizzino, A.; Salager, J.L.; Aubry, J.M. Classification of ester oils according to
their Equivalent Alkane Carbon Number (EACN) and asymmetry of fish diagrams of C10E4/ester oil/water systems. J. Colloid
Interface Sci. 2013, 403, 67–76. [CrossRef]
Gradzielski, M.; Langevin, D.; Sottmann, T.; Strey, R. Droplet microemulsions at the emulsification boundary: The influence of
the surfactant structure on the elastic constants of the amphiphilic film. J. Chem. Phys. 1997, 106, 8232–8238. [CrossRef]
Tchakalova, V.; Testard, F.; Wong, K.; Parker, A.; Benczédi, D.; Zemb, T. Solubilization and interfacial curvature in microemulsions:
II. Surfactant efficiency and PIT. Colloids Surf. A Physicochem. Eng. Asp. 2008, 331, 40–47. [CrossRef]
Israelachvili, J.N.; Mitchell, D.J.; Ninham, B.W. Theory of self-assembly of hydrocarbon amphiphiles into micelles and bilayers.
J. Chem. Soc. Faraday Trans. 2 Mol. Chem. Phys. 1976, 72, 1525–1568. [CrossRef]
Deen, G.R.; Pedersen, J.S. Phase behavior and microstructure of C12E5 nonionic microemulsions with chlorinated oils. Langmuir
2008, 24, 3111–3117. [CrossRef]
Engelskirchen, S.; Elsner, N.; Sottmann, T.; Strey, R. Triacylglycerol microemulsions stabilized by alkyl ethoxylate surfactants—A
basic study: Phase behavior, interfacial tension and microstructure. J. Colloid Interface Sci. 2007, 312, 114–121. [CrossRef] [PubMed]
Wormuth, K.R.; Kaler, E.W. Microemulsifying polar oils. J. Phys. Chem. 1989, 93, 4855–4861. [CrossRef]
Tanford, C. The Hydrophobic Effect: Formation of Micelles and Biological Membranes, 2nd ed.; J. Wiley: Hoboken, NJ, USA, 1980.
Kahlweit, M.; Busse, G.; Faulhaber, B.; Eibl, H. Preparing nontoxic microemulsions. Langmuir 1995, 11, 4185–4187. [CrossRef]
van Os, N.M.; Haak, J.R.; Rupert, L.A.M. Physico-Chemical Properties of Selected Anionic, Cationic and Nonionic Surfactants; Elsevier:
Amsterdam, The Netherlands, 2012.
Cayias, J.L.; Schechter, R.S.; Wade, W.H. Modeling crude oils for low interfacial tension. Soc. Pet. Eng. J. 1976, 16, 351–357.
[CrossRef]
Cash, L.; Cayias, J.L.; Fournier, G.; Macallister, D.; Schares, T.; Schechter, R.S.; Wade, W.H. The application of low interfacial
tension scaling rules to binary hydrocarbon mixtures. J. Colloid Interface Sci. 1976, 59, 39–44. [CrossRef]