-
XFLEX HYDRO demonstrators grid services assessment and Ancillary Services Matrix elaboration
Authors:
Christophe Nicolet,
Matthieu Dreyer,
Christian Landry,
Sébastien Alligné,
Antoine Béguin,
Yves Vaillant,
Stefan Tobler,
Goekhan Sari,
Grégory Païs,
Matteo Bianciotto,
Steve Sawyer,
Richard Taylor,
Manuel Vaz Castro,
Maria Helena Vasconcelos,
Carlos Moreira
Abstract:
This paper presents the methodology and key results which enabled to establish the so-called Ancillary Service Matrix (ASM) presenting the ability to deliver the different ancillary services of each of the 6 demonstrators of the XFLEX HYDRO research project combined with the applicable technologies studied in this analysis. These technologies include i) the variable speed technology with Doubly Fe…
▽ More
This paper presents the methodology and key results which enabled to establish the so-called Ancillary Service Matrix (ASM) presenting the ability to deliver the different ancillary services of each of the 6 demonstrators of the XFLEX HYDRO research project combined with the applicable technologies studied in this analysis. These technologies include i) the variable speed technology with Doubly Fed Induction Machine (DFIM) or Full Size Frequency Converters (FSFC), ii) the Smart Power Plant Supervisor (SPPS) enabling to extend the operating range of the hydraulic units in turbine mode based on a better knowledge of the hydro unit wear and tear and associated costs over the full unit operating range, iii) the hydraulic short circuit (HSC) operation leading to simultaneous operation of pump and turbines of Pumped Storage Power Plants (PSPP) and iv) the Hydro-Battery-Hybrid (HBH) applied at Run-of-River demonstrator.
The demonstrators considered for this study includes 4 pumped storage power plants, 1 conventional hydro storage plant and 1 run-of-the-river plant. For each demonstrator a 1D simulation model was developed and validated and was further enhanced to include the model of control system enabling to address the various ancillary services. The systematic 1D numerical simulation of ancillary service contribution of each demonstrator and related technologies enabled to quantify the magnitude of active power response to contribute to the different grid services. The results have been scored between 0 and 5 for each ancillary service, allowing to populate the Ancillary Service Matrix which is summarizing the results in a graphical and synthetic way. The analysis of the score of the Ancillary Services Matrix enables the reader to draw several key conclusions about the benefits unlocked by the implementation of these technologies which are summarized in the paper.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Deep Learning for Plant Identification and Disease Classification from Leaf Images: Multi-prediction Approaches
Authors:
Jianping Yao,
Son N. Tran,
Saurabh Garg,
Samantha Sawyer
Abstract:
Deep learning plays an important role in modern agriculture, especially in plant pathology using leaf images where convolutional neural networks (CNN) are attracting a lot of attention. While numerous reviews have explored the applications of deep learning within this research domain, there remains a notable absence of an empirical study to offer insightful comparisons due to the employment of var…
▽ More
Deep learning plays an important role in modern agriculture, especially in plant pathology using leaf images where convolutional neural networks (CNN) are attracting a lot of attention. While numerous reviews have explored the applications of deep learning within this research domain, there remains a notable absence of an empirical study to offer insightful comparisons due to the employment of varied datasets in the evaluation. Furthermore, a majority of these approaches tend to address the problem as a singular prediction task, overlooking the multifaceted nature of predicting various aspects of plant species and disease types. Lastly, there is an evident need for a more profound consideration of the semantic relationships that underlie plant species and disease types. In this paper, we start our study by surveying current deep learning approaches for plant identification and disease classification. We categorise the approaches into multi-model, multi-label, multi-output, and multi-task, in which different backbone CNNs can be employed. Furthermore, based on the survey of existing approaches in plant pathology and the study of available approaches in machine learning, we propose a new model named Generalised Stacking Multi-output CNN (GSMo-CNN). To investigate the effectiveness of different backbone CNNs and learning approaches, we conduct an intensive experiment on three benchmark datasets Plant Village, Plant Leaves, and PlantDoc. The experimental results demonstrate that InceptionV3 can be a good choice for a backbone CNN as its performance is better than AlexNet, VGG16, ResNet101, EfficientNet, MobileNet, and a custom CNN developed by us. Interestingly, empirical results support the hypothesis that using a single model can be comparable or better than using two models. Finally, we show that the proposed GSMo-CNN achieves state-of-the-art performance on three benchmark datasets.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Machine Learning for Leaf Disease Classification: Data, Techniques and Applications
Authors:
Jianping Yao,
Son N. Tran,
Samantha Sawyer,
Saurabh Garg
Abstract:
The growing demand for sustainable development brings a series of information technologies to help agriculture production. Especially, the emergence of machine learning applications, a branch of artificial intelligence, has shown multiple breakthroughs which can enhance and revolutionize plant pathology approaches. In recent years, machine learning has been adopted for leaf disease classification…
▽ More
The growing demand for sustainable development brings a series of information technologies to help agriculture production. Especially, the emergence of machine learning applications, a branch of artificial intelligence, has shown multiple breakthroughs which can enhance and revolutionize plant pathology approaches. In recent years, machine learning has been adopted for leaf disease classification in both academic research and industrial applications. Therefore, it is enormously beneficial for researchers, engineers, managers, and entrepreneurs to have a comprehensive view about the recent development of machine learning technologies and applications for leaf disease detection. This study will provide a survey in different aspects of the topic including data, techniques, and applications. The paper will start with publicly available datasets. After that, we summarize common machine learning techniques, including traditional (shallow) learning, deep learning, and augmented learning. Finally, we discuss related applications. This paper would provide useful resources for future study and application of machine learning for smart agriculture in general and leaf disease classification in particular.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Platformization of Inequality: Gender and Race in Digital Labor Platforms
Authors:
Isabel Munoz,
Pyeonghwa Kim,
Clea O'Neil,
Michael Dunn,
Steve Sawyer
Abstract:
We contribute empirical and conceptual insights regarding the roles of digital labor platforms in online freelancing, focusing attention to social identities such as gender, race, ethnicity, and occupation. Findings highlight how digital labor platforms reinforce and exacerbate identity-based stereotypes, bias and expectations in online freelance work. We focus on online freelancing as this form o…
▽ More
We contribute empirical and conceptual insights regarding the roles of digital labor platforms in online freelancing, focusing attention to social identities such as gender, race, ethnicity, and occupation. Findings highlight how digital labor platforms reinforce and exacerbate identity-based stereotypes, bias and expectations in online freelance work. We focus on online freelancing as this form of working arrangement is becoming more prevalent. Online freelancing also relies on the market-making power of digital platforms to create an online labor market. Many see this as one likely future of work with less bias. Others worry that labor platforms' market power allows them to embed known biases into new working arrangements: a platformization of inequality. Drawing on data from 108 online freelancers, we discuss six findings: 1) female freelance work is undervalued; 2) gendered occupational expectations; 3) gendered treatment; 4) shared expectations of differential values; 5) racial stereotypes and expectations; and 6) race and ethnicity as an asset. We discuss the role of design in the platformization and visibility of social identity dimensions and the implications of the reinforced identity perceptions and marginalization in digital labor platforms.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Evaluating Accumulo Performance for a Scalable Cyber Data Processing Pipeline
Authors:
Scott M. Sawyer,
B. David O'Gwynn
Abstract:
Streaming, big data applications face challenges in creating scalable data flow pipelines, in which multiple data streams must be collected, stored, queried, and analyzed. These data sources are characterized by their volume (in terms of dataset size), velocity (in terms of data rates), and variety (in terms of fields and types). For many applications, distributed NoSQL databases are effective alt…
▽ More
Streaming, big data applications face challenges in creating scalable data flow pipelines, in which multiple data streams must be collected, stored, queried, and analyzed. These data sources are characterized by their volume (in terms of dataset size), velocity (in terms of data rates), and variety (in terms of fields and types). For many applications, distributed NoSQL databases are effective alternatives to traditional relational database management systems. This paper considers a cyber situational awareness system that uses the Apache Accumulo database to provide scalable data warehousing, real-time data ingest, and responsive querying for human users and analytic algorithms. We evaluate Accumulo's ingestion scalability as a function of number of client processes and servers. We also describe a flexible data model with effective techniques for query planning and query batching to deliver responsive results. Query performance is evaluated in terms of latency of the client receiving initial result sets. Accumulo performance is measured on a database of up to 8 nodes using real cyber data.
△ Less
Submitted 21 July, 2014;
originally announced July 2014.
-
The Interface Region Imaging Spectrograph (IRIS)
Authors:
B. De Pontieu,
A. M. Title,
J. Lemen,
G. D. Kushner,
D. J. Akin,
B. Allard,
T. Berger,
P. Boerner,
M. Cheung,
C. Chou,
J. F. Drake,
D. W. Duncan,
S. Freeland,
G. F. Heyman,
C. Hoffman,
N. E. Hurlburt,
R. W. Lindgren,
D. Mathur,
R. Rehse,
D. Sabolish,
R. Seguin,
C. J. Schrijver,
T. D. Tarbell,
J. -P. Wuelser,
C. J. Wolfson
, et al. (63 additional authors not shown)
Abstract:
The Interface Region Imaging Spectrograph (IRIS) small explorer spacecraft provides simultaneous spectra and images of the photosphere, chromosphere, transition region, and corona with 0.33-0.4 arcsec spatial resolution, 2 s temporal resolution and 1 km/s velocity resolution over a field-of-view of up to 175 arcsec x 175 arcsec. IRIS was launched into a Sun-synchronous orbit on 27 June 2013 using…
▽ More
The Interface Region Imaging Spectrograph (IRIS) small explorer spacecraft provides simultaneous spectra and images of the photosphere, chromosphere, transition region, and corona with 0.33-0.4 arcsec spatial resolution, 2 s temporal resolution and 1 km/s velocity resolution over a field-of-view of up to 175 arcsec x 175 arcsec. IRIS was launched into a Sun-synchronous orbit on 27 June 2013 using a Pegasus-XL rocket and consists of a 19-cm UV telescope that feeds a slit-based dual-bandpass imaging spectrograph. IRIS obtains spectra in passbands from 1332-1358, 1389-1407 and 2783-2834 Angstrom including bright spectral lines formed in the chromosphere (Mg II h 2803 Angstrom and Mg II k 2796 Angstrom) and transition region (C II 1334/1335 Angstrom and Si IV 1394/1403 Angstrom). Slit-jaw images in four different passbands (C II 1330, Si IV 1400, Mg II k 2796 and Mg II wing 2830 Angstrom) can be taken simultaneously with spectral rasters that sample regions up to 130 arcsec x 175 arcsec at a variety of spatial samplings (from 0.33 arcsec and up). IRIS is sensitive to emission from plasma at temperatures between 5000 K and 10 MK and will advance our understanding of the flow of mass and energy through an interface region, formed by the chromosphere and transition region, between the photosphere and corona. This highly structured and dynamic region not only acts as the conduit of all mass and energy feeding into the corona and solar wind, it also requires an order of magnitude more energy to heat than the corona and solar wind combined. The IRIS investigation includes a strong numerical modeling component based on advanced radiative-MHD codes to facilitate interpretation of observations of this complex region. Approximately eight Gbytes of data (after compression) are acquired by IRIS each day and made available for unrestricted use within a few days of the observation.
△ Less
Submitted 10 January, 2014;
originally announced January 2014.
-
Ancient human genomes suggest three ancestral populations for present-day Europeans
Authors:
Iosif Lazaridis,
Nick Patterson,
Alissa Mittnik,
Gabriel Renaud,
Swapan Mallick,
Karola Kirsanow,
Peter H. Sudmant,
Joshua G. Schraiber,
Sergi Castellano,
Mark Lipson,
Bonnie Berger,
Christos Economou,
Ruth Bollongino,
Qiaomei Fu,
Kirsten I. Bos,
Susanne Nordenfelt,
Heng Li,
Cesare de Filippo,
Kay Prüfer,
Susanna Sawyer,
Cosimo Posth,
Wolfgang Haak,
Fredrik Hallgren,
Elin Fornander,
Nadin Rohland
, et al. (95 additional authors not shown)
Abstract:
We sequenced genomes from a $\sim$7,000 year old early farmer from Stuttgart in Germany, an $\sim$8,000 year old hunter-gatherer from Luxembourg, and seven $\sim$8,000 year old hunter-gatherers from southern Sweden. We analyzed these data together with other ancient genomes and 2,345 contemporary humans to show that the great majority of present-day Europeans derive from at least three highly diff…
▽ More
We sequenced genomes from a $\sim$7,000 year old early farmer from Stuttgart in Germany, an $\sim$8,000 year old hunter-gatherer from Luxembourg, and seven $\sim$8,000 year old hunter-gatherers from southern Sweden. We analyzed these data together with other ancient genomes and 2,345 contemporary humans to show that the great majority of present-day Europeans derive from at least three highly differentiated populations: West European Hunter-Gatherers (WHG), who contributed ancestry to all Europeans but not to Near Easterners; Ancient North Eurasians (ANE), who were most closely related to Upper Paleolithic Siberians and contributed to both Europeans and Near Easterners; and Early European Farmers (EEF), who were mainly of Near Eastern origin but also harbored WHG-related ancestry. We model these populations' deep relationships and show that EEF had $\sim$44% ancestry from a "Basal Eurasian" lineage that split prior to the diversification of all other non-African lineages.
△ Less
Submitted 1 April, 2014; v1 submitted 23 December, 2013;
originally announced December 2013.
-
Analyzing Machupo virus-receptor binding by molecular dynamics simulations
Authors:
Austin G. Meyer,
Sara L. Sawyer,
Andrew D. Ellington,
Claus O. Wilke
Abstract:
In many biological applications, we would like to be able to computationally predict mutational effects on affinity in protein-protein interactions. However, many commonly used methods to predict these effects perform poorly in important test cases. In particular, the effects of multiple mutations, non-alanine substitutions, and flexible loops are difficult to predict with available tools and prot…
▽ More
In many biological applications, we would like to be able to computationally predict mutational effects on affinity in protein-protein interactions. However, many commonly used methods to predict these effects perform poorly in important test cases. In particular, the effects of multiple mutations, non-alanine substitutions, and flexible loops are difficult to predict with available tools and protocols. We present here an existing method applied in a novel way to a new test case; we interrogate affinity differences resulting from mutations in a host-virus protein-protein interface. We use steered molecular dynamics (SMD) to computationally pull the machupo virus (MACV) spike glycoprotein (GP1) away from the human transferrin receptor (hTfR1). We then approximate affinity using the maximum applied force of separation and the area under the force-versus-distance curve. We find, even without the rigor and planning required for free energy calculations, that these quantities can provide novel biophysical insight into the GP1/hTfR1 interaction. First, with no prior knowledge of the system we can differentiate among wild type and mutant complexes. Moreover, we show that this simple SMD scheme correlates well with relative free energy differences computed via free energy perturbation. Second, although the static co-crystal structure shows two large hydrogen-bonding networks in the GP1/hTfR1 interface, our simulations indicate that one of them may not be important for tight binding. Third, one viral site known to be critical for infection may mark an important evolutionary suppressor site for infection-resistant hTfR1 mutants. Finally, our approach provides a framework to compare the effects of multiple mutations, individually and jointly, on protein-protein interactions.
△ Less
Submitted 13 January, 2014; v1 submitted 27 February, 2013;
originally announced February 2013.
-
A time-dependent Poisson random field model for polymorphism within and between two related biological species
Authors:
Amei Amei,
Stanley Sawyer
Abstract:
We derive a Poisson random field model for population site polymorphisms differences within and between two species that share a relatively recent common ancestor. The model can be either equilibrium or time inhomogeneous. We first consider a random field of Markov chains that describes the fate of a set of individual mutations. This field is approximated by a Poisson random field from which we ca…
▽ More
We derive a Poisson random field model for population site polymorphisms differences within and between two species that share a relatively recent common ancestor. The model can be either equilibrium or time inhomogeneous. We first consider a random field of Markov chains that describes the fate of a set of individual mutations. This field is approximated by a Poisson random field from which we can make inferences about the amounts of mutation and selection that have occurred in the history of observed aligned DNA sequences.
△ Less
Submitted 8 November, 2010;
originally announced November 2010.