Search | arXiv e-print repository

SHS: Scorpion Hunting Strategy Swarm Algorithm

Authors: Abhilash Singh, Seyed Muhammad Hossein Mousavi, Kumar Gaurav

Abstract: We introduced the Scorpion Hunting Strategy (SHS), a novel population-based, nature-inspired optimisation algorithm. This algorithm draws inspiration from the hunting strategy of scorpions, which identify, locate, and capture their prey using the alpha and beta vibration operators. These operators control the SHS algorithm's exploitation and exploration abilities. To formulate an optimisation meth… ▽ More We introduced the Scorpion Hunting Strategy (SHS), a novel population-based, nature-inspired optimisation algorithm. This algorithm draws inspiration from the hunting strategy of scorpions, which identify, locate, and capture their prey using the alpha and beta vibration operators. These operators control the SHS algorithm's exploitation and exploration abilities. To formulate an optimisation method, we mathematically simulate these dynamic events and behaviors. We evaluate the effectiveness of the SHS algorithm by employing 20 benchmark functions (including 10 conventional and 10 CEC2020 functions), using both qualitative and quantitative analyses. Through a comparative analysis with 12 state-of-the-art meta-heuristic algorithms, we demonstrate that the proposed SHS algorithm yields exceptionally promising results. These findings are further supported by statistically significant results obtained through the Wilcoxon rank sum test. Additionally, the ranking of SHS, as determined by the average rank derived from the Friedman test, positions it at the forefront when compared to other algorithms. Going beyond theoretical validation, we showcase the practical utility of the SHS algorithm by applying it to six distinct real-world optimisation tasks. These applications illustrate the algorithm's potential in addressing complex optimisation challenges. In summary, this work not only introduces the innovative SHS algorithm but also substantiates its effectiveness and versatility through rigorous benchmarking and real-world problem-solving scenarios. △ Less

Submitted 19 July, 2024; originally announced July 2024.

arXiv:2407.09950 [pdf]

PSO Fuzzy XGBoost Classifier Boosted with Neural Gas Features on EEG Signals in Emotion Recognition

Authors: Seyed Muhammad Hossein Mousavi

Abstract: Emotion recognition is the technology-driven process of identifying and categorizing human emotions from various data sources, such as facial expressions, voice patterns, body motion, and physiological signals, such as EEG. These physiological indicators, though rich in data, present challenges due to their complexity and variability, necessitating sophisticated feature selection and extraction me… ▽ More Emotion recognition is the technology-driven process of identifying and categorizing human emotions from various data sources, such as facial expressions, voice patterns, body motion, and physiological signals, such as EEG. These physiological indicators, though rich in data, present challenges due to their complexity and variability, necessitating sophisticated feature selection and extraction methods. NGN, an unsupervised learning algorithm, effectively adapts to input spaces without predefined grid structures, improving feature extraction from physiological data. Furthermore, the incorporation of fuzzy logic enables the handling of fuzzy data by introducing reasoning that mimics human decision-making. The combination of PSO with XGBoost aids in optimizing model performance through efficient hyperparameter tuning and decision process optimization. This study explores the integration of Neural-Gas Network (NGN), XGBoost, Particle Swarm Optimization (PSO), and fuzzy logic to enhance emotion recognition using physiological signals. Our research addresses three critical questions concerning the improvement of XGBoost with PSO and fuzzy logic, NGN's effectiveness in feature selection, and the performance comparison of the PSO-fuzzy XGBoost classifier with standard benchmarks. Acquired results indicate that our methodologies enhance the accuracy of emotion recognition systems and outperform other feature selection techniques using the majority of classifiers, offering significant implications for both theoretical advancement and practical application in emotion recognition technology. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: PSO, Fuzzy, XGBoost, Neural Gas Network (NGN), Feature Selection, EEG Signals, Emotion Recognition

arXiv:2407.09110 [pdf]

doi 10.1145/3565066.3608247

The Magic XRoom: A Flexible VR Platform for Controlled Emotion Elicitation and Recognition

Authors: S. M. Hossein Mousavi, Matteo Besenzoni, Davide Andreoletti, Achille Peternier, Silvia Giordano

Abstract: Affective computing has recently gained popularity, especially in the field of human-computer interaction systems, where effectively evoking and detecting emotions is of paramount importance to enhance users experience. However, several issues are hindering progress in the field. In fact, the complexity of emotions makes it difficult to understand their triggers and control their elicitation. Addi… ▽ More Affective computing has recently gained popularity, especially in the field of human-computer interaction systems, where effectively evoking and detecting emotions is of paramount importance to enhance users experience. However, several issues are hindering progress in the field. In fact, the complexity of emotions makes it difficult to understand their triggers and control their elicitation. Additionally, effective emotion recognition requires analyzing multiple sensor data, such as facial expressions and physiological signals. These factors combined make it hard to collect high-quality datasets that can be used for research purposes (e.g., development of emotion recognition algorithms). Despite these challenges, Virtual Reality (VR) holds promise as a solution. By providing a controlled and immersive environment, VR enables the replication of real-world emotional experiences and facilitates the tracking of signals indicative of emotional states. However, controlling emotion elicitation remains a challenging task also within VR. This research paper introduces the Magic Xroom, a VR platform designed to enhance control over emotion elicitation by leveraging the theory of flow. This theory establishes a mapping between an individuals skill levels, task difficulty, and perceived emotions. In the Magic Xroom, the users skill level is continuously assessed, and task difficulty is adjusted accordingly to evoke specific emotions. Furthermore, user signals are collected using sensors, and virtual panels are utilized to determine the ground truth emotional states, making the Magic Xroom an ideal platform for collecting extensive datasets. The paper provides detailed implementation information, highlights the main properties of the Magic Xroom, and presents examples of virtual scenarios to illustrate its abilities and capabilities. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: Proceedings of the 25th International Conference on Mobile Human-Computer Interaction

arXiv:2407.05189 [pdf]

Enhancing Language Learning through Technology: Introducing a New English-Azerbaijani (Arabic Script) Parallel Corpus

Authors: Jalil Nourmohammadi Khiarak, Ammar Ahmadi, Taher Ak-bari Saeed, Meysam Asgari-Chenaghlu, Toğrul Atabay, Mohammad Reza Baghban Karimi, Ismail Ceferli, Farzad Hasanvand, Seyed Mahboub Mousavi, Morteza Noshad

Abstract: This paper introduces a pioneering English-Azerbaijani (Arabic Script) parallel corpus, designed to bridge the technological gap in language learning and machine translation (MT) for under-resourced languages. Consisting of 548,000 parallel sentences and approximately 9 million words per language, this dataset is derived from diverse sources such as news articles and holy texts, aiming to enhance… ▽ More This paper introduces a pioneering English-Azerbaijani (Arabic Script) parallel corpus, designed to bridge the technological gap in language learning and machine translation (MT) for under-resourced languages. Consisting of 548,000 parallel sentences and approximately 9 million words per language, this dataset is derived from diverse sources such as news articles and holy texts, aiming to enhance natural language processing (NLP) applications and language education technology. This corpus marks a significant step forward in the realm of linguistic resources, particularly for Turkic languages, which have lagged in the neural machine translation (NMT) revolution. By presenting the first comprehensive case study for the English-Azerbaijani (Arabic Script) language pair, this work underscores the transformative potential of NMT in low-resource contexts. The development and utilization of this corpus not only facilitate the advancement of machine translation systems tailored for specific linguistic needs but also promote inclusive language learning through technology. The findings demonstrate the corpus's effectiveness in training deep learning MT systems and underscore its role as an essential asset for researchers and educators aiming to foster bilingual education and multilingual communication. This research covers the way for future explorations into NMT applications for languages lacking substantial digital resources, thereby enhancing global language education frameworks. The Python package of our code is available at https://pypi.org/project/chevir-kartalol/, and we also have a website accessible at https://translate.kartalol.com/. △ Less

Submitted 6 July, 2024; originally announced July 2024.

Comments: This paper is accepted and published at NeTTT 2024 Conf

arXiv:2407.00463 [pdf, other]

Open-Source Conversational AI with SpeechBrain 1.0

Authors: Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain de Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Pierre Champion, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Xuechen Liu , et al. (7 additional authors not shown)

Abstract: SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker recognition, text-to-speech, and much more. It promotes transparency and replicability by releasing both the pre-trained models and the complete "recipes" of code and algorithms required for training them. This paper prese… ▽ More SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker recognition, text-to-speech, and much more. It promotes transparency and replicability by releasing both the pre-trained models and the complete "recipes" of code and algorithms required for training them. This paper presents SpeechBrain 1.0, a significant milestone in the evolution of the toolkit, which now has over 200 recipes for speech, audio, and language processing tasks, and more than 100 models available on Hugging Face. SpeechBrain 1.0 introduces new technologies to support diverse learning modalities, Large Language Model (LLM) integration, and advanced decoding strategies, along with novel models, tasks, and modalities. It also includes a new benchmark repository, offering researchers a unified platform for evaluating models across diverse tasks. △ Less

Submitted 18 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

Comments: Submitted to JMLR (Machine Learning Open Source Software)

arXiv:2406.06399 [pdf, other]

Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for Dialogue

Authors: Simone Alghisi, Massimo Rizzoli, Gabriel Roccabruna, Seyed Mahed Mousavi, Giuseppe Riccardi

Abstract: We study the limitations of Large Language Models (LLMs) for the task of response generation in human-machine dialogue. Several techniques have been proposed in the literature for different dialogue types (e.g., Open-Domain). However, the evaluations of these techniques have been limited in terms of base LLMs, dialogue types and evaluation metrics. In this work, we extensively analyze different LL… ▽ More We study the limitations of Large Language Models (LLMs) for the task of response generation in human-machine dialogue. Several techniques have been proposed in the literature for different dialogue types (e.g., Open-Domain). However, the evaluations of these techniques have been limited in terms of base LLMs, dialogue types and evaluation metrics. In this work, we extensively analyze different LLM adaptation techniques when applied to different dialogue types. We have selected two base LLMs, Llama-2 and Mistral, and four dialogue types Open-Domain, Knowledge-Grounded, Task-Oriented, and Question Answering. We evaluate the performance of in-context learning and fine-tuning techniques across datasets selected for each dialogue type. We assess the impact of incorporating external knowledge to ground the generation in both scenarios of Retrieval-Augmented Generation (RAG) and gold knowledge. We adopt consistent evaluation and explainability criteria for automatic metrics and human evaluation protocols. Our analysis shows that there is no universal best-technique for adapting large language models as the efficacy of each technique depends on both the base LLM and the specific type of dialogue. Last but not least, the assessment of the best adaptation technique should include human evaluation to avoid false expectations and outcomes derived from automatic metrics. △ Less

Submitted 5 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

arXiv:2405.18732 [pdf, other]

Gemini & Physical World: Large Language Models Can Estimate the Intensity of Earthquake Shaking from Multi-Modal Social Media Posts

Authors: S. Mostafa Mousavi, Marc Stogaitis, Tajinder Gadh, Richard M Allen, Alexei Barski, Robert Bosch, Patrick Robertson, Nivetha Thiruverahan, Youngmin Cho, Aman Raj

Abstract: This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified… ▽ More This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified Mercalli Intensity (MMI) values, aligns well with independent observational data. Furthermore, our results suggest that LLMs, trained on vast internet data, may have developed a unique understanding of physical phenomena. Specifically, Google's Gemini models demonstrate a simplified understanding of the general relationship between earthquake magnitude, distance, and MMI intensity, accurately describing observational data even though it's not identical to established models. These findings raise intriguing questions about the extent to which Gemini's training has led to a broader understanding of the physical world and its phenomena. The ability of Generative AI models like Gemini to generate results consistent with established scientific knowledge highlights their potential to augment our understanding of complex physical phenomena like earthquakes. The flexible and effective approach proposed in this study holds immense potential for enriching our understanding of the impact of physical phenomena and improving resilience during natural disasters. This research is a significant step toward harnessing the power of social media and AI for natural disaster mitigation, opening new avenues for understanding the emerging capabilities of Generative AI and LLMs for scientific applications. △ Less

Submitted 14 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

arXiv:2404.08700 [pdf, other]

DyKnow:Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs

Authors: Seyed Mahed Mousavi, Simone Alghisi, Giuseppe Riccardi

Abstract: LLMs acquire knowledge from massive data snapshots collected at different timestamps. Their knowledge is then commonly evaluated using static benchmarks. However, factual knowledge is generally subject to time-sensitive changes, and static benchmarks cannot address those cases. We present an approach to dynamically evaluate the knowledge in LLMs and their time-sensitiveness against Wikidata, a pub… ▽ More LLMs acquire knowledge from massive data snapshots collected at different timestamps. Their knowledge is then commonly evaluated using static benchmarks. However, factual knowledge is generally subject to time-sensitive changes, and static benchmarks cannot address those cases. We present an approach to dynamically evaluate the knowledge in LLMs and their time-sensitiveness against Wikidata, a publicly available up-to-date knowledge graph. We evaluate the time-sensitive knowledge in twenty-four private and open-source LLMs, as well as the effectiveness of four editing methods in updating the outdated facts. Our results show that 1) outdatedness is a critical problem across state-of-the-art LLMs; 2) LLMs output inconsistent answers when prompted with slight variations of the question prompt; and 3) the performance of the state-of-the-art knowledge editing algorithms is very limited, as they can not reduce the cases of outdatedness and output inconsistency. △ Less

Submitted 12 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

arXiv:2401.02297 [pdf, other]

Are LLMs Robust for Spoken Dialogues?

Authors: Seyed Mahed Mousavi, Gabriel Roccabruna, Simone Alghisi, Massimo Rizzoli, Mirco Ravanelli, Giuseppe Riccardi

Abstract: Large Pre-Trained Language Models have demonstrated state-of-the-art performance in different downstream tasks, including dialogue state tracking and end-to-end response generation. Nevertheless, most of the publicly available datasets and benchmarks on task-oriented dialogues focus on written conversations. Consequently, the robustness of the developed models to spoken interactions is unknown. In… ▽ More Large Pre-Trained Language Models have demonstrated state-of-the-art performance in different downstream tasks, including dialogue state tracking and end-to-end response generation. Nevertheless, most of the publicly available datasets and benchmarks on task-oriented dialogues focus on written conversations. Consequently, the robustness of the developed models to spoken interactions is unknown. In this work, we have evaluated the performance of LLMs for spoken task-oriented dialogues on the DSTC11 test sets. Due to the lack of proper spoken dialogue datasets, we have automatically transcribed a development set of spoken dialogues with a state-of-the-art ASR engine. We have characterized the ASR-error types and their distributions and simulated these errors in a large dataset of dialogues. We report the intrinsic (perplexity) and extrinsic (human evaluation) performance of fine-tuned GPT-2 and T5 models in two subtasks of response generation and dialogue state tracking, respectively. The results show that LLMs are not robust to spoken noise by default, however, fine-tuning/training such models on a proper dataset of spoken TODs can result in a more robust performance. △ Less

Submitted 4 January, 2024; originally announced January 2024.

arXiv:2310.10963 [pdf]

MRI brain tumor segmentation using informative feature vectors and kernel dictionary learning

Authors: Seyedeh Mahya Mousavi, Mohammad Mostafavi

Abstract: This paper presents a method based on a kernel dictionary learning algorithm for segmenting brain tumor regions in magnetic resonance images (MRI). A set of first-order and second-order statistical feature vectors are extracted from patches of size 3 * 3 around pixels in the brain MRI scans. These feature vectors are utilized to train two kernel dictionaries separately for healthy and tumorous tis… ▽ More This paper presents a method based on a kernel dictionary learning algorithm for segmenting brain tumor regions in magnetic resonance images (MRI). A set of first-order and second-order statistical feature vectors are extracted from patches of size 3 * 3 around pixels in the brain MRI scans. These feature vectors are utilized to train two kernel dictionaries separately for healthy and tumorous tissues. To enhance the efficiency of the dictionaries and reduce training time, a correlation-based sample selection technique is developed to identify the most informative and discriminative subset of feature vectors. This technique aims to improve the performance of the dictionaries by selecting a subset of feature vectors that provide valuable information for the segmentation task. Subsequently, a linear classifier is utilized to distinguish between healthy and unhealthy pixels based on the learned dictionaries. The results demonstrate that the proposed method outperforms other existing methods in terms of segmentation accuracy and significantly reduces both the time and memory required, resulting in a remarkably fast training process. △ Less

Submitted 16 October, 2023; originally announced October 2023.

arXiv:2308.01700 [pdf]

Bees Local Phase Quantization Feature Selection for RGB-D Facial Expressions Recognition

Authors: Seyed Muhammad Hossein Mousavi, Atiye Ilanloo

Abstract: Feature selection could be defined as an optimization problem and solved by bio-inspired algorithms. Bees Algorithm (BA) shows decent performance in feature selection optimization tasks. On the other hand, Local Phase Quantization (LPQ) is a frequency domain feature which has excellent performance on Depth images. Here, after extracting LPQ features out of RGB (colour) and Depth images from the Ir… ▽ More Feature selection could be defined as an optimization problem and solved by bio-inspired algorithms. Bees Algorithm (BA) shows decent performance in feature selection optimization tasks. On the other hand, Local Phase Quantization (LPQ) is a frequency domain feature which has excellent performance on Depth images. Here, after extracting LPQ features out of RGB (colour) and Depth images from the Iranian Kinect Face Database (IKFDB), the Bees feature selection algorithm applies to select the desired number of features for final classification tasks. IKFDB is recorded with Kinect sensor V.2 and contains colour and depth images for facial and facial micro-expressions recognition purposes. Here five facial expressions of Anger, Joy, Surprise, Disgust and Fear are used for final validation. The proposed Bees LPQ method is compared with Particle Swarm Optimization (PSO) LPQ, PCA LPQ, Lasso LPQ, and just LPQ features for classification tasks with Support Vector Machines (SVM), K-Nearest Neighbourhood (KNN), Shallow Neural Network and Ensemble Subspace KNN. Returned results, show a decent performance of the proposed algorithm (99 % accuracy) in comparison with others. △ Less

Submitted 3 August, 2023; originally announced August 2023.

Comments: The International Workshop on the Bees Algorithm and its Applications, Birmingham, UK (https://sites.google.com/view/baaworkshop/baa-past-events/2022)

arXiv:2307.06396 [pdf]

doi 10.6084/m9.figshare.14396195

Introduction to Facial Micro Expressions Analysis Using Color and Depth Images: A Matlab Coding Approach (Second Edition, 2023)

Authors: Seyed Muhammad Hossein Mousavi

Abstract: The book attempts to introduce a gentle introduction to the field of Facial Micro Expressions Recognition (FMER) using Color and Depth images, with the aid of MATLAB programming environment. FMER is a subset of image processing and it is a multidisciplinary topic to analysis. So, it requires familiarity with other topics of Artifactual Intelligence (AI) such as machine learning, digital image proc… ▽ More The book attempts to introduce a gentle introduction to the field of Facial Micro Expressions Recognition (FMER) using Color and Depth images, with the aid of MATLAB programming environment. FMER is a subset of image processing and it is a multidisciplinary topic to analysis. So, it requires familiarity with other topics of Artifactual Intelligence (AI) such as machine learning, digital image processing, psychology and more. So, it is a great opportunity to write a book which covers all of these topics for beginner to professional readers in the field of AI and even without having background of AI. Our goal is to provide a standalone introduction in the field of MFER analysis in the form of theorical descriptions for readers with no background in image processing with reproducible Matlab practical examples. Also, we describe any basic definitions for FMER analysis and MATLAB library which is used in the text, that helps final reader to apply the experiments in the real-world applications. We believe that this book is suitable for students, researchers, and professionals alike, who need to develop practical skills, along with a basic understanding of the field. We expect that, after reading this book, the reader feels comfortable with different key stages such as color and depth image processing, color and depth image representation, classification, machine learning, facial micro-expressions recognition, feature extraction and dimensionality reduction. The book attempts to introduce a gentle introduction to the field of Facial Micro Expressions Recognition (FMER) using Color and Depth images, with the aid of MATLAB programming environment. △ Less

Submitted 19 June, 2023; originally announced July 2023.

Comments: This is the second edition of the book

arXiv:2305.17422 [pdf, other]

doi 10.18653/v1/2023.wassa-1.9

Understanding Emotion Valence is a Joint Deep Learning Task

Authors: Gabriel Roccabruna, Seyed Mahed Mousavi, Giuseppe Riccardi

Abstract: The valence analysis of speakers' utterances or written posts helps to understand the activation and variations of the emotional state throughout the conversation. More recently, the concept of Emotion Carriers (EC) has been introduced to explain the emotion felt by the speaker and its manifestations. In this work, we investigate the natural inter-dependency of valence and ECs via a multi-task lea… ▽ More The valence analysis of speakers' utterances or written posts helps to understand the activation and variations of the emotional state throughout the conversation. More recently, the concept of Emotion Carriers (EC) has been introduced to explain the emotion felt by the speaker and its manifestations. In this work, we investigate the natural inter-dependency of valence and ECs via a multi-task learning approach. We experiment with Pre-trained Language Models (PLM) for single-task, two-step, and joint settings for the valence and EC prediction tasks. We compare and evaluate the performance of generative (GPT-2) and discriminative (BERT) architectures in each setting. We observed that providing the ground truth label of one task improves the prediction performance of the models in the other task. We further observed that the discriminative model achieves the best trade-off of valence and EC prediction tasks in the joint prediction setting. As a result, we attain a single model that performs both tasks, thus, saving computation resources at training and inference times. △ Less

Submitted 31 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

arXiv:2305.15908 [pdf, other]

doi 10.18653/v1/2023.nlp4convai-1.1

Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps?

Authors: Seyed Mahed Mousavi, Simone Caldarella, Giuseppe Riccardi

Abstract: Longitudinal Dialogues (LD) are the most challenging type of conversation for human-machine dialogue systems. LDs include the recollections of events, personal thoughts, and emotions specific to each individual in a sparse sequence of dialogue sessions. Dialogue systems designed for LDs should uniquely interact with the users over multiple sessions and long periods of time (e.g. weeks), and engage… ▽ More Longitudinal Dialogues (LD) are the most challenging type of conversation for human-machine dialogue systems. LDs include the recollections of events, personal thoughts, and emotions specific to each individual in a sparse sequence of dialogue sessions. Dialogue systems designed for LDs should uniquely interact with the users over multiple sessions and long periods of time (e.g. weeks), and engage them in personal dialogues to elaborate on their feelings, thoughts, and real-life events. In this paper, we study the task of response generation in LDs. We evaluate whether general-purpose Pre-trained Language Models (PLM) are appropriate for this purpose. We fine-tune two PLMs, GePpeTto (GPT-2) and iT5, using a dataset of LDs. We experiment with different representations of the personal knowledge extracted from LDs for grounded response generation, including the graph representation of the mentioned events and participants. We evaluate the performance of the models via automatic metrics and the contribution of the knowledge via the Integrated Gradients technique. We categorize the natural language generation errors via human evaluations of contextualization, appropriateness and engagement of the user. △ Less

Submitted 25 May, 2023; originally announced May 2023.

arXiv:2303.08070 [pdf]

Victoria Amazonica Optimization (VAO): An Algorithm Inspired by the Giant Water Lily Plant

Authors: Seyed Muhammad Hossein Mousavi

Abstract: The Victoria Amazonica plant, often known as the Giant Water Lily, has the largest floating spherical leaf in the world, with a maximum leaf diameter of 3 meters. It spreads its leaves by the force of its spines and creates a large shadow underneath, killing any plants that require sunlight. These water tyrants use their formidable spines to compel each other to the surface and increase their stre… ▽ More The Victoria Amazonica plant, often known as the Giant Water Lily, has the largest floating spherical leaf in the world, with a maximum leaf diameter of 3 meters. It spreads its leaves by the force of its spines and creates a large shadow underneath, killing any plants that require sunlight. These water tyrants use their formidable spines to compel each other to the surface and increase their strength to grab more space from the surface. As they spread throughout the pond or basin, with the earliest-growing leaves having more room to grow, each leaf gains a unique size. Its flowers are transsexual and when they bloom, Cyclocephala beetles are responsible for the pollination process, being attracted to the scent of the female flower. After entering the flower, the beetle becomes covered with pollen and transfers it to another flower for fertilization. After the beetle leaves, the flower turns into a male and changes color from white to pink. The male flower dies and sinks into the water, releasing its seed to help create a new generation. In this paper, the mathematical life cycle of this magnificent plant is introduced, and each leaf and blossom are treated as a single entity. The proposed bio-inspired algorithm is tested with 24 benchmark optimization test functions, such as Ackley, and compared to ten other famous algorithms, including the Genetic Algorithm. The proposed algorithm is tested on 10 optimization problems: Minimum Spanning Tree, Hub Location Allocation, Quadratic Assignment, Clustering, Feature Selection, Regression, Economic Dispatching, Parallel Machine Scheduling, Color Quantization, and Image Segmentation and compared to traditional and bio-inspired algorithms. Overall, the performance of the algorithm in all tasks is satisfactory. △ Less

Submitted 22 January, 2023; originally announced March 2023.

Comments: 45 pages

arXiv:2302.07748 [pdf, other]

doi 10.18653/v1/2023.wnu-1.1

Whats New? Identifying the Unfolding of New Events in Narratives

Authors: Seyed Mahed Mousavi, Shohei Tanaka, Gabriel Roccabruna, Koichiro Yoshino, Satoshi Nakamura, Giuseppe Riccardi

Abstract: Narratives include a rich source of events unfolding over time and context. Automatic understanding of these events provides a summarised comprehension of the narrative for further computation (such as reasoning). In this paper, we study the Information Status (IS) of the events and propose a novel challenging task: the automatic identification of new events in a narrative. We define an event as a… ▽ More Narratives include a rich source of events unfolding over time and context. Automatic understanding of these events provides a summarised comprehension of the narrative for further computation (such as reasoning). In this paper, we study the Information Status (IS) of the events and propose a novel challenging task: the automatic identification of new events in a narrative. We define an event as a triplet of subject, predicate, and object. The event is categorized as new with respect to the discourse context and whether it can be inferred through commonsense reasoning. We annotated a publicly available corpus of narratives with the new events at sentence level using human annotators. We present the annotation protocol and study the quality of the annotation and the difficulty of the task. We publish the annotated dataset, annotation materials, and machine learning baseline models for the task of new event extraction for narrative understanding. △ Less

Submitted 8 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

arXiv:2301.12176 [pdf]

Neural Gas Network Image Features and Segmentation for Brain Tumor Detection Using Magnetic Resonance Imaging Data

Authors: S. Muhammad Hossein Mousavi

Abstract: Accurate detection of brain tumors could save lots of lives and increasing the accuracy of this binary classification even as much as a few percent has high importance. Neural Gas Networks (NGN) is a fast, unsupervised algorithm that could be used in data clustering, image pattern recognition, and image segmentation. In this research, we used the metaheuristic Firefly Algorithm (FA) for image cont… ▽ More Accurate detection of brain tumors could save lots of lives and increasing the accuracy of this binary classification even as much as a few percent has high importance. Neural Gas Networks (NGN) is a fast, unsupervised algorithm that could be used in data clustering, image pattern recognition, and image segmentation. In this research, we used the metaheuristic Firefly Algorithm (FA) for image contrast enhancement as pre-processing and NGN weights for feature extraction and segmentation of Magnetic Resonance Imaging (MRI) data on two brain tumor datasets from the Kaggle platform. Also, tumor classification is conducted by Support Vector Machine (SVM) classification algorithms and compared with a deep learning technique plus other features in train and test phases. Additionally, NGN tumor segmentation is evaluated by famous performance metrics such as Accuracy, F-measure, Jaccard, and more versus ground truth data and compared with traditional segmentation techniques. The proposed method is fast and precise in both tasks of tumor classification and segmentation compared with other methods. A classification accuracy of 95.14 % and segmentation accuracy of 0.977 is achieved by the proposed method. △ Less

Submitted 28 January, 2023; originally announced January 2023.

Comments: 7 pages

arXiv:2208.14564 [pdf, other]

doi 10.1093/gji/ggac355

QuakeFlow: A Scalable Machine-learning-based Earthquake Monitoring Workflow with Cloud Computing

Authors: Weiqiang Zhu, Alvin Brian Hou, Robert Yang, Avoy Datta, S. Mostafa Mousavi, William L. Ellsworth, Gregory C. Beroza

Abstract: Earthquake monitoring workflows are designed to detect earthquake signals and to determine source characteristics from continuous waveform data. Recent developments in deep learning seismology have been used to improve tasks within earthquake monitoring workflows that allow the fast and accurate detection of up to orders of magnitude more small events than are present in conventional catalogs. To… ▽ More Earthquake monitoring workflows are designed to detect earthquake signals and to determine source characteristics from continuous waveform data. Recent developments in deep learning seismology have been used to improve tasks within earthquake monitoring workflows that allow the fast and accurate detection of up to orders of magnitude more small events than are present in conventional catalogs. To facilitate the application of machine-learning algorithms to large-volume seismic records, we developed a cloud-based earthquake monitoring workflow, QuakeFlow, that applies multiple processing steps to generate earthquake catalogs from raw seismic data. QuakeFlow uses a deep learning model, PhaseNet, for picking P/S phases and a machine learning model, GaMMA, for phase association with approximate earthquake location and magnitude. Each component in QuakeFlow is containerized, allowing straightforward updates to the pipeline with new deep learning/machine learning models, as well as the ability to add new components, such as earthquake relocation algorithms. We built QuakeFlow in Kubernetes to make it auto-scale for large datasets and to make it easy to deploy on cloud platforms, which enables large-scale parallel processing. We used QuakeFlow to process three years of continuous archived data from Puerto Rico, and found more than a factor of ten more events that occurred on much the same structures as previously known seismicity. We applied Quakeflow to monitoring frequent earthquakes in Hawaii and found over an order of magnitude more events than are in the standard catalog, including many events that illuminate the deep structure of the magmatic system. We also added Kafka and Spark streaming to deliver real-time earthquake monitoring results. QuakeFlow is an effective and efficient approach both for improving realtime earthquake monitoring and for mining archived seismic data sets. △ Less

Submitted 30 August, 2022; originally announced August 2022.

arXiv:2109.09911 [pdf, other]

doi 10.1029/2021JB023283

An End-to-End Earthquake Detection Method for Joint Phase Picking and Association using Deep Learning

Authors: Weiqiang Zhu, Kai Sheng Tai, S. Mostafa Mousavi, Peter Bailis, Gregory C. Beroza

Abstract: Earthquake monitoring by seismic networks typically involves a workflow consisting of phase detection/picking, association, and location tasks. In recent years, the accuracy of these individual stages has been improved through the use of machine learning techniques. In this study, we introduce a new, end-to-end approach that improves overall earthquake detection accuracy by jointly optimizing each… ▽ More Earthquake monitoring by seismic networks typically involves a workflow consisting of phase detection/picking, association, and location tasks. In recent years, the accuracy of these individual stages has been improved through the use of machine learning techniques. In this study, we introduce a new, end-to-end approach that improves overall earthquake detection accuracy by jointly optimizing each stage of the detection pipeline. We propose a neural network architecture for the task of multi-station processing of seismic waveforms recorded over a seismic network. This end-to-end architecture consists of three sub-networks: a backbone network that extracts features from raw waveforms, a phase picking sub-network that picks P- and S-wave arrivals based on these features, and an event detection sub-network that aggregates the features from multiple stations and detects earthquakes. We use these sub-networks in conjunction with a shift-and-stack module based on back-projection that introduces kinematic constraints on arrival times, allowing the model to generalize to different velocity models and to variable station geometry in seismic networks. We evaluate our proposed method on the STanford EArthquake Dataset (STEAD) and on the 2019 Ridgecrest, CA earthquake sequence. The results demonstrate that our end-to-end approach can effectively pick P- and S-wave arrivals and achieve earthquake detection accuracy rivaling that of other state-of-the-art approaches. △ Less

Submitted 20 September, 2021; originally announced September 2021.

arXiv:2109.09008 [pdf, other]

doi 10.1029/2021JB023249

Earthquake Phase Association using a Bayesian Gaussian Mixture Model

Authors: Weiqiang Zhu, Ian W. McBrearty, S. Mostafa Mousavi, William L. Ellsworth, Gregory C. Beroza

Abstract: Earthquake phase association algorithms aggregate picked seismic phases from a network of seismometers into individual earthquakes and play an important role in earthquake monitoring. Dense seismic networks and improved phase picking methods produce massive earthquake phase data sets, particularly for earthquake swarms and aftershocks occurring closely in time and space, making phase association a… ▽ More Earthquake phase association algorithms aggregate picked seismic phases from a network of seismometers into individual earthquakes and play an important role in earthquake monitoring. Dense seismic networks and improved phase picking methods produce massive earthquake phase data sets, particularly for earthquake swarms and aftershocks occurring closely in time and space, making phase association a challenging problem. We present a new association method, the Gaussian Mixture Model Association (GaMMA), that combines the Gaussian mixture model for phase measurements (both time and amplitude), with earthquake location, origin time, and magnitude estimation. We treat earthquake phase association as an unsupervised clustering problem in a probabilistic framework, where each earthquake corresponds to a cluster of P and S phases with hyperbolic moveout of arrival times and a decay of amplitude with distance. We use a multivariate Gaussian distribution to model the collection of phase picks for an event, the mean of which is given by the predicted arrival time and amplitude from the causative event. We carry out the pick assignment for each earthquake and determine earthquake parameters (i.e., earthquake location, origin time, and magnitude) under the maximum likelihood criterion using the Expectation-Maximization (EM) algorithm. The GaMMA method does not require the typical association steps of other algorithms, such as grid-search or supervised training. The results on both synthetic test and the 2019 Ridgecrest earthquake sequence show that GaMMA effectively associates phases from a temporally and spatially dense earthquake sequence while producing useful estimates of earthquake location and magnitude. △ Less

Submitted 18 September, 2021; originally announced September 2021.

arXiv:2106.03398 [pdf, other]

Dark Matter Effects on Stellar Populations in Globular Clusters

Authors: Ebrahim Hassani, Seyyed Milad Ghaffarpour Mousavi

Abstract: According to the classical view of globular clusters, stars inside globular clusters are evolved from the same giant molecular cloud. Then their stars' chemical compositions must be the same. But recent photometric and spectroscopic studies of globular clusters reveal the presence of more-than-one stellar populations inside globular clusters. This finding challenges our classical view of globular… ▽ More According to the classical view of globular clusters, stars inside globular clusters are evolved from the same giant molecular cloud. Then their stars' chemical compositions must be the same. But recent photometric and spectroscopic studies of globular clusters reveal the presence of more-than-one stellar populations inside globular clusters. This finding challenges our classical view of globular clusters. In this work, we investigated the possibility of solving multiple stellar populations problem in globular clusters using dark matter assumptions. We showed that the presence of dark matter inside globular clusters changes the physical parameters (e.g. chemical composition, luminosity, temperature, age, etc.) of stars inside them. We supposed that dark matter distributed non-uniformly inside globular clusters. It means stars in high dark matter density environments (like the central region of globular clusters) are more affected by the presence of dark matter. Using this assumption, we showed that stars in different locations of globular clusters (corresponding to different dark matter densities) follow different evolutionary paths (e.g. on Hertzsprung-Russell diagram). We used this note to infer that the presence of dark matter inside globular clusters can be the reason for the multiple stellar populations. △ Less

Submitted 28 April, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

Comments: 10 Pages, 3 Figures, 1 Table

arXiv:2012.01486 [pdf]

doi 10.1134/S1063773721090024

Refined Ephemeris for Four Hot Jupiters using Ground-Based and TESS Observations

Authors: Fatemeh Davoudi, PegahSadat MirshafieKhozani, Ehsan Paki, Mojtaba Roshana, Fatemeh HashemiNasab, Ahmad MazidabadiFarahani, Farzaneh Ahangarani Farahani, Tayebeh Farjadnia, Farshid Nasrollahzadeh, Shiva Rezvanpanah, S. Mohammad Mahdi Mousavi, Rahimeh Foroughi, Atila Poro, Amir Ghalee

Abstract: WASP-12 b, WASP-33 b, WASP-36 b, and WASP-46 b are four transiting planetary systems which we have studied. These systems' light curves were derived from observations made by the Transiting Light Exoplanet Survey Satellite (TESS) and some ground-based telescopes. We used Exofast-v1 to model these light curves and calculate mid-transit times. Also, we plotted TTV diagrams for them using derived mid… ▽ More WASP-12 b, WASP-33 b, WASP-36 b, and WASP-46 b are four transiting planetary systems which we have studied. These systems' light curves were derived from observations made by the Transiting Light Exoplanet Survey Satellite (TESS) and some ground-based telescopes. We used Exofast-v1 to model these light curves and calculate mid-transit times. Also, we plotted TTV diagrams for them using derived mid-transit times and those available within the literature. O-C analysis of these timings enables us to refine the linear ephemeris of four systems. We measured WASP-12's tidal quality factor based on adding TESS data as Q*'=(2.13+-0.29)*10^5. According to the analysis, the orbital period of the WASP-46 b system is increasing. The WASP-36 b and WASP-33 b systems have not shown any obvious quadratic trend in their TTV diagrams. The increase in their period is most likely due to inaccurate liner ephemeris that has increased over time. So, more observations are needed to evaluate whether or not there is an orbital decay in the WASP-36 b and WASP-33 b systems. △ Less

Submitted 14 August, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

Comments: 14 figures, accepted at the Astronomy Letters journal

arXiv:2007.01109 [pdf, other]

doi 10.1039/D0SM00571A

Wall entrapment of peritrichous bacteria: A mesoscale hydrodynamics simulation study

Authors: S. Mahdiyeh Mousavi, Gerhard Gompper, Roland G. Winkler

Abstract: Microswimmers such as E. Coli bacteria accumulate and exhibit an intriguing dynamics near walls, governed by hydrodynamic and steric interactions. Insight into the underlying mechanisms and predominant interactions demand a detailed characterization of the entrapment process. We employ a mesoscale hydrodynamics simulation approach to study entrapment of a E. coli-type cell at a no-slip wall. The c… ▽ More Microswimmers such as E. Coli bacteria accumulate and exhibit an intriguing dynamics near walls, governed by hydrodynamic and steric interactions. Insight into the underlying mechanisms and predominant interactions demand a detailed characterization of the entrapment process. We employ a mesoscale hydrodynamics simulation approach to study entrapment of a E. coli-type cell at a no-slip wall. The cell is modeled by a spherocylindrical body with several explicit helical flagella. Three stages of the entrapment process can be distinguished: the approaching regime, where a cell swims toward the wall on a nearly straight trajectory; a scattering regime, where the cell touches the wall, with an reorientation; and a surface-swimming regime. Our simulations show that steric interactions may dominate the entrapment process, yet, hydrodynamic interactions slow down the adsorption dynamics close to the boundary and imply a circular motion on the wall. The locomotion of the cell is characterized by a strong wobbling dynamics, with cells preferentially pointing toward the wall. △ Less

Submitted 2 July, 2020; originally announced July 2020.

Journal ref: Soft Matter 16, 4866 (2020)

arXiv:1912.01144 [pdf, other]

doi 10.1109/TGRS.2020.2988770

Bayesian-Deep-Learning Estimation of Earthquake Location from Single-Station Observations

Authors: S. Mostafa Mousavi, Gregory C. Beroza

Abstract: We present a deep learning method for single-station earthquake location, which we approach as a regression problem using two separate Bayesian neural networks. We use a multi-task temporal-convolutional neural network to learn epicentral distance and P travel time from 1-minute seismograms. The network estimates epicentral distance and P travel time with absolute mean errors of 0.23 km and 0.03 s… ▽ More We present a deep learning method for single-station earthquake location, which we approach as a regression problem using two separate Bayesian neural networks. We use a multi-task temporal-convolutional neural network to learn epicentral distance and P travel time from 1-minute seismograms. The network estimates epicentral distance and P travel time with absolute mean errors of 0.23 km and 0.03 s respectively, along with their epistemic and aleatory uncertainties. We design a separate multi-input network using standard convolutional layers to estimate the back-azimuth angle, and its epistemic uncertainty. This network estimates the direction from which seismic waves arrive to the station with a mean error of 1 degree. Using this information, we estimate the epicenter, origin time, and depth along with their confidence intervals. We use a global dataset of earthquake signals recorded within 1 degree (~112 km) from the event to build the model and to demonstrate its performance. Our model can predict epicenter, origin time, and depth with mean errors of 7.3 km, 0.4 second, and 6.7 km respectively, at different locations around the world. Our approach can be used for fast earthquake source characterization with a limited number of observations, and also for estimating location of earthquakes that are sparsely recorded -- either because they are small or because stations are widely separated. △ Less

Submitted 2 December, 2019; originally announced December 2019.

arXiv:1911.05975 [pdf, other]

doi 10.1029/2019GL085976

A Machine-Learning Approach for Earthquake Magnitude Estimation

Authors: S. Mostafa Mousavi, Gregory C. Beroza

Abstract: In this study we develop a single-station deep-learning approach for fast and reliable estimation of earthquake magnitude directly from raw waveforms. We design a regressor composed of convolutional and recurrent neural networks that is not sensitive to the data normalization, hence waveform amplitude information can be utilized during the training. Our network can predict earthquake magnitudes wi… ▽ More In this study we develop a single-station deep-learning approach for fast and reliable estimation of earthquake magnitude directly from raw waveforms. We design a regressor composed of convolutional and recurrent neural networks that is not sensitive to the data normalization, hence waveform amplitude information can be utilized during the training. Our network can predict earthquake magnitudes with an average error close to zero and standard deviation of ~0.2 based on single-station waveforms without instrument response correction. We test the network for both local and duration magnitude scales and show a station-based learning can be an effective approach for improving the performance. The proposed approach has a variety of potential applications from routine earthquake monitoring to early warning systems. △ Less

Submitted 14 November, 2019; originally announced November 2019.

arXiv:1811.02695 [pdf, other]

doi 10.1109/TGRS.2019.2926772

Seismic Signal Denoising and Decomposition Using Deep Neural Networks

Authors: Weiqiang Zhu, S. Mostafa Mousavi, Gregory C. Beroza

Abstract: Denoising and filtering are widely used in routine seismic-data-processing to improve the signal-to-noise ratio (SNR) of recorded signals and by doing so to improve subsequent analyses. In this paper we develop a new denoising/decomposition method, DeepDenoiser, based on a deep neural network. This network is able to learn simultaneously a sparse representation of data in the time-frequency domain… ▽ More Denoising and filtering are widely used in routine seismic-data-processing to improve the signal-to-noise ratio (SNR) of recorded signals and by doing so to improve subsequent analyses. In this paper we develop a new denoising/decomposition method, DeepDenoiser, based on a deep neural network. This network is able to learn simultaneously a sparse representation of data in the time-frequency domain and a non-linear function that maps this representation into masks that decompose input data into a signal of interest and noise (defined as any non-seismic signal). We show that DeepDenoiser achieves impressive denoising of seismic signals even when the signal and noise share a common frequency band. Our method properly handles a variety of colored noise and non-earthquake signals. DeepDenoiser can significantly improve the SNR with minimal changes in the waveform shape of interest, even in presence of high noise levels. We demonstrate the effect of our method on improving earthquake detection. There are clear applications of DeepDenoiser to seismic imaging, micro-seismic monitoring, and preprocessing of ambient noise data. We also note that potential applications of our approach are not limited to these applications or even to earthquake data, and that our approach can be adapted to diverse signals and applications in other settings. △ Less

Submitted 6 November, 2018; originally announced November 2018.

arXiv:1811.01989 [pdf, other]

Clustering of Janus Particles in Optical Potential Driven by Hydrodynamic Fluxes

Authors: S. Masoumeh Mousavi, Sabareesh K. P. Velu, Agnese Callegari, Luca Biancofiore, Giovanni Volpe

Abstract: Self-organisation is driven by the interactions between the individual components of a system mediated by the environment, and is one of the most important strategies used by many biological systems to develop complex and functional structures. Furthermore, biologically-inspired self-organisation offers opportunities to develop the next generation of materials and devices for electronics, photonic… ▽ More Self-organisation is driven by the interactions between the individual components of a system mediated by the environment, and is one of the most important strategies used by many biological systems to develop complex and functional structures. Furthermore, biologically-inspired self-organisation offers opportunities to develop the next generation of materials and devices for electronics, photonics and nanotechnology. In this work, we demonstrate experimentally that a system of Janus particles (silica microspheres half-coated with gold) aggregates into clusters in the presence of a Gaussian optical potential and disaggregates when the optical potential is switched off. We show that the underlying mechanism is the existence of a hydrodynamic flow induced by a temperature gradient generated by the light absorption at the metallic patches on the Janus particles. We also perform simulations, which agree well with the experiments and whose results permit us to clarify the underlying mechanism. The possibility of hydrodynamic-flux-induced reversible clustering may have applications in the fields of drug delivery, cargo transport, bioremediation and biopatterning. △ Less

Submitted 5 November, 2018; originally announced November 2018.

Comments: 15 pages, 6 figures

arXiv:1810.01965 [pdf]

CRED: A Deep Residual Network of Convolutional and Recurrent Units for Earthquake Signal Detection

Authors: S. Mostafa Mousavi, Weiqiang Zhu, Yixiao Sheng, Gregory C. Beroza

Abstract: Earthquake signal detection is at the core of observational seismology. A good detection algorithm should be sensitive to small and weak events with a variety of waveform shapes, robust to background noise and non-earthquake signals, and efficient for processing large data volumes. Here, we introduce the Cnn-Rnn Earthquake Detector (CRED), a detector based on deep neural networks. The network uses… ▽ More Earthquake signal detection is at the core of observational seismology. A good detection algorithm should be sensitive to small and weak events with a variety of waveform shapes, robust to background noise and non-earthquake signals, and efficient for processing large data volumes. Here, we introduce the Cnn-Rnn Earthquake Detector (CRED), a detector based on deep neural networks. The network uses a combination of convolutional layers and bi-directional long-short-term memory units in a residual structure. It learns the time-frequency characteristics of the dominant phases in an earthquake signal from three component data recorded on a single station. We train the network using 500,000 seismograms (250k associated with tectonic earthquakes and 250k identified as noise) recorded in Northern California and tested it with an F-score of 99.95. The robustness of the trained model with respect to the noise level and non-earthquake signals is shown by applying it to a set of semi-synthetic signals. The model is applied to one month of continuous data recorded at Central Arkansas to demonstrate its efficiency, generalization, and sensitivity. Our model is able to detect more than 700 microearthquakes as small as -1.3 ML induced during hydraulic fracturing far away than the training region. The performance of the model is compared with STA/LTA, template matching, and FAST algorithms. Our results indicate an efficient and reliable performance of CRED. This framework holds great promise in lowering the detection threshold while minimizing false positive detection rates. △ Less

Submitted 3 October, 2018; originally announced October 2018.

arXiv:1708.09134 [pdf]

A New Super-Twisting Algorithm-Based Sliding Mode Observer Design for Fault Estimation in a Class of Nonlinear Fractional Order Systems

Authors: Seyed Mohammad Moein Mousavi, Amin Ramezani, HamidReza Momeni

Abstract: This paper is concerned with fault estimation in a class of nonlinear fractional order systems using a new super twisting algorithm based second order step by step sliding mode observer. Since the existing sliding mode observers are troubled with the chattering phenomenon, here a new observer structure is proposed and finite time convergence of error dynamics is proved using fractional order super… ▽ More This paper is concerned with fault estimation in a class of nonlinear fractional order systems using a new super twisting algorithm based second order step by step sliding mode observer. Since the existing sliding mode observers are troubled with the chattering phenomenon, here a new observer structure is proposed and finite time convergence of error dynamics is proved using fractional order super twisting algorithm (FSTA). Two numerical examples of chaotic fractional order systems and a comparison with respect to a similar observer justify the effectiveness of the proposed observer △ Less

Submitted 26 June, 2018; v1 submitted 30 August, 2017; originally announced August 2017.

arXiv:1707.01600 [pdf, ps, other]

Option Pricing with Delayed Information

Authors: Tomoyuki Ichiba, Seyyed Mostafa Mousavi

Abstract: We propose a model to study the effects of delayed information on option pricing. We first talk about the absence of arbitrage in our model, and then discuss super replication with delayed information in a binomial model, notably, we present a closed form formula for the price of convex contingent claims. Also, we address the convergence problem as the time-step and delay length tend to zero and i… ▽ More We propose a model to study the effects of delayed information on option pricing. We first talk about the absence of arbitrage in our model, and then discuss super replication with delayed information in a binomial model, notably, we present a closed form formula for the price of convex contingent claims. Also, we address the convergence problem as the time-step and delay length tend to zero and introduce analogous results in the continuous time framework. Finally, we explore how delayed information exaggerates the volatility smile. △ Less

Submitted 5 July, 2017; originally announced July 2017.

arXiv:1706.01737 [pdf]

Second Order Step by Step Sliding mode Observer for Fault Estimation in a Class of Nonlinear Fractional Order Systems

Authors: Seyed Mohammad Moein Mousavi, Amin Ramezani

Abstract: This paper considers fault estimation in nonlinear fractional order systems in observer form. For this aim, a step by step second order sliding mode observer is used. By means of a fractional inequality, the stability of the observer estimation errors is analyzed and some conditions are introduced to guarantee finite time convergence of estimation errors. Finally, in a numerical example, effective… ▽ More This paper considers fault estimation in nonlinear fractional order systems in observer form. For this aim, a step by step second order sliding mode observer is used. By means of a fractional inequality, the stability of the observer estimation errors is analyzed and some conditions are introduced to guarantee finite time convergence of estimation errors. Finally, in a numerical example, effectiveness of this observer is demonstrated. △ Less

Submitted 9 June, 2017; v1 submitted 6 June, 2017; originally announced June 2017.

arXiv:1706.01736 [pdf]

Lyapunov-based Model Reference Adaptive Controller Design for a Class of Nonlinear Fractional Order Systems

Authors: Seyed Mohammad Moein Mousavi, Mohammad T. H. Beheshti, Amin Ramezani

Abstract: This paper is concerned with model reference adaptive controller design for a class of nonlinear fractional order systems. Recent works on this topic rarely include direct methods and they are mostly based on indirect methods where the frequency distributed model is used to prove the stability of the closed loop system. Since the chain rule cannot be applied in fractional derivations, in order to… ▽ More This paper is concerned with model reference adaptive controller design for a class of nonlinear fractional order systems. Recent works on this topic rarely include direct methods and they are mostly based on indirect methods where the frequency distributed model is used to prove the stability of the closed loop system. Since the chain rule cannot be applied in fractional derivations, in order to prove the lyapunov stability here fractional inequalities are used. Finally, by means of a numerical example, the controller performance is demonstrated. △ Less

Submitted 16 October, 2017; v1 submitted 6 June, 2017; originally announced June 2017.

Comments: submitted to International Journal of Dynamics and Control

arXiv:1607.06373 [pdf, ps, other]

Systemic Risk and Stochastic Games with Delay

Authors: Rene Carmona, Jean-Pierre Fouque, Seyyed Mostafa Mousavi, Li-Hsien Sun

Abstract: We propose a model of inter-bank lending and borrowing which takes into account clearing debt obligations. The evolution of log-monetary reserves of $N$ banks is described by coupled diffusions driven by controls with delay in their drifts. Banks are minimizing their finite-horizon objective functions which take into account a quadratic cost for lending or borrowing and a linear incentive to borro… ▽ More We propose a model of inter-bank lending and borrowing which takes into account clearing debt obligations. The evolution of log-monetary reserves of $N$ banks is described by coupled diffusions driven by controls with delay in their drifts. Banks are minimizing their finite-horizon objective functions which take into account a quadratic cost for lending or borrowing and a linear incentive to borrow if the reserve is low or lend if the reserve is high relative to the average capitalization of the system. As such, our problem is an $N$-player linear-quadratic stochastic differential game with delay. An open-loop Nash equilibrium is obtained using a system of fully coupled forward and advanced backward stochastic differential equations. We then describe how the delay affects liquidity and systemic risk characterized by a large number of defaults. We also derive a close-loop Nash equilibrium using an HJB approach. △ Less

Submitted 21 July, 2016; originally announced July 2016.

Comments: 1 figure

arXiv:1603.04110 [pdf, other]

Geometry of Interest (GOI): Spatio-Temporal Destination Extraction and Partitioning in GPS Trajectory Data

Authors: Seyed Morteza Mousavi, Aaron Harwood, Shanika Karunasekera, Mojtaba Maghrebi

Abstract: Nowadays large amounts of GPS trajectory data is being continuously collected by GPS-enabled devices such as vehicles navigation systems and mobile phones. GPS trajectory data is useful for applications such as traffic management, location forecasting, and itinerary planning. Such applications often need to extract the time-stamped Sequence of Visited Locations (SVLs) of the mobile objects. The ne… ▽ More Nowadays large amounts of GPS trajectory data is being continuously collected by GPS-enabled devices such as vehicles navigation systems and mobile phones. GPS trajectory data is useful for applications such as traffic management, location forecasting, and itinerary planning. Such applications often need to extract the time-stamped Sequence of Visited Locations (SVLs) of the mobile objects. The nearest neighbor query (NNQ) is the most applied method for labeling the visited locations based on the IDs of the POIs in the process of SVL generation. NNQ in some scenarios is not accurate enough. To improve the quality of the extracted SVLs, instead of using NNQ, we label the visited locations as the IDs of the POIs which geometrically intersect with the GPS observations. Intersection operator requires the accurate geometry of the points of interest which we refer to them as the Geometries of Interest (GOIs). In some application domains (e.g. movement trajectories of animals), adequate information about the POIs and their GOIs may not be available a priori, or they may not be publicly accessible and, therefore, they need to be derived from GPS trajectory data. In this paper we propose a novel method for estimating the POIs and their GOIs, which consists of three phases: (i) extracting the geometries of the stay regions; (ii) constructing the geometry of destination regions based on the extracted stay regions; and (iii) constructing the GOIs based on the geometries of the destination regions. Using the geometric similarity to known GOIs as the major evaluation criterion, the experiments we performed using long-term GPS trajectory data show that our method outperforms the existing approaches. △ Less

Submitted 16 May, 2016; v1 submitted 13 March, 2016; originally announced March 2016.

Comments: A version of this technical report has been submitted to the Springer Journal of Ambient Intelligence and Humanized Computing and it is under review

arXiv:1603.04099 [pdf, other]

Contagion and Stability in Financial Networks

Authors: Seyyed Mostafa Mousavi, Robert Mackay, Alistair Tucker

Abstract: This paper investigates two mechanisms of financial contagion that are, firstly, the correlated exposure of banks to the same source of risk, and secondly the direct exposure of banks in the interbank market. It will consider a random network of banks which are connected through the inter-bank market and will discuss the desirable level of banks exposure to the same sources of risk, that is invest… ▽ More This paper investigates two mechanisms of financial contagion that are, firstly, the correlated exposure of banks to the same source of risk, and secondly the direct exposure of banks in the interbank market. It will consider a random network of banks which are connected through the inter-bank market and will discuss the desirable level of banks exposure to the same sources of risk, that is investment in similar portfolios, for different levels of network connectivity when peering through the lens of the systemic cost incurred to the economy from the banks simultaneous failure. It demonstrates that for all levels of network connectivity, certain levels of diversifying individual banks diversifications are not optimum under any condition. So, given an acceptable level of systemic cost, the regulator could let banks decrease their capital buffers by moving away from the non-optimum area. △ Less

Submitted 13 March, 2016; originally announced March 2016.

Comments: Wealth IJMBF, Volume 3, Issue 1, Jan-June 2014

arXiv:1204.6275 [pdf, ps]

doi 10.1088/0953-4075/43/16/165501

Effect of quantum interference on the optical properties of a three-level V-type atomic system beyond the two-photon resonance condition

Authors: S. M. Mousavi, L. Safari, M. Mahmoudi, M. Sahrai

Abstract: The effect of quantum interference on the optical properties of a pumped-probe three-level V-type atomic system is investigated. The probe absorption, dispersion, group index and optical bistability beyond the two-photon resonance condition are discussed. It is found that the optical properties of a medium in the frequency of the probe field, in general, are phase independent. The phase dependence… ▽ More The effect of quantum interference on the optical properties of a pumped-probe three-level V-type atomic system is investigated. The probe absorption, dispersion, group index and optical bistability beyond the two-photon resonance condition are discussed. It is found that the optical properties of a medium in the frequency of the probe field, in general, are phase independent. The phase dependence arises from a scattering of the coupling field into the probe field at a frequency which in general differs from the probe field frequency. It is demonstrated that beyond the two-photon resonance condition the phase sensitivity of the medium will disappear. △ Less

Submitted 27 April, 2012; originally announced April 2012.

Comments: 8 pages, 8 figures

Journal ref: J. Phys. B 43, 165501 (2010)

arXiv:1102.0089 [pdf, ps, other]

doi 10.1007/s10714-011-1307-2

Some exact solutions of F(R) gravity with charged (a)dS black hole interpretation

Authors: S. H. Hendi, B. Eslam Panah, S. M. Mousavi

Abstract: In this paper we obtain topological static solutions of some kind of pure $F(R)$ gravity. The present solutions are two kind: first type is uncharged solution which corresponds with the topological (a)dS Schwarzschild solution and second type has electric charge and is equivalent to the Einstein-$Λ$-conformally invariant Maxwell solution. In other word, starting from pure gravity leads to (charged… ▽ More In this paper we obtain topological static solutions of some kind of pure $F(R)$ gravity. The present solutions are two kind: first type is uncharged solution which corresponds with the topological (a)dS Schwarzschild solution and second type has electric charge and is equivalent to the Einstein-$Λ$-conformally invariant Maxwell solution. In other word, starting from pure gravity leads to (charged) Einstein-$Λ$ solutions which we interpreted them as (charged) (a)dS black hole solutions of pure $F(R)$ gravity. Calculating the Ricci and Kreschmann scalars show that there is a curvature singularity at $r=0$. We should note that the Kreschmann scalar of charged solutions goes to infinity as $r \rightarrow 0$, but with a rate slower than that of uncharged solutions. △ Less

Submitted 17 October, 2011; v1 submitted 1 February, 2011; originally announced February 2011.

Comments: 21 pages, 4 figures, generalization to higher dimensions, references added

Journal ref: Gen Relativ Gravit (2012) 44:835-853

Showing 1–37 of 37 results for author: Mousavi, S M