-
A New Method for Cross-Lingual-based Semantic Role Labeling
Authors:
Mohammad Ebrahimi,
Behrouz Minaei Bidgoli,
Nasim Khozouei
Abstract:
Semantic role labeling is a crucial task in natural language processing, enabling better comprehension of natural language. However, the lack of annotated data in multiple languages has posed a challenge for researchers. To address this, a deep learning algorithm based on model transfer has been proposed. The algorithm utilizes a dataset consisting of the English portion of CoNLL2009 and a corpus…
▽ More
Semantic role labeling is a crucial task in natural language processing, enabling better comprehension of natural language. However, the lack of annotated data in multiple languages has posed a challenge for researchers. To address this, a deep learning algorithm based on model transfer has been proposed. The algorithm utilizes a dataset consisting of the English portion of CoNLL2009 and a corpus of semantic roles in Persian. To optimize the efficiency of training, only ten percent of the educational data from each language is used. The results of the proposed model demonstrate significant improvements compared to Niksirt et al.'s model. In monolingual mode, the proposed model achieved a 2.05 percent improvement on F1-score, while in cross-lingual mode, the improvement was even more substantial, reaching 6.23 percent. Worth noting is that the compared model only trained two of the four stages of semantic role labeling and employed golden data for the remaining two stages. This suggests that the actual superiority of the proposed model surpasses the reported numbers by a significant margin. The development of cross-lingual methods for semantic role labeling holds promise, particularly in addressing the scarcity of annotated data for various languages. These advancements pave the way for further research in understanding and processing natural language across different linguistic contexts.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Comprehensive Forecasting of California's Energy Consumption: A Multi-Source and Sectoral Analysis Using ARIMA and ARIMAX Models
Authors:
Zahra Moslemi,
Logan Clark,
Sarah Kernal,
Samantha Rehome,
Scott Sprengel,
Ahoora Tamizifar,
Shawna Tuli,
Vish Chokshi,
Mo Nomeli,
Ella Liang,
Moury Bidgoli,
Jeff Lu,
Manish Dasaur,
Marty Hodgett
Abstract:
California's significant role as the second-largest consumer of energy in the United States underscores the importance of accurate energy consumption predictions. With a thriving industrial sector, a burgeoning population, and ambitious environmental goals, the state's energy landscape is dynamic and complex. This paper presents a comprehensive analysis of California's energy consumption trends an…
▽ More
California's significant role as the second-largest consumer of energy in the United States underscores the importance of accurate energy consumption predictions. With a thriving industrial sector, a burgeoning population, and ambitious environmental goals, the state's energy landscape is dynamic and complex. This paper presents a comprehensive analysis of California's energy consumption trends and provides detailed forecasting models for different energy sources and sectors. The study leverages ARIMA and ARIMAX models, considering both historical consumption data and exogenous variables. We address the unique challenges posed by the COVID-19 pandemic and the limited data for 2022, highlighting the resilience of these models in the face of uncertainty. Our analysis reveals that while fossil fuels continue to dominate California's energy landscape, renewable energy sources, particularly solar and biomass, are experiencing substantial growth. Hydroelectric power, while sensitive to precipitation, remains a significant contributor to renewable energy consumption. Furthermore, we anticipate ongoing efforts to reduce fossil fuel consumption. The forecasts for energy consumption by sector suggest continued growth in the commercial and residential sectors, reflecting California's expanding economy and population. In contrast, the industrial sector is expected to experience more moderate changes, while the transportation sector remains the largest energy consumer.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
A Strategy for Implementing description Temporal Dynamic Algorithms in Dynamic Knowledge Graphs by SPIN
Authors:
Alireza Shahbazi,
Seyyed Ahmad Mirsanei,
Malikeh Haj Khan Mirzaye Sarraf,
Behrouz Minaei Bidgoli
Abstract:
Planning and reasoning about actions and processes, in addition to reasoning about propositions, are important issues in recent logical and computer science studies. The widespread use of actions in everyday life such as IoT, semantic web services, etc., and the limitations and issues in the action formalisms are two factors that lead us to study how actions are represented.
Since 2007, there ha…
▽ More
Planning and reasoning about actions and processes, in addition to reasoning about propositions, are important issues in recent logical and computer science studies. The widespread use of actions in everyday life such as IoT, semantic web services, etc., and the limitations and issues in the action formalisms are two factors that lead us to study how actions are represented.
Since 2007, there have been some ideas to integrate Description Logic (DL) and action formalisms for representing both static and dynamic knowledge. Meanwhile, time is an important factor in dynamic situations, and actions change states over time. In this study, on the one hand, we examined related logical structures such as extensions of description logics (DLs), temporal formalisms, and action formalisms. On the other hand, we analyzed possible tools for designing and developing the Knowledge and Action Base (KAB).
For representation and reasoning about actions, we embedded actions into DLs (such as Dynamic-ALC and its extensions). We propose a terminable algorithm for action projection, planning, checking the satisfiability, consistency, realizability, and executability, and also querying from KAB. Actions in this framework were modeled with SPIN and added to state space. This framework has also been implemented as a plugin for the Protégé ontology editor.
During the last two decades, various algorithms have been presented, but due to the high computational complexity, we face many problems in implementing dynamic ontologies. In addition, an algorithm to detect the inconsistency of actions' effects was not explicitly stated. In the proposed strategy, the interactions of actions with other parts of modeled knowledge, and a method to check consistency between the effects of actions are presented. With this framework, the ramification problem can be well handled in future works.
△ Less
Submitted 20 January, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
PersianLLaMA: Towards Building First Persian Large Language Model
Authors:
Mohammad Amin Abbasi,
Arash Ghafouri,
Mahdi Firouzmandi,
Hassan Naderi,
Behrouz Minaei Bidgoli
Abstract:
Despite the widespread use of the Persian language by millions globally, limited efforts have been made in natural language processing for this language. The use of large language models as effective tools in various natural language processing tasks typically requires extensive textual data and robust hardware resources. Consequently, the scarcity of Persian textual data and the unavailability of…
▽ More
Despite the widespread use of the Persian language by millions globally, limited efforts have been made in natural language processing for this language. The use of large language models as effective tools in various natural language processing tasks typically requires extensive textual data and robust hardware resources. Consequently, the scarcity of Persian textual data and the unavailability of powerful hardware resources have hindered the development of large language models for Persian. This paper introduces the first large Persian language model, named PersianLLaMA, trained on a collection of Persian texts and datasets. This foundational model comes in two versions, with 7 and 13 billion parameters, trained on formal and colloquial Persian texts using two different approaches. PersianLLaMA has been evaluated for natural language generation tasks based on the latest evaluation methods, namely using larger language models, and for natural language understanding tasks based on automated machine metrics. The results indicate that PersianLLaMA significantly outperforms its competitors in both understanding and generating Persian text. PersianLLaMA marks an important step in the development of Persian natural language processing and can be a valuable resource for the Persian-speaking community. This large language model can be used for various natural language processing tasks, especially text generation like chatbots, question-answering, machine translation, and text summarization
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Spin- and valley-dependent transports through ferromagnetic 8-pmmn borophene monolayer
Authors:
Fatemeh Imanian Mofrad Bidgoli,
Hossein Nikoofard,
Narges Nikoofard,
Mahdi Esmaeilzadeh
Abstract:
We study spin and valley-dependent transport properties in an n-p-n junction of 8-pmmn borophene monolayer. An external gate voltage and exchange magnetic field, induced by the proximity effect of a ferromagnetic insulator, are applied to this junction as electric and magnetic potential barriers. We show that the exchange magnetic field generates spin polarization in the system and applying a gate…
▽ More
We study spin and valley-dependent transport properties in an n-p-n junction of 8-pmmn borophene monolayer. An external gate voltage and exchange magnetic field, induced by the proximity effect of a ferromagnetic insulator, are applied to this junction as electric and magnetic potential barriers. We show that the exchange magnetic field generates spin polarization in the system and applying a gate voltage, as a simple method, causes valley polarization. This property (valley polarization) is due to the anisotropic and tilted Dirac cones of the borophene structure. It is an advantage of borophene monolayer over graphene monolayer because in graphene it is necessary to apply strain to have valley polarization. We also show that the proposed device (borophene-based n-p-n junction) can work as perfect spin and perfect valley filters. The spin and valley filters can be controlled by changing two factors, i.e. gate voltage and Fermi energy. Moreover, it is shown that for full spin and valley polarizations and thus perfect spin and valley filters, the length of the barriers must be larger than a specific value (60nm). These results show that the borophene monolayer has a suitable potential to be used in spintronic and valleytronic devices.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
Persian Semantic Role Labeling Using Transfer Learning and BERT-Based Models
Authors:
Saeideh Niksirat Aghdam,
Sayyed Ali Hossayni,
Erfan Khedersolh Sadeh,
Nasim Khozouei,
Behrouz Minaei Bidgoli
Abstract:
Semantic role labeling (SRL) is the process of detecting the predicate-argument structure of each predicate in a sentence. SRL plays a crucial role as a pre-processing step in many NLP applications such as topic and concept extraction, question answering, summarization, machine translation, sentiment analysis, and text mining. Recently, in many languages, unified SRL dragged lots of attention due…
▽ More
Semantic role labeling (SRL) is the process of detecting the predicate-argument structure of each predicate in a sentence. SRL plays a crucial role as a pre-processing step in many NLP applications such as topic and concept extraction, question answering, summarization, machine translation, sentiment analysis, and text mining. Recently, in many languages, unified SRL dragged lots of attention due to its outstanding performance, which is the result of overcoming the error propagation problem. However, regarding the Persian language, all previous works have focused on traditional methods of SRL leading to a drop in accuracy and imposing expensive feature extraction steps in terms of financial resources, time and energy consumption. In this work, we present an end-to-end SRL method that not only eliminates the need for feature extraction but also outperforms existing methods in facing new samples in practical situations. The proposed method does not employ any auxiliary features and shows more than 16 (83.16) percent improvement in accuracy against previous methods in similar circumstances.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity
Authors:
Mohammad Abdous,
Poorya Piroozfar,
Behrouz Minaei Bidgoli
Abstract:
One of the components of natural language processing that has received a lot of investigation recently is semantic textual similarity. In computational linguistics and natural language processing, assessing the semantic similarity of words, phrases, paragraphs, and texts is crucial. Calculating the degree of semantic resemblance between two textual pieces, paragraphs, or phrases provided in both m…
▽ More
One of the components of natural language processing that has received a lot of investigation recently is semantic textual similarity. In computational linguistics and natural language processing, assessing the semantic similarity of words, phrases, paragraphs, and texts is crucial. Calculating the degree of semantic resemblance between two textual pieces, paragraphs, or phrases provided in both monolingual and cross-lingual versions is known as semantic similarity. Cross lingual semantic similarity requires corpora in which there are sentence pairs in both the source and target languages with a degree of semantic similarity between them. Many existing cross lingual semantic similarity models use a machine translation due to the unavailability of cross lingual semantic similarity dataset, which the propagation of the machine translation error reduces the accuracy of the model. On the other hand, when we want to use semantic similarity features for machine translation the same machine translations should not be used for semantic similarity. For Persian, which is one of the low resource languages, no effort has been made in this regard and the need for a model that can understand the context of two languages is felt more than ever. In this article, the corpus of semantic textual similarity between sentences in Persian and English languages has been produced for the first time by using linguistic experts. We named this dataset PESTS (Persian English Semantic Textual Similarity). This corpus contains 5375 sentence pairs. Also, different models based on transformers have been fine-tuned using this dataset. The results show that using the PESTS dataset, the Pearson correlation of the XLM ROBERTa model increases from 85.87% to 95.62%.
△ Less
Submitted 5 September, 2024; v1 submitted 13 May, 2023;
originally announced May 2023.
-
OSLO: On-the-Sphere Learning for Omnidirectional images and its application to 360-degree image compression
Authors:
Navid Mahmoudian Bidgoli,
Roberto G. de A. Azevedo,
Thomas Maugey,
Aline Roumy,
Pascal Frossard
Abstract:
State-of-the-art 2D image compression schemes rely on the power of convolutional neural networks (CNNs). Although CNNs offer promising perspectives for 2D image compression, extending such models to omnidirectional images is not straightforward. First, omnidirectional images have specific spatial and statistical properties that can not be fully captured by current CNN models. Second, basic mathema…
▽ More
State-of-the-art 2D image compression schemes rely on the power of convolutional neural networks (CNNs). Although CNNs offer promising perspectives for 2D image compression, extending such models to omnidirectional images is not straightforward. First, omnidirectional images have specific spatial and statistical properties that can not be fully captured by current CNN models. Second, basic mathematical operations composing a CNN architecture, e.g., translation and sampling, are not well-defined on the sphere. In this paper, we study the learning of representation models for omnidirectional images and propose to use the properties of HEALPix uniform sampling of the sphere to redefine the mathematical tools used in deep learning models for omnidirectional images. In particular, we: i) propose the definition of a new convolution operation on the sphere that keeps the high expressiveness and the low complexity of a classical 2D convolution; ii) adapt standard CNN techniques such as stride, iterative aggregation, and pixel shuffling to the spherical domain; and then iii) apply our new framework to the task of omnidirectional image compression. Our experiments show that our proposed on-the-sphere solution leads to a better compression gain that can save 13.7% of the bit rate compared to similar learned models applied to equirectangular images. Also, compared to learning models based on graph convolutional networks, our solution supports more expressive filters that can preserve high frequencies and provide a better perceptual quality of the compressed images. Such results demonstrate the efficiency of the proposed framework, which opens new research venues for other omnidirectional vision tasks to be effectively implemented on the sphere manifold.
△ Less
Submitted 21 August, 2022; v1 submitted 19 July, 2021;
originally announced July 2021.
-
Fine granularity access in interactive compression of 360-degree images based on rate-adaptive channel codes
Authors:
Navid Mahmoudian Bidgoli,
Thomas Maugey,
Aline Roumy
Abstract:
In this paper, we propose a new interactive compression scheme for omnidirectional images. This requires two characteristics: efficient compression of data, to lower the storage cost, and random access ability to extract part of the compressed stream requested by the user (for reducing the transmission rate). For efficient compression, data needs to be predicted by a series of references that have…
▽ More
In this paper, we propose a new interactive compression scheme for omnidirectional images. This requires two characteristics: efficient compression of data, to lower the storage cost, and random access ability to extract part of the compressed stream requested by the user (for reducing the transmission rate). For efficient compression, data needs to be predicted by a series of references that have been pre-defined and compressed. This contrasts with the spirit of random accessibility. We propose a solution for this problem based on incremental codes implemented by rate-adaptive channel codes. This scheme encodes the image while adapting to any user request and leads to an efficient coding that is flexible in extracting data depending on the available information at the decoder. Therefore, only the information that is needed to be displayed at the user's side is transmitted during the user's request, as if the request was already known at the encoder. The experimental results demonstrate that our coder obtains a better transmission rate than the state-of-the-art tile-based methods at a small cost in storage. Moreover, the transmission rate grows gradually with the size of the request and avoids a staircase effect, which shows the perfect suitability of our coder for interactive transmission.
△ Less
Submitted 21 August, 2020; v1 submitted 25 June, 2020;
originally announced June 2020.
-
Threshold for weak saturation stability
Authors:
M. Bidgoli,
A. Mohammadian,
B. Tayfeh-Rezaie,
M. Zhukovskii
Abstract:
We study the weak $K_s$-saturation number of the Erdős--Rényi random graph $\mathbbmsl{G}(n, p)$, denoted by $\mathrm{wsat}(\mathbbmsl{G}(n, p), K_s)$, where $K_s$ is the complete graph on $s$ vertices. Korándi and Sudakov in 2017 proved that the weak $K_s$-saturation number of $K_n$ is stable, in the sense that it remains the same after removing edges with constant probability. In this paper, we…
▽ More
We study the weak $K_s$-saturation number of the Erdős--Rényi random graph $\mathbbmsl{G}(n, p)$, denoted by $\mathrm{wsat}(\mathbbmsl{G}(n, p), K_s)$, where $K_s$ is the complete graph on $s$ vertices. Korándi and Sudakov in 2017 proved that the weak $K_s$-saturation number of $K_n$ is stable, in the sense that it remains the same after removing edges with constant probability. In this paper, we prove that there exists a threshold for this stability property and give upper and lower bounds on the threshold. This generalizes the result of Korándi and Sudakov. A general upper bound for $\mathrm{wsat}(\mathbbmsl{G}(n, p), K_s)$ is also provided.
△ Less
Submitted 12 November, 2021; v1 submitted 11 June, 2020;
originally announced June 2020.
-
Percolating sets in bootstrap percolation on the Hamming graphs
Authors:
M. R. Bidgoli,
A. Mohammadian,
B. Tayfeh-Rezaie
Abstract:
For any integer $r\geqslant0$, the $r$-neighbor bootstrap percolation on a graph is an activation process of the vertices. The process starts with some initially activated vertices and then, in each round, any inactive vertex with at least $r$ active neighbors becomes activated. A set of initially activated vertices leading to the activation of all vertices is said to be a percolating set. Denote…
▽ More
For any integer $r\geqslant0$, the $r$-neighbor bootstrap percolation on a graph is an activation process of the vertices. The process starts with some initially activated vertices and then, in each round, any inactive vertex with at least $r$ active neighbors becomes activated. A set of initially activated vertices leading to the activation of all vertices is said to be a percolating set. Denote the minimum size of a percolating set in the $r$-neighbor bootstrap percolation process on a graph $G$ by $m(G, r)$. In this paper, we present upper and lower bounds on $m(K_n^d, r)$, where $K_n^d$ is the Cartesian product of $d$ copies of the complete graph $K_n$ which is referred as the Hamming graph. Among other results, we show that $m(K_n^d, r)=\frac{1+o(1)}{(d+1)!}r^d$ when both $r$ and $d$ go to infinity with $r<n$ and $d=o(\!\sqrt{r})$.
△ Less
Submitted 6 May, 2019;
originally announced May 2019.
-
On $K_{2,t}$-bootstrap percolation
Authors:
M. R. Bidgoli,
A. Mohammadian,
B. Tayfeh-Rezaie
Abstract:
Given two graphs $G$ and $H$, it is said that $G$ percolates in $H$-bootstrap process if one could join all the nonadjacent pairs of vertices of $G$ in some order such that a new copy of $H$ is created at each step. Balogh, Bollobás and Morris in 2012 investigated the threshold of $H$-bootstrap percolation in the Erdős-Rényi model for the complete graph $H$ and proposed the similar problem for…
▽ More
Given two graphs $G$ and $H$, it is said that $G$ percolates in $H$-bootstrap process if one could join all the nonadjacent pairs of vertices of $G$ in some order such that a new copy of $H$ is created at each step. Balogh, Bollobás and Morris in 2012 investigated the threshold of $H$-bootstrap percolation in the Erdős-Rényi model for the complete graph $H$ and proposed the similar problem for $H=K_{s,t}$, the complete bipartite graph. In this paper, we provide lower and upper bounds on the threshold of $K_{2, t}$-bootstrap percolation. In addition, a threshold function is derived for $K_{2, 4}$-bootstrap percolation.
△ Less
Submitted 27 June, 2018;
originally announced June 2018.
-
A location-aware embedding technique for accurate landmark recognition
Authors:
Federico Magliani,
Navid Mahmoudian Bidgoli,
Andrea Prati
Abstract:
The current state of the research in landmark recognition highlights the good accuracy which can be achieved by embedding techniques, such as Fisher vector and VLAD. All these techniques do not exploit spatial information, i.e. consider all the features and the corresponding descriptors without embedding their location in the image. This paper presents a new variant of the well-known VLAD (Vector…
▽ More
The current state of the research in landmark recognition highlights the good accuracy which can be achieved by embedding techniques, such as Fisher vector and VLAD. All these techniques do not exploit spatial information, i.e. consider all the features and the corresponding descriptors without embedding their location in the image. This paper presents a new variant of the well-known VLAD (Vector of Locally Aggregated Descriptors) embedding technique which accounts, at a certain degree, for the location of features. The driving motivation comes from the observation that, usually, the most interesting part of an image (e.g., the landmark to be recognized) is almost at the center of the image, while the features at the borders are irrelevant features which do no depend on the landmark. The proposed variant, called locVLAD (location-aware VLAD), computes the mean of the two global descriptors: the VLAD executed on the entire original image, and the one computed on a cropped image which removes a certain percentage of the image borders. This simple variant shows an accuracy greater than the existing state-of-the-art approach. Experiments are conducted on two public datasets (ZuBuD and Holidays) which are used both for training and testing. Morever a more balanced version of ZuBuD is proposed.
△ Less
Submitted 19 April, 2017;
originally announced April 2017.
-
Wisdom of Crowds cluster ensemble
Authors:
Hosein Alizadeh,
Muhammad Yousefnezhad,
Behrouz Minaei Bidgoli
Abstract:
The Wisdom of Crowds is a phenomenon described in social science that suggests four criteria applicable to groups of people. It is claimed that, if these criteria are satisfied, then the aggregate decisions made by a group will often be better than those of its individual members. Inspired by this concept, we present a novel feedback framework for the cluster ensemble problem, which we call Wisdom…
▽ More
The Wisdom of Crowds is a phenomenon described in social science that suggests four criteria applicable to groups of people. It is claimed that, if these criteria are satisfied, then the aggregate decisions made by a group will often be better than those of its individual members. Inspired by this concept, we present a novel feedback framework for the cluster ensemble problem, which we call Wisdom of Crowds Cluster Ensemble (WOCCE). Although many conventional cluster ensemble methods focusing on diversity have recently been proposed, WOCCE analyzes the conditions necessary for a crowd to exhibit this collective wisdom. These include decentralization criteria for generating primary results, independence criteria for the base algorithms, and diversity criteria for the ensemble members. We suggest appropriate procedures for evaluating these measures, and propose a new measure to assess the diversity. We evaluate the performance of WOCCE against some other traditional base algorithms as well as state-of-the-art ensemble methods. The results demonstrate the efficiency of WOCCE's aggregate decision-making compared to other algorithms.
△ Less
Submitted 13 May, 2016;
originally announced May 2016.
-
Case study: Data Mining of Associate Degree Accepted Candidates by Modular Method
Authors:
Behrouz Minaei Bidgoli,
Maryam Nazaridoust
Abstract:
Since about 10 years ago, University of Applied Science and Technology (UAST) in Iran has admitted students in discontinuous associate degree by modular method, so that almost 100,000 students are accepted every year. Although the first aim of holding such courses was to improve scientific and skill level of employees, over time a considerable group of unemployed people have been interested to par…
▽ More
Since about 10 years ago, University of Applied Science and Technology (UAST) in Iran has admitted students in discontinuous associate degree by modular method, so that almost 100,000 students are accepted every year. Although the first aim of holding such courses was to improve scientific and skill level of employees, over time a considerable group of unemployed people have been interested to participate in these courses. According to this fact, in this paper, we mine and analyze a sample data of accepted candidates in modular 2008 and 2009 courses by using unsupervised and supervised learning paradigms. In the first step, by using unsupervised paradigm, we grouped (clustered) set of modular accepted candidates based on their student status and labeled data sets by three classes so that each class somehow shows educational and student status of modular accepted candidates. In the second step, by using supervised and unsupervised algorithms, we generated predicting models in 2008 data sets. Then, by making a comparison between performances of generated models, we selected predicting model of association rules through which some rules were extracted. Finally, this model is executed for Test set which includes accepted candidates of next course then by evaluation of results, the percentage of correctness and confidentiality of obtained results can be viewed.
△ Less
Submitted 16 April, 2014;
originally announced April 2014.