Ezekiel Soremekun

Publication Date: May 1, 2023

Research Interests:
Computer Science, Artificial Intelligence, Cloud Computing, Image Analysis (Mathematics), and Software

Publication Date: May 1, 2023

Research Interests:
Computer Science, Program Slicing, Debugging, and Programming language

Download (.pdf)

Publisher: Cornell University

Publication Date: Jan 8, 2021

Publication Name: arXiv (Cornell University)

Research Interests:
Computer Science, Program Slicing, Programmer, Debugging, and Slicing

Download (.pdf)

Publisher: Cornell University

Publication Date: Dec 29, 2021

Publication Name: arXiv (Cornell University)

Research Interests:
Computer Science and arXiv

Download (.pdf)

The volume of activity in the Nigerian e-commerce sector has been increasing recently, almost at a geometric rate. A good percentage of Nigerians are now looking online to purchase various categories of items from Apparel, books,... more

The volume of activity in the Nigerian e-commerce sector has been increasing recently, almost at a geometric rate. A good percentage of Nigerians are now looking online to purchase various categories of items from Apparel, books, stationery, home items up to Electronics. E-commerce services have been available in Nigeria for less than a decade but increased patronage from Nigerians appear to have coincided with the introduction of the ability to pay for orders upon delivery and inspection. This paper investigates the relationship between this mode of payment and the sudden surge in patronage of Nigerian e-commerce outlets by taking a look at different online merchants, their payment options, the trust factor in the market, other possible factors influencing the growth of the e-commerce sector and online traffic information and statistics over three months

Publisher: Taylor & Francis

Publication Date: May 1, 2014

Publication Name: Journal of Marketing Management

Research Interests:
Business, Marketing, Marketing Management, Clothing, Prosperity, and 2 morePayment and Nigerians

Publisher: Cornell University

Publication Date: May 8, 2023

Publication Name: arXiv (Cornell University)

Research Interests:
Computer Science, Artificial Intelligence, Image Analysis (Mathematics), Software, and Ground Truth

Download (.pdf)

Publisher: Cornell University

Publication Date: Feb 24, 2020

Publication Name: arXiv (Cornell University)

Research Interests:
Computer Science, Artificial Intelligence, Robustness (evolution), and Backdoor

Download (.pdf)

Publisher: IEEE Computer Society

Publication Date: 2022

Publication Name: IEEE Transactions on Software Engineering

Research Interests:
Information Systems, Computer Science, Machine Learning, Software, Computer Software, and arXiv

Download (.pdf)

We present a model-based verification technique built on selective and pragmatic use of formal methods, by using simplified model checking tools that focus on error detection rather than formalized proofs. This framework is to check and... more

We present a model-based verification technique built on selective and pragmatic use of formal methods, by using simplified model checking tools that focus on error detection rather than formalized proofs. This framework is to check and confirm that the trajectories and events of DEVS-Driven Modeling Language (DDML) simulation models and that of the real system agree in order to achieve replicative, predictive and structural validity through the lightweight application of formal methods. This is to reduce and ease the Simulation model verification efforts while increasing the coverage of the process, in order to verify the transformational accuracy of the model development process, increase confidence in the simulation model and allow for performance evaluation of simulation models. This framework provides a model refinement iterative procedure that helps to enhance the DEVS Simulation Model, correct errors or adapt to changing contextual requirements. This refinement procedure is applicable to evolutionary software development and systems requiring rapid prototyping, in order to meet up with changing requirements of such systems with the aid of iterative refinement. Furthermore, we present a case study example of a GSM telecommunication system to reveal the ability of this framework to not only formally verify system but also refine their models.

Publication Date: Sep 1, 2013

Research Interests:
Computer Science, Devs, and Iterative refinement

Publisher: IEEE

Publication Name: 2022 IEEE 22nd International Conference on Software Quality, Reliability and Security (QRS)

Research Interests:
Computer Science, Fuzz Testing, Secure Coding, and Abstract Syntax Tree

Download (.pdf)

Program failures are often caused by invalid inputs, for instance due to input corruption. To obtain the passing input, one needs to debug the data. In this paper we present a generic technique called ddmax that (1) identifies which parts... more

Program failures are often caused by invalid inputs, for instance due to input corruption. To obtain the passing input, one needs to debug the data. In this paper we present a generic technique called ddmax that (1) identifies which parts of the input data prevent processing, and (2) recovers as much of the (valuable) input data as possible. To the best of our knowledge, ddmax is the first approach that fixes faults in the input data without requiring program analysis. In our evaluation, ddmax repaired about 69% of input files and recovered about 78% of data within one minute per input.

Publication Date: Jun 27, 2020

Research Interests:
Computer Science, Debugging, and Programming language

What is the best technique for fault localization? In a study of 37 real bugs (and 37 injected faults) in more than a dozen open source C programs, we compare the effectiveness of statistical debugging against dynamic slicing—the first... more

What is the best technique for fault localization? In a study of 37 real bugs (and 37 injected faults) in more than a dozen open source C programs, we compare the effectiveness of statistical debugging against dynamic slicing—the first study ever to compare the techniques. On average, dynamic slicing is more effective than statistical debugging, requiring programmers to examine only 14% (42 lines) of the code before finding the defect, less than half the effort required by statistical debugging (30% or 157 lines). Best results are obtained by a hybrid approach: If programmers first examine the top five most suspicious locations from statistical debugging, and then switch to dynamic slices, they will need to examine only 11% (35 lines) of the code.

Publication Date: 2016

Download (.pdf)

Publisher: Cornell University

Publication Date: Aug 17, 2022

Publication Name: arXiv (Cornell University)

Research Interests:
Computer Science, Parsing, and Closeness

Download (.pdf)

Publisher: Elsevier BV

Publication Name: Computers & Security

Research Interests:
Computer Science, Computer Security, Backdoor, and Computers Security

Download (.pdf)

Publisher: Cornell University

Publication Date: May 18, 2022

Publication Name: arXiv (Cornell University)

Research Interests:
Computer Science and Software

Download (.pdf)

Publisher: ACM

Publication Name: Proceedings of the 19th International Conference on Mining Software Repositories

Download (.pdf)

Context: When software evolves, opportunities for introducing faults appear. Therefore, it is important to test the evolved program behaviors during each evolution cycle. However, while software evolves, its complexity is also evolving,... more

Context: When software evolves, opportunities for introducing faults appear. Therefore, it is important to test the evolved program behaviors during each evolution cycle. However, while software evolves, its complexity is also evolving, introducing challenges to the testing process. To deal with this issue, testing techniques should be adapted to target the effect of the program changes instead of the entire program functionality. To this end, commit-aware mutation testing , a powerful testing technique, has been proposed. Unfortunately, commit-aware mutation testing is challenging due to the complex program semantics involved. Hence, it is pertinent to understand the characteristics, predictability, and potential of the technique. Objective: We conduct an exploratory study to investigate the properties of commit-relevant mutants , i.e., the test elements of commit-aware mutation testing, by proposing a general definition and an experimental approach to identify them. We thus, aim a...

Publisher: Association for Computing Machinery (ACM)

Publication Name: ACM Transactions on Software Engineering and Methodology

Research Interests:
Information Systems, Computer Science, Computer Software, and arXiv

Download (.pdf)

Code embedding is a keystone in the application of machine learning on several Software Engineering (SE) tasks. To effectively support a plethora of SE tasks, the embedding needs to capture program syntax and semantics in a way that is... more

Code embedding is a keystone in the application of machine learning on several Software Engineering (SE) tasks. To effectively support a plethora of SE tasks, the embedding needs to capture program syntax and semantics in a way that is generic . To this end, we propose the first self-supervised pre-training approach (called GraphCode2Vec) which produces task-agnostic embedding of lexical and program dependence features. GraphCode2Vec achieves this via a synergistic combination of code analysis and Graph Neural Networks . GraphCode2Vec is generic , it allows pre-training , and it is applicable to several SE downstream tasks . We evaluate the effectiveness of GraphCode2Vec on four (4) tasks (method name prediction, solution classification, mutation testing and overfitted patch classification), and compare it with four (4) similarly generic code embedding baselines (Code2Seq, Code2Vec, CodeBERT, GraphCodeBERT) and seven (7) task-specific , learning-based methods. In particular, GraphCo...

Publisher: ArXiv

Publication Date: 2021

Publication Name: ArXiv

Research Interests:
Computer Science and arXiv

Download (.pdf)

Publisher: Software Engineering

Publication Date: 2018

Research Interests:
Computer Science and Software Engineering

Download (.pdf)

Publication Date: 2021

Research Interests:
Computer Science and Debugging

Download (.pdf)

The introduction of robust optimisation has pushed the state-of-the-art in defending against adversarial attacks. However, the behaviour of such optimisation has not been studied in the light of a fundamentally different class of attacks... more

The introduction of robust optimisation has pushed the state-of-the-art in defending against adversarial attacks. However, the behaviour of such optimisation has not been studied in the light of a fundamentally different class of attacks called backdoors. In this paper, we demonstrate that adversarially robust models are susceptible to backdoor attacks. Subsequently, we observe that backdoors are reflected in the feature representation of such models. Then, this observation is leveraged to detect backdoor-infected models via a detection technique called AEGIS. Specifically, AEGIS uses feature clustering to effectively detect backdoor-infected robust Deep Neural Networks (DNNs). In our evaluation of major classification tasks using CIFAR-10, MNIST and FMNIST datasets, AEGIS effectively detects robust DNNs infected with backdoors. Overall, AEGIS has 97% (70/72) detection accuracy and 0.3% (2/648) false positive rate, for all configurations. Our investigation reveals that salient featu...

Publisher: ArXiv

Publication Date: 2020

Publication Name: ArXiv

Research Interests:
Mathematics, Computer Science, and arXiv

Download (.pdf)

Program failures are often caused by invalid inputs, for instance due to input corruption. To obtain the passing input, one needs to debug the data. In this paper we present a generic technique called ddmax that (1) identifies which parts... more

Program failures are often caused by invalid inputs, for instance due to input corruption. To obtain the passing input, one needs to debug the data. In this paper we present a generic technique called ddmax that (1) identifies which parts of the input data prevent processing, and (2) recovers as much of the (valuable) input data as possible. To the best of our knowledge, ddmax is the first approach that fixes faults in the input data without requiring program analysis. In our evaluation, ddmax repaired about 69% of input files and recovered about 78% of data within one minute per input.

Publisher: 2020 IEEE/ACM 42nd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)

Publication Date: 2020

Publication Name: 2020 IEEE/ACM 42nd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)

Research Interests:
Computer Science

Download (.pdf)

How do practitioners debug computer programs? In a retrospective study with 180 respondents and an observational study with 12 practitioners, we collect and discuss data on how developers spend their time on diagnosis and fixing bugs,... more

How do practitioners debug computer programs? In a retrospective study with 180 respondents and an observational study with 12 practitioners, we collect and discuss data on how developers spend their time on diagnosis and fixing bugs, with key findings on tools and strategies used, as well as highlighting the need for automated assistance. To facilitate and guide future research, we provide a highly usable debugging benchmark providing fault locations, patches and explanations for common bugs as provided by the practitioners.

Publication Date: 2016

Download (.pdf)

Generating structured input files to test programs can be performed by techniques that produce them from a grammar that serves as the specification for syntactically correct input files. Two interesting scenarios then arise for effective... more

Generating structured input files to test programs can be performed by techniques that produce them from a grammar that serves as the specification for syntactically correct input files. Two interesting scenarios then arise for effective testing. In the first scenario, software engineers would like to generate inputs that are as similar as possible to the inputs in common usage of the program, to test the reliability of the program. More interesting is the second scenario where inputs should be as dissimilar as possible from normal usage. This is useful for robustness testing and exploring yet uncovered behavior. To provide test cases for both scenarios, we leverage a context-free grammar to parse a set of sample input files that represent the program's common usage, and determine probabilities for individual grammar production as they occur during parsing the inputs. Replicating these probabilities during grammar-based test input generation, we obtain inputs that are close to t...

Publisher: ArXiv

Publication Date: 2018

Publication Name: ArXiv

Research Interests:
Computer Science and arXiv

Download (.pdf)

Statistical fault localization is an easily deployed technique for quickly determining candidates for faulty code locations. If a human programmer has to search the fault beyond the top candidate locations, though, more traditional... more

Statistical fault localization is an easily deployed technique for quickly determining candidates for faulty code locations. If a human programmer has to search the fault beyond the top candidate locations, though, more traditional techniques of following dependencies along dynamic slices may be better suited. In a large study of 457 bugs (369 single faults and 88 multiple faults) in 46 open source C programs, we compare the effectiveness of statistical fault localization against dynamic slicing. For single faults, we find that dynamic slicing was eight percentage points more effective than the best performing statistical debugging formula; for 66% of the bugs, dynamic slicing finds the fault earlier than the best performing statistical debugging formula. In our evaluation, dynamic slicing is more effective for programs with single fault, but statistical debugging performs better on multiple faults. Best results, however, are obtained by a hybrid approach : If programmers first exam...

Publisher: Empir. Softw. Eng.

Publication Date: 2021

Publication Name: Empirical Software Engineering

Research Interests:
Computer Science, Empirical Software Engineering, and Computer Software

Download (.pdf)

Publisher: Software Engineering

Publication Date: 2021

A program fails. Under which circumstances does the failure occur? Our Alhazenapproach starts with a run that exhibits a particular behavior and automatically determines input features associated with the behavior in question: (1) We use... more

A program fails. Under which circumstances does the failure occur? Our Alhazenapproach starts with a run that exhibits a particular behavior and automatically determines input features associated with the behavior in question: (1) We use a grammar to parse the input into individual elements. (2) We use a decision tree learner to observe and learn which input elements are associated with the behavior in question. (3) We use the grammar to generate additional inputs to further strengthen or refute hypotheses as learned associations. (4) By repeating steps 2 and 3, we obtain a theory that explains and predicts the given behavior. In our evaluation using inputs for find, grep, NetHack, and a JavaScript transpiler, the theories produced by Alhazen predict and produce failures with high accuracy and allow developers to focus on a small set of input features: “grep fails whenever the --fixed-strings option is used in conjunction with an empty search string.”

Publisher: ESEC/SIGSOFT FSE

Publication Date: 2020

Software often produces biased outputs. In particular, machine learning (ML) based software are known to produce erroneous predictions when processing discriminatory inputs. Such unfair program behavior can be caused by societal bias. In... more

Software often produces biased outputs. In particular, machine learning (ML) based software are known to produce erroneous predictions when processing discriminatory inputs. Such unfair program behavior can be caused by societal bias. In the last few years, Amazon, Microsoft and Google have provided software services that produce unfair outputs, mostly due to societal bias (e.g. gender or race). In such events, developers are saddled with the task of conducting fairness testing. Fairness testing is challenging; developers are tasked with generating discriminatory inputs that reveal and explain biases. We propose a grammar-based fairness testing approach (called ASTRAEA) which leverages context-free grammars to generate discriminatory inputs that reveal fairness violations in software systems. Using probabilistic grammars, ASTRAEA also provides fault diagnosis by isolating the cause of observed software bias. ASTRAEA's diagnoses facilitate the improvement of ML fairness. ASTRAEA ...

Publisher: ArXiv

Publication Date: 2020

Download (.pdf)

How do professional software engineers debug computer programs? In an experiment with 27 real bugs that existed in several widely used programs, we invited 12 professional software engineers, who together spent one month on localizing,... more

How do professional software engineers debug computer programs? In an experiment with 27 real bugs that existed in several widely used programs, we invited 12 professional software engineers, who together spent one month on localizing, explaining, and fixing these bugs. This did not only allow us to study the various tools and strategies used to debug the same set of errors. We could also determine exactly which statements a developer would localize as faults, how a developer would diagnose and explain an error, and how a developer would fix an error – all of which software engineering researchers seek to automate. Until now, it has been difficult to evaluate the effectiveness and utility of automated debugging techniques without a user study. We publish the collected data, called DBGBENCH, to facilitate the effective evaluation of automated fault localization, diagnosis, and repair techniques w.r.t. the judgement of human experts. Keywords-Debugging in Practice, Fault Localization,...

Publication Date: 2017

Download (.pdf)

Publisher: Software Engineering

Publication Date: 2021

Debugging is a search process to find, understand and fix the root cause of software defects. Can debugging benefit from probabilistic information? We hypothesize that debugging activities can benefit from probabilistic information that... more

Debugging is a search process to find, understand and fix the root cause of software defects. Can debugging benefit from probabilistic information? We hypothesize that debugging activities can benefit from probabilistic information that capturethe statistical dependence of program features and the minorvariations of program behavior. This probabilistic informationhelps to guide the search for the root cause of the bug andprovides detailed diagnostic information (such as failure-inducinginputs and method calls leading to the fault). To realize ourhypothesis, we propose to improve debugging activities by guiding bug diagnosis using both probabilistic reasoning and program analysis. The main idea is to mine probabilistic information from program executions, then apply these information to construct probabilistic event structures (e.g. probabilistic call graphs) that guides debugging activities such as fault localization and comprehension. The resulting probabilistic model will guide bu...

Publisher: 2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C)

Publication Date: 2017

Publisher: Software Engineering

Publication Date: 2021

Publisher: IEEE

Publication Name: 2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C)

Publisher: ACM

Publication Name: Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering

Publisher: IEEE

Publication Name: 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE)

Publisher: ACM

Publication Name: Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis

Download (.pdf)

Publication Date: May 1, 2023

Research Interests: Computer Science, Artificial Intelligence, Cloud Computing, Image Analysis (Mathematics), and Software<div>()</div>

Publication Date: May 1, 2023

Research Interests: Computer Science, Program Slicing, Debugging, and Programming language<div>()</div>

Publisher: Cornell University

Publication Date: Jan 8, 2021

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science, Program Slicing, Programmer, Debugging, and Slicing<div>()</div>

Publisher: Cornell University

Publication Date: Dec 29, 2021

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science and arXiv<div>()</div>

Publisher: Taylor & Francis

Publication Date: May 1, 2014

Publication Name: Journal of Marketing Management

Publisher: Cornell University

Publication Date: May 8, 2023

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science, Artificial Intelligence, Image Analysis (Mathematics), Software, and Ground Truth<div>()</div>

Publisher: Cornell University

Publication Date: Feb 24, 2020

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science, Artificial Intelligence, Robustness (evolution), and Backdoor<div>()</div>

Publisher: IEEE Computer Society

Publication Date: 2022

Publication Name: IEEE Transactions on Software Engineering

Research Interests: Information Systems, Computer Science, Machine Learning, Software, Computer Software, and arXiv<div>()</div>

Publication Date: Sep 1, 2013

Research Interests: Computer Science, Devs, and Iterative refinement<div>()</div>

Publisher: IEEE

Publication Name: 2022 IEEE 22nd International Conference on Software Quality, Reliability and Security (QRS)

Research Interests: Computer Science, Fuzz Testing, Secure Coding, and Abstract Syntax Tree<div>()</div>

Publication Date: Jun 27, 2020

Research Interests: Computer Science, Debugging, and Programming language<div>()</div>

Publication Date: 2016

Publisher: Cornell University

Publication Date: Aug 17, 2022

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science, Parsing, and Closeness<div>()</div>

Publisher: Elsevier BV

Publication Name: Computers & Security

Research Interests: Computer Science, Computer Security, Backdoor, and Computers Security<div>()</div>

Publisher: Cornell University

Publication Date: May 18, 2022

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science and Software<div>()</div>

Publisher: ACM

Publication Name: Proceedings of the 19th International Conference on Mining Software Repositories

Publisher: Association for Computing Machinery (ACM)

Publication Name: ACM Transactions on Software Engineering and Methodology

Research Interests: Information Systems, Computer Science, Computer Software, and arXiv<div>()</div>

Publisher: ArXiv

Publication Date: 2021

Publication Name: ArXiv

Research Interests: Computer Science and arXiv<div>()</div>

Publisher: Software Engineering

Publication Date: 2018

Research Interests: Computer Science and Software Engineering<div>()</div>

Publication Date: 2021

Research Interests: Computer Science and Debugging<div>()</div>

Publisher: ArXiv

Publication Date: 2020

Publication Name: ArXiv

Research Interests: Mathematics, Computer Science, and arXiv<div>()</div>

Publisher: 2020 IEEE/ACM 42nd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)

Publication Date: 2020

Publication Name: 2020 IEEE/ACM 42nd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)

Research Interests: Computer Science<div>()</div>

Publication Date: 2016

Publisher: ArXiv

Publication Date: 2018

Publication Name: ArXiv

Research Interests: Computer Science and arXiv<div>()</div>

Publisher: Empir. Softw. Eng.

Publication Date: 2021

Publication Name: Empirical Software Engineering

Research Interests: Computer Science, Empirical Software Engineering, and Computer Software<div>()</div>

Publisher: Software Engineering

Publication Date: 2021

Research Interests:
Computer Science, Artificial Intelligence, Cloud Computing, Image Analysis (Mathematics), and Software

Research Interests:
Computer Science, Program Slicing, Debugging, and Programming language

Research Interests:
Computer Science, Program Slicing, Programmer, Debugging, and Slicing

Research Interests:
Computer Science and arXiv

Research Interests:
Computer Science, Artificial Intelligence, Image Analysis (Mathematics), Software, and Ground Truth

Research Interests:
Computer Science, Artificial Intelligence, Robustness (evolution), and Backdoor

Research Interests:
Information Systems, Computer Science, Machine Learning, Software, Computer Software, and arXiv

Research Interests:
Computer Science, Devs, and Iterative refinement

Research Interests:
Computer Science, Fuzz Testing, Secure Coding, and Abstract Syntax Tree

Research Interests:
Computer Science, Debugging, and Programming language

Research Interests:
Computer Science, Parsing, and Closeness

Research Interests:
Computer Science, Computer Security, Backdoor, and Computers Security

Research Interests:
Computer Science and Software

Research Interests:
Information Systems, Computer Science, Computer Software, and arXiv

Research Interests:
Computer Science and arXiv

Research Interests:
Computer Science and Software Engineering

Research Interests:
Computer Science and Debugging

Research Interests:
Mathematics, Computer Science, and arXiv

Research Interests:
Computer Science

Research Interests:
Computer Science and arXiv

Research Interests:
Computer Science, Empirical Software Engineering, and Computer Software

Research Interests:
Information Systems and Computer Software