Rakesh Verma

Followers

Following

Co-author

Mentions

Public Views

Interests

Uploads

Papers by Rakesh Verma

Clustering for Security Challenges

Proceedings of the ACM International Workshop on Security and Privacy Analytics

Download

An In-Depth Benchmarking and Evaluation of Phishing Detection Research for Security Needs

IEEE Access, 2020

Download

Vulnerability Detection via Multimodal Learning: Datasets and Analysis

Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security

Download

Automated email Generation for Targeted Attacks using Natural Language

ArXiv, 2019

With an increasing number of malicious attacks, the number of people and organizations falling pr... more With an increasing number of malicious attacks, the number of people and organizations falling prey to social engineering attacks is proliferating. Despite considerable research in mitigation systems, attackers continually improve their modus operandi by using sophisticated machine learning, natural language processing techniques with an intent to launch successful targeted attacks aimed at deceiving detection mechanisms as well as the victims. We propose a system for advanced email masquerading attacks using Natural Language Generation (NLG) techniques. Using legitimate as well as an influx of varying malicious content, the proposed deep learning system generates \textit{fake} emails with malicious content, customized depending on the attacker's intent. The system leverages Recurrent Neural Networks (RNNs) for automated text generation. We also focus on the performance of the generated emails in defeating statistical detectors, and compare and analyze the emails using a propose...

Download

Can Machines Tell Stories? A Comparative Study of Deep Neural Language Models and Metrics

IEEE Access

Download

Phishing Sites Detection from a Web Developer’s Perspective Using Machine Learning

Proceedings of the 53rd Hawaii International Conference on System Sciences, 2020

Download

Statistics

Cybersecurity Analytics, 2019

Claim Verification using a Multi-GAN based Model

Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications, 2021

Download

An efficient multiversion access structure

IEEE Transactions on Knowledge and Data Engineering, 1997

Download

Real Cryptographic . . . : Improving Techniques for Proving Undecidability of Checking Security Goals

Existing undecidability proofs of checking secrecy of cryptographic protocols have the limitation... more Existing undecidability proofs of checking secrecy of cryptographic protocols have the limitations of not considering protocols common in literature, which are in the form of communication sequences, since only protocols as non-matching roles are considered, and not considering an attacker who is an insider since only an outsider attacker is considered. Therefore the complexity of checking the realistic attacks, such as the attack to the public key Needham-Schroeder protocol, is unknown. The limitations have been observed independently and described similarly by Froschle in a recently published paper [1], where two open problems are posted. This paper investigates these limitations, and we present a generally applicable approach by reductions with novel features from the reachability problem of 2-counter machines, and we solve the two open problems. We also prove the undecidability of checking authentication which is the first detailed proof to the best of our knowledge. A unique fe...

Tight Bounds for Prefetching and Buffer Management Algorithms for Parallel I/O Systems

AbstractÐThe I/O performance of applications in multiple-disk systems can be improved by overlapp... more AbstractÐThe I/O performance of applications in multiple-disk systems can be improved by overlapping disk accesses. This requires the use of appropriate prefetching and buffer management algorithms that ensure the most useful blocks are accessed and retained in the buffer. In this paper, we answer several fundamental questions on prefetching and buffer management for distributed-buffer parallel I/O systems. First, we derive and prove the optimality of an algorithm, P-min, that minimizes the number of parallel I/Os. Second, we analyze P-con, an algorithm that always matches its replacement decisions with those of the well-known demand-paged MIN algorithm. We show that P-con can become fully sequential in the worst case. Third, we investigate the behavior of on-line algorithms for multiple-disk prefetching and buffer management. We define and analyze P-lru, a parallel version of the traditional LRU buffer management algorithm. Unexpectedly, we find that the competitive ratio of P-lru ...

Download

Comprehensive Method for Detecting Phishing EmailsUsing Correlation-based Analysis and User Participation

Proceedings of the Seventh ACM on Conference on Data and Application Security and Privacy

Download

Text Mining

Cybersecurity Analytics

Sixteen Years of Phishing User Studies: What Have We Learned?

ArXiv, 2021

Several previous studies have investigated user susceptibility to phishing attacks. A thorough me... more Several previous studies have investigated user susceptibility to phishing attacks. A thorough meta-analysis or systematic review is required to gain a better understanding of these findings and to assess the strength of evidence for phishing susceptibility of a subpopulation, e.g., older users. We aim to determine whether an effect exists; another aim is to determine whether the effect is positive or negative and to obtain a single summary estimate of the effect. OBJECTIVES: We systematically review the results of previous user studies on phishing susceptibility and conduct a meta-analysis. METHOD: We searched four online databases for English studies on phishing. We included all user studies in phishing detection and prevention, whether they proposed new training techniques or analyzed users’ vulnerability. FINDINGS: A careful analysis reveals some discrepancies between the findings. More than half of the studies that analyzed the effect of age reported no statistically significan...

Download

Unique Normal Forms and Confluence of Rewrite Systems: Persistence

Programming language interpreters, proving theorems of the form A = 2?, abstract data types, and ... more Programming language interpreters, proving theorems of the form A = 2?, abstract data types, and program optimization can all be represented by a finite set of rules called a rewrite system. In this paper, we study two fundamental concepts, uniqueness of normal forms and confluence, for nonlinear systems in the absence of termination. This is a difficult topic with only a few results so far. Through a novel approach, we show that every persistent system has unique normal forms. This result is tight and a substantial generalization of previous work. In the process we derive a necessary and sufficient condition for persistence for the first time and give new classes of persistent systems. We also prove the confluence of the union (function symbols can be shared) of a nonlinear system with a left-linear system under fairly general conditions. Again persistence plays a key role in this proof. We are not aware of any confluence result that allows the same level of function symbol sharing.

Download

Data Mining – Unsupervised Learning

Cybersecurity Analytics

The Laboratory for Rapid Rewriting Version 3 . 01

In this paper, we present the design and performance of the Laboratory for Rapid Rewriting system... more In this paper, we present the design and performance of the Laboratory for Rapid Rewriting system, LRR. Given a convergent or orthogonal rewrite system, R, and a term t, LRR computes the normal form of t whenever it exists. LRR consists of two interpreters: Smaran and TGR, which stands for Term Graph Rewriter. Both Smaran and TGR use term graphs with varying amounts of sharing. Smaran also stores the history of all rule applications in a very efficient data structure. A number of optimizations have been initiated in the implementation of LRR including a preprocessor for rules and the DS-list data structure. We give an overview of how to use the system, its core algorithms, data structures, optimizations and features. The performance of LRR on some benchmarks both favorable and unfavorable is presented and compared with two other interpreters Maude and Elan. 1 Research supported in part by NSF grant CCF 0306475 and DUE 1062954 The Laboratory for Rapid Rewriting Version 3.0 Rakesh M. ...

Download

Less is More: Exploiting Social Trust to Increase the Effectiveness of a Deception Attack

ArXiv, 2020

Cyber attacks such as phishing, IRS scams, etc., still are successful in fooling Internet users. ... more Cyber attacks such as phishing, IRS scams, etc., still are successful in fooling Internet users. Users are the last line of defense against these attacks since attackers seem to always find a way to bypass security systems. Understanding users' reason about the scams and frauds can help security providers to improve users security hygiene practices. In this work, we study the users' reasoning and the effectiveness of several variables within the context of the company representative fraud. Some of the variables that we study are: 1) the effect of using LinkedIn as a medium for delivering the phishing message instead of using email, 2) the effectiveness of natural language generation techniques in generating phishing emails, and 3) how some simple customizations, e.g., adding sender's contact info to the email, affect participants perception. The results obtained from the within-subject study show that participants are not prepared even for a well-known attack - company r...

Download

The CATS Hackathon: Creating and Refining Test Items for Cybersecurity Concept Inventories

ArXiv, 2019

For two days in February 2018, 17 cybersecurity educators and professionals from government and i... more For two days in February 2018, 17 cybersecurity educators and professionals from government and industry met in a "hackathon" to refine existing draft multiple-choice test items, and to create new ones, for a Cybersecurity Concept Inventory (CCI) and Cybersecurity Curriculum Assessment (CCA) being developed as part of the Cybersecurity Assessment Tools (CATS) Project. We report on the results of the CATS Hackathon, discussing the methods we used to develop test items, highlighting the evolution of a sample test item through this process, and offering suggestions to others who may wish to organize similar hackathons. Each test item embodies a scenario, question stem, and five answer choices. During the Hackathon, participants organized into teams to (1) Generate new scenarios and question stems, (2) Extend CCI items into CCA items, and generate new answer choices for new scenarios and stems, and (3) Review and refine draft CCA test items. The CATS Project provides rigorous ...

Download

Experiments in Extractive Summarization: Integer Linear Programming, Term/Sentence Scoring, and Title-driven Models

ArXiv, 2020

In this paper, we revisit the challenging problem of unsupervised single-document summarization a... more In this paper, we revisit the challenging problem of unsupervised single-document summarization and study the following aspects: Integer linear programming (ILP) based algorithms, Parameterized normalization of term and sentence scores, and Title-driven approaches for summarization. We describe a new framework, NewsSumm, that includes many existing and new approaches for summarization including ILP and title-driven approaches. NewsSumm's flexibility allows to combine different algorithms and sentence scoring schemes seamlessly. Our results combining sentence scoring with ILP and normalization are in contrast to previous work on this topic, showing the importance of a broader search for optimal parameters. We also show that the new title-driven reduction idea leads to improvement in performance for both unsupervised and supervised approaches considered.

Download

Clustering for Security Challenges

Proceedings of the ACM International Workshop on Security and Privacy Analytics

Download

An In-Depth Benchmarking and Evaluation of Phishing Detection Research for Security Needs

IEEE Access, 2020

Download

Vulnerability Detection via Multimodal Learning: Datasets and Analysis

Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security

Download

Automated email Generation for Targeted Attacks using Natural Language

ArXiv, 2019

Download

Can Machines Tell Stories? A Comparative Study of Deep Neural Language Models and Metrics

IEEE Access

Download

Phishing Sites Detection from a Web Developer’s Perspective Using Machine Learning

Proceedings of the 53rd Hawaii International Conference on System Sciences, 2020

Download

Statistics

Cybersecurity Analytics, 2019

Claim Verification using a Multi-GAN based Model

Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications, 2021

Download

An efficient multiversion access structure

IEEE Transactions on Knowledge and Data Engineering, 1997

Download

Real Cryptographic . . . : Improving Techniques for Proving Undecidability of Checking Security Goals