0% found this document useful (0 votes)

6 views

Software Defect Prediction

Uploaded by

sahilverma20652

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Software Defect Prediction

Uploaded by

sahilverma20652

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Software Defect Prediction

Submitted To : Submitted By :

Priya Singh Sahil Verma

Department of Software Department of Software
Engineering, Delhi Technological Engineering, Delhi Technological
University, University,
Delhi, India Delhi, India
priya.singh.academia@gmail.com Sahilverma.1802@gmail.com
INTRODUCTION
 Software Defect Prediction (SDP) is a critical research area in software
engineering focused on identifying software defects before deployment.

 The primary objectives of SDP include improving software quality,

enhancing reliability, and reducing maintenance costs by early detection
of fault components.

 Traditional manual testing and debugging processes have become

challenging due to the growing complexity of software systems

 These models analyze data patterns to classify software components into

defective and non-defective categories.
OBJECTIVES
 SDP is a specialized domain in software engineering focused on
identifying defects in software systems before production.

 The primary goal is to detect code sections more likely to contain

errors.

 Utilizes advanced techniques including: Machine learning,

Statistical analysis.

 Enables more software development practices.

 Provides insights for selecting the best suitable embedding

techniques for SDP tasks.
 Helps create more robust, stable, and high-performing software
applications.
SOFTWARE DEFECT PREDICTION
Software Defect Prediction is essential
for the quality of the software,
reliability, and efficiency of software
systems. It allows identifying early
defects in a software system, which is
becoming increasingly important due
to the increasing complexity of Figure 1. Flowchart of SDP

software applications and demand on

speed for delivery.
DATASET USED
METHODOLOGY
The methodology involves the use
of a large-scale dataset :

 PyTraceBugs Dataset

 Unique Code Study

 Plagiarism Detection in
Documents

 Duplication and Near-Duplication

Detection

The following figures illustrates the

Figure 2. Flowchart for
process followed during this study.
predicting bug
PyTraceBugs Dataset

 Collected the PyTraceBugs dataset at a large-scale, which

consists of over 24,000 buggy snippets and 5.7 million
repaired ones.

 Fine-tune Python-specific BERT embeddings while

classifying buggy and correct code in a binary defect
prediction setup.

 Evaluate model performance based on precision, recall,

and F1-score metrics.
Unique Code Study
 Analyzed Syntactic Redundancy using token-based
measures and Hamming distance.

 Analyzing around 420 million lines of code belonging to

about 6,000 software projects.

 Searched through statistical redundancy metrics to

explore the patterns and similarities.
Plagiarism Detection in
 Reviewed systems such as Turnitin, SafeAssign, and EVE.
Documents
 String matching, tokenization, and heuristic-based
comparisons against proprietary databases have been
used.

 Plagiarism is further detected through similarity scores

and percentage matches.
Duplication and Near-Duplication
Detection
 Used a watermark-based system to identify plagiarized
code in programming assignments.

 Classified students into copier or supplier categories using

binary processing.

 Benchmarked against systems like MOSS to highlight

advantages of direct plagiarism detection.
RESULT
The following pie chart and graph
shows the Distribution of
publications by year and Publications
by different sources.
CONCLUSION
 Machine learning and natural language processing are driving
significant role in software engineering processes.

 A breakthrough in defect prediction due to its large-size, diverse

corpora that address limitations of earlier datasets.

 Innovations in plagiarism detection and text summarization between

NLP and software engineering.

 Research on detecting redundancy and ensuring uniqueness improves

software quality and maintainability.
FUTURE WORK
 Enhanced Defect Detection

 Develop models that better understand code semantics for enhanced

accuracy in plagiarism detection and defect prediction.

 Design computationally efficient algorithms and cloud-based

solutions for large-scale data processing in defect prediction and
redundancy detection.

 Advanced techniques, diverse datasets can significantly improve SDP

models.
THANK-YOU

A novel approach to enhancing software quality assurance through early detection and prevention of software faults
No ratings yet
A novel approach to enhancing software quality assurance through early detection and prevention of software faults
13 pages
Improving Software Development Process Through Data Mining Techniques of Unsupervised Algorithms IJERTV10IS110002
No ratings yet
Improving Software Development Process Through Data Mining Techniques of Unsupervised Algorithms IJERTV10IS110002
4 pages
29thJunePresentation
No ratings yet
29thJunePresentation
15 pages
Software Metrics For Fault Prediction Using Machine Learning Approaches
No ratings yet
Software Metrics For Fault Prediction Using Machine Learning Approaches
5 pages
A Systematic Literature Review On Fault Prediction Performance in Software Engineering
100% (2)
A Systematic Literature Review On Fault Prediction Performance in Software Engineering
7 pages
Software Defect Prediction: A Survey With Machine Learning Approach
No ratings yet
Software Defect Prediction: A Survey With Machine Learning Approach
6 pages
Ahts04 Sandia National Laboratories: Multimodal Deep Learning For Flaw Detection in Software Programs
No ratings yet
Ahts04 Sandia National Laboratories: Multimodal Deep Learning For Flaw Detection in Software Programs
13 pages
A Case Study On Machine Learning Model For Code Re
No ratings yet
A Case Study On Machine Learning Model For Code Re
8 pages
Comprehensive Study On Machine Learning
No ratings yet
Comprehensive Study On Machine Learning
10 pages
DMML
100% (2)
DMML
5 pages
Deep Learning Software Defect Prediction Methods F
No ratings yet
Deep Learning Software Defect Prediction Methods F
11 pages
Defect Prediction-Survey
No ratings yet
Defect Prediction-Survey
14 pages
Study of Predicting Fault Prone Software Modules
No ratings yet
Study of Predicting Fault Prone Software Modules
3 pages
When A Patch Goes Bad Exploring The Properties of
No ratings yet
When A Patch Goes Bad Exploring The Properties of
10 pages
Software Engineering Notes
No ratings yet
Software Engineering Notes
203 pages
Python Code Smells Detection Using Conventional Machine Learning Models
No ratings yet
Python Code Smells Detection Using Conventional Machine Learning Models
21 pages
32 AI Exp2
No ratings yet
32 AI Exp2
5 pages
Applying Machine Learning To Software Fault Prediction: Bartłomiej Wójcicki, Robert Dąbrowski
No ratings yet
Applying Machine Learning To Software Fault Prediction: Bartłomiej Wójcicki, Robert Dąbrowski
18 pages
Open Source Quality Report
No ratings yet
Open Source Quality Report
6 pages
Literature Survey
No ratings yet
Literature Survey
3 pages
SSRN Id4632664
No ratings yet
SSRN Id4632664
39 pages
Researchdemo 1
No ratings yet
Researchdemo 1
11 pages
An Empirical Study On Application of Word Embedding Techniques For Prediction of Software Defect Severity Level
No ratings yet
An Empirical Study On Application of Word Embedding Techniques For Prediction of Software Defect Severity Level
8 pages
A Survey of Different Machine Learning M
No ratings yet
A Survey of Different Machine Learning M
13 pages
Efficient Software Cost Estimation Using Machine Learning Techniques
No ratings yet
Efficient Software Cost Estimation Using Machine Learning Techniques
20 pages
Deep Learning Based Software Defect Prediction
No ratings yet
Deep Learning Based Software Defect Prediction
11 pages
Privacy Preserving Mining in Code Profiling Data: ISSN (ONLINE) : 2250-0758, ISSN (PRINT) : 2394-6962
No ratings yet
Privacy Preserving Mining in Code Profiling Data: ISSN (ONLINE) : 2250-0758, ISSN (PRINT) : 2394-6962
5 pages
1 s2.0 S095058491500052X Main
No ratings yet
1 s2.0 S095058491500052X Main
14 pages
Buffer Overflow
No ratings yet
Buffer Overflow
12 pages
Problem Statement
No ratings yet
Problem Statement
5 pages
Software Development Analytics for Xen Why and How
No ratings yet
Software Development Analytics for Xen Why and How
10 pages
Assessing Personalized Software Defect Predictors
No ratings yet
Assessing Personalized Software Defect Predictors
4 pages
Empirical Study On Bug Prediction
No ratings yet
Empirical Study On Bug Prediction
6 pages
Software Quality Assessment of A Web Application For Biomedical Data Analysis
No ratings yet
Software Quality Assessment of A Web Application For Biomedical Data Analysis
10 pages
Java File Security System (JFSS) Evaluation Using Software Engineering Approaches
No ratings yet
Java File Security System (JFSS) Evaluation Using Software Engineering Approaches
6 pages
Code Metrics
No ratings yet
Code Metrics
10 pages
An Effect of Particle Swarm Optimization On SDLC: Shrishti Tamrakar (M.Tech Scholar), Anubhav Sharma (Asst - Prof.)
No ratings yet
An Effect of Particle Swarm Optimization On SDLC: Shrishti Tamrakar (M.Tech Scholar), Anubhav Sharma (Asst - Prof.)
7 pages
M.Phil Computer Science Knowledge and Data Engineering Projects
No ratings yet
M.Phil Computer Science Knowledge and Data Engineering Projects
2 pages
Fuzzy C Means Method For Cross - Project Software Defect Prediction
No ratings yet
Fuzzy C Means Method For Cross - Project Software Defect Prediction
10 pages
The Automation Revolution in Software Development
No ratings yet
The Automation Revolution in Software Development
9 pages
Deep Learning for Software Defect Prediction- A Survey
No ratings yet
Deep Learning for Software Defect Prediction- A Survey
6 pages
Ieee 2012 Projects Software Engineering at Seabirds (Cochin, Thiruvananthapuram, Mysore, Mangalore, Hubli, Chennai, Trichy)
No ratings yet
Ieee 2012 Projects Software Engineering at Seabirds (Cochin, Thiruvananthapuram, Mysore, Mangalore, Hubli, Chennai, Trichy)
7 pages
Software Engineering What It Is, Definition, Tut
No ratings yet
Software Engineering What It Is, Definition, Tut
2 pages
Database Performance Assessment Using MDA: Abstract
No ratings yet
Database Performance Assessment Using MDA: Abstract
6 pages
A Buffer Overflow Prediction Approach Based On Sof
No ratings yet
A Buffer Overflow Prediction Approach Based On Sof
13 pages
Dead Code Detection
No ratings yet
Dead Code Detection
15 pages
Bug IR
No ratings yet
Bug IR
24 pages
On The Effectiveness of Developer Features in Code Smell Prioritization - A Replication Study
No ratings yet
On The Effectiveness of Developer Features in Code Smell Prioritization - A Replication Study
23 pages
1 s2.0 S2214212623002740 Main
No ratings yet
1 s2.0 S2214212623002740 Main
12 pages
Noddy Nerd Ppt Minor 1 Edited
No ratings yet
Noddy Nerd Ppt Minor 1 Edited
24 pages
Exploring Metaheuristic Optimized Machine Learning
No ratings yet
Exploring Metaheuristic Optimized Machine Learning
45 pages
Comparative Analysis of Software Reliability Prediction Using Machine Learning and Deep Learning
No ratings yet
Comparative Analysis of Software Reliability Prediction Using Machine Learning and Deep Learning
6 pages
Software Defect Prediction Using ML
No ratings yet
Software Defect Prediction Using ML
6 pages
Paper 1
No ratings yet
Paper 1
13 pages
Using Fuzzy Clustering and Software Metrics To Predict Faults in Large Industrial Software Systems
No ratings yet
Using Fuzzy Clustering and Software Metrics To Predict Faults in Large Industrial Software Systems
5 pages
Cop Tse Accepted
No ratings yet
Cop Tse Accepted
21 pages
Introduction To Software Engineering
No ratings yet
Introduction To Software Engineering
31 pages
Malware-Analysis-and-Detection-Using-Machine-Learning-Algorithm
No ratings yet
Malware-Analysis-and-Detection-Using-Machine-Learning-Algorithm
4 pages
Ali Kone
No ratings yet
Ali Kone
6 pages
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
994-0089 D400 Substation Gateway Hardware User Manual v1.30 R12
No ratings yet
994-0089 D400 Substation Gateway Hardware User Manual v1.30 R12
124 pages
Gis Practical No 4 Bscit
No ratings yet
Gis Practical No 4 Bscit
22 pages
Nutanix-NCP-EUC
No ratings yet
Nutanix-NCP-EUC
11 pages
Radio Engineering - Design Exercise 2016 v1.0
No ratings yet
Radio Engineering - Design Exercise 2016 v1.0
22 pages
XY-MBZ55A-YC1155-Bluetooth-5-BR-EDR-BLE-module-Datasheet-20211101
No ratings yet
XY-MBZ55A-YC1155-Bluetooth-5-BR-EDR-BLE-module-Datasheet-20211101
27 pages
Stardom Fcn-Rtu: Low Power Autonomous Controller
No ratings yet
Stardom Fcn-Rtu: Low Power Autonomous Controller
1 page
Chapter 8 User Defined Function
No ratings yet
Chapter 8 User Defined Function
6 pages
White Paper On V Band Final
No ratings yet
White Paper On V Band Final
24 pages
SQL
No ratings yet
SQL
501 pages
Lumion 7 Crack Cgpersia
No ratings yet
Lumion 7 Crack Cgpersia
1 page
Tbs Mambo Radio: Compact All-In-One Remote Control Radio With TBS Tracer System
No ratings yet
Tbs Mambo Radio: Compact All-In-One Remote Control Radio With TBS Tracer System
49 pages
Developing With Angular
100% (2)
Developing With Angular
402 pages
SSRF PPT (1)
No ratings yet
SSRF PPT (1)
8 pages
Bilanciai: D400 Terminal
No ratings yet
Bilanciai: D400 Terminal
72 pages
Ecomtotally Account Setup Steps
No ratings yet
Ecomtotally Account Setup Steps
4 pages
Wavence Ubt Manual Information
No ratings yet
Wavence Ubt Manual Information
8 pages
Edwards, Jonathan - Charity and Its Fruits (New York, 1852)
100% (1)
Edwards, Jonathan - Charity and Its Fruits (New York, 1852)
566 pages
5G&EMF Explained - AMTA - 23aug - 2019 - 20
No ratings yet
5G&EMF Explained - AMTA - 23aug - 2019 - 20
12 pages
WWW Csselectronics Com Pages Lin Bus Protocol Intro Basics
No ratings yet
WWW Csselectronics Com Pages Lin Bus Protocol Intro Basics
18 pages
VSAM Interview Questions and Answers 214
No ratings yet
VSAM Interview Questions and Answers 214
13 pages
Config Guide Trim Op Tim Ization Apo
No ratings yet
Config Guide Trim Op Tim Ization Apo
13 pages
Electrical Safety For Ships, Mobile and Fixed Offshore Platforms
No ratings yet
Electrical Safety For Ships, Mobile and Fixed Offshore Platforms
20 pages
Srx345 Sys JB Datasheet
No ratings yet
Srx345 Sys JB Datasheet
4 pages
[doc] functional
No ratings yet
[doc] functional
162 pages
Ups Software Application en
No ratings yet
Ups Software Application en
30 pages
ADB - Unit - III (Chapter-2) - Query Processing and Decomposition
No ratings yet
ADB - Unit - III (Chapter-2) - Query Processing and Decomposition
42 pages
FCT software list
No ratings yet
FCT software list
29 pages
Form CG-1 Help
No ratings yet
Form CG-1 Help
6 pages
Oracle 11g Business Intelligence Enterprise Edition (OBIEE)
No ratings yet
Oracle 11g Business Intelligence Enterprise Edition (OBIEE)
6 pages
Ironscales - PPT (Not Daily)
No ratings yet
Ironscales - PPT (Not Daily)
16 pages