Welcome to Scribd!

Skip carousel

Pattern Discovery 11.3

Uploaded by

Rossana Cunha

0% found this document useful (0 votes)

30 views6 pages

Session 3. Pattern Discovery for Software Bug Mining

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Session 3. Pattern Discovery for Software Bug Mining

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

30 views6 pages

Pattern Discovery 11.3

Uploaded by

Rossana Cunha

Session 3. Pattern Discovery for Software Bug Mining

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

Session 3.

Pattern Discovery
for Software Bug Mining

Pattern Discovery for Software Bug Mining

Software is complex, and its runtime data is larger and more complex!
Finding bugs is challenging: Often no clear specifications or properties; need
substantial human efforts in analyzing data
Software reliability analysis
Static bug detection: Check the code
Dynamic bug detection or testing: Run the code
Debugging: Given symptoms or failures, pinpoint the bug locations in the code
Why pattern mining?Code or running sequences contain hidden patterns
Common patterns likely specification or property
Violations (anomalies comparing to patterns) likely bugs
Mining patterns to narrow down the scope of inspection
Code locations or predicates that happen more in failing runs but less in
passing runs are suspicious bug locations

Typical Software Bug Detection Methods

Mining rules from source code

Bugs as deviant behavior (e.g., by statistical analysis)

Mining programming rules (e.g., by frequent itemset mining)

Mining function precedence protocols (e.g., by frequent subsequence mining)

Revealing neglected conditions (e.g., by frequent itemset/subgraph mining)

Mining rules from revision histories

By frequent itemset mining

Mining copy-paste patterns from source code

Find copy-paste bugs (e.g., CP-Miner [Li et al., OSDI04]) (to be discussed here)

Reference: Z. Li, S. Lu, S. Myagmar, Y. Zhou, CP-Miner: A Tool for Finding

Copy-paste and Related Bugs in Operating System Code, OSDI04

Mining Copy-and-Paste Bugs

void __init prom_meminit(void)
Copy-pasting is common
{
12% in Linux file system

19% in X Window system

for (i=0; i<n; i++) {
total[i].adr = list[i].addr;
Copy-pasted code is error-prone
total[i].bytes = list[i].size;
Mine forget-to-change bugs by
total[i].more = &total[i+1];
sequential pattern mining
}

Build a sequence database from source

Code copy-andpasted but forget
code
for (i=0; i<n; i++) {
to change id!
taken[i].adr = list[i].addr;
Mining sequential patterns
taken[i].bytes = list[i].size;
Finding mismatched identifier names &
taken[i].more = &total[i+1];
bugs

Courtesy of Yuanyuan Zhou@UCSD

(Simplified example from linux2.6.6/arch/sparc/prom/memory.c)

Building Sequence Database from Source Code

(mapped to)

Statement number
Tokenize each component
Different operators, constants, key words
different tokens
Same type of identifiers same token
Program A long sequence
Cut the long sequence by blocks

old = 3;
new = 3;
Map a statement
Tokenize
5 61 20
5 61 20
to a number
16
5

Hash

Courtesy of Yuanyuan Zhou@UCSD

Hash values
65
16
16
71

65
16
16
71

for (i=0; i<n; i++) {

total[i].adr = list[i].addr;
total[i].bytes = list[i].size;
total[i].more = &total[i+1];
}

for (i=0; i<n; i++) {

taken[i].adr = list[i].addr;
taken[i].bytes = list[i].size;
taken[i].more = &total[i+1];
}

Final sequence DB:

(65)
(16, 16, 71)

Sequential Pattern Mining & Detecting

Forget-to-Change Bugs
Modification to the sequence pattern mining algorithm
(16, 16, 71)
Constrain the max gap

(16, 16, 10, 71)

Composing Larger Copy-Pasted Segments

Combine the neighboring copy-pasted segments
repeatedly
Find conflicts: Identify names that cannot be mapped to the
corresponding ones
E.g., 1 out of 4 total is unchanged, unchanged ratio =
0.25
If 0 < unchanged ratio < threshold, then report it as a bug
CP-Miner reported many C-P bugs in Linux, Apache, out of
millions of LOC (lines of code)

Allow a maximal gap:

inserting statements
in copy-and-paste

f (a1);
f (a2);
f (a3);

Courtesy of Yuanyuan Zhou@UCSD

conflict

f1 (b1);
f1 (b2);
f2 (b3);

VVV Project
Document16 pages
VVV Project
Rossana Cunha
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
Document3 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
Rossana Cunha
No ratings yet
cs229 Linalg
Document26 pages
cs229 Linalg
Awais Ali
No ratings yet
cs229 Prob PDF
Document12 pages
cs229 Prob PDF
indoexchange
No ratings yet
Ergonomic Criteria for Evaluating Human-Computer Interfaces
Document83 pages
Ergonomic Criteria for Evaluating Human-Computer Interfaces
Rossana Cunha
No ratings yet
The Impact of Translation Technologies
Document23 pages
The Impact of Translation Technologies
Rossana Cunha
No ratings yet
(Garcia-2009) Beyond Translation Memory Computers and The Profes
Document17 pages
(Garcia-2009) Beyond Translation Memory Computers and The Profes
Rossana Cunha
No ratings yet
Pattern Discovery 11.6
Document2 pages
Pattern Discovery 11.6
Rossana Cunha
No ratings yet
Session 5. A Comparative Study of Three Strategies
Document7 pages
Session 5. A Comparative Study of Three Strategies
Rossana Cunha
No ratings yet
Pattern Discovery 11.5
Document5 pages
Pattern Discovery 11.5
Rossana Cunha
No ratings yet
Pattern Discovery 11.4
Document5 pages
Pattern Discovery 11.4
Rossana Cunha
No ratings yet
Pattern Discovery 11.2
Document7 pages
Pattern Discovery 11.2
Rossana Cunha
No ratings yet
Pattern Discovery 11.1
Document10 pages
Pattern Discovery 11.1
Rossana Cunha
No ratings yet
Pattern Discovery 10.4
Document10 pages
Pattern Discovery 10.4
Rossana Cunha
No ratings yet
Pattern Discovery 10.3
Document6 pages
Pattern Discovery 10.3
Rossana Cunha
No ratings yet
Simultaneously Infer Phrases and Topics
Document4 pages
Simultaneously Infer Phrases and Topics
Rossana Cunha
No ratings yet
Preparing Curriculum For Translator Training Courses
Document4 pages
Preparing Curriculum For Translator Training Courses
Rossana Cunha
No ratings yet
Pattern Discovery 10.1
Document5 pages
Pattern Discovery 10.1
Rossana Cunha
No ratings yet
Coursera - Pattern Discovery in Data Mining - Week 1 Quiz
Document5 pages
Coursera - Pattern Discovery in Data Mining - Week 1 Quiz
Rossana Cunha
No ratings yet
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (894)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (587)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (265)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1937)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carre
Rating: 3.5 out of 5 stars
3.5/5 (104)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
Super Market Billing System
Document14 pages
Super Market Billing System
Muzammil Iqbal
No ratings yet
RAC Database From 12.1 To 12.2 - Workshop
Document27 pages
RAC Database From 12.1 To 12.2 - Workshop
zuggo848592
No ratings yet
C in Arduino
Document15 pages
C in Arduino
CYRENE ESPIRITU
No ratings yet
Acknowledgement
Document10 pages
Acknowledgement
Aryan Mehtele
No ratings yet
5 Software Design
Document34 pages
5 Software Design
Sahil Gupta
No ratings yet
DBMS Cep 6
Document11 pages
DBMS Cep 6
GEM FOR GAMES
No ratings yet
ICT - Programming
Document37 pages
ICT - Programming
Sekolah Portal
100% (4)
Introduction To Regular Expressions: Maria Eugenia Inzaugarat
Document52 pages
Introduction To Regular Expressions: Maria Eugenia Inzaugarat
Bé Xíu
No ratings yet
Grover's Algorithm: Quantum Algorithms
Document5 pages
Grover's Algorithm: Quantum Algorithms
Saif Hassan
No ratings yet
Computer Graphics Subject Practicals: Complete Projects in BSIT
Document45 pages
Computer Graphics Subject Practicals: Complete Projects in BSIT
Sayyed Salman Mehdi Mosvi
100% (7)
Magik
Document5 pages
Magik
jgonzales2011
No ratings yet
Resume Indrajeet Singh
Document5 pages
Resume Indrajeet Singh
Nismi Narayanan Kj
No ratings yet
DPR Webinar Resources
Document7 pages
DPR Webinar Resources
Akshaya M -108
No ratings yet
OPEN SOURCE WORKFLOW
Document4 pages
OPEN SOURCE WORKFLOW
Vidya Sathish
No ratings yet
Teste PL SQL
Document12 pages
Teste PL SQL
Sandru Oana
No ratings yet
University of Central Punjab: Objective
Document14 pages
University of Central Punjab: Objective
Infinite Loop
No ratings yet
Function Pointers in VBScript
Document14 pages
Function Pointers in VBScript
api-3817447
50% (2)
Working With Engines and Connections - SQLAlchemy 1.4 Documentation
Document96 pages
Working With Engines and Connections - SQLAlchemy 1.4 Documentation
Leandro Gabriel Kategora
No ratings yet
Assignment 3:: Due 8am On Mon, Oct 28, 2019
Document10 pages
Assignment 3:: Due 8am On Mon, Oct 28, 2019
Nak Parr
No ratings yet
All-Coding-Challenges 20-26
Document7 pages
All-Coding-Challenges 20-26
Graficki Radovi
No ratings yet
Google Web Toolkit
Document297 pages
Google Web Toolkit
Jatin Kacha
No ratings yet
Revision Point - Series
Document5 pages
Revision Point - Series
zahir khan
No ratings yet
TESSY UserManual 40
Document467 pages
TESSY UserManual 40
Chandan B U
No ratings yet
WPF Applications
Document35 pages
WPF Applications
victorkrull
No ratings yet
How Amazon Web Services Uses Formal Methods
Document8 pages
How Amazon Web Services Uses Formal Methods
Cristian Georgiu
No ratings yet
IT4104 Section2
Document51 pages
IT4104 Section2
Chamila Rasangika Seneviratne
No ratings yet
Friend Fuction in C++
Document9 pages
Friend Fuction in C++
usmanahmadawan
No ratings yet
ec1c5223410e9dfcccaa4b90c2efaa8e
Document163 pages
ec1c5223410e9dfcccaa4b90c2efaa8e
sissword
100% (1)
Algorithms and Programming Flowcharts
Document30 pages
Algorithms and Programming Flowcharts
Darius Vishkurti
No ratings yet
11.1 Python Introduction - Course Notes PDF
Document56 pages
11.1 Python Introduction - Course Notes PDF
Zohair Kharal
No ratings yet