Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A scalable learning algorithm for data-driven program analysis

Authors
Cha, SooyoungJeong, SehunOh, Hakjoo
Issue Date
12월-2018
Publisher
ELSEVIER SCIENCE BV
Keywords
Data-driven program analysis; Learning algorithm
Citation
INFORMATION AND SOFTWARE TECHNOLOGY, v.104, pp.1 - 13
Indexed
SCIE
SCOPUS
Journal Title
INFORMATION AND SOFTWARE TECHNOLOGY
Volume
104
Start Page
1
End Page
13
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/71411
DOI
10.1016/j.infsof.2018.07.002
ISSN
0950-5849
Abstract
Context: Recently data-driven program analysis has emerged as a promising approach for building cost-effective static analyzers. The ideal static analyzer should apply accurate but costly techniques only when they benefit. However, designing such a strategy for real-world programs is highly nontrivial and requires labor-intensive work. The goal of data-driven program analysis is to automate this process by learning the strategy from data through a learning algorithm. Objective: Current learning algorithms for data-driven program analysis are not scalable enough to be used with large codebases. The objective of this paper is to overcome this shortcoming and present a new algorithm that is able to efficiently learn a strategy from large codebases. Method: The key idea is to use an oracle and transform the existing blackbox learning problem into a whitebox one that is much easier to solve. The oracle quantifies the relative importance of each part of the program with respect to the analysis precision. The oracle can be obtained by running the most and least precise analyses only once over the codebase. Results: Our learning algorithm is much faster than the existing algorithms while producing high quality strategies. The evaluation is done with 140 open-source C programs, comprising of 2.1 MLoC in total. Learning at this large scale was previously impractical. Conclusion: Our work advances the state-of-the-art of data-driven program analysis by addressing the scalability issue of the existing learning algorithm. Our technique will make the data-driven approach more practical in the real-world.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE