Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A scalable learning algorithm for data-driven program analysis

Full metadata record
DC Field Value Language
dc.contributor.authorCha, Sooyoung-
dc.contributor.authorJeong, Sehun-
dc.contributor.authorOh, Hakjoo-
dc.date.accessioned2021-09-02T02:35:23Z-
dc.date.available2021-09-02T02:35:23Z-
dc.date.created2021-06-19-
dc.date.issued2018-12-
dc.identifier.issn0950-5849-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/71411-
dc.description.abstractContext: Recently data-driven program analysis has emerged as a promising approach for building cost-effective static analyzers. The ideal static analyzer should apply accurate but costly techniques only when they benefit. However, designing such a strategy for real-world programs is highly nontrivial and requires labor-intensive work. The goal of data-driven program analysis is to automate this process by learning the strategy from data through a learning algorithm. Objective: Current learning algorithms for data-driven program analysis are not scalable enough to be used with large codebases. The objective of this paper is to overcome this shortcoming and present a new algorithm that is able to efficiently learn a strategy from large codebases. Method: The key idea is to use an oracle and transform the existing blackbox learning problem into a whitebox one that is much easier to solve. The oracle quantifies the relative importance of each part of the program with respect to the analysis precision. The oracle can be obtained by running the most and least precise analyses only once over the codebase. Results: Our learning algorithm is much faster than the existing algorithms while producing high quality strategies. The evaluation is done with 140 open-source C programs, comprising of 2.1 MLoC in total. Learning at this large scale was previously impractical. Conclusion: Our work advances the state-of-the-art of data-driven program analysis by addressing the scalability issue of the existing learning algorithm. Our technique will make the data-driven approach more practical in the real-world.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherELSEVIER SCIENCE BV-
dc.subjectSTRATEGY-
dc.titleA scalable learning algorithm for data-driven program analysis-
dc.typeArticle-
dc.contributor.affiliatedAuthorOh, Hakjoo-
dc.identifier.doi10.1016/j.infsof.2018.07.002-
dc.identifier.scopusid2-s2.0-85050690366-
dc.identifier.wosid000449138900001-
dc.identifier.bibliographicCitationINFORMATION AND SOFTWARE TECHNOLOGY, v.104, pp.1 - 13-
dc.relation.isPartOfINFORMATION AND SOFTWARE TECHNOLOGY-
dc.citation.titleINFORMATION AND SOFTWARE TECHNOLOGY-
dc.citation.volume104-
dc.citation.startPage1-
dc.citation.endPage13-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.subject.keywordPlusSTRATEGY-
dc.subject.keywordAuthorData-driven program analysis-
dc.subject.keywordAuthorLearning algorithm-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE