ICSE2023

Number of papers: 11

Automated Program Repair in the Era of Large Pre-Trained Language Models

Authors: Xia, Chunqiu Steven and Wei, Yuxiang and Zhang, Lingming
Abstract: Automated Program Repair (APR) aims to help developers automatically patch software bugs. However, current state-of-the-art traditional and learning-based APR techniques face the problem of limited patch variety, failing to fix complicated bugs. This is mainly due to the reliance on bug-fixing datasets to craft fix templates (traditional) or directly predict potential patches (learning-based). Large Pre-Trained Language Models (LLMs), trained using billions of text/code tokens, can potentially h...
Link: Read Paper
Labels: code generation, program repair

Automated Repair of Programs from Large Language Models

Authors: Fan, Zhiyu and Gao, Xiang and Mirchev, Martin and Roychoudhury, Abhik and Tan, Shin Hwei
Abstract: Large language models such as Codex, have shown the capability to produce code for many programming tasks. However, the success rate of existing models is low, especially for complex programming tasks. One of the reasons is that language models lack awareness of program semantics, resulting in incorrect programs, or even programs which do not compile. In this paper, we systematically study whether automated program repair (APR) techniques can fix the incorrect solutions produced by language mode...
Link: Read Paper
Labels: code generation, program repair

Automating Code-Related Tasks Through Transformers: The Impact of Pre-Training

Authors: Tufano, Rosalia and Pascarella, Luca and Bavota, Gabriele
Abstract: Transformers have gained popularity in the software engineering (SE) literature. These deep learning models are usually pre-trained through a self-supervised objective, meant to provide the model with basic knowledge about a language of interest (e.g., Java). A classic pre-training objective is the masked language model (MLM), in which a percentage of tokens from the input (e.g., a Java method) is masked, with the model in charge of predicting them. Once pre-trained, the model is then fine-tuned...
Link: Read Paper
Labels: general coding task, code model, code model training, source code model, empirical study

CCTest: Testing and Repairing Code Completion Systems

Authors: Li, Zongjie and Wang, Chaozheng and Liu, Zhibo and Wang, Haoxuan and Chen, Dong and Wang, Shuai and Gao, Cuiyun
Abstract: Code completion, a highly valuable topic in the software development domain, has been increasingly promoted for use by recent advances in large language models (LLMs). To date, visible LLM-based code completion frameworks such as GitHub Copilot and GPT are trained using deep learning over vast quantities of unstructured text and open source code. As the paramount component and the cornerstone in daily programming tasks, code completion has largely boosted professionals' efficiency in building re...
Link: Read Paper
Labels: code generation, code completion

CodaMosa: Escaping Coverage Plateaus in Test Generation with Pre-Trained Large Language Models

Authors: Lemieux, Caroline and Inala, Jeevana Priya and Lahiri, Shuvendu K. and Sen, Siddhartha
Abstract: Search-based software testing (SBST) generates high-coverage test cases for programs under test with a combination of test case generation and mutation. SBST's performance relies on there being a reasonable probability of generating test cases that exercise the core logic of the program under test. Given such test cases, SBST can then explore the space around them to exercise various parts of the program. This paper explores whether Large Language Models (LLMs) of code, such as OpenAI's Codex, c...
Link: Read Paper
Labels: program testing, fuzzing

Concrat: An Automatic C-to-Rust Lock API Translator for Concurrent Programs

Authors: Hong, Jaemin and Ryu, Sukyoung
Abstract: Concurrent programs suffer from data races. To prevent data races, programmers use locks. However, programs can eliminate data races only when they acquire and release correct locks at correct timing. The lock API of C, in which people have developed a large portion of legacy system programs, does not validate the correct use of locks. On the other hand, Rust, a recently developed system programming language, provides a lock API that guarantees the correct use of locks via type checking. This ma...
Link: Read Paper
Labels: code generation, program transformation

Explaining Software Bugs Leveraging Code Structures in Neural Machine Translation

Authors: Mahbub, Parvez and Shuvo, Ohiduzzaman and Rahman, Mohammad Masudur
Abstract: Software bugs claim ≈ 50% of development time and cost the global economy billions of dollars. Once a bug is reported, the assigned developer attempts to identify and understand the source code responsible for the bug and then corrects the code. Over the last five decades, there has been significant research on automatically finding or correcting software bugs. However, there has been little research on automatically explaining the bugs to the developers, which is essential but a highly challen...
Link: Read Paper
Labels: static analysis, bug detection

Large Language Models are Few-Shot Testers: Exploring LLM-Based General Bug Reproduction

Authors: Kang, Sungmin and Yoon, Juyeon and Yoo, Shin
Abstract: Many automated test generation techniques have been developed to aid developers with writing tests. To facilitate full automation, most existing techniques aim to either increase coverage, or generate exploratory inputs. However, existing test generation techniques largely fall short of achieving more semantic objectives, such as generating tests to reproduce a given bug report. Reproducing bugs is nonetheless important, as our empirical study shows that the number of tests added in open source ...
Link: Read Paper
Labels: program testing, bug reproduction

On the Applicability of Language Models to Block-Based Programs

Authors: Griebl, Elisabeth and Fein, Benedikt and Oberm"{u}ller, Florian and Fraser, Gordon and Just, Ren'{e
Abstract: Block-based programming languages like SCRATCH are increasingly popular for programming education and end-user programming. Recent program analyses build on the insight that source code can be modelled using techniques from natural language processing. Many of the regularities of source code that support this approach are due to the syntactic overhead imposed by textual programming languages. This syntactic overhead, however, is precisely what block-based languages remove in order to simplify pr...
Link: Read Paper
Labels: code generation, code completion

Retrieval-Based Prompt Selection for Code-Related Few-Shot Learning

Authors: Nashid, Noor and Sintaha, Mifta and Mesbah, Ali
Abstract: Large language models trained on massive code corpora can generalize to new tasks without the need for task-specific fine-tuning. In few-shot learning, these models take as input a prompt, composed of natural language instructions, a few instances of task demonstration, and a query and generate an output. However, the creation of an effective prompt for code-related tasks in few-shot learning has received little attention. We present a technique for prompt creation that automatically retrieves c...
Link: Read Paper
Labels: general coding task, code model, code model training, prompt strategy, retrieval-augmented generation

VULGEN: Realistic Vulnerability Generation Via Pattern Mining and Deep Learning

Authors: Nong, Yu and Ou, Yuzhe and Pradel, Michael and Chen, Feng and Cai, Haipeng
Abstract: Building new, powerful data-driven defenses against prevalent software vulnerabilities needs sizable, quality vulnerability datasets, so does large-scale benchmarking of existing defense solutions. Automatic data generation would promisingly meet the need, yet there is little work aimed to generate much-needed quality vulnerable samples. Meanwhile, existing similar and adaptable techniques suffer critical limitations for that purpose. In this paper, we present VULGEN, the first injection-based v...
Link: Read Paper
Labels: static analysis, bug detection, benchmark

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ICSE2023

Automated Program Repair in the Era of Large Pre-Trained Language Models

Automated Repair of Programs from Large Language Models

Automating Code-Related Tasks Through Transformers: The Impact of Pre-Training

CCTest: Testing and Repairing Code Completion Systems

CodaMosa: Escaping Coverage Plateaus in Test Generation with Pre-Trained Large Language Models

Concrat: An Automatic C-to-Rust Lock API Translator for Concurrent Programs

Explaining Software Bugs Leveraging Code Structures in Neural Machine Translation

Large Language Models are Few-Shot Testers: Exploring LLM-Based General Bug Reproduction

On the Applicability of Language Models to Block-Based Programs

Retrieval-Based Prompt Selection for Code-Related Few-Shot Learning

VULGEN: Realistic Vulnerability Generation Via Pattern Mining and Deep Learning

Files

README.md

Latest commit

History

README.md

File metadata and controls

ICSE2023