Selene: Pioneering Automated Proof in Software Verification

Lichen Zhang; Shuai Lu; Nan Duan

doi:10.18653/v1/2024.acl-long.98

Selene: Pioneering Automated Proof in Software Verification

Abstract

Ensuring correctness is a pivotal aspect of software engineering. Among the various strategies available, software verification offers a definitive assurance of correctness. Nevertheless, writing verification proofs is resource-intensive and manpower-consuming, and there is a great need to automate this process. We introduce Selene in this paper, which is the first project-level automated proof benchmark constructed based on the real-world industrial-level operating system microkernel, seL4. Selene provides a comprehensive framework for end-to-end proof generation and a lightweight verification environment. Our experimental results with advanced large language models (LLMs), such as GPT-3.5-turbo and GPT-4, highlight the capabilities of LLMs in the domain of automated proof generation. Additionally, our further proposed augmentations indicate that the challenges presented by Selene can be mitigated in future research endeavors.

Anthology ID:: 2024.luhme-long.98
Volume:: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1776–1789
Language:
URL:: https://aclanthology.org/2024.luhme-long.98/
DOI:: 10.18653/v1/2024.acl-long.98
Bibkey:
Cite (ACL):: Lichen Zhang, Shuai Lu, and Nan Duan. 2024. Selene: Pioneering Automated Proof in Software Verification. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1776–1789, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Selene: Pioneering Automated Proof in Software Verification (Zhang et al., ACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.acl-long.98.pdf

PDF Cite Search Fix data