Trust but Verify: Benchmarks for Hallucination, Vulnerability, and Style Drift in AI-Generated Code Reviews

Syed Khundmir Azmi

Trust but Verify: Benchmarks for Hallucination, Vulnerability, and Style Drift in AI-Generated Code Reviews

Authors

Syed Khundmir Azmi Independent Researcher, USA

Keywords:

AI code reviews, hallucination, vulnerability, style drift, AI verification, benchmarks, software development, coding standards, AI reliability, system security

Abstract

The growing popularity of AI-based code reviews in software development necessitates a thorough understanding of their shortcomings and potential risks. The current paper addresses three key problems: hallucination, vulnerability, and style drift, which may jeopardize the quality and security of AI-generated code reviews. Hallucinations refer to instances where AI provides incorrect or irrelevant recommendations, whereas vulnerability points out the threats of misuse or assaults on AI systems. Style drift refers to the shift in coding standards used by the AI. The primary objective of this study is to establish clear standards for identifying and confirming these issues, thereby enhancing the accuracy and reliability of AI-mediated code assessments. The most important finding is that in the absence of proper verification solutions, AI-produced code reviews may cause significant quality differences. The research also contains suggestions on the measures to improve the reliability of AI systems, so that they could correspond to the industry requirements.

Downloads

Requires Subscription Pdf

Published

06-02-2023

How to Cite

Syed Khundmir Azmi. (2023). Trust but Verify: Benchmarks for Hallucination, Vulnerability, and Style Drift in AI-Generated Code Reviews. Well Testing Journal, 32(1), 76–90. Retrieved from https://welltestingjournal.com/index.php/WT/article/view/229

Download Citation

Issue

Vol. 32 No. 1 (2023)

Section

Original Research Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

This license requires that re-users give credit to the creator. It allows re-users to distribute, remix, adapt, and build upon the material in any medium or format, for noncommercial purposes only.

Trust but Verify: Benchmarks for Hallucination, Vulnerability, and Style Drift in AI-Generated Code Reviews

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

SCOPUS

SCIMAGO

Keywords

CC