Evaluating Large Language Models for the Generation of Unit Tests with Equivalence Partitions and Boundary Values

Rodríguez, Martín; Rossi, Gustavo Héctor; Fernández, Alejandro

Evaluating Large Language Models for the Generation of Unit Tests with Equivalence Partitions and Boundary Values

cic.institucionOrigen	Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA)
cic.isFulltext	SI
cic.isPeerReviewed	NO
cic.lugarDesarrollo	Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA)
cic.parentType	Objeto de conferencia
cic.version	Aceptada
dc.date.accessioned	2026-03-20T12:04:08Z
dc.date.available	2026-03-20T12:04:08Z
dc.identifier.uri	https://digital.cic.gba.gob.ar/handle/11746/12673
dc.title	Evaluating Large Language Models for the Generation of Unit Tests with Equivalence Partitions and Boundary Values	en
dc.type	Documento de conferencia
dcterms.abstract	The design and implementation of unit tests is a complex task many programmers neglect. This research evaluates the potential of Large Language Models (LLMs) in automatically generating test cases, comparing them with manual tests. An optimized prompt was developed, that integrates code and requirements, covering critical cases such as equivalence partitions and boundary values. The strengths and weaknesses of LLMs versus trained programmers were compared through quantitative metrics and manual qualitative analysis. The results show that the effectiveness of LLMs depends on well-designed prompts, robust implementation, and precise requirements. Although flexible and promising, LLMs still require human supervision. This work highlights the importance of manual qualitative analysis as an essential complement to automation in unit test evaluation.	en
dcterms.creator.author	Rodríguez, Martín
dcterms.creator.author	Rossi, Gustavo Héctor
dcterms.creator.author	Fernández, Alejandro
dcterms.identifier.other	arXiv:2505.09830
dcterms.identifier.url	https://arxiv.org/abs/2505.09830
dcterms.isPartOf.series	13th Conference on Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET 2025) (La Plata, 24 al 26 de junio de 2025)
dcterms.issued	2025
dcterms.language	Inglés
dcterms.license	Attribution-NonCommercial-NoDerivatives 4.0 International (BY-NC-ND 4.0)
dcterms.subject	Evaluation	en
dcterms.subject	Unit Testing	en
dcterms.subject	LLM	es
dcterms.subject.materia	Ciencias de la Computación e Información

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: Evaluating Large Language Models.pdf-PDFA.pdf
Tamaño:: 277.24 KB
Formato:: Adobe Portable Document Format
Descripción:: Documento completo

Descargar

Bloque de licencias

Mostrando 1 - 1 de 1

Nombre:: license.txt
Tamaño:: 3.46 KB
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Colecciones

Artículos y presentaciones en Congresos LIFIA