Evaluating Large Language Models for the Generation of Unit Tests with Equivalence Partitions and Boundary Values
| cic.institucionOrigen | Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA) | |
| cic.isFulltext | SI | |
| cic.isPeerReviewed | NO | |
| cic.lugarDesarrollo | Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA) | |
| cic.parentType | Objeto de conferencia | |
| cic.version | Aceptada | |
| dc.date.accessioned | 2026-03-20T12:04:08Z | |
| dc.date.available | 2026-03-20T12:04:08Z | |
| dc.identifier.uri | https://digital.cic.gba.gob.ar/handle/11746/12673 | |
| dc.title | Evaluating Large Language Models for the Generation of Unit Tests with Equivalence Partitions and Boundary Values | en |
| dc.type | Documento de conferencia | |
| dcterms.abstract | The design and implementation of unit tests is a complex task many programmers neglect. This research evaluates the potential of Large Language Models (LLMs) in automatically generating test cases, comparing them with manual tests. An optimized prompt was developed, that integrates code and requirements, covering critical cases such as equivalence partitions and boundary values. The strengths and weaknesses of LLMs versus trained programmers were compared through quantitative metrics and manual qualitative analysis. The results show that the effectiveness of LLMs depends on well-designed prompts, robust implementation, and precise requirements. Although flexible and promising, LLMs still require human supervision. This work highlights the importance of manual qualitative analysis as an essential complement to automation in unit test evaluation. | en |
| dcterms.creator.author | Rodríguez, Martín | |
| dcterms.creator.author | Rossi, Gustavo Héctor | |
| dcterms.creator.author | Fernández, Alejandro | |
| dcterms.identifier.other | arXiv:2505.09830 | |
| dcterms.identifier.url | https://arxiv.org/abs/2505.09830 | |
| dcterms.isPartOf.series | 13th Conference on Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET 2025) (La Plata, 24 al 26 de junio de 2025) | |
| dcterms.issued | 2025 | |
| dcterms.language | Inglés | |
| dcterms.license | Attribution-NonCommercial-NoDerivatives 4.0 International (BY-NC-ND 4.0) | |
| dcterms.subject | Evaluation | en |
| dcterms.subject | Unit Testing | en |
| dcterms.subject | LLM | es |
| dcterms.subject.materia | Ciencias de la Computación e Información |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- Evaluating Large Language Models.pdf-PDFA.pdf
- Tamaño:
- 277.24 KB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Documento completo
Bloque de licencias
1 - 1 de 1
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 3.46 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: