Rule-Based Matching for Real Estate Features Detection
| cic.institucionOrigen | Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA) | |
| cic.isFulltext | SI | |
| cic.isPeerReviewed | SI | |
| cic.lugarDesarrollo | Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA) | |
| cic.parentType | Objeto de conferencia | |
| cic.version | Publicada | |
| dc.date.accessioned | 2026-03-06T14:23:36Z | |
| dc.date.available | 2026-03-06T14:23:36Z | |
| dc.identifier.uri | https://digital.cic.gba.gob.ar/handle/11746/12661 | |
| dc.title | Rule-Based Matching for Real Estate Features Detection | en |
| dc.type | Documento de conferencia | |
| dcterms.abstract | Most of the information about real estate for sale in the Buenos Aires province, Argentina is unstructured, which means that it does not always follow the same format, making extraction a challenging process. Variability in wording, human errors, noise, and incomplete data further complicate the task. Given the large volume of information available, automated techniques are required to transform unstructured text into structured data. This article presents an approach to extract attribute-value pairs from the information contained in the property listings for the province of Buenos Aires, in order to incorporate this data into a knowledge graph. The approach uses pattern-based information extraction for 17 features with an exhaustive evaluation over two datasets: a ground truth labeled by experts and a dataset containing a real-world use case. The results demonstrates accurate values. | en |
| dcterms.creator.author | Ibañez Gutkin, Mateo Agustín | |
| dcterms.creator.author | Pagano, Álvaro A. | |
| dcterms.creator.author | Bazzana Tanevitch, Luciana | |
| dcterms.creator.author | Torres, Diego | |
| dcterms.identifier.other | ISBN: 978-950-34-2583-1 | |
| dcterms.isPartOf.series | XIII Jornadas de Cloud Computing, Big Data & Emerging Topics (La Plata, 24 al 26 de junio de 2025) | |
| dcterms.issued | 2025-06 | |
| dcterms.language | Inglés | |
| dcterms.license | Attribution-NonCommercial-NoDerivatives 4.0 International (BY-NC-ND 4.0) | |
| dcterms.subject | Information Extraction | en |
| dcterms.subject | Rule-based matching | en |
| dcterms.subject | Natural Language Processing | en |
| dcterms.subject | Knowledge Graph Completion | en |
| dcterms.subject.materia | Ciencias de la Computación e Información |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- Documento_completo.pdf-PDFA.pdf (5).pdf
- Tamaño:
- 4.82 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Documento completo
Bloque de licencias
1 - 1 de 1
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 3.46 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: