Rule-Based Matching for Real Estate Features Detection

cic.institucionOrigenLaboratorio de Investigación y Formación en Informática Avanzada (LIFIA)
cic.isFulltextSI
cic.isPeerReviewedSI
cic.lugarDesarrolloLaboratorio de Investigación y Formación en Informática Avanzada (LIFIA)
cic.parentTypeObjeto de conferencia
cic.versionPublicada
dc.date.accessioned2026-03-06T14:23:36Z
dc.date.available2026-03-06T14:23:36Z
dc.identifier.urihttps://digital.cic.gba.gob.ar/handle/11746/12661
dc.titleRule-Based Matching for Real Estate Features Detectionen
dc.typeDocumento de conferencia
dcterms.abstractMost of the information about real estate for sale in the Buenos Aires province, Argentina is unstructured, which means that it does not always follow the same format, making extraction a challenging process. Variability in wording, human errors, noise, and incomplete data further complicate the task. Given the large volume of information available, automated techniques are required to transform unstructured text into structured data. This article presents an approach to extract attribute-value pairs from the information contained in the property listings for the province of Buenos Aires, in order to incorporate this data into a knowledge graph. The approach uses pattern-based information extraction for 17 features with an exhaustive evaluation over two datasets: a ground truth labeled by experts and a dataset containing a real-world use case. The results demonstrates accurate values.en
dcterms.creator.authorIbañez Gutkin, Mateo Agustín
dcterms.creator.authorPagano, Álvaro A.
dcterms.creator.authorBazzana Tanevitch, Luciana
dcterms.creator.authorTorres, Diego
dcterms.identifier.otherISBN: 978-950-34-2583-1
dcterms.isPartOf.seriesXIII Jornadas de Cloud Computing, Big Data & Emerging Topics (La Plata, 24 al 26 de junio de 2025)
dcterms.issued2025-06
dcterms.languageInglés
dcterms.licenseAttribution-NonCommercial-NoDerivatives 4.0 International (BY-NC-ND 4.0)
dcterms.subjectInformation Extractionen
dcterms.subjectRule-based matchingen
dcterms.subjectNatural Language Processingen
dcterms.subjectKnowledge Graph Completionen
dcterms.subject.materiaCiencias de la Computación e Información

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
Documento_completo.pdf-PDFA.pdf (5).pdf
Tamaño:
4.82 MB
Formato:
Adobe Portable Document Format
Descripción:
Documento completo

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
3.46 KB
Formato:
Item-specific license agreed upon to submission
Descripción: