Documento de conferencia
Acceso Abierto

Evaluating Information Extraction Approaches in the Construction of a Real Estate Observatory

Enlace externo
Resumen

A real estate observatory plays a significant role in the aggregation and analysis of real estate market data. The information that lies in real estate advertisements can be leveraged to populate such an observatory. However, this data can present itself in both a structured and an unstructured manner. Unstructured data represents a problem to automatically process and extract information since it lacks a predefined structure. Thus, there’s a need for techniques to give structure to unstructured data. Information Extraction (IE) is the process of structuring data from unstructured data. Natural Language Processing techniques enable machines to understand texts, making them particularly significant in the context of IE. This work evaluates both rule-based and machine-learning based IE approaches to extract features from real estate descriptions within advertisements. Those features are relevant in the context of real estate observatory construction. The performance of each approach is measured using precision, recall and f1-score metrics.

Palabras clave
Information Extraction
Natural Language Processing
Real Estate Observatory
http://creativecommons.org/licenses/by-nc-nd/4.0/

Esta obra se publica con la licencia Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (BY-NC-ND 4.0)

item.page.license
Cargando...
Miniatura