32nd International Conference on Scientific and Statistical Database Management. July 2020 Article No.: 21. Pages 1–4
Abstract. An important task on land registration processes is to be able to determine the prevalent data provenance for a finalized polygon that represents a cadastral parcel, since the finalized polygon is derived by the examination of a set of initial polygons, drawn from several individual registers (databases). These registers might contain different, partially similar or conflicting information regarding the ownership, usage and polygon geometry of a cadastral parcel. In such cases, the cadastration expert either select one of of the initial geometries, or (in cases none of the initial accurately represents the finalized land parcel) creates a new geometry. Maintaining this provenance information is of high importance for further cadastration and validation/quality assessment processes; however, due to the gradual and long lasting nature of cadastration procedures, this information is absent from large parts of cadastral databases. In this paper, we present an approach for effectively classifying such land parcel polygons with respect to their provenance information. We propose a method that can produce highly accurate provenance recommendations based only on attributes derived from the geometry of a land parcel. In particular, we implement a set of spatial training features, capturing polygon properties and relations. These features are fed into several classification algorithms and are evaluated on a proprietary dataset of a cadastration company.