Training data selection for record linkage classification

This paper presents a new two-step approach for record linkage, focusing on the creation of high-quality training data in the first step. The approach employs the unsupervised random forest model as a similarity measure to produce a similarity score vector for record matching. Three constructions we...

Full description

Saved in:
Bibliographic Details
Main Authors: Zaturrawiah Ali Omar, Zamira Hasanah Zamzuri, Noratiqah Mohd Ariff, Mohd Aftar Abu Bakar
Format: Article
Language:English
English
Published: MDPI AG 2023
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/42203/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/42203/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/42203/
https://doi.org/10.3390/sym15051060
Tags: Add Tag
No Tags, Be the first to tag this record!