Training data selection for record linkage classification
This paper presents a new two-step approach for record linkage, focusing on the creation of high-quality training data in the first step. The approach employs the unsupervised random forest model as a similarity measure to produce a similarity score vector for record matching. Three constructions we...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English English |
Published: |
MDPI AG
2023
|
Subjects: | |
Online Access: | https://eprints.ums.edu.my/id/eprint/42203/1/ABSTRACT.pdf https://eprints.ums.edu.my/id/eprint/42203/2/FULL%20TEXT.pdf https://eprints.ums.edu.my/id/eprint/42203/ https://doi.org/10.3390/sym15051060 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|