Training data selection for record linkage classification
This paper presents a new two-step approach for record linkage, focusing on the creation of high-quality training data in the first step. The approach employs the unsupervised random forest model as a similarity measure to produce a similarity score vector for record matching. Three constructions we...
Saved in:
Main Authors: | Zaturrawiah Ali Omar, Zamira Hasanah Zamzuri, Noratiqah Mohd Ariff, Mohd Aftar Abu Bakar |
---|---|
Format: | Article |
Language: | English English |
Published: |
MDPI AG
2023
|
Subjects: | |
Online Access: | https://eprints.ums.edu.my/id/eprint/42203/1/ABSTRACT.pdf https://eprints.ums.edu.my/id/eprint/42203/2/FULL%20TEXT.pdf https://eprints.ums.edu.my/id/eprint/42203/ https://doi.org/10.3390/sym15051060 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Short Text Classification Using An Enhanced Term Weighting Scheme And Filter-Wrapper Feature Selection
by: Alsmadi, Issa Mohammad Ibrahim
Published: (2018) -
The Application of Facial Expression Recognition in Reducing Inaccuracy in Pain Scale Intensity Identification
by: Mohd Ariff bin Sulaiman, et al.
Published: (2021) -
A survey of mat-heuristics for combinatorial optimisation problems: Variants, trends and opportunities
by: Chong, Man Ngoo, et al.
Published: (2024) -
ETC training management system / the needs and expectations for training management system in Thailand
by: Adam, Metinee
Published: (2009) -
Caputo’s implicit solution of time-fractional diffusion equation using half-sweep AOR iteration
by: Andang Sunarto, et al.
Published: (2016)