Title: Record De-duplication Using Genetic and Hash Algorithm
_____________________________________________________________________________
Title: |
Record De-duplication Using Genetic and Hash Algorithm |
Author Name: |
Miss. Jagruti Waykole, Prof. Sharmila Shinde |
Abstract: |
In todays world, the increasing volume of information available in digital libraries and e-commerce has become a challenging problem for data administrators. Most of the systems may be affected by the existence of duplicates entries in their repositories. In this, a genetic programming approach is used to record deduplication. This approach performs better than other approaches as found in the literature survey. Here the deduplication is done by hash-based similarity function i.e, with MD5 and SHA-1 algorihtm. This approach removes the duplicate dataset samples in the system and find the optimization solution to deduplication of records or data samples. |
Back |