Call of Papers for Current Volume **************** OnLine Submission of Paper

Title: Record De-duplication Using Genetic and Hash Algorithm

_____________________________________________________________________________

Title:
Record De-duplication Using Genetic and Hash Algorithm
Author Name:
Miss. Jagruti Waykole, Prof. Sharmila Shinde
Abstract:
In todays world, the increasing volume of information available in digital libraries and e-commerce has become a challenging problem for data administrators. Most of the systems may be affected by the existence of duplicates entries in their repositories. In this, a genetic programming approach is used to record deduplication. This approach performs better than other approaches as found in the literature survey. Here the deduplication is done by hash-based similarity function i.e, with MD5 and SHA-1 algorihtm. This approach removes the duplicate dataset samples in the system and find the optimization solution to deduplication of records or data samples.
Back