Rishit Garkhel
Download PDF
http://doi.org/10.37648/ijrst.v10i04.005
Identifying and disposing of the copied document is one of the serious issues in the wide space of information cleaning and information quality in the framework. Ordinarily, a similar sensible true element might have numerous portrayals in the information distribution centre. Copy disposal is hard because it is brought about by a few blunders like typographical mistakes and various pictures of similar consistent worth. Our primary aim of this study is to recognise specific and inaccurate representations by utilising copy description and end rules. This methodology is used to work on the proficiency of the information. The significance of information precision and quality has expanded with the blast of information size. In the copy disposal step, just one duplicate of accurate copied records or documents is held and dispensed with other copy records or documents. The end cycle is vital to delivering cleaning information. Before the end sequence, the similitude limit esteems are determined for every one of the records available in the informational collection. The closeness limit admires significant for the end communication.
Keywords: Duplicate record recognition; Duplication; information linkage
Disclaimer: Indexing of published papers is subject to the evaluation and acceptance criteria of the respective indexing agencies. While we strive to maintain high academic and editorial standards, International Journal of Research in Science and Technology does not guarantee the indexing of any published paper. Acceptance and inclusion in indexing databases are determined by the quality, originality, and relevance of the paper, and are at the sole discretion of the indexing bodies.