Managing Inconsistencies in data Grid environments: A practical approach

Ejaz Ahmed*, Nik Bessis, Peter Norrington, Yong Yue

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Much work has been done in the area of data access and integration using various data mapping, matching, and loading techniques. One of the main concerns when integrating data from heterogeneous data sources is data redundancy. The concern is mainly due to the different business contexts and purposes from which the data systems were originally built. A common process for accessing data from integrated databases involves the use of each data source's own catalogue or metadata schema. In this article, the authors take the view that there is a greater chance of data inconsistencies, such as data redundancies when integrating them within a grid environment as compared to traditional distributed paradigms. The importance of improving the data search and matching process is briefly discussed, and a partial service oriented generic strategy is adopted to consolidate distinct catalogue schemas of federated databases to access information seamlessly. To this end, a proposed matching strategy between structure objects and data values across federated databases in a grid environment is presented.

Original languageEnglish
Pages (from-to)51-64
Number of pages14
JournalInternational Journal of Grid and High Performance Computing
Volume2
Issue number4
DOIs
Publication statusPublished - Oct 2010
Externally publishedYes

Keywords

  • Data integration
  • Grid
  • Metadata
  • Pattern matching
  • Plug-in relations
  • Staging dbms

Cite this