Abstract
Much work has been done in the area of data access and integration using various data mapping, matching, and loading techniques. One of the main concerns when integrating data from heterogeneous data sources is data redundancy. The concern is mainly due to the different business contexts and purposes from which the data systems were originally built. A common process for accessing data from integrated databases involves the use of each data source's own catalogue or metadata schema. In this article, the authors take the view that there is a greater chance of data inconsistencies, such as data redundancies when integrating them within a grid environment as compared to traditional distributed paradigms. The importance of improving the data search and matching process is briefly discussed, and a partial service oriented generic strategy is adopted to consolidate distinct catalogue schemas of federated databases to access information seamlessly. To this end, a proposed matching strategy between structure objects and data values across federated databases in a grid environment is presented.
Original language | English |
---|---|
Pages (from-to) | 51-64 |
Number of pages | 14 |
Journal | International Journal of Grid and High Performance Computing |
Volume | 2 |
Issue number | 4 |
DOIs | |
Publication status | Published - Oct 2010 |
Externally published | Yes |
Keywords
- Data integration
- Grid
- Metadata
- Pattern matching
- Plug-in relations
- Staging dbms