skip to main content
article

Multiplex, Fusionplex and Autoplex: three generations of information integration

Published:01 December 2004Publication History
Skip Abstract Section

Abstract

We describe three generations of information integration systems developed at George Mason University. All three systems adopt a virtual database design: a global integration schema, a mapping between this schema and the schemas of the participating information sources, and automatic interpretation of global queries. The focus of Multiplex is rapid integration of very large, evolving, and heterogeneous collections of information sources. Fusionplex strengthens these capabilities with powerful tools for resolving data inconsistencies. Finally, Autoplex takes a more proactive approach to integration, by "recruiting" contributions to the global integration schema from available information sources. Using machine learning techniques it confronts a major cost of integration, that of mapping new sources into the global schema.

References

  1. P. Anokhin and A. Motro. Fusionplex: Resolution of data inconsistencies in the integration of heterogeneous information sources. Tech. Rep. ISETR-03-06. Dept. of Information and Software Engineering, George Mason Univ., 2003.]]Google ScholarGoogle Scholar
  2. J. Berlin and A. Motro. Autoplex: Automated discovery of contents for virtual databases. In Proc. of CoopIS 01, Sixth IFCIS Int. Conf. on Cooperative Information Systems. Lecture Notes in Computer Science No. 2172, pp. 108--122, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Berlin and A. Motro. TupleRank: Ranking discovered content in virtual databases. Tech. Rep. ISE-TR-02-03. Dept. of Information and Software Engineering, George Mason Univ., 2002.]]Google ScholarGoogle Scholar
  4. J. Berlin and A. Motro. Database schema matching using machine learning with feature selection. In Proc. of CAiSE 02, the 14th Int. Conf. on Advanced Information Systems Engineering. Lecture Notes in Computer Science No. 2348, pp. 452--466, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. U. Dayal and H. Hwang. View definition and generalization for database integration in a mutli-database system. IEEE Transactions on Software Engineering SE-10(6), pp. 628--644, 1984.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Doan, P. Domingos, and A. Halevy. Reconciling schemas of disparate data sources: a machine-learning approach. In Proc. of ACM SIGMOD Conf. on Management of Data, pp. 509--520, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. D. Ullman, V. Vassalos, J. Widom. The TSIMMIS Approach to Mediation: Data Models and Languages. Journal of Intelligent Information Systems, 8(2), pp. 117--132, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. R. Genesereth, A. M. Keller, and O. M. Duschka. Infomaster: An information integration system. In Proc. of ACM SIGMOD Conf. on Management of Data, pp. 539--542, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Y. Halevy. Answering queries using views: A survey. VLDB Journal, 10(4):270--294, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. A. Knoblock, S. Minton, J. L. Ambite, N. Ashish, I. Muslea, A. Philpot, and S. Tejada. The Ariadne approach to Web-based information integration. Int. Journal of Cooperative Information Systems, 10(1-2):145--169, 2001.]]Google ScholarGoogle ScholarCross RefCross Ref
  11. A. Y. Levy, A. Rajaraman, and J. J. Ordille. Querying heterogeneous information sources using source descriptions. In Proc. of the 22nd Int. Conf. on Very Large Databases, pp. 251--262, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Levy, C. Knoblock, S. Minton, and W. Cohen. Information integration. IEEE Intelligent Systems, 13(5):12--24, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Madhavan, P. A. Bernstein, and E. Rahm. Generic schema matching with Cupid. In Proc. of the 27th Int. Conf. on Very Large Databases, pp. 49--58, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Miller, L. Haas, and M. Hernández. Schema mapping as query discovery. In Proc. of the 26th Int. Conf. on Very Large Databases, pp. 77--88, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Motro. Superviews: Virtual integration of multiple databases. IEEE Transactions on Software Engineering, SE-13(7):785--798, 1987.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Motro. Multiplex: a formal model for multidatabases and its implementation. In Proc. of Fourth Int. Workshop on Next Generation Information Technologies and Systems, Lecture Notes in Computer Science No. 1649, pp. 138--158, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Motro, P. Anokhin and A. C. Acar. Utility-based resolution of data inconsistencies. In Proc. of Int. Workshop on Information Quality in Information Systems (in conjunction with SIGMOD 2004), pp. 35--43, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. N. F. Noy and M. A. Musen. The PROMPT suite: Interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies, 59(6):983--1024, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. E. Rahm and P. Bernstein. On matching schemas automatically. Technical Report MSR-TR-2001-17, Microsoft, 2001.]]Google ScholarGoogle Scholar
  20. E. A. Rundensteiner, A. Koeller, and X. Zhang. Maintaining data warehouses over changing information sources. Communications of the ACM, 43(6):57--62, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Templeton, D. Brill, S. K. Dao, E. Lund, P. Ward, A. L. P. Chen, and R. McGregor. Mermaid - A front-end to distributed heterogeneous databases. In Proc. of the IEEE, 75(5):695--708, 1987.]]Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image ACM SIGMOD Record
    ACM SIGMOD Record  Volume 33, Issue 4
    December 2004
    92 pages
    ISSN:0163-5808
    DOI:10.1145/1041410
    Issue’s Table of Contents

    Copyright © 2004 Authors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 1 December 2004

    Check for updates

    Qualifiers

    • article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader