Information Eextraction to Form Schemas:
- Michael J. Cafarella, Dan Suciu, Oren Etzioni: Navigating Extracted Data
with Schema Discovery. WebDB 2007
http://gemo.futurs.inria.fr/events/WebDB2007/Papers/p47.pdf
- E. Agichtein and L. Gravano. Snowball: Extracting relations from large
plain-text collections. In Procs. of the Fifth ACM
International Conference on Digital Libraries, 2000.
- Y. Cong and H. Jagadish. Schema summarization. In VLDB, 2006.
- M. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim. Xtract:
A system for extracting document type descriptors from xml documents.
In Proceedings of the 2000 ACM SIGMOD
- J. V. den Bercken, B. Blohsfeld, J.-P. Dittrich, J. Kramer, T. Schafer,
M. Schneider, and B. Seeger.
XXL - a library approach to supporting efficient implementations of advanced
database queries. In Proc. of VLDB, pages 3948, 2001.
- Ulf Leser, Felix Naumann: (Almost) Hands-Off Information Integration for
the Life Sciences. CIDR 2005: 131-143
http://www.cidrdb.org/cidr2005/papers/P11.pdf