DATA WAREHOUSING LAB
Professor : Minsoo Lee
What's Data Warehousing?
Data Warehousing is an area of research that is of increasing importance in today's IT infrastructure where an enormous amount of data exists. Data Warehouses integrate data from the distributed sources of data by cleansing (i.e., removing errors) and transforming the data and integrating it into a repository that can be used for long running queries to analyze the data.
Data Warehouse research therefore focuses on (1) the ETL (extraction, transformation, loading) of data into the data warehouse, (2) configuration of the data warehouse, and (3) query processing for the data warehouse.
Data Warehouses can provide huge benefits to companies and organizations that can effectively analyze and make use of the information.
Research in the DW LAB
Data Warehousing (µ¥ÀÌÅÍ¿þ¾îÇÏ¿ì¡)
Materialized Views | In order to efficiently process data warehouse queries, the query results need to be cached as materialized views. Several issues arise when dealing with materialized views. Which ones to select, or how to refresh them are the biggest issues. Query rewriting methods also need to be investigated. |
Semistructured Data Warehouses | As an increasing amount of data is becoming available in the form of XML, data warehouses with special capabilities that can deal with this type of semistructured data need to be designed and developed. Semistructured data warehouses is the future trend. |
BioInformatic Data Warehouses | As the bioinformatic field is rapidly gaining interest in the research community, there is need to be able to effectively construct a data warehouse that is suitable for the bioinformatic data. |
Business Intelligence Tools | Business intelligence is largely dependent on the analytical capabilities of the the tools that are provided to analyze data in a data warehouse. Business Intelligence tools also need to be performant, easy to use, and also Web-based in today's competitive environment. |
Web Infrastructures (À¥ ±â¹Ý Á¤º¸½Ã½ºÅÛ)
Knowledge Networks | Knowledge cannot be easily embedded in today's Internet infrastructure. However, by extending the current Web servers with a standard module to process and connect knowledge, we can make the Internet more intelligent. Rules and events can be used to add knowledge into the Internet. |
Web Information Systems | With the emergence of the Internet, information systems are moving to the Web considering it as a new platform. 3-tier architectures are becoming more common and thus the techonologies needed to realize this architecture is essential for moving information systems to the Web. Such areas include caching of data in the middle tier, security issues, and data size issues. |
E-commerce (ÀüÀÚ »ó°Å·¡)
E-commerce |
E-commerce is the next wave for interaction between buyers and sellers. The biggest area that would benefit from this area is Business-to-Business e-commerce. In order for this to be realized, platforms for business exchanges, workflow, and CRM need to be developed. |
Mobile Data Warehousing (¸ð¹ÙÀÏ µ¥ÀÌÅÍ¿þ¾îÇÏ¿ì¡)
Mobile Transactions |
The mobile user needs to connect and query databases and data warehouses, especially those who are decision makers of a business. Connection cost is very high and mobile users need a very fast response time. Therefore special techniques to make the queries run faster is needed. |