Delivery of data mining as e-services in the World Wide Web

Date

2005-05

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

The field of knowledge discovery and data mining emerged in the recent past as a result of the dramatic evolution of the technology for information storage, access, and analysis. Distributed data mining (DDM) is a result of further evolution of the data mining technology. DDM embraces the growing trend of merging computation with communication.

At the same time, the Internet and the World Wide Web (WWW) have provided a platform where organizations conduct commercial transactions. This was the transformation that led to the onset of electronic commerce (e-commerce). It was the new medium that brought with it the opportunity and ability for doing businesses in a global marketplace. Recently, the rapid strides in research and development in e-commerce are triggering the emergence of the next evolutionary phase, namely, e-service. This model envisages the Internet and WWW evolving from a global arena for selling goods to a virtual marketplace of services, particularly where businesses and organizations conduct their transactions via the Internet.

Today, the goods that are able to be transacted in an e-services modeled (B2B) virtual marketplace are not restricted in real entities such as electronics, furniture, or tickets. They can also be resources such as software, computation abilities, or useful datasets. These resources are potentially able to be sold or rented to clients as e-services. Data mining in conjunction with other business intelligence applications is emerging as intuitively suitable for being delivered as such an e-service, mainly because of the fact that several small to medium range businesses are constrained by the high cost of setting up and maintaining infrastructure of support technologies and software required for business intelligence.

To efficiently and effectively deliver data mining as an e-service in the World Wide Web, Web service technologies are introduced to provide a layer of abstraction above existing software systems. Unlike existing distributed computing systems, Web services are adapted to the Web. The default network protocol is HTTP. With Web services, the communication protocol among e-service agents and service providers is already there, World Wide Web. Web services work at a level of abstraction that is capable of bridging any operating system, hardware platform, or programming language, just as the Web is.

Data mining e-service model requires interactions between clients and e-service agents, as well as between e-service agents and e-service providers. These interactions need to be implemented in a reliable, stable, and scalable way. Java 2 Enterprise Edition (J2EE) provides APIs and design patterns to design sound architectures and result in quality web-based applications, which makes the Internet-based data mining e-service feasible.

This thesis investigates the delivery of Internet based data mining e-services and proposes an architecture supported by Web services and J2EE technologies to address the specific infrastructure requirements imposed on data mining systems by the e-services domain.

Description

Keywords

Java 2 enterprise edition (J2EE), Distribute, Web service, E-services, Data mining

Citation