• Nenhum resultado encontrado

Intelligent Decision System Based on PA-AKD Approach of D3M

N/A
N/A
Protected

Academic year: 2017

Share "Intelligent Decision System Based on PA-AKD Approach of D3M "

Copied!
5
0
0

Texto

(1)

Intelligent Decision System Based on

PA-AKD Approach of D3M

A. TEJASWI* J.N.V.V.S. PRAKASH** A. MANASWI*** J.N.V.R. SWARUP KUMAR* G. SRINIVAS****

*

Dept of Information Technology, GITAM University, Visakhapatnam. **

ICWA India. ***

School of Computing Sciences, VIT University, Vellore. ****

Asst. Prof., Dept of Information Technology, GITAM University, Visakhapatnam.

ABSTRACT

Due to the rapid advancement of electronic commerce and web technologies in recent years, the concepts and applications of decision support systems have been significantly extended. The demand placed on data that is loaded into the data warehouse and queried from the data warehouse has forced entrepreneurs and business users to manage the data. But the large commercial transactions have lots of useful knowledge for business decisions in the data warehouse. Through Intelligent Decisions, customers / entrepreneurs are able to make effective decisions concerning the management and direction of their organization. This paper introduces business intelligent decisions that take place from the data-warehouse through the Actionable-Knowledge Discovery (AKD) in Domain Driven Data Mining (D3M for short). The general architecture of D3M for enterprise decisions was proposed and the model storage was presented, and its characteristics would be analyzed. The deliverables of D3M were extracted from the result of data mining and knowledge base. The enterprise decisions and reports could be analyzed through D3M deliverables.

Keywords: Business Intelligence, Intelligent Decision, D3M, PA-AKD, DSS.

1. Introduction

The concept of Business Intelligence (BI) was put forward by Gartner Group in 1996. At that time, business intelligence would be defined a technology and its applications; it was made up of data warehouse(or data mart), inquiring report forms, data analysis, data mining, data backup and recovery components, and its purpose was to help enterprise decision-making[1]. With the exponential growth in the amount of data being collected, improvements in technology, and research in machine learning, retailers are now able to reduce the ever growing difficult and complex decision making process by recruiting the efforts of data mining. Data mining is a computerized technology that uses complicated algorithms to find relationships and trends in large databases, real or perceived, previously unknown to the retailer, to promote decision support. Decision making frames are perspectives or maps used by the decision makers to guide the process. As the name suggests, a frame establishes the boundaries and constraints of the process. Decision making frames greatly influence the decision making process. They can be used to delineate the current situation, to establish what the options are and to improve efficiency by determining which information and issues can be omitted.

The Intelligent Miner also includes an extensive pre-processing library of tools to prepare that data for mining and verification. To improve the data analyst's productivity, these tools can be invoked dynamically, without coding, during the iterative process of preparing, mining and verification. They include data selection, transformation and cleansing. Additionally, a set of interactive visualization tools can be used to bring out unusual features that might otherwise be "drowned out." Business intelligence is a broad category of applications and technologies for gathering, providing access to, and analyzing data for the purpose of helping enterprise users make better business decisions. The term implies having a comprehensive knowledge of all factors that affect a business, such as customers, competitors, business partners, economic environment, and internal operations, therefore enabling optimal decisions to be made. Business Intelligence provides readers with an introduction and practical guide to the mathematical models and analysis methodologies vital to business intelligence.

(2)

Vol.2 (3), 2010, 109-113

System is the DSS Analysis Tool. Among many early views of Decision Support System (DSS), it identifies that DSS is an interactive computer-based system that assists managerial decision makers utilizing data and models to solve semi-structured and unstructured organizational problems from the organizational data-warehouse.

Data Warehouse consists of OLAP query analysis tool, data mining tool, and DSS analysis tool. The Data Warehouse System with these tools can process the huge precious information hidden in Data Warehouse. One quickly emerging research topic is the intelligent decision system that provides functional supports to consumers/entrepreneurs for efficiently and effectively making personalized decisions. Most decision-makers in finance and business commonly use linguistic categories (e.g. low, high, and large) to describe complex relationships in their domain. We therefore ideally need a mechanism to convert ‘raw’ data from a domain (e.g. price, volume and open interest data) into such linguistic symbolic descriptions. We use a relatively simple method based on the use of a clustering algorithm to convert such data into linguistic descriptions. It is on these linguistic descriptions that the D3M will operate on [3].The financial services industry is a marketplace with a significant demand of advanced consumer decision support.

2. Existing Data Mining System Analysis

The designers in the organization should fully consider the data characteristic of specific domain and the particularity of demand of mining when designing the D3M algorithm. The common application system of D3M, mostly use information sources like traditional database , files, email and so on, Data warehouse or various other application sources or large scale relational database. It’s the core part of support system of enterprise decision. The data mining methodologies in D3M had met the entrepreneur needs to some extent. In fact the efficiency of data mining can be improved by several ways.

1) Preprocessing of data sets, remove the noise data, follow the requirements of mining to clean and transfer the data, and reduce the data quantity of mining.

2) According to the requirements of data mining methodologies and data analysis, we need to research and design algorithm with higher efficiency of various data mining.

According to the actual situation and needs of the general application field of business intelligence, with the aim of improve and efficiency of data mining application. This paper will modify and improve the existing data mining system mainly in the areas of Domain Driven Data Mining and architecture [1] .

A) ROLE OF AKD IN D3M

In the real world, data mining is a problem-solving process (R) from business problems Ψ (with problem status τ) to problem-solving solutions Φ [6].

R: Ψ(τ1) →Φ(τ2)

Gradually, data miners realize that the actionability of a discovered pattern must be assessed by and satisfies domain user needs. To achieve business expectations, business interestingness measures to what degree a pattern is of interest to a business person from social, economic, personal and psychoanalytic factors. Recently business objective interest is recognized by some researchers, say profit mining and domain-driven data mining, involving business interests. Moreover, business subjective interest also plays essential roles in assessing business interests. This leads to a comprehensive cognition of actionability. There are two sets of interest measures needed to be calculated when a pattern is extracted. For instance, we say a mined association trading rule is (technically) interesting because it satisfies requests on support and confidence.

In the real-world mining, business interests may differ or conflict technical significance. Clearly, actionable knowledge mining targets patterns confirming the relationship that the pattern satisfies both business expectations as well as technical significance. However, it is a kind of artwork to tune thresholds and balance significance and difference between technical and business interestingness. There are two steps in technical interest evolution. The original focus basically was on technical objective interest, which aims to capture the complexities of pattern structure and statistical significance. Technical subjective measures, which also recognize to what extent a pattern is of interest to a particular user.

(3)

suppose it is identified pattern set Pmn={P1mn, P2mn, … PUmn } includes all patterns discovered in DB, where Pumn

(u=1,…,U) is a pattern discovered by mn.

AKD e,τ,mЄM :DB e,τ,mn Pmn

O e,τ,mЄM Int(P) pЄP

AKD is critical in promoting the productivity of data mining and knowledge discovery for smart business operations and decision-making rules. With regard to AKD approach, the existing work mainly focuses on developing post-analysis techniques to filter/prune rules, reduce redundancy and summarize learned rules. Real-world data mining is a complex problem-solving system. From the view of systems and micro economy, the endogenous character of AKD determines that it is an optimization problem with certain objectives under a particular environment [6].

3. Architecture of Intelligent Decision System based on PA-AKD Approach

PA-AKD is a two-step pattern extraction and refinement exercise. First, generally interesting patterns (which we call ‘general patterns’) are mined from data sets in terms of technical interestingness (to(), ts()) are used. Further, the mined general patterns are pruned, distilled and summarized into operable business rules (embedding actions) (which we call ‘deliverables’) in terms of domain specific business interestingness (bo(), bs()) and involving domain (Ωd ) and meta (Ωm) knowledge. PA-AKD is a two-step optimization problem that can be expressed as follows

PA-AKD: DB e,ti(),m1 P e,bi(),m2,Ωd,Ωm P,R

In Post Analysis, a recent highlight is to extract actions from learned rules. A typical effort on learning action rules is to split attributes into ‘hard/soft’ or ‘stable/ flexible’ to extract actions that may improve the loyalty or profitability of customers. The existing post-analysis and post-mining focuses on association rules or their combination with some specific methods. This limits the actionability of learned actions and the generalization of proposed approaches for AKD. For a pattern p, Int(p) can be further measured in terms of technical interestingness (ti(p)) and business interestingness (bi(p)) .

Int(p) = I(ti(p), bi(p))

The interestingness system, which combines technical interestingness (ti()) with business expectations (bi()) into a Post Analysis AKD interestingness system (i()). Domain knowledge (Ωd) and Environment (e) must be considered in the data mining process. Finally the outputs are P and R. Correspondingly, the actionability of a pattern p is measured by act(p):

act(p)=O pЄ P (Int(p)) →O(αt0 (p))+O( ts (p))+ O( b0(p))+O(δbs(p)) →t0act+tsact+b0act+bsact →tiact + biact

O(.) is the optimization function to extract function to extract those pЄ P , where Int(˜p) can beat a given benchmark. Whereas t0

act , ts

act , bo

act and bs

act

measure the respective actionable performance in terms of each interestingness element. Due to the inconsistency often existing in different aspects, we often find identified patterns only fitting in one of the following sub-sets:

Int(p)→{{ti act, bi act},{¬ti act, bi act}, {ti act, ¬bi act},{¬ti act, ¬bi act}} Where ’¬’ indicates satisfactory [5,6]

(4)

Vol.2 (3), 2010, 109-113

Figure1: Architecture of Intelligent Decision System based on PA-AKD. Facing the business intelligence application is composed of 3 parts as the fig.1 shows.

Data source: Data warehouse and other data marts are the basis of data mining, business intelligence

application systems should have multiple sources of data processing capabilities, such as ordinary documents (E-mail, etc.), relational database, data warehouse, data market and so on.

Analysis and Data layer: Data mining is the core of business intelligence, it’s important for parallel data

mining algorithms to improve the efficiency. In order to adapt the requests of business intelligence applications, it’s far from enough to provide multi-dimensional analysis tools and tools that support various modes as such as possible, involved in associated business intelligence, classification, clustering, time series, D3M tools can support these models, and this is one of the criteria for D3M to measure quality.

Application layer: The deliverables of D3M were extracted from the result of data mining and knowledge base.

The enterprise decisions and reports were analyzed through D3M deliverables.

Domain driven Data Mining has significance and practical value for the improvement of the efficiently of data mining. And it lays a solid basic for the improvement of the efficiently of data mining. What’s more it’s also an important role in the application of business intelligence.

4. D3M Attainability to individual service

D3M for service in individuation is a kind of service modes that supplies different services depending on different customers. It is the best choice when compared to traditional service modes. Individual service on Decision Support System is to supply different services depending on different customers and new application and development of service in individuation. With development of e-commerce, customers need more time and energy to find something interesting and the great quantity of information on internet rather than they do in traditional commerce. In this case, customers/ entrepreneurs need service modes that automatically organize and adjust information according to their demands.

(5)

problem-Domain Intelligence: It assists in understanding and problem-solving of the problem. Domain intelligence consists of qualitative and quantitative intelligence. Both types of intelligence are instantiated in terms of aspects such as domain knowledge, background information and organizational factors.

Network Intelligence: It includes information retrieval, text mining, and web mining. By the method, customer

characteristics visiting a commerce web site can be found according to statistic information and visiting mode. So latent customers can be found and effective commerce measures can be taken.

Human Intelligence: refers to (1) explicit or direct involvement of humans such as empirical knowledge (2)

implicit or indirect involvement of human intelligence such as imaginary thinking, emotional intelligence. We can get a similar customer group according to information in servers. That is to say, it can produce items into a same set explicitly. By the method, marketing strategies can be improved in E-commerce. Automatically send sales mails to special customers implicitly, and when any customer from different groups visit a web site, the web can change its pages for him or her, for example. By the measures, demands of customers can be met in some way to reach marketing targets.

Social Intelligence: Social Intelligence includes collective intelligence, social network analysis, and social

cognition interaction. It not only gives a theory frame, but also helps to manage goods and improve Decision support system services in E-commerce.

Intelligence Metasynthesis The above ubiquitous intelligence has to be combined for the problem solving. The

methodology for combining such intelligence is called metasynthesis, which provides a human-centered and human machine co-operated problem-solving process by involving, synthesizing and using ubiquitous intelligence surrounding AKD as need for problem-solving. By the technology, on the one hand, association between pages files visited by customers in a session can be found by mining logs in a web site. On the other hand, goods can be also found by mining trade transaction databases. These will be helpful to any e-commerce web site to organize page structures and make strategies of marketing.

Conclusion

In this paper we present a system architecture, design for developing consumer/Entrepreneur-oriented intelligent decision support systems and discuss how this approach can be applied to support personalized decision making process in various e-services application domains. In fact, the Domain Driven Data Mining is closely linked to the large potentiality of E-commerce with the future of intelligent decision system. PA-AKD system is then used to extract truly interesting patterns from the database. By using both technical interestingness and business expectations patterns of PA-AKD system, the efficiency of accessing the data from their data warehouse can be efficiently improved. And it will inevitably bring intelligent decision system more extensive application prospect and market value. The value must make enterprise in different level compete and win totally with now, and play a vital role to the core competitiveness, which strengthen enterprises.

References

[1] He YueShun, Ding QiuLin, “Application Research on Data mining Architecture for Intelligent Decision”, IEEE International Conference on Information Processing-2009.

[2] Wenchuan Yang Wujie Zhu Yang Liu Yan De,“The Prototupe Researchof a Web-based DSS Intelligent Agent Overdata Warehouse”, IEEE International Conference on Web Intelligence-2004.

[3] Swan Goonatilake, “Intelligent Hybrid Systems for Financial Decision Making”, ACM-1995.

[4] Chien-Chih Yu, “A Web-Based Consumer-Oriented Intelligent Decision Support System for Personalized E-Services”, International Conference on Electronic Commerce-2004.

[5] Longbing Cao, “Domain Driven Data Mining (D3M)”, IEEE International Conference on Data Mining Workshops-2008.

[6] Longbing Cao, Yanchang Zhao, Huaifeng Zhang, Dan Luo, Chengqi Zhang, E.K. Park, “Flexible Frameworks for Actionable Knowledge Discovery”, Journal of Latex class files, vol. 1, no. 1, Jan- 2008.

[7] Xiong Zhongyan, Research on parallel Data Mining and Application for Business Intelligence, Chongqing university,2004.

[8] Mohammed J.Zaki, Yi Pan. Introduction: Recent Developments in Parallel and Distributed Data Mining. Kluwer Academic Publishers. 2006.

[9] Efrem G. Mallach. Decision support and data warehouse systems, Boston: McGraw-Hill, 2005. 12, 79-80.

[10] Barry Wilkinson, Michael Allen, Parallel Programming: Techniques and Applications Using Networked Worksations and Parallel Computers. 2002.

Referências

Documentos relacionados

The probability of attending school four our group of interest in this region increased by 6.5 percentage points after the expansion of the Bolsa Família program in 2007 and

Na hepatite B, as enzimas hepáticas têm valores menores tanto para quem toma quanto para os que não tomam café comparados ao vírus C, porém os dados foram estatisticamente

didático e resolva as ​listas de exercícios (disponíveis no ​Classroom​) referentes às obras de Carlos Drummond de Andrade, João Guimarães Rosa, Machado de Assis,

Remelted zone of the probes after treatment with current intensity of arc plasma – 100 A, lower bainite, retained austenite and secondary cementite.. Because the secondary

H„ autores que preferem excluir dos estudos de prevalˆncia lesŽes associadas a dentes restaurados para evitar confus‚o de diagn€stico com lesŽes de

Ousasse apontar algumas hipóteses para a solução desse problema público a partir do exposto dos autores usados como base para fundamentação teórica, da análise dos dados

During the discussion of this item at the meeting of the Executive Committee the suggestion was made that the Director, taking into account the comments in any replies he

I have pleasure in replying to your letter concerning the structure and organization of the Pan American Sanitary Conference, which matter was examined by the Executive