Data Mining Professional

Introduction

The set of information produced by contemporary society is growing ever more rapidly. A large part of this information has the potential to play a strategic role in the business field and to be used for commercial purposes by companies. However, traditional data storage and processing techniques seem incapable of successfully managing a large amount of data that is too large. Data mining is a discipline that is emerging in recent years and that is able to offer adequate tools for analysis based on large amounts of data. A feature of this discipline is that, as the number of available information increases, it is able to discover new knowledge and new information, with rapid and innovative methods.Data mining cannot be considered as a separate science, but a branch that uses instruments of disciplines already existing such as statistical sciences, computer science, marketing.The difference lies in the quantity of data to be processed in the analyzes, and by the tendency to discover, starting from them, relations, regularities and characteristics not known a priori. Finally, all data mining analyzes have a very specific purpose, namely that of producing information of strict economic utility for the management of the company that commissioned them. The world wide web is one of the places where the information produced by our company has the fastest development, in terms of size. The companies immediately realized the economic potential of this information and tried to use it for their own commercial purposes. An example is that of companies that sell products directly online, that can maximize their profit knowing as much information as possible about their customers. In addition to data collected directly, for example by telephone interview or online questionnaires, new information about customers can also be collected indirectly, for example by observing the pages visited by them or the products purchased in a reference period. There are all the prerequisites for using web-oriented data mining techniques, or rather those subject to web-mining study. Once the characteristics of this discipline are known, the methods described in a real case study will be applied. A large amount of data will be analyzed, produced by the conduct of users browsing a website,

 

Data Mining Overview

The society which we live is often called the information society. A large amount of data, growing exponentially year after year, is available to every individual and every organization: these data constitute a potential factor of development in all fields, from economics to science and engineering . In the business world, company and customer data are an essential source of financial strategies: for this reason enormous resources have been devoted to collecting and storing information in recent years. In reality, until now, the full potential of these data has not been exploited for several reasons: for example, they have often been archived without taking into account the purposes for which they were collected. Currently, however, the development of technology in information and methodological research is able to cope with these needs: the most advanced hardware and software tools allow data to be collected and organized so that they are more directly usable, while developments in the sectors are of the IT that of statistics make it possible to have flexible and scalable procedures that are able to analyze large databases and obtain effective summaries and relevant information from them. Some authors define data mining as a discipline that is able to process and extract information from this large mass of data (Kantardzic, 2003). Others take the data mining business objectives into greater consideration (Giudici, 2001). In this view, the aim is to use all the statistical, IT, marketing, etc. resources, alongside the decision-making processes, to derive from the results data to be used for the support of business decisions. We propose a definition of data mining that takes into account both the characteristics of its procedures and their purpose.

 

Data mining is a process of description, selection, synthesis of a large mass of data, to discover in them regularities or relationships not evident a priori, with the aim of obtaining a relevant result for business or business purposes.

 

Data mining is a process of description, selection, synthesis of a large mass of data, to discover in them regularities or relationships not evident a priori, with the aim of obtaining a relevant result for business or business purposes.

Leave a Reply

Your email address will not be published. Required fields are marked *