Professional Documents
Culture Documents
Clustering is one of data mining technique that used the following areas:-
A cluster is therefore a collection of objects which are similar between them and
dissimilar to the objects belonging to other clusters.
A decision tree is a structure that includes a root node, branches, and leaf nodes.
Each internal node denotes a test on an attribute, each branch denotes the
outcome of a test, and each leaf node holds a class label. The topmost node in the
tree is the root node.
The benefits of having a decision tree are as follows −
1- Web content mining: - describe the discovery of useful information from the
web contents. Web content mining can be used for mining of useful data,
information and knowledge from web page content.
2- Web structure mining: - helps to find useful knowledge or information pattern
from the structure of hyperlinks. This web structure concerns with discovering the
model underlying the link structures of the web. It used to study the topology of
hyperlinks with or without the description of links.
3- Web usage mining: - is used for mining the web log records (access
information of web pages) and helps to discover the user access patterns of web
pages. Web server registers a web log entry for every web page.
Some of the techniques to discover and analyze the web usage pattern
are:-
i) Session and visitor analysis
The analysis of preprocessed data can be performed in session analysis, which
includes the record of visitors, days, sessions etc. This information can be used to
analyze the behavior of visitors.
ii) OLAP (Online Analytical Processing)
OLAP performs Multidimensional analysis of complex data.
OLAP can be performed on different parts of log related data in a certain interval of
time.
Text mining:-is the process of deriving high-quality information from text.
High-quality information is typically derived through the devising of patterns and
trends through means such as statistical pattern learning.
Text mining, is the process of examining large collections of written resources
to generate new information, and to transform the unstructured text into
structured data for use in further analysis.
Areas of text mining
1. Information extraction
2. Natural language processing
3. Data mining
4. Information retrieval