You are on page 1of 10

Data Mining

A process used to turn raw data into


useful information
PROCESS:
Job Understanding
Data understanding
Data preparation
Process modeling
Process evaluation
Deployment

Data Mining Applications in Govt.

E-government is a modern way that


government department provides services
for the public
E-government means that government
uses modern information and
communication technique, integrate
management and service by network
technique, and realize optimization
recombination of government organization
structure and workflow on the Internet.

Common Uses of Data


Mining

Fraud or non-compliance anomaly


detection CMAD (compliance monitoring
for anomaly detection), primary monitoring
system comparing some predetermined
conditions of acceptance with actual data.
For eg: Credit Card fraud detections
Lie detection
Criminal Investigation and homeland
security examine trends, locations, past
records etc

Data mining techniques


1. Multi-dimensional Cross-table analysis
Step 1: Analysis of the data
Step 2: Define variables for multiple choice
questions; define a variable for each topic.
Step 3: The data file after transformation
(the partial data) is listed.
Step 4: Multi-dimensional cross-table
analysis

Data mining techniques


2. Correlation analysis method
variables X and Y carry on the observation, on a group of data:
xi, yi (i = 1, 2,..., n) , then the correlation coefficient formula is

Where y ,x respectively are the arithmetic average values.


Here | Rxy | 1.
0 < | Rxy | < 1, means X and Y are right relevant
1 < Rxy < 0, then there is negative correlation b/w X and Y
|Rxy| closer to 1, implies that their exists a remarkable linear
relationship b/w X and Y.
Rxy is close to 0, then X and Y are not related
|Rxy| = 1, then X and Y are completely related.

Data mining programs in US


While there are undoubtedly many
classified projects in this category, some
have been disclosed. For example:
Investigative Data Warehouse (IDW).
The FBI describes the IDW as its single
largest repository of operational and
intelligence information; it serves as a
centralized data access point for FBI
agents across the country.

Data mining programs in US

Total Information Awareness (TIA)


The Defense Departments Advanced
Research Projects Agency (DARPA)
began a program after 9/11 to gather
vast amounts of domestic and foreign
data

Data mining programs in US

Secure Flight / CAPPS II.

Also following 9/11, the Federal Aviation


Administration (later the Transportation Security
Administration (TSA)) began work to develop a
replacement for the existing air passenger
screening system (Computer-Assisted Passenger
Prescreening System, or CAPPS) to screen air
passengers for inclusion on watch lists or for
terrorist or criminal threat. The system was
designed to use information acquired from
government sources, airlines, and commercial
data brokers.

Other applications

Bio-informatics
Text mining
Text clustering
Financial and banking sectors
Scientific enquery and research analysis
Corporate surveillance
Medical and healthcare
Marketing and retailing
Bibliomining data mining + bibliometrics + statistics
+ reporting tools, extract patterns of behaviour based
artifacts from library systems

THANK YOU.

You might also like