Professional Documents
Culture Documents
Data Collection
Case Study Diamonds
Diamonds have been treasured as gemstones since their use as
religious icons in ancient India. Their usage in engraving tools
also dates to early human history. The popularity of diamonds
has risen since the 19th century because of increased supply,
improved cutting and polishing techniques, growth in the world
economy, and innovative and successful advertising campaigns.
SI1 Slightly Included - diamonds have noticeable inclusions that are easy to very easy for
a trained grader to see when viewed under 10x magnification. The SI category is divided
SI2 into two grades; SI1 denotes a higher clarity grade than SI2.
VS1 Very Slightly Included - diamonds have minor inclusions that are difficult to somewhat
easy for a trained grader to see when viewed under 10x magnification. The VS category
is divided into two grades; VS1 denotes a higher clarity grade than VS2. Typically the
VS2 inclusions in VS diamonds are invisible without magnification.
VVS1 Very Very Slightly Included - diamonds have minute inclusions that are difficult for a
skilled grader to see under 10x magnification. The VVS category is divided into two
VVS2 grades; VVS1 denotes a higher clarity grade than VVS2.
6
Key messages this week you will
learn...
What is Statistics
What we need for Statistical Analysis
The nature of Statistical Data
The distinction between Population and Sample
Once data are collected, there is the need to organise and present
them in a manner to facilitate their interpretation.
Organisation &
Presentation
of Data
Analysis of
Data Collection of
Data
Conclusion &
Recommendation
People Nature
10
Nature of Statistical Data
Data are categorised in two ways: primary data and
secondary data
Random
selection
13
Lecture Example 1: Diamonds
The buyer wishes to determine the average carat weight, the
most common colour, the overall clarity and the quality of
the cut of diamonds displayed by a major jewellery retailer.
14
Lecture Example 1: Diamonds
The data are described by the following variables:
Carat weight
Cut
Clarity
Colour
Price
15
Lecture Exercise 1: Your Turn (Stop and Consider)
Fifty per cent of teens sext by mobile phone
(August 2, 2015, by Cosima Marriner, Sun-Herald senior writer)
16
Lecture Exercise 1 (ctd)
Most of the 1200 teens
surveyed who had sexted
said they sent the image to
a person with whom they
had a relationship.
Forty per cent had sent a
sext to more than one
person in the past year.
Only six per cent of sexters
reported sending an image
on to a third party for whom
the picture wasn't originally
intended.
17
Lecture Exercise 1 (ctd)
The criminology researchers who conducted the survey said the results
underscored the mismatch between sexting laws, which classify the
practice as child abuse or child pornography and do not distinguish
between consensual and non-consensual sexting, and the reality of teen
sexting.
"Lots of kids are doing it, but not very often, and not with many people,"
Murray Lee, associate professor in criminology at Sydney University, said.
"For the most part sexting is an exploration of their sexuality. Sometimes
it can move into the field of bullying but that's very rare."
But school cybersafety lecturers said boys were regularly badgering girls
to send them sexual images. "Not a day goes by that I don't deal with
girls around Australia under pressure to send these photos," said Susan
McLean, a leading cyber safety expert, and former police officer.
Random
selection
23
Sources of errors in Censuses and
Sample Surveys
Sampling Errors
Suppose we want to find out the average weight of all students of
the University of Canberra.
We may do this by selecting a sample of say 200 students and
finding the average weight of our sample of students. What we get
is an estimate.
We may expect this estimate to be close to the true average weight
of all students. Can we expect this estimate to be exactly equal?
Differences between the estimates based on samples (statistics) and
the true population values (parameters) are called sampling
errors.
24
Sources of errors in Censuses and
Sample Surveys (ctd)
Non-Sampling Errors
Both the census and the sample survey are subject to other errors
called non-sampling errors.
Representativeness
The sample should capture the variation in the target population
in respect of the characteristic or characteristics under
study.
27
Sample Design (ctd)
Sampling Frames
A list from which we select a sample is called a sampling frame.
28
Sample Design Simple Random Sampling (SRS)
Definition
SRS is a method of random selection where every subset of the
chosen sample size has the same chance of selection.
Application of SRS
We need a list of all members of the target population (sampling
frame).
This list is numbered serially (e.g. if the population consists of
1,000 individuals, we number them 1, 2, 3 etc., up to 1,000).
Suppose we need a sample of size 100. We select 100 random
numbers (lying between 1 and 1,000) using Excel.
Example
Suppose we have a population of 1,000 and we need a sample of 75.
Then k=13
30
Sample Design Systematic Sampling (ctd)
Example
Suppose we have a population of 1,000 and we need a sample of
75.
Then k=13
31
Sample Design Stratified Random Sampling
Stratification i.e., the division of the target population into groups must
be on a criterion or on criteria relevant to the survey topic.
E.g., if we are stratifying by education level, we need to know every individuals
education level
33
Sample Design Cluster Sampling
34
Key messages this week you will
learn...
What is Statistics
What we need for Statistical Analysis
The nature of Statistical Data
The distinction between Population and Sample
Please note:
The book is a reference only; its the lecture content
which dictates what you read in the book.
36
Next Week
Presenting data: Tables and Graphs
37