Welcome to Scribd!

Eliminating Duplicate Cases Stat A

Uploaded by

0% found this document useful (0 votes)

18 views2 pages

This document describes a sample Stata program that eliminates duplicate cases from a dataset. It mimics CPS data where people can be interviewed for up to three years. The program keeps all cases interviewed in 2008 and cases from 2007 and 2009 not interviewed in 2008, keeping only one case per person. It generates a variable to identify 2008 cases, sorts by ID and this variable, generates a variable to flag duplicate IDs, and drops duplicates, retaining only unique cases.

Original Description:

Eliminating Duplicate Cases in Stata

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

18 views2 pages

Eliminating Duplicate Cases Stat A

Uploaded by

Andrew Tandoh

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Eliminating Duplicate Cases in Stata 1

* "h:\000\STATA_doc\DelDupCases.do"
* This is a sample program that eliminates duplicate cases from a datset.
* The sample data mimics data from CPS.
* People can be interviewed for up to three years.
* This researcher wants to save all of the cases that were interviewed
* in 2008, and any cases that were interviewed in 2007 and 2009
* who were not interviewed in 2008, and she wants only one case per person
* even if they were interviewed in multiple years.
** I have color coded this for you.
** My comments are in green, my commands to Stata are in blue, and the things
** stata has to tell me are in black.

** Get the data set.

use "h:\000\STATA_doc\test.dta", clear

** Show you the data as they are now .

list
+----------------+
| year id v1 |
|----------------|
1. | 2007 1 1 |
2. | 2008 1 2 |
3. | 2009 1 3 |
4. | 2007 2 4 |
5. | 2008 2 5 |
|----------------|
6. | 2009 2 6 |
7. | 2007 3 7 |
8. | 2009 4 8 |
+----------------+

** create a variable named tyear. If year = 2008 the value = 0.

** otherwise it will be 1. This variable will be used to sort
** so that we can keep ids for the year 2008 if they exist .
gen tyear = 1
replace tyear = 0 if year == 2008
(2 real changes made)

** Sort so that we can identify duplicate cases later.

sort id tyear

** Show you the data as they are now .

list
+------------------------+
| year id v1 tyear |
|------------------------|
1. | 2008 1 2 0 |
2. | 2009 1 3 1 |
3. | 2007 1 1 1 |
4. | 2008 2 5 0 |
5. | 2007 2 4 1 |
|------------------------|
6. | 2009 2 6 1 |
7. | 2007 3 7 1 |
8. | 2009 4 8 1 |
+------------------------+

1
Prepared by Patty Glynn, University of Washington, November 1, 2010. Thanks to Sara Vera for testing this.
** The following command creates a variable named ppdup that has the
** value of id for the case behind it.
** [_n-1] asks stata to look at the previous case .
** (similar to “lag” in SAS and SPSS) .

gen ppdup = id[_n-1]

(1 missing value generated)

** Show you the data as they are now .

list
+--------------------------------+
| year id v1 tyear ppdup |
|--------------------------------|
1. | 2008 1 2 0 . |
2. | 2007 1 1 1 1 |
3. | 2009 1 3 1 1 |
4. | 2008 2 5 0 1 |
5. | 2009 2 6 1 2 |
|--------------------------------|
6. | 2007 2 4 1 2 |
7. | 2007 3 7 1 2 |
8. | 2009 4 8 1 3 |
+--------------------------------+

** select cases where the id is not the same as the id in the previous case .
drop if ppdup == id
(4 observations deleted)

** Show you the data as they are now .

list
+--------------------------------+
| year id v1 tyear ppdup |
|--------------------------------|
1. | 2008 1 2 0 . |
2. | 2008 2 5 0 1 |
3. | 2007 3 7 1 2 |
4. | 2009 4 8 1 3 |
+--------------------------------+

** All of the key commands not annotated and without “list” commands .

use "h:\000\STATA_doc\test.dta", clear

gen tyear = 1
replace tyear = 0 if year == 2008
sort id tyear
gen ppdup = id[_n-1]
drop if ppdup == id

Netcourse 101: Answers To Exercises in Lesson 3
Document7 pages
Netcourse 101: Answers To Exercises in Lesson 3
Adetola Adeosun
No ratings yet
Total Revenue To English Crown, 1485-1815
Document3 pages
Total Revenue To English Crown, 1485-1815
samuraijack7
No ratings yet
Data Manipulation
Document70 pages
Data Manipulation
Onduso Sammy Magara
No ratings yet
Dspace Tips Tabelas
Document5 pages
Dspace Tips Tabelas
Angelo Silva
No ratings yet
Technical Tips On Time Series With Stata: Gustavo Sánchez
Document37 pages
Technical Tips On Time Series With Stata: Gustavo Sánchez
rubengutieerrez
No ratings yet
Managing Hierarchical Data in Mysql: Mike Hillyer - Mysql Ab
Document33 pages
Managing Hierarchical Data in Mysql: Mike Hillyer - Mysql Ab
sleyter94
No ratings yet
Parking System Database Tables
Document3 pages
Parking System Database Tables
Naresh
No ratings yet
SW2011 SP0 x86 CRACK
Document3 pages
SW2011 SP0 x86 CRACK
qamarul
No ratings yet
Zvquemtl
Document113 pages
Zvquemtl
api-3708589
No ratings yet
Dbms 5th Assignment 22
Document9 pages
Dbms 5th Assignment 22
RUSHIKESH SAWANT
No ratings yet
Overview of Common Data Types PDF
Document24 pages
Overview of Common Data Types PDF
dieko
No ratings yet
SQL - Answers Assignment-1
Document9 pages
SQL - Answers Assignment-1
nivya inventateq
No ratings yet
Do_file_quan Ly Va Lam Sach Du Lieu
Document6 pages
Do_file_quan Ly Va Lam Sach Du Lieu
trannhi2806tg
No ratings yet
SQL Practice Questions 2 Solutions
Document7 pages
SQL Practice Questions 2 Solutions
VipinVKumar
No ratings yet
Date
Document5 pages
Date
Vishal Tandon
No ratings yet
XAMPP For Windows - Mysql - H Localhost - U Root - P
Document14 pages
XAMPP For Windows - Mysql - H Localhost - U Root - P
tcoffopharell
No ratings yet
How to create barcode objects
Document1 page
How to create barcode objects
Rafael Ramos
No ratings yet
Mysql Join Assignment
Document10 pages
Mysql Join Assignment
Onkar K
No ratings yet
MIS 406 Final Practical Assignment 2018-2!10!017
Document27 pages
MIS 406 Final Practical Assignment 2018-2!10!017
Towhid Ul Islam
No ratings yet
PNG Spring05
Document16 pages
PNG Spring05
David Sarif
No ratings yet
Creating Dynamic Internal Table
Document8 pages
Creating Dynamic Internal Table
Eldior Solutions
No ratings yet
Metadata Matters by Tom Kyte (Oracle)
Document60 pages
Metadata Matters by Tom Kyte (Oracle)
ittichai
100% (2)
MS Team recording links and MPI Lab assessments
Document10 pages
MS Team recording links and MPI Lab assessments
kumarkl
No ratings yet
OIA With ADBC
Document2 pages
OIA With ADBC
Nikhil Bhatia
No ratings yet
Lauro Vals Venezolano 3 Natalia
Document4 pages
Lauro Vals Venezolano 3 Natalia
Marcelo Figueroa
No ratings yet
Struts 2 Date Format
Document25 pages
Struts 2 Date Format
Anonymous WstkPD9E9h
100% (3)
Configure and Use the pg_profile Extension
Document4 pages
Configure and Use the pg_profile Extension
Rajkishore Patro
No ratings yet
Class1 SnowSQL
Document10 pages
Class1 SnowSQL
snowflake batch
No ratings yet
Test Algo
Document6 pages
Test Algo
Rohit Chandekar
No ratings yet
Time Series by Oscar Torres-Reyna
Document31 pages
Time Series by Oscar Torres-Reyna
Dacian Balosin
No ratings yet
Output DBMS2 (B) Prac
Document7 pages
Output DBMS2 (B) Prac
Sapna Madhavai
No ratings yet
Panel
Document93 pages
Panel
jjanggu
100% (1)
Hints and Answers
Document13 pages
Hints and Answers
ashishamitav123
No ratings yet
Sample Problems
Document5 pages
Sample Problems
Ayush Kumar
No ratings yet
Tugas 11-11-2015
Document2 pages
Tugas 11-11-2015
R yanto
No ratings yet
19.3.4 Klasifikasi Di Spark
Document5 pages
19.3.4 Klasifikasi Di Spark
Yafi Shalihuddin
No ratings yet
Dealing With Date Variables in Stata: Created by Jennifer Cocohoba, 9/5/07
Document3 pages
Dealing With Date Variables in Stata: Created by Jennifer Cocohoba, 9/5/07
Mithun Bhattacharya
No ratings yet
Parking Management System
Document29 pages
Parking Management System
prithiks
No ratings yet
SAP ABAL ALV Grid Real Example Explained
Document18 pages
SAP ABAL ALV Grid Real Example Explained
badzkun
No ratings yet
OpenStack Pike Volet 11
Document6 pages
OpenStack Pike Volet 11
IRIE
No ratings yet
Ceilometer To Gnocchi - A Guide For Openstack
Document23 pages
Ceilometer To Gnocchi - A Guide For Openstack
btkk zztb
No ratings yet
04 Sapnapr 03
Document6 pages
04 Sapnapr 03
Sapna Madhavai
No ratings yet
Technical Document
Document16 pages
Technical Document
rachmat99
No ratings yet
Alv Grid Abap
Document4 pages
Alv Grid Abap
Bruna
No ratings yet
Employee Details Report Using Logical Database - PNP
Document10 pages
Employee Details Report Using Logical Database - PNP
Ayaz Ahmed Shaik
No ratings yet
Leetcode SQL QnA 1693149052
Document60 pages
Leetcode SQL QnA 1693149052
krishna4351
No ratings yet
Final Fantasy Tactics Advance Stats Growth
Document3 pages
Final Fantasy Tactics Advance Stats Growth
Ray Anthony Uy Pating
No ratings yet
Check and Analyze The STATISTICS in The MySQL Database Smart Way of Technology
Document3 pages
Check and Analyze The STATISTICS in The MySQL Database Smart Way of Technology
ikke den dikke
No ratings yet
Analysis of Household Survey Data
Document38 pages
Analysis of Household Survey Data
Yoseph Bekele
No ratings yet
Manage Hierarchical Data in MySQL with Adjacency List and Nested Set Models
Document23 pages
Manage Hierarchical Data in MySQL with Adjacency List and Nested Set Models
Arisetty Sravan Kumar
No ratings yet
DOFILE_QUAN LY VA LAM SACH DU LIEU 2
Document6 pages
DOFILE_QUAN LY VA LAM SACH DU LIEU 2
trannhi2806tg
No ratings yet
MYSQL - ASSIGNMENT ON ITEM TABLE
Document7 pages
MYSQL - ASSIGNMENT ON ITEM TABLE
Aryan Singh
No ratings yet
1.DDL Commands: 1. Creating A Table
Document6 pages
1.DDL Commands: 1. Creating A Table
vivek srivastava
No ratings yet
Database Management System: Final Team Project Name - Student ID
Document11 pages
Database Management System: Final Team Project Name - Student ID
Sam Ram
No ratings yet
IBM System 360 RPG Debugging Template and Keypunch Card
From Everand
IBM System 360 RPG Debugging Template and Keypunch Card
Archive Classics
No ratings yet
Beginning C# and .NET
From Everand
Beginning C# and .NET
Benjamin Perkins
No ratings yet
Handbook of Python: Easy to Carry Python Basics
From Everand
Handbook of Python: Easy to Carry Python Basics
Ramprakash S.
No ratings yet
Tableau Your Data!: Fast and Easy Visual Analysis with Tableau Software
From Everand
Tableau Your Data!: Fast and Easy Visual Analysis with Tableau Software
Daniel G. Murray
Rating: 4.5 out of 5 stars
4.5/5 (4)
GIS Tutorial for ArcGIS Desktop 10.8
From Everand
GIS Tutorial for ArcGIS Desktop 10.8
Wilpen L. Gorr
Rating: 4 out of 5 stars
4/5 (6)
Miscellaneous Industrial Machinery World Summary: Market Values & Financials by Country
From Everand
Miscellaneous Industrial Machinery World Summary: Market Values & Financials by Country
Editorial DataGroup
No ratings yet
Grossman Model
Document32 pages
Grossman Model
Andrew Tandoh
100% (2)
Example How To Write A Proposal
Document9 pages
Example How To Write A Proposal
Spoorthi Poojari
No ratings yet
Identifying at Risk Students With SPQ
Document8 pages
Identifying at Risk Students With SPQ
Andrew Tandoh
No ratings yet
Minimum Capital Requirement Ghana
Document2 pages
Minimum Capital Requirement Ghana
Andrew Tandoh
No ratings yet
Adinkra Symbols
Document26 pages
Adinkra Symbols
extinguisha4real
100% (2)
Bank Regulation and Supervision Around The World 15JAN2013
Document111 pages
Bank Regulation and Supervision Around The World 15JAN2013
Andrew Tandoh
No ratings yet
Banking Sector Report Highlights Ghana Bank Performance
Document18 pages
Banking Sector Report Highlights Ghana Bank Performance
Andrew Tandoh
No ratings yet
Omitted Variable Tests
Document4 pages
Omitted Variable Tests
Andrew Tandoh
No ratings yet
10 - Sampling and Sample Size Calculation 2009 Revised NJF - WB
Document41 pages
10 - Sampling and Sample Size Calculation 2009 Revised NJF - WB
Obodai Manny
No ratings yet
Durbin-Watson Test for Autocorrelation
Document6 pages
Durbin-Watson Test for Autocorrelation
Andrew Tandoh
No ratings yet
Timetable For The Spring Semester 2013, Odense PDF
Document3 pages
Timetable For The Spring Semester 2013, Odense PDF
Andrew Tandoh
No ratings yet
Legal and Regulatory Framework
Document2 pages
Legal and Regulatory Framework
Andrew Tandoh
No ratings yet
Geometry
Document4 pages
Geometry
Andrew Tandoh
No ratings yet
MSC Thesis Supervision
Document1 page
MSC Thesis Supervision
Andrew Tandoh
No ratings yet
SSRN Id2191280 1
Document21 pages
SSRN Id2191280 1
Andrew Tandoh
No ratings yet
Xtxttobit 2
Document8 pages
Xtxttobit 2
Andrew Tandoh
No ratings yet
09 Chapter 2
Document46 pages
09 Chapter 2
Andrew Tandoh
No ratings yet
The Impact of Dividend Policy On Commercial Banks Performance in Ghana
Document12 pages
The Impact of Dividend Policy On Commercial Banks Performance in Ghana
Andrew Tandoh
No ratings yet
2014wesp Country Classification (
Document8 pages
2014wesp Country Classification (
Tanveer Ahmad
No ratings yet
Test PDF
Document1 page
Test PDF
Andrew Tandoh
No ratings yet
Thyristor - Cross Reference
Document13 pages
Thyristor - Cross Reference
rachmanjkt
100% (2)
The Impact of Dividend Policy On Commercial Banks Performance in Ghana
Document14 pages
The Impact of Dividend Policy On Commercial Banks Performance in Ghana
Andrew Tandoh
No ratings yet
All Saints, Stranton Church Hartlepool: Annual Report of The Parochial Church Council For The Year Ended 31 December 2013
Document18 pages
All Saints, Stranton Church Hartlepool: Annual Report of The Parochial Church Council For The Year Ended 31 December 2013
Andrew Tandoh
No ratings yet
The Use of Dummy Variables in Regression Analysis
Document2 pages
The Use of Dummy Variables in Regression Analysis
brose5
No ratings yet
New Church Monthly Report: Reporting Month: - Year
Document2 pages
New Church Monthly Report: Reporting Month: - Year
Andrew Tandoh
No ratings yet
Sample - Employment Letter
Document2 pages
Sample - Employment Letter
Andrew Tandoh
No ratings yet
Cover Page ETHICS and The Conduct of Business 3rd
Document1 page
Cover Page ETHICS and The Conduct of Business 3rd
Andrew Tandoh
No ratings yet
Keynes Consumption
Document9 pages
Keynes Consumption
Andrew Tandoh
No ratings yet
1 cm Graph Paper with Red Lines
Document1 page
1 cm Graph Paper with Red Lines
Andrew Tandoh
No ratings yet
Introduction To Opm
Document30 pages
Introduction To Opm
Naeem Ul Hassan
No ratings yet
sl2018 667 PDF
Document8 pages
sl2018 667 PDF
Gaurav Maithil
No ratings yet
Finance at Iim Kashipur: Group 9
Document8 pages
Finance at Iim Kashipur: Group 9
Rajat Singh
No ratings yet
2007 Bomet District Paper 2
Document16 pages
2007 Bomet District Paper 2
Ednah Wambui
No ratings yet
Inbound 9092675230374889652
Document14 pages
Inbound 9092675230374889652
Sean Andrew Soriano
No ratings yet
Urodynamics Griffiths ICS 2014
Document198 pages
Urodynamics Griffiths ICS 2014
nadal
No ratings yet
Chapter 2
Document22 pages
Chapter 2
Okorie Chinedu P
No ratings yet
Cisco CMTS Feature Guide
Document756 pages
Cisco CMTS Feature Guide
Ezequiel Mariano Daoud
No ratings yet
Domingo V People (Estafa)
Document16 pages
Domingo V People (Estafa)
Kim Escosia
No ratings yet
ROPE TENSIONER Product-Catalog-2019
Document178 pages
ROPE TENSIONER Product-Catalog-2019
jeedan
No ratings yet
Active and Passive Voice of Future Continuous Tense - Passive Voice Tips-1
Document5 pages
Active and Passive Voice of Future Continuous Tense - Passive Voice Tips-1
Kamal deep singh Singh
No ratings yet
Lanegan (Greg Prato)
Document254 pages
Lanegan (Greg Prato)
Maria Luisa
No ratings yet
Injection Timing (5L) : Inspection
Document2 pages
Injection Timing (5L) : Inspection
ali
No ratings yet
Electrosteel Castings Limited (ECL) - Technology That Cares
Document4 pages
Electrosteel Castings Limited (ECL) - Technology That Cares
Ujjawal Prakash
No ratings yet
Spsi R
Document2 pages
Spsi R
Brandy A
No ratings yet
Steps To Configure Linux For Oracle 9i Installation: 1. Change Kernel Parameters
Document5 pages
Steps To Configure Linux For Oracle 9i Installation: 1. Change Kernel Parameters
ruhelanik
No ratings yet
PA Inspection Guidelines For Single Site Acceptance: 1 © Nokia Siemens Networks
Document18 pages
PA Inspection Guidelines For Single Site Acceptance: 1 © Nokia Siemens Networks
Denny Wijaya
No ratings yet
Mechanics of Deformable Bodies
Document21 pages
Mechanics of Deformable Bodies
Varun. hr
No ratings yet
Amana PLE8317W2 Service Manual
Document113 pages
Amana PLE8317W2 Service Manual
Schneks
No ratings yet
Inventarisasi Data Kondisi Jalan Ke Dalam Aplikasi Sistem Informasi Geografis (Sig)
Document10 pages
Inventarisasi Data Kondisi Jalan Ke Dalam Aplikasi Sistem Informasi Geografis (Sig)
Wiro Sableng
No ratings yet
10 Slides For A Perfect Startup Pitch Deck
Document6 pages
10 Slides For A Perfect Startup Pitch Deck
Zakky Azhari
No ratings yet
For Coin & Blood (2nd Edition) - Sickness
Document16 pages
For Coin & Blood (2nd Edition) - Sickness
Myriam Poveda
50% (2)
Design and Analysis of Crankshaft Components
Document21 pages
Design and Analysis of Crankshaft Components
sushant47
0% (1)
Kahveci: Ozkan
Document2 pages
Kahveci: Ozkan
Victor Smith
No ratings yet
2019 May Chronicle AICF
Document27 pages
2019 May Chronicle AICF
Ram Krishna
No ratings yet
Daftar Pustaka
Document4 pages
Daftar Pustaka
Ramli Usman
No ratings yet
Key formulas for introductory statistics
Document8 pages
Key formulas for introductory statistics
imam awaluddin
No ratings yet
PC November 2012
Document50 pages
PC November 2012
bartekdid
No ratings yet
2002, Vol.86, Issues 4, Hospital Medicine
Document221 pages
2002, Vol.86, Issues 4, Hospital Medicine
Faisal H Rana
No ratings yet
Philips DVD Player Specifications
Document2 pages
Philips DVD Player Specifications
bhau_20
No ratings yet