Data Processing

Data Editing

Data editing took place at a number of stages throughout the processing (see Other processing), including:
a) Office editing and coding
b) During data entry
c) Structure checking and completeness
d) Secondary editing
e) Structural checking of SPSS data files

Detailed documentation of the editing of data can be found in the data processing guidelines

Other Processing

Data were processed in clusters, with each cluster being processed as a complete unit through each stage of data processing.  Each cluster goes through the following steps:
1) Questionnaire reception
2) Office editing and coding
3) Data entry
4) Structure and completeness checking
5) Verification entry
6) Comparison of verification data
7) Back up of raw data
8) Secondary editing
9) Edited data back up
After all clusters are processed, all data is concatenated together and then the following steps are completed for all data files:
10) Export to SPSS in 5 files (hh - household, hl - household members, wm - women, bh - birth history, ch - children under 5)
11) Recoding of variables needed for analysis
12) Adding of sample weights
13) Calculation of wealth quintiles and merging into data
14) Structural checking of SPSS files
15) Data quality tabulations
16) Production of analysis tabulations

Details of each of these steps can be found in the data processing documentation, data editing guidelines, data processing programs in CSPro and SPSS, and tabulation guidelines.

The data was carried out by 11 data entry operators and 1 data entry supervisor. In order to ensure quality control, and internal consistency checks were performed. Procedures and standard programs developed under the global MICS3 project and adapted to the Yemen questionnaire were used throughout. Data processing began after data collection had been conducted in October 2006 and was completed in December 2006. All range checks and skips were controlled by the program and operators could not override these.  A limited set of consistency checks were also included inthe data entry program.  Open-ended responses ("Other" answers) were not entered or coded, except in rare circumstances where the response matched an existing code in the questionnaire.  

Structure and completeness checking ensured that all questionnaires for the cluster had been entered, were structurally sound, and that women's and children's questionnaires existed for each eligible woman and child. 

100% verification of all variables was performed using independent verification, i.e. double entry of data, with separate comparison of data followed by modification of one or both datasets to correct keying errors by original operators who first keyed the files.

After completion of all processing in CSPro, all individual cluster files were backed up before concatenating data together using the CSPro file concatenate utility.

For tabulation and analysis SPSS versions 10.0 and 14.0 were used.  Version 10.0 was originally used for all tabulation programs, except for child mortality.  Later version 14.0 was used for child mortality, data quality tabulations and other analysis activities.

After transferring all files to SPSS, certain variables were recoded for use as background characteristics in the tabulation of the data, including grouping age, education, geographic areas as needed for analysis.  In the process of recoding ages and dates some random imputation of dates (within calculated constraints) was performed to handle missing or "don't know" ages or dates.  Additionally, a wealth (asset) index of household members was calculated using principal components analysis, based on household assets, and both the score and quintiles were included in the datasets for use in tabulations.

Scripts/Programs

Secondary data processing programs - SPSS , PAPFAM, Strategic Information Section, Division of Policy and Planning (DPP), UNICEF NYHQ, 2007-01-01, English [eng], Yemen [yem]
Contributor(s): Strategic Information Section, Division of Policy and Planning (DPP), UNICEF NYHQ
Data Processing\Yemen MICS Syntax.zip
Show more info: Description  Table of Contents 

Data Entry Programme , PAPFAM, Strategic Information Section, Division of Policy and Planning (DPP), UNICEF NYHQ, 2006-10-01, English [eng], Yemen [yem]
Contributor(s): Strategic Information Section, Division of Policy and Planning (DPP), UNICEF NYHQ
Data Processing\CSPRO.zip
Show more info: Description  Table of Contents 

Generated: MAY-27-2009 using the IHSN Microdata Management Toolkit