Data Generation Process Workshop
Yue Hu
Tsinghua University
Download the pool of surveys from the dcpotools website
Have a look at the downloaded spreadsheet
Freeze Panes from Window country_var is empty)archive
dataverse, gesis, icpsr, roper, or ukds labelsdataverse, use the data_link to achieve the questionnairefile_id in the search box
icpsr) or “Studies/Datasets” (roper)



Download the template from the DCPOtools website.
survey: survey in the surveys_data spreadsheet;variable: The question index, e.g., “q56,” “v122”;question_text: The complete sentences read to the people taking the survey, or as close to that as you can find;response_categories: The number and the label of each of the options, e.g., “1. Strongly agree, 2. Agree, 3. Neither agree nor disagree, 4. Disagree, 5. Strongly disagree”.If, you’re sure there are no relevant questions in the survey, enter the survey and put “NA” under variable, and move on.
You may want to go over one archive at a time.
If there’s anything you think is important but unable to structuralized, put them to the note column.
Three people per group, one group per topic.
Make sure recording the full sentence of the questions
According to Landis & Koch (1977), let’s aim 0.8.
A high κ is not the ultimate goal, a.k.a., no! fake! consistency!
Twice communications with your partners:
Make sure you record the data in the same way.