...
- Copy and paste the questionnaire from the PDF into Word.
- Remove “junk” from the questionnaire (e.g., help screens, derived variables, page numbers) and any line breaks or formatting errors that may have been introduced in the copy-and-paste process.
- Colour code the elements of the questionnaire (e.g., change the font colour of all the code lists to yellow).
- Brown = question label
- Green = question literal
- Red = instruction
- Yellow = code list
- Orange = response domain
- Black = statement label
- Pink = statement
- Light blue = sequence
- Dark blue = condition label
- Purple = condition text
- Grey = loop
- Whilst doing the colour-coding, move the question labels to a separate line before the question literal, and add labels for the other constructs also on a separate line before the construct (e.g., add condition labels to the line above the condition text, add statement labels to the line above the statement).
- Note: Do not add qi_ or qc_ before the question label - the parse_ncds_capi_txt parser will do this.
- Tidy up the questionnaire (i.e., group all the instructions together; check for repeated labels and re-label accordingly).
- Add information about the response domains. Format: Label,Type,Type 2,Format,Min,Max. See the readme on GitLab for formatting examples.
- Add information about loops. Format: {label, _variable, start value (always “1”), end value (when provided)}. Note that loop whiles are added in Archivist, not at the tagging stage. See the readme on GitLab for formatting examples.
- Add the CLOSER condition logic in curly brackets, and a condition label on a separate line above the condition logic. See the readme on GitLab for formatting examples.
- Copy and paste the questionnaire into Excel:
- Add a new column of numbers in ascending order.
- Remove any blank rows.
- Sort the numbers column back to ascending order.
- Filter the text by colour so it is grouped together by colour.
- Add a new column for the element tags.
- Tag the elements (e.g., add "question label" to the tags column for all the brown text, add "question literal to the tags column for all the green text, etc.).
- Sort the numbered column ascending and apply it to all the text so that it is back in the order of the questionnaire.
...