Conversation
added in ICES surveys to diet score derived variable
updated function diet_score_cat3 to include master survey cycles
added in master survey cycles for required diet score variables to derive diet score
included master survey cycles for diet score and diet-score dependent variables
yulric
left a comment
There was a problem hiding this comment.
The database names are not consistent between the variables and variable details sheet. The variable details sheet uses the _i suffix whereas the variables sheet uses the _m suffix. Can you fix that?
yulric
left a comment
There was a problem hiding this comment.
I'm still seeing the extra zeros in some lines. I've replied to the comments that have them.
removed extra zero in cchs2013_2014_m cycle name for FVCDCAR, FVCDFRU, FVCDJUI, FVCDPOT, FVCDSAL, FVCDTOT, FVCDVEG
…s; fixed all their databaseStart and variableStart info too
|
Reviewed the FVC and diet_score variable changes (FVCDFRU, FVCDSAL, FVCDPOT, FVCDCAR, FVCDVEG, FVCDJUI, diet_score, diet_score_cat3). No issues found. Checked for:
Note: This review covers only the FVC/diet_score variables. The variables.csv full-file rewrite (column reordering, new metadata columns) was not reviewed here. |
Code reviewReviewed 9 derived FVC/diet variables and 30 raw FVC variables for PUMF and Master across 2001-2018 cycles. L6 integration test: cross-cycle prevalenceRan No step changes at the 2014-2015 era boundary. The 2015+ variable renames (FVCDVFRU, FVCDVGRN, FVCDVORA, FVCDVPOT, FVCDVVEG, FVCDVJUI) are correctly mapped. Master ( Issues found and fixedAll issues below have been fixed in commits P0 -- data bug:
P1 -- naming/mapping errors: P2 -- metadata quality: Schema:
Checked: era boundary defaults, databaseStart consistency (variables.csv vs variable_details.csv), PUMF/Master naming conventions, pre-2007 cycle letters, DV function specifications (diet_score_fun, diet_score_fun_cat), dummyVariable naming, swapped recEnd values, label/metadata consistency, known error patterns, and PUMF integration across all cycles. Note: Post-approval commit 16a8f3a was pushed on 2026-02-10 (after yulric's approval on 2025-12-04). Review fixes applied in commits 8939654 and d8f3890. CEP: |
variables.csv: - Fix chs→cchs typo in databaseStart for 30 FVC_* variables (2011/2013 master) - Fix double spaces in FVC_4E, FVC_5E labels - Fix FVCDTOT "consumptoin" typo and trailing dash in labelLong - Fix inconsistent labelLong formatting for FVCDFRU, FVCDSAL, FVCDVEG - Add descriptions for 7 FVCD* derived frequency variables - Improve diet_score_cat3 labelLong and description - Add reviewNotes for 30 FVC_* variables variable_details.csv: - Fix chs→cchs typo in databaseStart/variableStart for 144 FVC_* rows - Fix FVC_6D swapped recEnd values (P0 data bug: valid range mapped to NA) - Fix FVC_1A and FVC_5A dummy variable suffix-recEnd misalignment - Rename _NA::a/_NA::b to _NAa/_NAb for FVC_1A-6A and diet_score_cat3 - Set diet_score_cat3 Func row dummyVariable to N/A - Fix double spaces in FVC_4E, FVC_5E catLabel and related fields
- Fix metadata_registry.yaml: replace _NA::[a-z] with _NA[a-z] in regex,
remove _cont{N} pattern (continuous variables use N/A), add missing
category examples (_NAa, _NAb)
- Fix variable_details.yaml: update dummyVariable notes to match actual
convention with regex pattern and correct examples
See issue #172 for propagation to other branches.
Review summary, L6 integration test script, PUMF test results, and cross-cycle prevalence QMD from the diet-score PR review.
FVCDFRU, FVCDSAL, FVCDPOT, FVCDCAR, FVCDVEG, FVCDJUI (diet_score dependent variables) updated to include master survey cycles from 2001-2018
diet_score updated to include master survey cycles from 2001-2018
diet_score_cat3 updated to include master survey cycles from 2001-2018
master survey cycles added to FVCDFRU, FVCDSAL, FVCDPOT, FVCDCAR, FVCDVEG, FVCDJUI, diet_score and diet_score_cat3 in variables.csv