Skip to content

Conversation

@wolfderby
Copy link

No description provided.

…g DDL (csv columns, types, awk_nl_count removed)
… comments, script, and README (increase length to 32)
…d integrity check

- load_missing_product_adverse_effect.sh: Stage CSV and insert missing product_adverse_effect rows
- load_missing_product_to_rxnorm.sh: Stage CSV and insert missing product_to_rxnorm rows
- patch_missing_vocab_rxnorm_from_staging.sh: Insert placeholder RxNorm vocab entries and reinsert mappings
- patch_missing_vocab_meddra_from_staging.sh: Insert placeholder MedDRA vocab entries and reinsert adverse effects
- integrity_check_and_cleanup.sh: Verify no unmatched staging rows and drop staging tables
- Added is_placeholder boolean column to vocab tables and marked placeholders
- For vocab tables with is_placeholder column, count only non-placeholder rows
- Checks column existence to avoid errors on tables without it
- Fixes csv_count_diff mismatches for vocab_meddra_adverse_effect and vocab_rxnorm_product
… restore script

- database/qa/run_qa_bulk.sh: Script to run QA logging for all CSVs in a directory
- .github/: GitHub Actions workflows for CI/CD
- database/schema/postgres_restore_constraints.sql: Script to restore FK/PK constraints
…n QA script

- Explain that wc -l may overcount due to embedded newlines
- Clarify that csv_count_diff is the key metric for import accuracy
- Documents wc_l_count, csv_record_count, select_count_on_domain
- Explains select_count_diff vs csv_count_diff
- Includes examples and troubleshooting tips
- Added about table DDL to postgres.sql with metadata columns
- Created populate_about_table.sql to insert version, description, counts, etc.
- Created run_populate_about.sh to execute the population
- Populated the table in cem_development_2025 with current stats
- Remove defaults for PGHOST, PGPORT, PGUSER, PGDATABASE
- Add checks and prompts for each if not set in environment
- Password can still be handled via PGPASSWORD or .pgpass
- Table names in SQL already prefixed with onsides.
- populate_about_metadata.sql: Inserts row counts and attributes for OnSIDES tables
- run_populate_about_metadata.sh: Shell script to execute with prompts for env vars
- Populated the table with current stats (row counts, version, data sources)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant