Patient Data Consolidation Tool

Project Overview

This Python script consolidates patient laboratory data from a CSV file, aggregating multiple entries for the same patient and collection date into a single row.

Features

Processes large CSV files with multiple rows per patient
Handles mixed data types
Chunks output into multiple files for easier management
Robust error handling for data type conversions

Prerequisites

Python 3.8+
pandas library
numpy library

Installation

1. Clone the Repository

git clone <your-repository-url>
cd patient-data-consolidation

2. Create Virtual Environment (Recommended)

python3 -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

3. Install Dependencies

pip install pandas numpy

Usage

Run the Script

python consolidate_patient_data.py

Customization

Modify input_file in the script to process different CSV files
Adjust chunk_size parameter to control output file sizes
Customize output_dir to specify output location

Input Requirements

CSV file with patient data
Columns should include:
- Patient
- CollectDate
- Various measurement columns

Output

Multiple CSV files in the specified output directory
chunk_summary.txt with processing details

Handling Large Files

Script is optimized for files with hundreds of thousands of rows
Uses memory-efficient processing techniques

Troubleshooting

Ensure all required libraries are installed
Check input file format and column names
Verify Python version compatibility

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
consolidate_patient_data.py		consolidate_patient_data.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Patient Data Consolidation Tool

Project Overview

Features

Prerequisites

Installation

1. Clone the Repository

2. Create Virtual Environment (Recommended)

3. Install Dependencies

Usage

Run the Script

Customization

Input Requirements

Output

Handling Large Files

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Languages

GlennRTC/patient-data-processor

Folders and files

Latest commit

History

Repository files navigation

Patient Data Consolidation Tool

Project Overview

Features

Prerequisites

Installation

1. Clone the Repository

2. Create Virtual Environment (Recommended)

3. Install Dependencies

Usage

Run the Script

Customization

Input Requirements

Output

Handling Large Files

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages