Home

Welcome to the dataDisk Wiki!

Overview

dataDisk is a Python package designed to simplify the creation and execution of data processing pipelines. It provides a flexible framework for defining sequential tasks, applying transformations, and validating data. Additionally, it includes a ParallelProcessor for efficient parallel execution.

Getting Started

Installation

To use dataDisk in your project, you can install it using pip:

pip install dataDisk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

Welcome to the dataDisk Wiki!

Overview

Table of Contents

Getting Started

Installation

Uh oh!

Uh oh!

Clone this wiki locally