This Dataset comprises of Synthetically Contaminated ASE Data from 40 LCL cell-line samples from the GEUVADIS Consortium. ASE was produced by spiking in a set proportion of reads (identified by the filename <contamination_in_percent>.csv) from contamination sample NA19159. ASE was produced at the variant level following best practice protocols.
The xz archive contains file .csv where each file represents ASE data from each of the 40 samples at that contamination percentage. Rows in the file correspond to Gene IDS (idenitified by ENSEMBL ID) and column names represent the various sample IDS.