-
Notifications
You must be signed in to change notification settings - Fork 13
Added Preprocessing Bundle to ML_Core #25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
@RogerDev Please review |
lilyclemson
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@vzeufack Good code and testing! There are a few merge conflicts and minor typos. Please resolve.
Preprocessing/StandardScaler.ecl
Outdated
| SHARED numberLayout := Preprocessing.Types.numberLayout; | ||
|
|
||
| /** | ||
| * Computes averages and stdevs for each feature in baseData. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please try to void shorthand: stdevs = standard deviation (stdevs)
Preprocessing/LabelEncoder.ecl
Outdated
| RETURN Result; | ||
| ENDMACRO; | ||
|
|
||
| <<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove line 59. It looks like a leftover from a merge conflict
| $.TestOneHotEncoder.RunOneHotEncoderTests; | ||
| $.TestStandardScaler.RunStandardScalerTests; | ||
| $.TestMinMaxScaler.RunMinMaxScalerTests; | ||
| <<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
| $.TestAreEqualRows.TestDifferentRows(); | ||
|
|
||
| $.TestCompare.TestEqualData(); | ||
| <<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
| END; | ||
|
|
||
| /** | ||
| <<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
| PROJECT(ROWS(LEFT), XF(LEFT))); | ||
|
|
||
| #UNIQUENAME(comparisonResult) | ||
| <<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please resolve the merge conflict
e6dff78 to
92ac0e2
Compare
lilyclemson
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some minor typos need correction. A few descriptions need more details.
Great job! @vzeufack
Preprocessing/LabelEncoder.ecl
Outdated
| * </pre> | ||
| */ | ||
| EXPORT GetMapping(key) := FUNCTIONMACRO | ||
| IMPORT Preprocessing.Utils.LabelEncoder; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It may cause broken code. please use relative path.
Preprocessing/LabelEncoder.ecl
Outdated
| * <p> Data with categorical values replaced by numbers. | ||
| */ | ||
| EXPORT Encode(dataToEncode, key) := FUNCTIONMACRO | ||
| IMPORT Preprocessing.Utils; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It may cause broken code. please use relative path.
Preprocessing/LabelEncoder.ecl
Outdated
| * <p> Data with categorical values replaced by their original labels. | ||
| */ | ||
| EXPORT Decode(dataToDecode, encoderKey) := FUNCTIONMACRO | ||
| IMPORT Preprocessing.Utils; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It may cause broken code. please use relative path.
| IMPORT STD; | ||
| IMPORT $.Files; | ||
|
|
||
| spray := STD.File.SprayDelimited('192.168.56.101', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please abstract the IP address
| IMPORT STD; | ||
| IMPORT $.Files; | ||
|
|
||
| STD.File.SprayDelimited('192.168.56.101', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please abstract the IP address
| * | ||
| * @param partialKey: same record structure as the key (see below). | ||
| * <p> Mapping between feature names and categories. | ||
| * Some names are mapped to empty categories such that |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add a description for the case where the names are mapped to non-empty categories.
Preprocessing/MinMaxScaler.ecl
Outdated
| t_FieldReal := MLC.types.t_FieldReal; | ||
|
|
||
| /** | ||
| * shifts the values in a range [min, max]. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor typo: Shift
Preprocessing/MinMaxScaler.ecl
Outdated
|
|
||
|
|
||
| /** | ||
| * scales the data using the following formula: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo: Scale
Preprocessing/OneHotEncoder.ecl
Outdated
| END; | ||
|
|
||
| /** | ||
| * Produces a mapping between numbers when encoded to numbers when decoded. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please rephrase this line for better description of the function.
Preprocessing/StratifiedSplit.ecl
Outdated
| END; | ||
|
|
||
| /** | ||
| * Determines y stats from full data. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please explain y stats
|
Is this branch going to be merged? Analytics would like to use the preprocessing module to prepare data for deep learning training. I have cloned @vzeufack 's repository yet run into this error when trying to import the module: Not sure if merging the branch would resolve this issue or not. Please advise! |
|
@Zelazny7 If ML_Core bundle is already installed previously, it may conflict with the downloaded ML_Core bundle. Rename the downloaded bundle should solve the issue. Please let me know if anything is unclear. |

The current version of the Preprocessing Bundle includes: