Train with validation data CU-861n1bn7h #10

CallMeMSL · 2023-05-05T12:34:54Z

Changes:

This implements the addition of a validation set for training and the retrieval of the train/validation score. A bug in the JSON parser for the train params has been fixed, which lets you now specify an array of values for a key. Additionally, bins can now be aligned between datasets, that are loaded from a file.

Problems:

With this implementation, only one validation set can be added.
This pull request breaks the current API by additionally requiring an Optional for the train and from_file methods.
Additionally, I found an instance where the addition of a big validation dataset leads to a segfault in the train step. See the ignored` test.

Possible Solutions

API Changes & only one validation set: This could be changed by changing the API to a builder pattern (as already mentioned). I thought of something along those lines:

let booster = Booster::new(&params)
    .train_data(...)
    .validation_data(...)
    .validation_data(...)
    .validation_data(...)
    .train_step_callback(...)
    .train_step_callback(...)
    .fit() // or fit_predict()

The callbacks could also be used for #6 and maybe should be done together with #3.
As for the broken test: I am not sure what is causing the segfault and I hope that I just somehow just load/handle the validation data wrong.

…than one metrix is used.

… provided for key was not formatted properly

…esults

leofidus · 2023-05-05T13:24:18Z

src/booster.rs

+        let out_strs = (0..num_metrics)
+            .map(|_| {
+                CString::new(" ".repeat(metric_name_length))
+                    .unwrap()
+                    .into_raw() as *mut c_char
+            })
+            .collect::<Vec<_>>();
+        lgbm_call!(lightgbm_sys::LGBM_BoosterGetEvalNames(
+            self.handle,
+            num_metrics,
+            &mut num_eval_names,
+            metric_name_length as u64,
+            &mut out_buffer_len,
+            out_strs.as_ptr() as *mut *mut c_char
+        ))?;
+        let output: Vec<String> = out_strs
+            .into_iter()
+            .map(|s| unsafe { CString::from_raw(s).into_string().unwrap() })
+            .take(num_eval_names as usize)
+            .collect();


I know the string logic is taken from feature_name(),j but if you are getting segfaults this (and feature_names) might be worth investigating. It is a bit suspicious, especially how the CString::from_raw documentation says you aren't supposed to change the string's length, which this probably does.

A better solution might be to allocate the strings as Vec<u8> initialized with 0s, and read them in with CString::from_vec_with_nul

Made a PR that should fix this, and makes the eval_names test pass: #11

leofidus · 2023-05-05T13:39:16Z

Possible Solutions

API Changes & only one validation set: This could be changed by changing the API to a builder pattern (as already mentioned). I thought of something along those lines:
let booster = Booster::new(&params)
    .train_data(...)
    .validation_data(...)
    .validation_data(...)
    .validation_data(...)
    .train_step_callback(...)
    .train_step_callback(...)
    .fit() // or fit_predict()
The callbacks could also be used for #6 and maybe should be done together with #3. As for the broken test: I am not sure what is causing the segfault and I hope that I just somehow just load/handle the validation data wrong.

That would make a lot of sense. Doesn't have to be part of this PR, we can also do a general cleanup PR that rewrites APIs to make more sense (another point would e.g. be the pervasive use of &str instead of AsRef<Path>, which the original author copied over from the XGBoost crate).

I'd maybe make the params part of the fit function though (unless that causes any problems) to make it easier to train multiple models with the same data but different parameters.

…with_val_data # Conflicts: # src/booster.rs

fix unsoundness of eval_names

leofidus · 2023-07-24T07:39:12Z

Task linked: CU-861n1bn7h LightGBM Validation Data

CallMeMSL added 6 commits May 4, 2023 13:02

change function signature and fix tests

e12857a

implement get_eval_names

4510965

tried to increase test complexity. eval_names() test fails when more …

6182be1

…than one metrix is used.

fixed bug in parameter formatting: edgecase where multiple values are…

87995cc

… provided for key was not formatted properly

implemented bin matching for dataset file loading, implemented eval r…

c2ee104

…esults

fix clippy warnings

7f6e73a

CallMeMSL requested a review from leofidus May 5, 2023 12:34

CallMeMSL linked an issue May 5, 2023 that may be closed by this pull request

Allow adding validation data, return metrics #5

Open

leofidus reviewed May 5, 2023

View reviewed changes

leofidus and others added 4 commits May 10, 2023 18:17

don't prematurely drop validation data

151a11f

fix seg_fault and wrong train loop indexing

5e9d239

Merge remote-tracking branch 'origin/train_with_val_data' into train_…

e9d9b94

…with_val_data # Conflicts: # src/booster.rs

fix unsoundness of eval_names

8779e60

leofidus mentioned this pull request May 12, 2023

fix unsoundness of eval_names #11

Merged

leofidus and others added 2 commits May 15, 2023 13:35

don't panic in sanity check

f1f0e76

Merge pull request #11 from DeepSignSecurity/eval_names_memory_safe

29154e0

fix unsoundness of eval_names

CallMeMSL changed the title ~~Train with validation data~~ Train with validation data CU-861n1bn7h Jul 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Train with validation data CU-861n1bn7h #10

Train with validation data CU-861n1bn7h #10

Uh oh!

CallMeMSL commented May 5, 2023

Uh oh!

leofidus May 5, 2023 •

edited

Loading

Uh oh!

leofidus May 12, 2023

Uh oh!

leofidus commented May 5, 2023

Possible Solutions

Uh oh!

leofidus commented Jul 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Train with validation data CU-861n1bn7h #10

Are you sure you want to change the base?

Train with validation data CU-861n1bn7h #10

Uh oh!

Conversation

CallMeMSL commented May 5, 2023

Changes:

Problems:

Possible Solutions

Uh oh!

leofidus May 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leofidus May 12, 2023

Choose a reason for hiding this comment

Uh oh!

leofidus commented May 5, 2023

Possible Solutions

Uh oh!

leofidus commented Jul 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

leofidus May 5, 2023 •

edited

Loading