[mlpack-git] [mlpack/mlpack] mlpack_preprocess_split doesn't work when absolute paths are specified (#667)

Ryan Curtin notifications at github.com
Thu Jun 2 14:57:08 EDT 2016


Here's what I tried to do to play with it:

```
$ bin/mlpack_preprocess_split -i ~/datasets/corel.csv -I ~/datasets/corel.csv -t corel-train.csv -T corel-test.csv -v
[DEBUG] Compiled with debugging symbols.
[WARN ] You did not specify --training_labels_file, so the trainingset labels file name will be automatically set to 'train_/home/ryan/datasets/corel.csv'.
[WARN ] You did not specify --test_labels_file, so the test set labels file name will be automatically set to 'test_/home/ryan/datasets/corel.csv'.
[WARN ] You did not specify --test_ratio, so it will be automatically set to 0.2.
[INFO ] Loading '/home/ryan/datasets/corel.csv' as CSV data.  Size is 32 x 37749.
[INFO ] Loading '/home/ryan/datasets/corel.csv' as CSV data.  Size is 32 x 37749.
[INFO ] Training data contains 30200 points.
[INFO ] Test data contains 7549 points.
[INFO ] Saving CSV data to 'corel-train.csv'.
[INFO ] Saving CSV data to 'corel-test.csv'.
[WARN ] Cannot open file 'train_/home/ryan/datasets/corel.csv' for writing; save failed.
[WARN ] Cannot open file 'test_/home/ryan/datasets/corel.csv' for writing; save failed.
[INFO ] 
[INFO ] Execution parameters:
[INFO ]   help: false
[INFO ]   info: ""
[INFO ]   input_file: /home/ryan/datasets/corel.csv
[INFO ]   input_labels_file: /home/ryan/datasets/corel.csv
[INFO ]   test_file: corel-test.csv
[INFO ]   test_labels_file: ""
[INFO ]   test_ratio: 0.2
[INFO ]   training_file: corel-train.csv
[INFO ]   training_labels_file: ""
[INFO ]   verbose: true
[INFO ]   version: false
[INFO ] 
[INFO ] Program timers:
[INFO ]   loading_data: 1.012701s
[INFO ]   saving_data: 0.530123s
[INFO ]   total_time: 1.631944s
```

So I guess, we need to be more careful with prepending "train_", we have to do it in the right place.  Maybe some string tokenization is necessary. :(

Also, Keon, I made a few formatting and documentation changes in 6a94eb6, please let me know if I broke anything.

---
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/mlpack/mlpack/issues/667
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.cc.gatech.edu/pipermail/mlpack-git/attachments/20160602/ec536c65/attachment.html>


More information about the mlpack-git mailing list