[mlpack-git] (blog) master: Keon Week Ten (b51bcc8)

gitdub at mlpack.org gitdub at mlpack.org
Tue Jul 26 12:51:04 EDT 2016


Repository : https://github.com/mlpack/blog
On branch  : master
Link       : https://github.com/mlpack/blog/compare/f099a745dcae8f2f867e1ffd963ba4ef1f24c72f...b51bcc8a6e2830de86b1df72e693ba4e03130f8c

>---------------------------------------------------------------

commit b51bcc8a6e2830de86b1df72e693ba4e03130f8c
Author: Keon Kim <kwk236 at gmail.com>
Date:   Wed Jul 27 01:51:04 2016 +0900

    Keon Week Ten


>---------------------------------------------------------------

b51bcc8a6e2830de86b1df72e693ba4e03130f8c
 content/blog/KeonWeekEight.md |  2 +-
 content/blog/KeonWeekNine.md  | 35 +++++++++++++++++++++++++++++++++++
 2 files changed, 36 insertions(+), 1 deletion(-)

diff --git a/content/blog/KeonWeekEight.md b/content/blog/KeonWeekEight.md
index df6dfd0..a0839cf 100644
--- a/content/blog/KeonWeekEight.md
+++ b/content/blog/KeonWeekEight.md
@@ -9,7 +9,7 @@ DatasetMapper & Imputer
 
 1) Optimized Imputer a little bit. The details are discussed in the pull request [#694](https://github.com/mlpack/mlpack/pull/694).
 
-3) Debugged and polished some comments.
+2) Debugged and polished some comments.
 
 Descriptive Statistics
 
diff --git a/content/blog/KeonWeekNine.md b/content/blog/KeonWeekNine.md
new file mode 100644
index 0000000..ff7470e
--- /dev/null
+++ b/content/blog/KeonWeekNine.md
@@ -0,0 +1,35 @@
+Title: Dataset and Experimentation Tools : Week-9 Highlights
+Date: 2016-07-26 16:00:00
+Tags: gsoc, dataset, data
+Author: Keon Kim
+
+This week, pull request for DatasetMapper & Imputer is merged. I thank Zoq, Rcurtin and especially, Tham for all the feedbacks.
+I feel like I gave them more work than I did.
+
+DatasetMapper & Imputer
+
+1) I added Impute() function that applies imputation to all dimensions in the given matrix.
+
+2) I made a program called mlpack_preprocess_check (previously called mlpack_preprocess_verify in this blog).
+I will make a pull request after adding comments and docs.
+
+Descriptive Statistics
+
+1) After discussing a little how to manage statistics class, I put it into the preprocess/ folder
+because it will only be used for preprocess_describe command line program. It's sole purpose is to
+provide cleaner interface.
+I might even consider removing the class because the code length became too large for a small program.
+I will make this decision as soon as possible and make a pull request next week.
+
+2) I optimized some functions in statistics class.
+
+3) changed class Statistics to DescriptiveStatistics to be more specific.
+
+Documentations
+
+1) I made lists of algorithms implemented in mlpack in README.md and updated to date.
+
+Other
+
+1) I replaced cross_validation's split function with data::Split() inside dt_utils.
+I will make a pull request regarding this after a few performance checks.




More information about the mlpack-git mailing list