[mlpack-svn] [MLPACK] #298: Enhancement of linear regression library

MLPACK Trac trac at coffeetalk-1.cc.gatech.edu
Sat Aug 10 18:28:24 EDT 2013


#298: Enhancement of linear regression library
--------------------------------+-------------------------------------------
  Reporter:  sumedhghaisas      |        Owner:              
      Type:  enhancement        |       Status:  new         
  Priority:  major              |    Milestone:  mlpack 1.0.7
 Component:  mlpack             |   Resolution:              
  Keywords:  linear_regression  |     Blocking:              
Blocked By:                     |  
--------------------------------+-------------------------------------------

Comment (by rcurtin):

 Hello Sumedh,

 Thank you very much for the contribution!  I will be taking a look through
 it shortly and integrating your improvements.  The improvement of
 Predict() looks good; I will have to make sure it still passes the test.
 I do think the equation for theta, as implemented, is not correct because
 Armadillo uses column-major memory ordering.  See
 http://www.mlpack.org/doxygen.php?doc=matrices.html for a better
 explanation (and let me know if that doesn't clarify).

 Are you willing to write a test for the ridge regression and the cost
 function calculation?  Everything that mlpack provides is tested; you can
 find the tests in src/mlpack/tests/.  We already have one for linear
 regression in src/mlpack/tests/linear_regression_test.cpp.  A good test
 for ridge regression might be to give a dataset where the inversion will
 fail if lambda = 0 (i.e. pass in a data matrix X where (X * X^T) is not
 invertible), but will give good results for some nonzero lambda.  You
 could use the current test as an example.  A test for the cost should not
 be hard; give an example dataset or two, and compare the cost with the
 cost that is expected for that dataset.

 Let me know if you have any questions.  Again, thanks for the
 contribution.  I will be working at least the modifications to Predict()
 in over the next day or two, and once I have a test for the new
 functionality I am happy to add that too.

 Thanks!

 Ryan

-- 
Ticket URL: <http://trac.research.cc.gatech.edu/fastlab/ticket/298#comment:1>
MLPACK <www.fast-lab.org>
MLPACK is an intuitive, fast, and scalable C++ machine learning library developed at Georgia Tech.


More information about the mlpack-svn mailing list