[mlpack-svn] [MLPACK] #372: CFTest takes an hour in debug mode

MLPACK Trac trac at coffeetalk-1.cc.gatech.edu
Mon Nov 24 17:58:51 EST 2014


#372: CFTest takes an hour in debug mode
---------------------+------------------------------------------------------
 Reporter:  rcurtin  |        Owner:                                    
     Type:  defect   |       Status:  new                               
 Priority:  major    |    Milestone:  mlpack 1.1.0                      
Component:  mlpack   |     Keywords:  CFTest, CF, slow, NMFALS, NMF, ALS
 Blocking:           |   Blocked By:                                    
---------------------+------------------------------------------------------
 According to Jenkins, CFTest takes nearly an hour in debug mode and half
 an hour in non-debug mode:

 http://big.cc.gt.atl.ga.us/job/mlpack%20-%20nightly%20matrix%20build/arch
 =linux-amd64,armadillo=armadillo-
 svn,buildmode=debug/lastCompletedBuild/testReport/MLPACKTest/CFTest/

 http://big.cc.gt.atl.ga.us/job/mlpack%20-%20nightly%20matrix%20build/arch
 =linux-amd64,armadillo=armadillo-
 svn,buildmode=release/lastCompletedBuild/testReport/MLPACKTest/CFTest/

 (or if those links don't work, you can look at the nightly matrix build,
 select a configuration, then click on the test result and narrow down to
 CFTest, and it should show how long it takes)

 I think that testing is quite important so we should be certain that the
 test is comprehensive, but I suspect that we can significantly cut down
 the runtime of that particular test suite (with each test taking 10 or 20
 minutes).

 A first guess would be that the tolerance of the NMF ALS factorizer is too
 tight, or simply that NMF-ALS is slow.  So, what should happen is this
 ticket is that we should investigate the cause of the slowdown (either
 using the Timer class and displaying the output) or with gprof, then
 determine if the real issue is a bottleneck in NMFALSFactorizer, or if the
 convergence tolerance is needlessly tight, or if NMFALSFactorizer is even
 the best choice for these tests and as the default CF factorizer.

 Siddharth and Sumedh, I've CC'ed you guys because you were significantly
 involved with the CF module development in the past and might have some
 insight here.

-- 
Ticket URL: <http://trac.research.cc.gatech.edu/fastlab/ticket/372>
MLPACK <www.fast-lab.org>
MLPACK is an intuitive, fast, and scalable C++ machine learning library developed at Georgia Tech.


More information about the mlpack-svn mailing list