[mlpack-svn] [MLPACK] #372: CFTest takes an hour in debug mode
MLPACK Trac
trac at coffeetalk-1.cc.gatech.edu
Mon Nov 24 17:58:51 EST 2014
#372: CFTest takes an hour in debug mode
---------------------+------------------------------------------------------
Reporter: rcurtin | Owner:
Type: defect | Status: new
Priority: major | Milestone: mlpack 1.1.0
Component: mlpack | Keywords: CFTest, CF, slow, NMFALS, NMF, ALS
Blocking: | Blocked By:
---------------------+------------------------------------------------------
According to Jenkins, CFTest takes nearly an hour in debug mode and half
an hour in non-debug mode:
http://big.cc.gt.atl.ga.us/job/mlpack%20-%20nightly%20matrix%20build/arch
=linux-amd64,armadillo=armadillo-
svn,buildmode=debug/lastCompletedBuild/testReport/MLPACKTest/CFTest/
http://big.cc.gt.atl.ga.us/job/mlpack%20-%20nightly%20matrix%20build/arch
=linux-amd64,armadillo=armadillo-
svn,buildmode=release/lastCompletedBuild/testReport/MLPACKTest/CFTest/
(or if those links don't work, you can look at the nightly matrix build,
select a configuration, then click on the test result and narrow down to
CFTest, and it should show how long it takes)
I think that testing is quite important so we should be certain that the
test is comprehensive, but I suspect that we can significantly cut down
the runtime of that particular test suite (with each test taking 10 or 20
minutes).
A first guess would be that the tolerance of the NMF ALS factorizer is too
tight, or simply that NMF-ALS is slow. So, what should happen is this
ticket is that we should investigate the cause of the slowdown (either
using the Timer class and displaying the output) or with gprof, then
determine if the real issue is a bottleneck in NMFALSFactorizer, or if the
convergence tolerance is needlessly tight, or if NMFALSFactorizer is even
the best choice for these tests and as the default CF factorizer.
Siddharth and Sumedh, I've CC'ed you guys because you were significantly
involved with the CF module development in the past and might have some
insight here.
--
Ticket URL: <http://trac.research.cc.gatech.edu/fastlab/ticket/372>
MLPACK <www.fast-lab.org>
MLPACK is an intuitive, fast, and scalable C++ machine learning library developed at Georgia Tech.
More information about the mlpack-svn
mailing list