LAPACKException(2) keeps happening #25

jwmi · 2020-11-05T04:15:05Z

Hi,
I've been using TSVD extensively -- thanks for this great package. However, I keep getting the following error when calling tsvd. Below is a minimal example in which failure occurs around 63% of the time. The error occurs on both Mac and Windows. Is it possible to fix this issue?
Thanks!
Jeff (@jwmi)

> using TSVD
> tsvd([1 0 ; 0 1], 1)

ERROR: LAPACKException(2)
Stacktrace:
 [1] chklapackerror at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\lapack.jl:38 [inlined]
 [2] bdsdc!(::Char, ::Char, ::Array{Float64,1}, ::Array{Float64,1}) at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\lapack.jl:5437
 [3] #svd!#162 at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:203 [inlined]
 [4] svd!(::Bidiagonal{Float64,Array{Float64,1}}) at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:203
 [5] svd(::Bidiagonal{Float64,Array{Float64,1}}; kw::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}) at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:207
 [6] svd at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:207 [inlined]
 [7] biLanczos(::Array{Int64,2}, ::Int64; maxiter::Int64, initvec::Array{Float64,1}, tolconv::Float64, tolreorth::Float64, stepsize::Int64, debug::Bool) at C:\Users\jwmil\.julia\packages\TSVD\Kpryb\src\svd.jl:171
 [8] _tsvd(::Array{Int64,2}, ::Int64; maxiter::Int64, initvec::Array{Float64,1}, tolconv::Float64, tolreorth::Float64, stepsize::Int64, debug::Bool) at C:\Users\jwmil\.julia\packages\TSVD\Kpryb\src\svd.jl:232
 [9] #tsvd#7 at C:\Users\jwmil\.julia\packages\TSVD\Kpryb\src\svd.jl:359 [inlined]
 [10] tsvd(::Array{Int64,2}, ::Int64) at C:\Users\jwmil\.julia\packages\TSVD\Kpryb\src\svd.jl:359
 [11] top-level scope at REPL[12]:1

The text was updated successfully, but these errors were encountered:

andreasnoack · 2020-11-05T08:32:48Z

So #26 fixes your minimal reproducer but I'd be interested in knowing if it also fixes your real use cases so please comment if the issue remains after the patch release.

jwmi · 2020-11-05T20:12:37Z

Hi @andreasnoack
Thanks for working on this! Even with this patch release, I'm still running into the same error in general use cases. Here is a simple simulation in which the error occurs on rank-one matrices.

using TSVD
I,J,K,k,n = 10,2,1,1,10000
for i = 1:n; tsvd(randn(I,K)*randn(J,K)',k); end

ERROR: LinearAlgebra.LAPACKException(2)
Stacktrace:
 [1] chklapackerror at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\lapack.jl:38 [inlined]
 [2] bdsdc!(::Char, ::Char, ::Array{Float64,1}, ::Array{Float64,1}) at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\lapack.jl:5437
 [3] #svd!#162 at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:203 [inlined]
 [4] svd!(::LinearAlgebra.Bidiagonal{Float64,Array{Float64,1}}) at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:203
 [5] svd(::LinearAlgebra.Bidiagonal{Float64,Array{Float64,1}}; kw::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}) at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:207
 [6] svd at D:\buildbot\worker\package_win64\build\usr\share\julia\stdlib\v1.5\LinearAlgebra\src\bidiag.jl:207 [inlined]
 [7] biLanczos(::Array{Float64,2}, ::Int64; maxiter::Int64, initvec::Array{Float64,1}, tolconv::Float64, tolreorth::Float64, stepsize::Int64, debug::Bool) at C:\Users\jwmil\.julia\packages\TSVD\U0N8S\src\svd.jl:175
 [8] _tsvd(::Array{Float64,2}, ::Int64; maxiter::Int64, initvec::Array{Float64,1}, tolconv::Float64, tolreorth::Float64, stepsize::Int64, debug::Bool) at C:\Users\jwmil\.julia\packages\TSVD\U0N8S\src\svd.jl:236
 [9] #tsvd#7 at C:\Users\jwmil\.julia\packages\TSVD\U0N8S\src\svd.jl:363 [inlined]
 [10] tsvd(::Array{Float64,2}, ::Int64) at C:\Users\jwmil\.julia\packages\TSVD\U0N8S\src\svd.jl:363
 [11] top-level scope at .\REPL[3]:1

The frequency with which an error occurs depends on the dimensions and rank of the matrix. Playing around a bit with the following chunk of code, it seems to occur more frequently on matrices in which one of the two dimensions is small and the rank is low.

I,J,K,k,n = 10,2,1,1,10000
f(A,k) = (try; tsvd(A,k); 0; catch; 1; end)
error_rate = sum([f(randn(I,K)*randn(J,K)',k) for i=1:n])/n
println("Error rate = ",error_rate)

Error rate = 0.0636

andreasnoack · 2020-11-07T20:28:34Z

Thanks for the new example. I think a proper fix here will require a some real effort. One of the new vectors is proportional to the previous basis vector so it ends as the zero vector after the axpy. I couldn't find a description of the issue in @rmlarsen's thesis but I might have missed it. Unfortunately, I don't have time to dive into the details right now, but I plan to take a closer look as soon as I have some spare cycles.

rmlarsen · 2020-11-08T20:11:33Z

This is an absolutely essential issue to deal with then implementing the algorithm in finite precision arithmetic. If you study the PROPACK implementation, you'll see the code related to this. See for example Jack Poulson's copy of PROPACK here: https://github.com/poulson/PROPACK/blob/master/double/dlanbpro.F#L386 (Easier than referring to my old tarball.) Rasmus

…

On Sat, Nov 7, 2020 at 12:28 PM Andreas Noack ***@***.***> wrote: Thanks for the new example. I think a proper fix here will require a some real effort. One of the new vectors is proportional to the previous basis vector so it ends as the zero vector after the axpy. I couldn't find a description of the issue in @rmlarsen <https://github.com/rmlarsen>'s thesis but I might have missed it. Unfortunately, I don't have time to dive into the details right now, but I plan to take a closer look as soon as I have some spare cycles. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#25 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEA72DQI2NVH6RSIESDA6ILSOWUX7ANCNFSM4TK3JQSQ> .

rmlarsen · 2020-11-08T20:15:23Z

BTW: There are lots of little details related to estimating norms for the stopping criterion, refining error bounds etc. that are not discussed in their final form in the thesis. On Sun, Nov 8, 2020 at 12:11 PM Rasmus Munk Larsen <[email protected]> wrote:

…

This is an absolutely essential issue to deal with then implementing the algorithm in finite precision arithmetic. If you study the PROPACK implementation, you'll see the code related to this. See for example Jack Poulson's copy of PROPACK here: https://github.com/poulson/PROPACK/blob/master/double/dlanbpro.F#L386 (Easier than referring to my old tarball.) Rasmus On Sat, Nov 7, 2020 at 12:28 PM Andreas Noack ***@***.***> wrote: > Thanks for the new example. I think a proper fix here will require a some > real effort. One of the new vectors is proportional to the previous basis > vector so it ends as the zero vector after the axpy. I couldn't find a > description of the issue in @rmlarsen <https://github.com/rmlarsen>'s > thesis but I might have missed it. Unfortunately, I don't have time to dive > into the details right now, but I plan to take a closer look as soon as I > have some spare cycles. > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#25 (comment)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AEA72DQI2NVH6RSIESDA6ILSOWUX7ANCNFSM4TK3JQSQ> > . >

RomeoV · 2023-10-20T21:32:48Z

I'm still getting this error fyi. Has anyone tried adapting the lines of code from the PROPACK reference above to the KSVD.jl implementation?

EDIT: Actually I found in my case that this happened in two cases:

if I put in a (Nx1) vector into tsvd, which could maybe be considered user error. (This happened sometimes automatically during another algorithm).
Somewhat randomly, when no orthogonal vector is found. Inspired by the link to PROPACK above, I inserted
a reorthogonalization step after v and alpha are computed (svd.jl:54):

            if α < τ * maximum(size(A)) * eps()  # orthogonalization failed, see https://github.com/poulson/PROPACK/blob/2465f89d5b1fba56de71c3e69e27d017c3dc2295/double/dlanbpro.F#L384
                @warn "Restart orthogonalization"
                b = V[1:(j-1)]
                B = KrylovKit.OrthonormalBasis(b ./ norm.(b))
                for _ in 1:3
                    v = rand!(v) * 2 .- 1
                    KrylovKit.orthonormalize!(v, B, KrylovKit.ModifiedGramSchmidt())
                    α = norm(v)
                    if !(α < τ * maximum(size(A)) * eps())
                        break
                    end
                end
            end

This seems to fix my issue for now.

See JuliaLinearAlgebra#25.

andreasnoack mentioned this issue Nov 5, 2020

Add a slight perturbation to initial α if it is one so ensure that #26

Merged

andreasnoack closed this as completed in #26 Nov 5, 2020

andreasnoack reopened this Nov 6, 2020

RomeoV added a commit to RomeoV/TSVD.jl that referenced this issue Oct 21, 2023

Add retrying normalization if necessary.

e2d474e

See JuliaLinearAlgebra#25.

RomeoV mentioned this issue Nov 1, 2023

Add retrying normalization if necessary. #39

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LAPACKException(2) keeps happening #25

LAPACKException(2) keeps happening #25

jwmi commented Nov 5, 2020

andreasnoack commented Nov 5, 2020

jwmi commented Nov 5, 2020

andreasnoack commented Nov 7, 2020

rmlarsen commented Nov 8, 2020 via email

rmlarsen commented Nov 8, 2020 via email

RomeoV commented Oct 20, 2023 •

edited

Loading

LAPACKException(2) keeps happening #25

LAPACKException(2) keeps happening #25

Comments

jwmi commented Nov 5, 2020

andreasnoack commented Nov 5, 2020

jwmi commented Nov 5, 2020

andreasnoack commented Nov 7, 2020

rmlarsen commented Nov 8, 2020 via email

rmlarsen commented Nov 8, 2020 via email

RomeoV commented Oct 20, 2023 • edited Loading

RomeoV commented Oct 20, 2023 •

edited

Loading