-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Closed
Labels
LAPACK issueDeficiency in code imported from Reference-LAPACKDeficiency in code imported from Reference-LAPACK
Description
Hi,
I am trying to build OpenBLAS using
- NVHPC 23.7
- CUDA 12.1.1
Environment variables:
- CFLAGS
-O2 -tp=host -Mflushz
- CXXFLAGS
-O2 -tp=host -Mflushz
- FCFLAGS
-O2 -tp=host -Mflushz
- FFLAGS
-O2 -tp=host -Mflushz
- F90LAGS
-O2 -tp=host -Mflushz
while running the LAPACK test suite with
make lapack-test BINARY='64' CC='nvc' FC='nvfortran' MAKE_NB_JOBS='-1' USE_OPENMP='1' USE_THREAD='1'
after a minor patch to the Makefile inside lapack-netlib/INSTALL/Makefile
by adding -Mnomain
to the testlsame
and testsecond
build recipes (to solve a multiple definition of `main'
error) (Not sure if this is worth a separate issue, if the test there is wrongly compiling C sources for Fortran exes or if it is supposed to be like that and the compiler would normally automatically disable the fortran main when needed)
i was able to get it to complete but i get several numerical (and also other) failures tested with the following versions of OpenBLAS.
- 0.3.23
--> LAPACK TESTING SUMMARY <--
SUMMARY nb test run numerical error other error
================ =========== ================= ================
REAL 1108743 286 (0.026%) 0 (0.000%)
DOUBLE PRECISION 1255005 280 (0.022%) 0 (0.000%)
COMPLEX 762259 153 (0.020%) 0 (0.000%)
COMPLEX16 699834 350 (0.050%) 0 (0.000%)
--> ALL PRECISIONS 3825841 1069 (0.028%) 0 (0.000%)
- 0.3.24
--> LAPACK TESTING SUMMARY <--
SUMMARY nb test run numerical error other error
================ =========== ================= ================
REAL 1108743 286 (0.026%) 0 (0.000%)
DOUBLE PRECISION 1255005 280 (0.022%) 0 (0.000%)
COMPLEX 746859 189 (0.025%) 0 (0.000%)
COMPLEX16 684434 386 (0.056%) 0 (0.000%)
--> ALL PRECISIONS 3795041 1141 (0.030%) 0 (0.000%)
- 0.3.27
--> LAPACK TESTING SUMMARY <--
SUMMARY nb test run numerical error other error
================ =========== ================= ================
REAL 1108743 37171 (3.353%) 0 (0.000%)
DOUBLE PRECISION 1255005 37165 (2.961%) 0 (0.000%)
COMPLEX 762259 1448 (0.190%) 0 (0.000%)
COMPLEX16 671348 37391 (5.570%) 14 (0.002%)
--> ALL PRECISIONS 3797355 113175 (2.980%) 14 (0.000%)
Metadata
Metadata
Assignees
Labels
LAPACK issueDeficiency in code imported from Reference-LAPACKDeficiency in code imported from Reference-LAPACK