Skip to content

Commit 9afc561

Browse files
authored
Merge pull request #36 from xianyi/develop
rebase
2 parents 02d60c1 + dca3e0c commit 9afc561

48 files changed

Lines changed: 2533 additions & 225 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

Changelog.txt

Lines changed: 44 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,48 @@
11
OpenBLAS ChangeLog
2+
====================================================================
3+
Version 0.3.9
4+
1-Mar-2020
5+
6+
common:
7+
* Fixed a miscompilation of the GETRF functions with CMAKE
8+
* Imported bugfix 390 from LAPACK (missing NaN propagation in xCOMBSSQ)
9+
* The size of the memory buffer used for splitting GEMM tasks across
10+
multiple threads can now be configured in the build system.
11+
12+
POWER:
13+
* Fixed several compilation problems related to endianness
14+
and ELF version on POWER8 and POWER9
15+
* Fixed use of the absolute value IAMIN/IAMAX instead of IMIN/IMAX
16+
* Fixed a race condition in the level3 blas code
17+
18+
MIPS64:
19+
* Fixed use of the absoltute value IAMIN/IAMAX instead of IMIN/IMAX
20+
21+
ARMV7:
22+
* Fixed a race condition in the level3 blas code
23+
* Fixed compilation on Android
24+
ARMV8:
25+
* Added support for Ampere EMAG8180
26+
* Added support for Neoverse N1
27+
* Improved performance of the blas_lock function
28+
* Fixed a race condition in the level3 blas code
29+
* Fixed a performance regression on TSV110-based servers
30+
31+
x86_64:
32+
* Fixed a long-standing error with undeclared register overwrites
33+
in the DSCAL microkernel for HASWELL,SKYLAKEX and ZEN
34+
* Fixed a long-standing bug in the SSE implementation of IAMAX
35+
* Fixed a CMAKE build failure with DYNAMIC_ARCH
36+
* Fixed cpu autodetection of Goldmont+, Cannon Lake and Ice Lake
37+
* Fixed a compilation failure on OSX with compiler name containing dash
38+
* Fixed compilation with MinGW on SkylakeX
39+
* Improved speed of the AVX512 GEMM3M kernel on SkylakeX
40+
* Added an AVX512 STRMM kernel for SkylakeX
41+
* Improved GEMM performance on Haswell and Zen
42+
43+
zarch:
44+
* fixed compilation of the DYNAMIC_ARCH code
45+
246
====================================================================
347
Version 0.3.8
448
9-Feb-2020

Makefile.system

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -327,7 +327,6 @@ ifeq ($(C_COMPILER), GCC)
327327
#Version tests for supporting specific features (MS_ABI, POWER9 intrinsics)
328328
GCCVERSIONGTEQ4 := $(shell expr `$(CC) -dumpversion | cut -f1 -d.` \>= 4)
329329
GCCVERSIONGT4 := $(shell expr `$(CC) -dumpversion | cut -f1 -d.` \> 4)
330-
GCCVERSIONGT5 := $(shell expr `$(CC) -dumpversion | cut -f1 -d.` \> 5)
331330
GCCVERSIONGTEQ7 := $(shell expr `$(CC) -dumpversion | cut -f1 -d.` \>= 7)
332331
GCCVERSIONGTEQ9 := $(shell expr `$(CC) -dumpversion | cut -f1 -d.` \>= 9)
333332
GCCMINORVERSIONGTEQ7 := $(shell expr `$(CC) -dumpversion | cut -f2 -d.` \>= 7)
@@ -575,6 +574,7 @@ ifneq ($(C_COMPILER), GCC)
575574
DYNAMIC_CORE += POWER9
576575
endif
577576
ifeq ($(C_COMPILER), GCC)
577+
GCCVERSIONGT5 := $(shell expr `$(CC) -dumpversion | cut -f1 -d.` \> 5)
578578
ifeq ($(GCCVERSIONGT5), 1)
579579
DYNAMIC_CORE += POWER9
580580
else

TargetList.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -89,6 +89,7 @@ CORTEXA57
8989
CORTEXA72
9090
CORTEXA73
9191
NEOVERSEN1
92+
EMAG8180
9293
FALKOR
9394
THUNDERX
9495
THUNDERX2T99

0 commit comments

Comments
 (0)