Skip to content

Commit

Permalink
Merge pull request OpenMathLib#4421 from ChipKerchner/power10Copies_D…
Browse files Browse the repository at this point in the history
…GEMM

Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM PowerPC versions.
  • Loading branch information
martin-frbg authored Jan 10, 2024
2 parents 7a6a246 + 5c5c1a1 commit 3599f2d
Show file tree
Hide file tree
Showing 3 changed files with 664 additions and 0 deletions.
3 changes: 3 additions & 0 deletions kernel/power/dgemm_tcopy_16_power8.S
Original file line number Diff line number Diff line change
Expand Up @@ -107,6 +107,9 @@ USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

#define o0 0

#ifdef POWER10
#include "dgemm_tcopy_macros_16_power10.S"
#endif
#include "dgemm_tcopy_macros_16_power8.S"

#define STACKSIZE 144
Expand Down
Loading

0 comments on commit 3599f2d

Please sign in to comment.