Group A: Implemented optimized matrix multiplication by Artorias17 · Pull Request #4 · AA-parallel-computing/Assignment-4-Optional

Artorias17 · 2026-05-31T18:34:11Z

Group A:

Ha Do (Student ID: 2402703)
Abhishek Roy (Student ID: 2502895)

Implemented and optimized the matrix multiplication. Implementation details are in the README.md file. The main speedup came from swapping the loop ordering from i -> j -> k to i -> k -> j, making the memory access pattern for all three matrices row-wise and introducing compiler flags for aggressive optimization, native SIMD, and loop unrolling.

System used for testing:

CPU: AMD Ryzen 7 8845HS
Architecture: x86-64
Cores: 8
Threads: 16

khanhhado1208 and others added 5 commits May 28, 2026 23:59

Implement blocked matrix multiplication and validation logic

e8c0dbb

Implement openMP multiplication and add gitignore

d384fde

Add python script to run and generate test results

e25a31b

Optimized by adding compiler flags and swapping loop ordering

dab97c4

Add cpu specs to README.md

c64045e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Group A: Implemented optimized matrix multiplication#4

Group A: Implemented optimized matrix multiplication#4
Artorias17 wants to merge 5 commits into
AA-parallel-computing:mainfrom
Artorias17:abhishek-roy

Artorias17 commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Artorias17 commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants