A research result on Performance Evaluation of OpenCIlk Program on Asymmetric Performance Multicore Processors #167
Replies: 2 comments 2 replies
-
Hi @adnan4791, thanks for your post. I would like to clarify your question, so that one of the OpenCilk experts can better answer it. To reiterate your background information, it sounds like you have used OpenCilk to evaluate the performance of a test (simple pi) program with the goal of exploring the performance characteristics of AMPs, and based on that, you have arrived at two proposed updates to Amdahl's law. For your question, are there any kinds of comments or advice you're seeking? Based on other correspondence I've had with you, I think you might want specifically want to know if anyone in our community is doing similar work, or knows of other people or conferences that would be relevant. Is that right? Best regards, |
Beta Was this translation helpful? Give feedback.
-
Here is an applied question related to this thread, posted recently: How well do Linux and Windows handle variation in CPU core performance? |
Beta Was this translation helpful? Give feedback.
-
Dear Experts,
I have done a performance evaluation of the OpenCIlk Program on an Asymmetric Performance Multicore Processor, AMPs for short. AMPs, such as Intel Core i5 1240P, integrate high-performance Core (P-CPUs) and efficient Core (E-CPUs). Their performance is imbalanced. Specifically, the E-CPU is slower. Hence the first problem, in this case, is the unfair workload distribution among the P-CPUs and E-CPUs. Some literature supposed that AMPs might cause a performance bottleneck due to the completion time being bound to slower CPUs in static scheduling. In this case the faster CPUs would suffer from reduced performance (reduced IPC).
I propose to apply OpenCilk as a solution to the problem for several reasons. The first reason is that OpenCilk is suitable for shared memory multicores such as AMPs. Second, OpenCilk implements lazy task creation, which has low overhead. Third, OpenCilk implements continuation steal with THE protocol, so steal operations do not cause significant overhead.
I evaluated the simple pi program, which has a regular workload, written in OpenCilk.
As expected, i found
I would like to write a paper, in this regard, i would like ask your comments and advice, and i apologize for my bad English.
Edited :
I am writing my questions here
i would like to thanks in advance for this opportunity,
adnan
Beta Was this translation helpful? Give feedback.
All reactions