Essay
Consider a multi-core processor with heterogeneous cores: A, B, C and D where core B runs twice as fast as A, core C runs three times as fast as A and cores C and A run at the same speed (ie have the same processor frequency, micro architecture etc). Suppose an application needs to compute the square of each element in an array of 256 elements. Consider the following two divisions of labor:
(a)
(b)
Compute (1) the total execution time taken in the two cases and (2) cumulative processor utilization (Amount of total time the processors are not idle divided by the total execution time). For case (b), if you do not consider Core D in cumulative processor utilization (assuming we have another application to run on Core D), how would it change? Ignore cache effects by assuming that a perfect prefetcher is in operation.
Correct Answer:

Verified
(1) Total execution Time
(a) Total execu...View Answer
Unlock this answer now
Get Access to more Verified Answers free of charge
Correct Answer:
Verified
(a) Total execu...
View Answer
Unlock this answer now
Get Access to more Verified Answers free of charge
Q1: Applying the send/receive programming model as outlined
Q2: Suppose we have a dual core chip
Q3: Why should there be stride-access for vector
Q4: How would you rewrite the following sequential
Q5: Consider the following GPU that consists of
Q6: Consider a multi-core processor with 64
Q7: Besides network bandwidth and bisection bandwidth, two
Q8: Vector architecture exploits the data-level parallelism to
Q10: Consider the following code that adds two
Q11: Consider a system with two multiprocessors with