Skip to content

Commit

Permalink
Relaxon heuristic to apply vertical weight optimization
Browse files Browse the repository at this point in the history
  • Loading branch information
yeonbok committed Nov 10, 2024
1 parent 076e625 commit 412cbe6
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -145,7 +145,7 @@ static bool is_weight_vertical(const fully_connected_params& params, size_t outp
<< "(computeUnitsCount : " << params.engineInfo.computeUnitsCount
<< " min_num_threads : " << min_num_threads << ")" << std::endl;
GPU_DEBUG_TRACE_DETAIL << "Use ofm_tile size 1 if the batch size is 1." << std::endl;
return (params.weights.IFM().v >= params.weights.OFM().v * 3
return (params.weights.IFM().v >= params.weights.OFM().v * 2
&& output_f / 2 /*most frequently used tile_ofm*/ <= min_num_threads);
}

Expand Down

0 comments on commit 412cbe6

Please sign in to comment.