Understand execution speed on Intel boards for HLS4ML #25

gnperdue · 2019-10-11T19:52:22Z

Currently models run slower on Intel (3 layer at 50 clocks vs 10 on Xilinx). Should eventually understand why.

gnperdue · 2019-10-29T15:31:38Z

Status:

Christian: Been working to see how large a model the Intel
tools can work with 5 layer, Fully connected, 100, 200, 500
nodes in intermediate layers. 500: choked quickly. 1 day then
ok for 100, also worked ok (took a few days) for 200-node
intermediate. Seems to be filling up the chip with LUTs.
Follow-ups: For this inherited code, not automatically hooked
up to evaluate models from Keras, but want to run them to
evaluate effectiveness of implemented model. Also looking
at scaling of resource usage for simple 1-layer model.

gnperdue · 2019-11-23T18:26:52Z

Status:

I’ve been working on improving model performance using the Intel tools (reducing
component usage, latency for a given model).  I’ve made some improvements (and
done a number of comparisons to results with the Xilinx tools), but the next step is
to talk to some of the Intel experts.  I gave an update on the status in an HLS4ML 
'working meeting' on Friday.

In terms of who to talk to from Intel, we have some contacts through CERN that
others have communicated with in the past, but Nhan also just met someone based
on Chicago while he was at SC19  that we were hoping to arrange a face-to-face with
at FNAL.  We’ll email him this coming week to touch base and try to get the ball rolling. 
(Nhan and I were also going to meet with Brian on Tuesday to go over results and
discussing setting up a meeting w/ the Intel guy.)

therwig · 2019-12-20T17:32:28Z

Update:
Made contact with local Intel experts after an introduction w/ Nhan at SC19. Had a first (virtual) meeting to discuss the Accelerator AI project in broad strokes and introduce the challenges we've faced so far in effectively porting hls4ml to QuartusHLS. Followed up since then and shared a code repository with implementation of a basic model in quartus (https://github.com/therwig/TestQuartusHLS). Intel contact will have a look and share tips on reducing the latency / shifting compute to utilize DSPs.

gnperdue · 2020-01-17T20:40:08Z

ongoing saga with licenses for Intel toolkit (compiler piece seems to be okay but validation simulation component, "modelsim", does not have a powerful enough license)
some potential to use Mentor Catapult HLS tools targeting Intel hardware (so, 3rd party solution) - looks promising so far

gnperdue mentioned this issue Nov 23, 2019

Get the HLS4ML tools working with Intel #17

Closed

gnperdue added this to the first-on-board-model milestone Nov 23, 2019

gnperdue assigned therwig Nov 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understand execution speed on Intel boards for HLS4ML #25

Understand execution speed on Intel boards for HLS4ML #25

gnperdue commented Oct 11, 2019

gnperdue commented Oct 29, 2019

gnperdue commented Nov 23, 2019

therwig commented Dec 20, 2019

gnperdue commented Jan 17, 2020

Understand execution speed on Intel boards for HLS4ML #25

Understand execution speed on Intel boards for HLS4ML #25

Comments

gnperdue commented Oct 11, 2019

gnperdue commented Oct 29, 2019

gnperdue commented Nov 23, 2019

therwig commented Dec 20, 2019

gnperdue commented Jan 17, 2020