AWS hadoop runs never complete #142

blueeyedJ · 2015-03-04T00:02:33Z

Hi,
I'm very new to this and am having trouble with hadoop and aws.

Running 0.10.1.
Cmd: /pkb.py --cloud=AWS --benchmarks=hadoop_terasort --machine_type=t1.micro --terasort_num_rows=1000

Is there a minimum machine_type? I would think 1000 rows would go very quickly but it never completed, even after 3 hours of running. Ideas/Suggestions?
Thanks,

cmccoy · 2015-03-04T00:06:13Z

There should be a run-specific verbose log file in /tmp/perfkitbenchmarker/run_<run_uri>/pkb.log - could you paste the last few commands run?

I wouldn't be surprised if a t1.micro didn't have enough memory to run Hadoop, but I haven't tried.

blueeyedJ · 2015-03-04T14:42:19Z

yeah that was it. Might consider adding a note to the pkb help output. Something along the lines of "resource intensive workload, may not work well with micro tiers".

cmccoy · 2015-03-04T20:24:16Z

Glad to hear it. I opened #144 to track updating the help.

cmccoy mentioned this issue Mar 4, 2015

Note workloads that may failed on shared core / low memory instance types #144

Open

cmccoy closed this as completed Mar 4, 2015

mateusz-blaszkowski mentioned this issue Dec 21, 2015

Hadoop hangs because it cannot reach other instances by hostname #765

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AWS hadoop runs never complete #142

AWS hadoop runs never complete #142

blueeyedJ commented Mar 4, 2015

cmccoy commented Mar 4, 2015

blueeyedJ commented Mar 4, 2015

cmccoy commented Mar 4, 2015

AWS hadoop runs never complete #142

AWS hadoop runs never complete #142

Comments

blueeyedJ commented Mar 4, 2015

cmccoy commented Mar 4, 2015

blueeyedJ commented Mar 4, 2015

cmccoy commented Mar 4, 2015