Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AWS hadoop runs never complete #142

Closed
blueeyedJ opened this issue Mar 4, 2015 · 3 comments
Closed

AWS hadoop runs never complete #142

blueeyedJ opened this issue Mar 4, 2015 · 3 comments

Comments

@blueeyedJ
Copy link

Hi,
I'm very new to this and am having trouble with hadoop and aws.

Running 0.10.1.
Cmd: /pkb.py --cloud=AWS --benchmarks=hadoop_terasort --machine_type=t1.micro --terasort_num_rows=1000

Is there a minimum machine_type? I would think 1000 rows would go very quickly but it never completed, even after 3 hours of running. Ideas/Suggestions?
Thanks,

@cmccoy
Copy link
Contributor

cmccoy commented Mar 4, 2015

There should be a run-specific verbose log file in /tmp/perfkitbenchmarker/run_<run_uri>/pkb.log - could you paste the last few commands run?

I wouldn't be surprised if a t1.micro didn't have enough memory to run Hadoop, but I haven't tried.

@blueeyedJ
Copy link
Author

yeah that was it. Might consider adding a note to the pkb help output. Something along the lines of "resource intensive workload, may not work well with micro tiers".

@cmccoy
Copy link
Contributor

cmccoy commented Mar 4, 2015

Glad to hear it. I opened #144 to track updating the help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants