-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Verify metrics using reporter approach #138
Comments
I assume you haven't done anything on this yet, @vmagotra, so I'm going to take a crack at it. I'm hoping this will lead the way toward a unit test for Sync up the total active urls value with state in UrlDbFunction (as @kkrugler suggested via a comment on Try using stream harness support for unit testing). |
For some reason, I'm getting the following failure when running locally (master + one unit test class I added): schmed-mb-air-2:flink-1.5.2 schmed$ ./bin/flink run ~/Projects/flink-crawler/target/flink-crawler-tool-1.0-SNAPSHOT.jar -commoncrawl 2017-22 -cachedir ~/Downloads/flink-crawler/common-crawl-cache -seedurls ~/Downloads/flink-crawler/common-crawl-seed-urls.txt -forcecrawldelay 0 -maxcontentsize 100000 -outputfile ~/Downloads/flink-crawler/common-crawl-content.txt |
From the Flink mailing list:
The text was updated successfully, but these errors were encountered: