A basic crawler written in nodejs
Crawljs has a dependency on jsdom which in turn has dependency on contextify which is a native nodejs extension. To run this crawler, you will require a C++ compiler on your machine. Details.
npm install -g crawljs
crawljs http://nodejs.org
crawljs http://nodejs.org 500
Crawls only first 500 urls encountered
var Crawler = require("../lib/Crawler")
, seed = "http://nodejs.org"
, limit = 500;
var crawler = new Crawler(limit);
crawler.crawl(seed);