chital


Scrape web pages for groups of articles
Example of usage
var chital = require( 'chital' );
var scraper = chital({
url : "http://news.ycombinator.com/news",
type : "text/html",
selectors : {
list : "td:not([align]).title",
article : {
url : {
selector : "a",
attr : "href"
},
src : "span",
title : "a"
}
}
});
scraper.execute(function( err, items ) {
if ( err ) {
return console.log( err );
}
});
Some website configurations reside in test/files/sitesToScrape.json
Disclaimer
This is still a work in progress, under active development. At the moment, the library is in alpha stage, providing very basic functionality. If you have any question, feel free to open an issue on this repository.
License
Copyright (c) 2014 António Nuno Monteiro
This work is distributed under the MIT license. For more information refer to the LICENSE file.