This module is based on arc90's readability project.
Features
- Optimized for more websites.
- Supporting HTML5 tags(
article
,section
) and Microdata API. - Focusing on both accuracy and performance. 4x times faster than arc90's version.
- Supporting encodings such as GBK and GB2312.
- Converting relative urls to absolute for images and links automatically(Thank Guillermo Baigorria & Tom Sutton).
Example
Install
$ npm install node-readability
Note that as of our 2.0.0 release, this module only works with Node.js >= 2.0. In the meantime you are still welcome to install a release in the 1.x series(by
npm install node-readability@1
) if you use an older Node.js version.
from https://github.com/luin/readability