A specific crawler to get chinese light novels chapters.
Node.js
- Initialize the server
- Open your console.
- Go to reader-crawler folder.
- Download the libraries needed, running
npm install - Run the server, with
npm run start
- Send the first URL to the server
- Open your browser
- Make a GET request to the URL
localhost:<port>/crawl?url=<url>&title=<title>&chapters=<chapters>where<port>is the port your server is listening (default is 8888)<url>is the initial url<title>is something in the page title to identify the useful URLs from the trash (a good choice would be... the novel's title)<chapters>is the approximate amount of chapters the novel has<encoding>is the encoding the page has (it is optional, and the default is GBK)
Victor Sin Yu Chen
Center of Informatics (CIn), UFPE