Conversation
Member
puzzle-it
commented
May 22, 2012
- I've added extractor class, it's haven't been tested yet but i am thinking about possible tests, suggestions are always welcome, it's based on the same principle of Crawler class, but it's HTMLParser is completely customizable, the customization can be made passing a dictonary containing different functions.
- I've added a test case for Crawler.crawl() method, it parses an html page and give back the result of an assert, between Crawler.crawl() output and an handy inserted list
- New suggestions and ideas are always welcome
…file for crawler and added test/run_tests.py file for running all tests it needs to be fixed, tests from tests/crawler_test.py can be run using python command, added updater/extractor.py file, it contains first feature extractor sketch it haven't been debugged yet
…er class in updater/crawler.py file to check exsistence of __url_list attribute, strings are inserted in __url_list as simple strings avoiding unicode ones
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.