As a user, I can see the summary result #9

rosle · 2020-04-03T04:56:39Z

What happened

✅ Store the keyword search result

Total results
Non-AdWords Link
AdWords Link

✅ Display the result on the keyword page

Insight

Add a new table links

User -(has many)-> Keywords -(has many)-> Links

Link store the url result and the property if that url is an AdWords or not and where it appears on the page (Could be top or bottom)

Proof Of Work

olivierobert · 2020-04-03T08:31:27Z

lib/google_crawler/google/scrapper.ex

+end
+
 defmodule GoogleCrawler.Google.Scrapper do
+  alias GoogleCrawler.Google.ScrapperResult


Where is ScrapperResult defined?

@olivierobert In this file above here 😆 ⬆️

Oh I see. Not sure about having more than one module per file though 🤔

Agree 👍 I think it's better to separate it 🤔

olivierobert · 2020-04-15T08:03:24Z

lib/google_crawler/google/scrapper.ex

@@ -1,12 +1,74 @@
 defmodule GoogleCrawler.Google.Scrapper do


I wondered from the beginning why you added two p to "scrapper". Only today, I checked you were right about it as I making some improvements to my implementation as well 🙈

Well according to Google, scrapper is probably not what you are looking for:

While scraping/scraper is:

Wait, which one is correct 😂?? So It's scraper, right? Now I'm confused 😲

Yes, scraper (the one below). A scrapper is a tool to scrap things off, not related to web scraping.

I'll fix the name 😂 👍

rosle · 2020-04-15T11:51:20Z

lib/google_crawler/search/search_keyword_worker.ex

@@ -1,4 +1,9 @@
 defmodule GoogleCrawler.SearchKeywordWorker do


I am really not confident about this genserver 😂

I think it's because

The state is quite complex I think 🤔 💭 - Now it stores a map of %{task_ref -> {%Keyword{}, retry_count}}. So I don't know if it is hard to read or not

I am not sure about the error handling part. There are 2 cases now

When the task is failed -> This handles by handle_info({:DOWN}, ...)

When something is wrong when it insert the record -> This also needs to retry, so I extracted the retry logic out to a new function. It looks a bit weird.

Main problem is I'm not sure if it should be like this or not. Is it the right way to do it or not. Probably I need to learn more about the Genserver 😵

Add the result scrapper

26a8ce6

olivierobert reviewed Apr 3, 2020

View reviewed changes

rosle added 2 commits April 14, 2020 15:49

Scrap the links from google result

d78b5a8

Add result links table and define the link schema

007238c

rosle force-pushed the feature/scrap-content branch from 236cd35 to 007238c Compare April 14, 2020 11:17

rosle added 2 commits April 15, 2020 10:44

Specify the user agent and set result language to english

400b764

Separate the scrapper result file and show the number of result

3da469c

olivierobert reviewed Apr 15, 2020

View reviewed changes

rosle force-pushed the feature/scrap-content branch 2 times, most recently from 2b145dc to 26dbb21 Compare April 15, 2020 08:34

Save the keyword result links to the database

4690bfb

rosle force-pushed the feature/scrap-content branch from 26dbb21 to 4690bfb Compare April 15, 2020 08:36

rosle added 4 commits April 15, 2020 15:43

Extract similar code to function for scrapper

b8b247b

Add tests for link

d0be953

Add tests for keyword result validations

2949091

Handle the error when database insertion is failed

d5d2274

rosle force-pushed the feature/scrap-content branch from 9119f50 to d5d2274 Compare April 15, 2020 11:43

rosle commented Apr 15, 2020

View reviewed changes

rosle added 5 commits April 16, 2020 10:35

Fix small typo and unused var

d2da49d

Display the link result

df7fc93

Fix code format

9fdfead

Add back nav and specify keyword ordering in the query

23e3cde

Increase the wait time to process the keyword

1b813a3

rosle marked this pull request as ready for review April 16, 2020 05:35

rosle added 2 commits April 16, 2020 13:42

Fix scraper typo

665d3a7

Remove TODOs

39f55b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

As a user, I can see the summary result #9

As a user, I can see the summary result #9

Uh oh!

rosle commented Apr 3, 2020 •

edited

Loading

Uh oh!

olivierobert Apr 3, 2020

Uh oh!

rosle Apr 14, 2020

Uh oh!

olivierobert Apr 15, 2020

Uh oh!

rosle Apr 15, 2020

Uh oh!

olivierobert Apr 15, 2020 •

edited

Loading

Uh oh!

rosle Apr 15, 2020

Uh oh!

olivierobert Apr 15, 2020 •

edited

Loading

Uh oh!

rosle Apr 15, 2020

Uh oh!

rosle Apr 15, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -1,4 +1,9 @@
		defmodule GoogleCrawler.SearchKeywordWorker do

As a user, I can see the summary result #9

Are you sure you want to change the base?

As a user, I can see the summary result #9

Uh oh!

Conversation

rosle commented Apr 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What happened

Insight

Proof Of Work

Uh oh!

olivierobert Apr 3, 2020

Choose a reason for hiding this comment

Uh oh!

rosle Apr 14, 2020

Choose a reason for hiding this comment

Uh oh!

olivierobert Apr 15, 2020

Choose a reason for hiding this comment

Uh oh!

rosle Apr 15, 2020

Choose a reason for hiding this comment

Uh oh!

olivierobert Apr 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rosle Apr 15, 2020

Choose a reason for hiding this comment

Uh oh!

olivierobert Apr 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rosle Apr 15, 2020

Choose a reason for hiding this comment

Uh oh!

rosle Apr 15, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rosle commented Apr 3, 2020 •

edited

Loading

olivierobert Apr 15, 2020 •

edited

Loading

olivierobert Apr 15, 2020 •

edited

Loading