Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scrape URLs then get to there. #52

Open
mizzunet opened this issue Jun 6, 2022 · 1 comment
Open

Scrape URLs then get to there. #52

mizzunet opened this issue Jun 6, 2022 · 1 comment

Comments

@mizzunet
Copy link

mizzunet commented Jun 6, 2022

I'm looking for,

  • Parses URLs
  • Visit to each parsed URLs
  • Parse data from visited page.

For example,

  • Get books URLs from Goodreads
  • Visit those places
  • Get books' data from the visited pages.

This is possible with colly, I wonder if it's possible with geziyor.

@glaslos
Copy link
Contributor

glaslos commented Dec 23, 2022

You can next calls with parsers like this:

g.Get("goodreads/allbooks", func(g *geziyor.Geziyor, r *client.Response) {
	// parse page for book links
	g.Get("booklink", func(g *geziyor.Geziyor, r *client.Response) {
		// parse book page
	})
})

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants