Hey Everyone, Just got my Lifetime VIP! Cant believe the amount of content and people's helpful input over here in VIP Section!!!
I am looking to scrape all the directory listings from
http://www.golden*p*ages.ie can anyone please recommend a scraper that can do this or how best should I approach this. Does anyone on this forum offer this as a service?
Many thanks in advance! :)
Actually Webharvey is available hete just search itm
WebHarvy
Visual Web Ripper
Hey Bouli,
I just stumbled on this site and saw your post, figured I'd make my first post a profitable one.
Here's how I scrape sites... This is on a linux machine of course
wget --wait=20 --limit-rate=20K -r -p -U Mozilla
http://www.golden%2Ap%2Aages.ie/
This will take time, but will scrape the entire site and probably not raise alarms. I scraped a 500+ site with this, but it took 20 hours. If you dont have time to wait, get on public wifi, or connect to a VPN and remove "--wait=20 --limit-rate=20K -r -p"
Have fun.
Just the other day i took 4 days downloading an entire site with 2,85Gb in size.
I used TeleportUltra, another great tool.
And it was worth it... impecable! you just need to change some adsense code, some logos, some few texts and urls and.. voilá!
you need to scrape some info from that site or you need to copy the entire site on your computer (html, site structure, links and so on)?