11-26-2012, 02:30 PM
Okay so I needed to acquire contacts for a specific company type so I created this little program to download webpages from a list of urls and extract all of the email contacts from the pages and save to a text file "C:\Extracted-Emails.txt".
How do you get the url list?
You can use hrefer (shared here on this forum already) to get laser targeted urls to contact pages (likely to have the email that I'm looking for). If you have like 10k urls it will take a few hours to go through the list, depending on your internet connection.
Once the program is done running you will typically have a lot of duplicates (when emails are embedded into html its in the page twice and this parses for all of the emails, duplicate or not) so just use any "duplicate line remover" to clean up.
Download
[hide]http://www16.zippyshare.com/v/13656175/file.html[/hide]
Virus Total
[hide]https://www.virustotal.com/file/8633a598a3ea5e9268beb1e78166b772e123bec99bacdcc9ec96fb6fe6fd0680/analysis/1353903681/[/spoiler]
How do you get the url list?
You can use hrefer (shared here on this forum already) to get laser targeted urls to contact pages (likely to have the email that I'm looking for). If you have like 10k urls it will take a few hours to go through the list, depending on your internet connection.
Once the program is done running you will typically have a lot of duplicates (when emails are embedded into html its in the page twice and this parses for all of the emails, duplicate or not) so just use any "duplicate line remover" to clean up.
Download
[hide]http://www16.zippyshare.com/v/13656175/file.html[/hide]
Virus Total
[hide]https://www.virustotal.com/file/8633a598a3ea5e9268beb1e78166b772e123bec99bacdcc9ec96fb6fe6fd0680/analysis/1353903681/[/spoiler]