E-mail Extractor

Last Update
9 July 2013
Regular License
$9
Extended License
$45
Sales
41

E-Mail Extractor is a web spider (PHP script) which crawls through the web and extracts e-mail addresses from webpages. You just have to enter start webpage URL, select crawling mode, set maximum number of URLs to crawl and launch a spider. It will walk around the web and collect e-mail addresses.

Features

Start webpage URL: spider starts working with this page.

2 crawling modes: same domain URLs only and all URLs.

Maximum number of URLs: spider can crawl limited and unlimited number of URLs.

Crawling statistics: list of URLs, e-mails and error log.

AJAX-ed interface: modern jQuery-driven interface.

Clean code: clean PHP and JavaScript code can be used for study purposes.

CURL and fsockopen supported: spider can work through either fsockopen or CURL.

Easy terminarion and resumption: close browser to stop the spider, open resumption URL to continue crawling.

Easy to install: edit inc/config.php.

Limitations

Spider doesn’t extract e-mails from images. Spider doesn’t extract e-mails from password protected area of websites. Some websites may block spiders. Make sure that using spiders is not against of your hosting provider TOS/TAC.

Requirements

PHP version 5.0 or greater MySQL version 5.0 or greater

Installation Let’s imagine that you have website http://www.website.com/ and you want to install script there.

Create folder email-extractor (use any other name) in root of your domain. Once created it can be reached by URL: http://www.website.com/email-extractor/

Make sure that folder email-extractor has permissions 0755; index.php and ajax.php have permissions 0644. Edit inc/config.php and set MySQL database parameters.