Web Data Extractor Pro portable is a web scraping tool specifically designed for mass-gathering of various data types. It can harvest URLs, phone and fax numbers, email addresses, as well as meta tag information and body text. Special feature of WDE Pro is custom extraction of structured data.
This high-speed and multithreaded program works by using a keyword into search engines, by spidering a website or a list of URLs from a file. You can also allow it to follow external links from the original pages, with the capability to go as deep into the URL paths as you need and actually search the entire Internet.
Web Data Extractor is superior for harvesting structured information and specific data types related to the keywords you provide by searching through multiple layers of websites.
• Completely new powerful spidering engine
• Completely reworked UI – slick & sexy
• Pro version of WDE doesn’t have any limits – feel free to process thousands of sites, gigabytes of data
• Extremely fast search and accuracy
• Extract any data you want by Custom data extraction
• Support of working with proxy servers’ list
• New session management allows you manage huge amount of data
• Brand new simplified user interface
• Unicode support
• Added checkbox “Get redirected URL” on the “Custom Data Editor” form to extract urls (e.g. website addresses) that are presented through a redirect
• Added checkbox “Mark Non-Responding Proxies Like Inactive Automatically”. If during the session proxy server determined as «bad» (not working), it is automatically marked as inactive, and it’s not used in the session
• Added new option “Use single line merge” to merge data into a single string. For example, you can export t-shirt colors like: “T-Shirt”, “Black, Yellow, Red, Green”
• Significantly improved loading of public proxy servers from the Internet
• “Human Factor” option has been improved
• Improved a parser of closed by JS email adresses
• Improved option of passing Google-captcha when searching data via Google
• We also made various minor changes and improvements based on feedbacks from our customers