DataExcavator - Web Scraping Server C#DataExcavator - Web Scraping Server C#

Name: DataExcavator - Web Scraping Server C#
Brand: vappv
SKU: 21203
Availability: OutOfStock
Rating: 5.0 (2 reviews)

Web data scraper - extract any data from any website. Export data in .xls / .json / .sql / .csv formats.

Average rating of 5.0 based on 2 votes

Home / Scripts & Code / C#

DataExcavator - Web Scraping Server C#

Web data scraper - extract any data from any website. Export data in .xls / .json / .sql / .csv f...

Average rating of 5.0

× This item is temporarily not available for sale

Live Demo Add bookmark 4 likes

Screenshots

Show all 11 screenshots

Overview

Data scraper for any website

Amazon scraper, facebook scraper, aliexpress scraper

Extract any data from any website - chromium scraper, JS support scraper

Data extractor, data parser - extract data with CSS-selectors and XPath.

Data Excavator - reasons to buy

Scraper support - we can help you extract any data from any website

Amazon data scraper - how to extract and parse data from Amazon.com website

📱 About application:

Data Excavator – powerful C# server for crawling, scraping and saving any data from websites. With the data excavator, you simply can scrape any data from any website and use it for you own purposes. It's a really simple and fast solution with minimal entry point for everyone who want to mine data and don’t want to read many of tutorials. Scraping process working based on .css and x-path selectors. Application includes crawling server, grabbing server (scraping server) and IO server. Each server written in pure multi-threaded model. Do you have 8-cores processor? Good. May be, 12-cores? Very good! The data excavator is directly depended from your PC quality – he can works at powerful servers. In general, with a good hardware, you can boost the data excavator to scraping websites in “monster-mode”, and make 100, 500, 1000 scraping requests per second. Do you really want to making professional data mining? Ok, then just use the Data excavator and forget about other ways to mine data. Our solution is the really fast native server, written with pure quality and with the best specific algoritms.

Most of existing data scraping solutions from competitors works pretty linear – you must do every scraping step yourself with browser plugin. Alternatively you must to use page-to-page switching with pressing “Scrape data” magic button. Of course, there is a lot of professional data-mining solutions with high price and original quality. But there is not so many good solutions with good price and performance.

The Data Excavator can be used in most of situations when you need to extract any-typed data from any website. May be, you want to create a e-commerce project and you search for a goods data source? May be you want to build a service for prices comparing? May be you are a big data specialist and must prepare some data set for analysing? Any task in data scraping that you can imagine you can solve with the Data Excavator application.

For example, take a look at how well our program manages to extract data from the Aliexpress website. We simply take any page and sequentially extract all data from it. You don't need any settings - we have a ready-made configuration.

Scrape data from AliExpress, aliexpress scraper

What are the key differences between our application and others? We offer a complete scraping server. It literally does everything you need to extract data, from multiple settings and automatic .css selectors, to exporting data on the fly. Based on our application, you can create large systems for automatic data scraping and analysis. Our application includes many comments on the source codes. You won't have any trouble understanding the interface structure and calls to system libraries. Our main pride is multithreaded scraping. We have made the application parallel in everything that was possible. You can create multiple projects and extract data from multiple sites simultaneously. Each project has its own thread pool (oh yes!) which can be increased or decreased. Each project has a separate thread pool for scanning pages, and a separate thread pool for parsing downloaded pages.

Our application is based on the Chromium Embedded Framework (CEF) - that is, it has a full-fledged Chromium browser built into it. This allows you to extract data from any site, even those where content is not immediately downloaded or requires a login. This fundamentally distinguishes us from our competitors - our application is suitable for scraping almost any site.

The application uses the ExcavatorSharp library (also our library). It is distributed using the subscription model. The application already has a special key for Codester built in, which does NOT require an additional subscription for several years. The library is well documented, and allows you to build your own scraper based on it.

💻 How it works

Our application is written in C#. Yes, it's a full C# (.NET) scraping server. We used a multi-threaded model in order to extract data from any site as fast as possible. Our application supports authorization and interaction with sites via JS. We try to make the interface simple, under which there is a fairly powerful engine.

Data Excavator - scraping server

🌎 What tasks you can solve

1. Scrape any data from any e-commerce websites, like: amazon, ebay, aliexpress, walmart and many other.
2. Scrape any data from any social network: facebook, twitter, instagram, linked in,
3. Scrape any data from any cryptocurrency exchange website
4. Scrape any data from any supplier website
5. Export of scraped data: .xlsx / .xls / .json /.csv and other

💸 Price policy

As you know, now the basic cost of using any competitive online scraping service is from $ 50 per month (price analysis at the beginning of 2020). The market also offers products that have a one-time price of about $ 150 - $200. Our own subscription model (not via codester) is about $150 per year. Buying the source codes of the application through CODESTER, you get the exclusive price of 150$ with the validity of the built-in key in the range of 2 years. In fact, you get the application at $75 per year, or $6.25 per month. We just enter the market and recruit the first audience for our service. That's why we offer these "drop" prices. Perhaps in half a year or a year we will revise them upwards. No, really, it's almost impossible to find something that works good at $6.25 a month. Yeah, that's a crazy price. But in modern conditions, only this approach allows us to compete with the giants of this market.

🖼 Special: working with pictures and BLOB data

Our system is able to work with images and other binary files. You can extract literally any information from the target page - images, media files, binary data and so on. Even if the image is packaged in the data:[blob] format, the system will correctly process it. All images are stored in files on your hard drive. When exporting, we collect the archive, which contains the exported data, as well as a set of images.

📁 Export of results

Once you have collected data from some site, you can export it. We support export in xlsx, csv, json, mysql formats. We write text data into a file and place images from the site in a folder next to the file. These images are linked to the data via the "images" column in the table, or via the corresponding parameter in the JSON object (depending on the export format you choose).

Scraping data from aliexpress.com

Aliexpress.com extract data - scraper

🛠 App modules and libraries

Our application is written in C#, platform .NET Framework. It includes the following modules and libraries:

1. CEF (Chromium Embedded Framework)
2. CEFSharp - connector between C# and CEF
3. EPPlus - working with Excel
4. RestSharp - working with remote calls ($_GET / $_POST)
5. ExcavatorSharp - library for parallel crawling and scraping
6. HtmlAgilityPack - parsing data from DOM
7. Newtonsoft.JSON - packing data into JSON format
8. log4net - data logging

💡 What scraping tasks can I solve with the application?

Scrape any website online with C# - amazon, Walmart, eBay, google, craigslist, aliexpress

With our C# scraper you can extract data from most well-known sites. Basically, it doesn't matter what the site looks like or how it displays the data. Even if a site requires a login and password, or displays dynamic content with a delay - we can still extract data from its pages. You can scrape data, for example, from the following websites:

Amazon.com
Walmart.com
Aliexpress.com
Ebay.com
Google.com
Craigslist.org
Sears.com
Kroger.com
Costco.com
Google.com
Bing.com
Wikipedia.org
Nytimes.com
Nypost.com
Washingtonpost.com
Wsj.com
Hr.com
Iherb.com
And much more!

At your disposal is a ready-made library of standard projects. No need to deal with anything - just use the ready-made settings from the list!

Ready-made scraping templates

⚠ Warnings:

When you buy our application, you must understand what it can and cannot do. The application is intended for data scraping, i.e. - you define the elements from which the application should get data, for example - headers, images, descriptions of something, etc. The application will scan pages and extract this data in strict accordance.

Please note that this is not a magic bullet that will automatically google, find the sites you want and extract data from them without your participation.

As a minimum of knowledge you should understand how .css-selectors or xpath work. You should also be familiar with general web data extraction skills such as proxying, $_GET and $_POST queries, page scanning management through templates and regular expressions.

Also, if you want to extract data to fill your site, you must understand that the system scans the data and then exports it to some format, or sends it via some http(s) link. The system does not know how to automatically insert data into your site.

☎ Additional options:

Fully free support! We are literally just entering the market and recruiting an audience for our solution. We went crazy and laid out the source codes for our application. If you want to build a solution for data scraping based on our experience, we will be glad to advise you!

Features

Pure multi-threadeded scraping (you can scrape many different websites in parallel)
Multithreaded crawling - get data from website in parallel mode
Browser-engine crawling - parse data from downloaded pages in parallel mode
Support for multiple proxy servers
$_GET and $_POST user args - download pages with set of args
Dynamic content crawling - get content created with JS, ActiveX and other. Wait for AJAX calls
Interaction of user JS-code with pages of the site
Robots.txt ans Sitemaps support
Pages reindexing support
User-defined crawling behaviors
Respect or disrespect for selected links
Analysis of robots.txt under the selected user agent
Multi-dimensional data extracting
Multithreaded data extraction
Exporting data: .xls, .xlsx, .csv, .sql, .json
Exporting data online via HTTP(S) url
Overview grabbed data into UI
Import&Export projects settings
Project settings testing on specified page
Grab only links from specified page (if you want)
Project performance metrics board
Forcing specified links reindexing
Grabbing website links administration panel
Projects interactive dashboard
Supports attributes downloading - blobs, images

Requirements

A. Application usage:

1. .NET Framework 4.5.2 or 4.6
2. X64 processor (because most of scraping tasks uses 1Gb of RAM as minimum)
3. Free space on HDD (1Gb+)
4. Windows 7, Windows 8, Windows 10

B. Application development:

1. IDE: VIsual Studio 2019

Instructions

❔ How it works:

1. Create new project and complete project settings (or use default settings set)
2. Specify a set of links to scraping
2. Start project
3. Wait while application will scrape specified links
4. Export data to preffered format, like a .xls / .xlsx /.csv / .json

💼 How to create new project (less then 3 minutes):

1. Click on "New project (express)"
2. Complete target website address
3. Click on "Auto detect .CSS-selectors"
4. Click on "Create new project"
DONE! System will automatically detect .CSS selectors and set all settings to default values.

🤔 Links:

Website URL: https://data-excavator.com
Common questions and UI: https://data-excavator.com/faq...
Core library information: https://data-excavator.com/exc...
Core library docs: https://data-excavator.com/exc...
Contact us: https://data-excavator.com/con...

If you are an developer, see readme file after purchasing.

Reviews

Mar 2, 2022

CINETIXCS Purchased

Rating:

Great dev, great help,great product, I can just recommend it!
Apr 28, 2020

Jesse10 Purchased

Rating:

Great product, best scraper, better than Parsehub, Data Miner, Instant Data Scraper, with outstanding support. Thanks so much!

Other items by this author

View author's shop

Free support
Future product updates
Quality checked by Codester
Lowest price guarantee

Not available

Information

Category	Scripts & Code / C#
First release	17 April 2020
Last update	11 August 2020
Files included	.cs, .dll, .xml
Software version	.NET 4.5, .NET 4.6
Tags	data, e-commerce, parser, data scraper, scraper, crawling, crawler, grabber, web scraper, scraping, Extract, parsing