Create a scraper / script / crawler to extract product data from an online shop - go through all products - export in csv or excel

En curso Publicado Oct 29, 2013 Pagado a la entrega
En curso Pagado a la entrega

Dear freelancers,

we need an effecient web scraper, which we can run on one of our own servers. WE ARE LOOKING FOR AN EXPERIENCED DEVELOPER - work must be flawless!

Following should be done:

The website to scrape/crawl is: [login to view URL]

--> It is an online shop with almost 80k products. The scraper should do the following: It should start with these main top level categories:

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

And should scrape EVERY SINGLE product within these categories (as said, something around 80,000 items).

The following information should be exported from EACH product - please use this url to understand different items explained below: [login to view URL]:

1) URL (e.g. "[login to view URL]")

2) Breadcrumbs (e.g. "Início > Masculino > Esporte Masculino > Calçados > Tenis")

3) Brand Name - located above product name (e.g. "Puma")

4) Product Name (e.g. "Tênis Puma Axis 2 Branco")

5) Image URLs --> ALL Images in product page --> USE default resolution (not zoom image) of ~275px × 400px (e.g. "[login to view URL] ; [login to view URL] ........ etc etc")

6) Current Price (e.g. "99,90")

7) Old Price - if applicable (e.g. "199,90")

8) Payable rates - if applicable (e.g. "5 x 19,98")

9) Available sizes: (e.g. "38, 39, 40, 41, 42, 43")

10) ALL Available Data in the tab "Detalhes do produto" --> Data here is:

--> A) a short text description AND

--> B) a list with multiple different entries (NOTE: products do not always have all these entries --> compare [login to view URL] versus [login to view URL]):

--> List items could be:

- Description (plain text above actual list)

- SKU (e.g. "RA870APM16PQL

- Modelo (e.g. "POLO RALPH LAUREN 89460PRL")

- Material (e.g. "Algodão")

- Composição (e.g. "100% Algodão")

- Cor (e.g. "Preto")

- Lavagem (e.g. "Lavar a mão")

- Medidas (e.g. "Ombro: 17cm/ Manga: 23cm/ Tórax: 116cm/ Comprimento: 76cm")

- Categoria (e.g. "Premium Masculino > Roupas > Pólos > Pólo Manga Curta")

--> That is all data we need for EACH product

***NOTE*** --> We will need to run the script MULTIPLE times per week: SO: The script MUST be effecient an FAST. The data should be extracted and then saved on the server (in csv or any other excel importable format). The script should possible to be run on OUR server.

***NOTE*** --> We are looking for a long term developer - we will not just need ONE script, BUT we will need similar scripts for 10 different online shops. SO: We are looking for somebody to then also develop other scripts.

Please get in touch if you have any questions.

Thank you very much,

Dan

PHP Arquitectura de software Extracción de datos web

Nº del proyecto: #5073992

Sobre el proyecto

21 propuestas Proyecto remoto Activo Oct 30, 2013

Adjudicado a:

name63

Hi! I can write a scrapper in perl, python or php. Perl is the fastest option. Will you run this script on Linux?

$99 USD en 3 días
(5 comentarios)
2.8

21 freelancers están ofertando un promedio de $202 por este trabajo

mantislin

Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi

$157 USD en 4 días
(381 comentarios)
7.7
ehsankayani

HI I am an expert in making scripts and most of my script use threading so they are extremely fast. Are you ok with a c# script that will run on windows?

$147 USD en 3 días
(197 comentarios)
7.8
cheapexcell

Hello, I am an expert in web scraping. I have done many similar jobs. Ready to start. Thanks, Alex

$515 USD en 10 días
(281 comentarios)
7.4
gangabass

Hi, I'm expert in web scraping that's why I'm sure you'll be impressed with my work. I'm talking about the program which you can run every day and it will collect all info you need from the target site. The program Más

$155 USD en 3 días
(452 comentarios)
7.3
uumairkhalid

Hi.. Expert web scraper/Data Minor here. Interested in your project. I assure you 100% accurate and good quality work. I have too too scraping experience. let me know if you want sample before awarding. lets start. Reg Más

$315 USD en 3 días
(153 comentarios)
7.0
uumarkhalid31

hi, i scraped ehops ad ecommerece websites many time and i am also interested in this project, let me do this work with perfection and accuracy thanks

$147 USD en 3 días
(226 comentarios)
7.0
tanveerjavaid

I have written some big scrappers while doing job in USA base software house to scrape craiglist so I can work on this project.. thanks

$150 USD en 0 días
(49 comentarios)
6.2
freelance4hire80

Hi Dan I'm expert in web scraping, and you can refer to my freelancer profile for similar jobs. My solution: 1. The crawler script written in Perl 2. The crawler write data into MySQL database. you can export out Más

$222 USD en 3 días
(48 comentarios)
6.2
acesolution

Hello Sir , I am highly interested in this project please provide me a chance to work for you....please message me i will give you demos of my scrapping work .........Thanks

$263 USD en 4 días
(115 comentarios)
6.7
ecartsolutions

Hello Sir The requirements are very clear in your project description. I am a PHP-Mysql expert and can help you with the data scrapping work. I can provide you with a sample of the scrap data within the 12 hrs. Plea Más

$250 USD en 3 días
(256 comentarios)
6.4
jitendraparmar07

Hello sir, I'm very pleased to submit this proposal to you. I'm software developer. I can easily write bot / scraper for your purpose. I'll write bot/scraper that scrapes data from website and import CSV /E Más

$185 USD en 4 días
(47 comentarios)
5.4
computerpsycho

Hello Sir/Madam I have 3+ years of experience in web scraping. I have scraped more than 600 websites. I can do this for you with utmost accuracy and in time. I have the No.1 web scraping software. My bid price is Más

$30 USD en 3 días
(6 comentarios)
2.9
denisfd23

Propunerea nu a fost încă furnizată

$155 USD en 3 días
(0 comentarios)
0.0
bqureshi103

Hi, I will provide you a PHP script which will collect all the data you need. Let me know if you have any questions. Regards Bilal

$200 USD en 3 días
(1 comentario)
0.0