554512 Scrapers for two websites

Completado Publicado Mar 1, 2012 Pagado a la entrega
Completado Pagado a la entrega

Two php scripts which scrape Television Series information the two websites below:

[url removed, login to view]

fields: (title, year, imdb website, description, categories (drama, comedy, etc..), actors, url of image. (The list has 334 titles)

[url removed, login to view]

You must traverse the tree, enter each series page to extract additional information.

fields to scrape: title, wikipedia page of base series e.g. /30_rock, , official website, imdb website, actors (the list has 217 series)

Each script must output to a separate csv file.

A third php script must merge the duplicate data: if two entries have the same imdb url, the wikipedia entry must be completed with the data from the imdb scrape.

Use php and curl with appropriate agents. No user interface, the scripts must be executed on command line, separately

Entrada de datos Odd Jobs PHP SQL Extracción de datos web

Nº del proyecto: #2300463

Sobre el proyecto

1 propuesta Proyecto remoto Activo Jul 11, 2012

Adjudicado a:

topman2009

Hi, I will do a good [login to view URL] check PMB. Thanks

$75 USD en 2 días
(0 comentarios)
0.0