City Data(repost)

Completado Publicado Feb 13, 2010 Pagado a la entrega
Completado Pagado a la entrega

The project entails writing a program to scrape wikipedia for data for 11 fields of information for approximately 9000 US cities. The data would then be formatted in a spreadsheet for my use.

Several of the fields can be found in a summary of the city that wikipedia has on the right hand side of each page on a city. The other fields involve searching the text of the wikipedia city entry and returning a value of "1" if the text is found. For certain fields (the ones that start "1 if...") I only care if there is text in the wikipedia document that says "Home Rule City" for example. In such a case, a "1" should be entered in the document. Note that "Home Rule City" may also appear as Homerule city, home rule city, home-rule city, etc.

I dont care about the program/script. I just care about the finished product which would be the completed table.

## Deliverables

The list of city names, and the state in which the city is located, are in the attached excel file. The list of fields are also listed in the attached excel file. Basically, the job is to screen scrape wikipedia pages on the cities listed to fill out the table.

Entrada de datos Ingeniería MySQL PHP Gestión de proyectos Arquitectura de software Verificación de software

Nº del proyecto: #3176211

Sobre el proyecto

2 propuestas Proyecto remoto Activo Feb 13, 2010

Adjudicado a:

rxhector2k5

See private message.

$85 USD en 14 días
(63 comentarios)
4.8

2 freelancers están ofertando un promedio de $77 por este trabajo

RockStone435

See private message.

$68 USD en 14 días
(352 comentarios)
7.9