Data Manipulation 101

Posted by Jad on July 30, 2007

One of the application’s core functionalities is monitoring changes on different sources. Some of which have some kind of webservices available while other don’t. By monitoring I mean getting, validating, storing and comparing the data over time. For that, different data manipulations are required:

  • fetching
  • scraping
  • parsing
  • cleansing
  • storing
  • mining
I will share in this serie of posts different resources, code snippets, benchmarks and techniques related to one or more of the above. Maybe if I get enough time, I can cover them all and group them in one easy-to-understand chapter: data manipulation 101.

The sandbox environement I am using:

And now, on with the show. Next up, Data Scraping.

Trackbacks

Use this link to trackback from your own site.

Comments

Leave a response

Comments