Archive

Archive for the ‘Web Mining’ Category

Avinash Kaushik – Things about Privacy Laws and Implications on Data Collection and Analysis

 

Privacy is a very important issue when it comes to digital. The way data is collected online and what happens to it is a much-scrutinized issue (and rightly so).

Digital data collection is also exceedingly complex, perhaps a reflection of the organic nature, and subsequent explosion, of the internet. Hence even sophisticated users find it difficult to know everything, one can hardly expect normal digital users to know what’s really happening.

For example, people are really shocked when they hear that even with no web analytics or advertising analytics tool on a site their behavior on the site gets automatically logged into server web logs. Information like IP address, the page requested, time stamps, browser ids and more are stored. These server logs can then be used to do basic reporting using off the shelf software.

See more about this point at: Occam’s Razor

Categories: Big Data, Web Mining Tags: ,

Data Jujitsu: The art of turning data into product

“There’s a method to solving data problems that avoids the big, heavyweight solution, and instead, concentrates building something quickly and iterating. Smart data scientists don’t just solve big, hard problems; they also have an instinct for making big problems small”. From Peter Skomoroch – Principal Data Scientist at LinkedIn.

Download the Free Report on:

http://radar.oreilly.com/2012/07/data-jujitsu.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+oreilly%2Fradar%2Fatom+%28O%27Reilly+Radar%29