Visualizing International Migration

The Wittgenstein Centre for Demography and Global Human Capital has created an interactive visualization of major migration patterns occurring from 1990-2010. The team behind the visualization used statistical missing data methods to approximate the true number of migrants, which is often difficult to identify due to illegal migration or varying recording and enforcement standards from country to … Continue reading Visualizing International Migration

Facebook Introduces Data Tool to Better Understand Users

Facebook announced a new service for companies that will allow them to better understand their customers. The service lets a marketer access anonymized user data to determine what they are saying about brands, activities, subjects, and events. Facebook expects this data will allow companies to better understand the needs of their consumers and therefore develop … Continue reading Facebook Introduces Data Tool to Better Understand Users

Gender equality report: an example of how big data can address big problems

A new report from the Bill and Melinda Gates Foundation and the Bill, Hillary, and Chelsea Clinton Foundation cites data collection and analysis as a valuable tool in solving problems and measuring progress in the fight against gender inequality. The report, called “No Ceilings”, uses 850,000 data points on gender issues to identify progress and … Continue reading Gender equality report: an example of how big data can address big problems

Statistical controls tell us how the gender pay gap works, not that it isn’t real

Patricia Arquette’s speech about the gender pay gap during Sunday night’s Oscars has the subject back on the public agenda, and with it the controversy over whether it’s really true that women earn 23 percent less than men. Max Ehrenfreund at the Washington Post writes that “some of that gap is due to women’s own … Continue reading Statistical controls tell us how the gender pay gap works, not that it isn’t real

Prof. Yann LeCun on His Quest to Unleash Deep Learning and Make Machines Smarter

Few people have been more closely associated with Deep Learning than Yann LeCun, 54. Working as a Bell Labs researcher during the late 1980s, LeCun developed the convolutional network technique and showed how it could be used to significantly improve handwriting recognition; many of the checks written in the United States are now processed with his … Continue reading Prof. Yann LeCun on His Quest to Unleash Deep Learning and Make Machines Smarter

US Department of Agriculture cultivates open data

The burgeoning open data movement has taken hold in federal agencies as well as state and local governments. Open data increases citizens’ confidence in government and fosters innovation and economic growth. Additionally, open data can improve agency operations as data is available in a central location. In the last year, the Department of Agriculture has … Continue reading US Department of Agriculture cultivates open data

Top Mistakes Developers Make When Using Python for Big Data Analytics

Interesting article by Karolina Alexiou. Regarding mistake #1, I disagree. I do it all the time, and it’s faster than finding, understanding, and fine-tuning a piece of code that will work for you, unless you are looking for something basic such as computing correlations for weighted observations. If you are good, your reinvented wheel will … Continue reading Top Mistakes Developers Make When Using Python for Big Data Analytics