Data Science Articles & Videos

Interview with Greg Linden - Developer of Amazon Recommendation EngineAs you know, we’ve been trying to cover from every angle, the innovations that ecommerce sites in general, and Amazon.com specifically, brought to the world. That is why I was thrilled to get to speak with Greg Linden, who was one of the Amazon engineers who was responsible for a lot of the personalization and data-driven innovations at Amazon, especially the recommendation engine...

Markov Chain Monte Carlo Without all the BullshitI have a little secret: I don’t like the terminology, notation, and style of writing in statistics. I find it unnecessarily complicated. So to counter, here’s my own explanation of Markov Chain Monte Carlo...

An Experimental Bird Migration VisualizationEvery year hundreds of millions of birds migrate to and from their wintering and breeding grounds, often traveling hundreds, if not thousands of kilometers twice a year... one tool, radar, has the ability to measure the mass flow of migrants both day and night at a temporal and spatial resolution that cannot be matched by any other monitoring toolts...

Machine Learning at American Express: Benefits and RequirementsCurious to know how American Express uses machine learning successfully, in production, at very large scale? An audience of over 300 recently got a peek into this big data story thanks to a presentation by Chao Yuan, SVP at American Express who heads their Modeling and Decision Science Team for US Consumer Business...

End-to-End Training of Deep Visuomotor PoliciesPolicy search methods based on reinforcement learning and optimal control can allow robots to automatically learn a wide range of tasks. However, practical applications of policy search tend to require the policy to be supported by hand-engineered components for perception, state estimation, and low-level control. We propose a method for learning policies that map raw, low-level observations, consisting of joint angles and camera images, directly to the torques at the robot's joints...

Bug Prediction at Google
As Google's code base and teams increase in size, it becomes more unlikely that the submitter and reviewer will even be aware that they're changing a hot spot. In order to help identify these hot spots and warn developers, we looked at bug prediction. Bug prediction uses machine-learning and statistical analysis to try to guess whether a piece of code is potentially buggy or not, usually within some confidence range....

Pandas / dplyr comparison for pandas 0.16This notebook compares pandas and dplyr. The comparison is just on syntax (verbage), not performance. Whether you're an R user looking to switch to pandas (or the other way around), I hope this guide will help ease the transition...