Now is the time to begin thinking of Data Science as a profession not a job, as a corporate culture not a corporate agenda, as a strategy not a stratagem, as a core competency not a course, and as a way of doing things not a thing to do.

Key ideas from a podcast with Deep Learning gurus Geoff Hinton, Yoshua Bengio, and Yann LeCun, where they explain the power of distributed representation and also propose a new open paper review process.

Michael Stonebraker, a database pioneer and a serial entrepreneur, won the 2014 ACM Turing Award (which carries $1 million prize) for fundamental contributions to the concepts and practices in modern database systems.

Successful analytics in the big data era does not start with data and software, but with hands-on, immersive training and goal-driven strategy - get it from The Modeling Agency in Washington DC (May), Toronto (Aug)

Analytics 2015 will go beyond the typical “buzz” about Big Data and the cloud, providing unique opportunities to learn about potential analytics applications to the Internet of Things, as well as practical implementations of cognitive computing, unstructured data analytics, and real-time decisions based on streaming data.

Word vectors in NLP, Machine Learning's place in programming, hardware for deep learning, Machine Learning interviews, and neural graphics engines are all topics covered this week on /r/MachineLearning.

The challenge will award a cash prize to developers that write the most interesting demo, application or show case utilizing the S4 capabilities for text analytics, linked data and knowledge graphs. Submissions due Mar 31.

Ontotext will show how news and media publishers can use semantic publishing technology to more efficiently generate content while increasing audience engagement through personalization and recommendations.

We discuss the future of distributed storage for enterprise, Scale-up vs. Scale-out, software design patterns in Cloud era, microservices model and the place for legacy database in modern enterprise IT.

White House report examines how companies are using big data and analytics to charge different prices to different customers (price discrimination), looks at both benefits and risks, and concludes that many concerns can be addressed by existing anti-discrimination and consumer protection laws.

Due to the big success of the first run, this 6 week online course is repeated on Coursera, starting April 1. This free course provides data science knowledge that can be applied directly to analyze and improve processes in a variety of domains.

Autoencoders are an extremely exciting new approach to unsupervised learning and for many machine learning tasks they have already surpassed the decades of progress made by researchers handpicking features.

Machine learning packages for Python, Java, Big Data, Lua/JS/Clojure, Scala, C/C++, CV/NLP, and R/Julia are represented using a cute but ill-fitting metaphor of a periodic table. We extract the useful links.

This week on /r/MachineLearning, we have a new NLP-focused deep learning course from Stanford, an introduction to scikit-learn, visualization of music collections, an implementation of DeepMind, and NLP using deep learning and Torch.

In statistical modeling, there are various algorithms to build a classifier, and each algorithm makes a different set of assumptions about the data. For Big Data, it pays off to analyze the data upfront and then design the modeling pipeline accordingly.

A free trial of Trifacta is a good opportunity for data analysts to start wrangle the different shapes and sizes of data sets. We give an example of wrangling Bay Area Bike Share data to better understand biking around San Francisco.

This new online Graduate Certificate covers the entire life cycle of data and analytics-supportive decision-making and is built around the framework from the Institute for Operations Research and Management Sciences (INFORMS).

Automating Tinder with Eigenfaces, the elephant in the room of Machine Learning, the Jürgen Schmidhuber AMA, and Shazam's music recognition algorithm make up the top posts in the last month on /r/MachineLearning.

Jeff Leek book "Elements of Data Analytic Style" had a rocket launch, thanks to author course on Coursera. The book includes a useful checklist that can guide beginning data analysts or serve for evaluating data analyses.

CDO is vital in bridging the gap between the C-suite and the data team. The Chief Data Officer Summit in San Jose on April 28-29 will bring together top data leaders to discuss the growing responsibilities of the CDO. Special KDnuggets discount.