Practical Data Science in Ruby

PS: This will be my last CRB talk for a long while. Thank you for the support & allowing me to geek out

<3

"Data-Driven"

Data Infrastructure

Distributed Systems

ETL

MapReduce

Data Warehousing

High Availability

PubSub (live analysis)

A means of experimentation

Can We Do It Without This?

Yes. Sorta.

Where to Start?

Web Application Datapoints

Comments/Feedback

Traffic Patterns/Analysis

Navigation Recommendation (Netflix recommended content)

The Complicated Stuff

Recommender Models

Potential

Easy

Starts

Natural Language Analysis!

These generally are very linear algebra heavy, and require modern research

This too requires a lot of modern reading. Everything from K-Nearest Neighbors to Random Forest algorithms can be used and will be mentioned

Site Traversal

NLP

A field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages. As such, NLP is related to the area of human–computer interaction.

Simple NLP In Ruby

Sentiment Analysis

Sentimental

A simple gem to get us started

https://github.com/ybur-yug/CRB_ruby_data

$ gem install sentimental

A Simple Start

Sentimental is a simple gem for analyzing the sentiment, or positive/negative/neutral inflection of a string or corpus

Magic

Q&A + Yell At Me For Having A Python Example

Practical Data Science In Ruby

Practical Data Science In Ruby

Think you need Mesos, Kafka, Data Warehousing, and crazy concurrency for data analytics? Well you sort of do, but we can do some useful things still without it, and wont even have to add a new language to our stack!