MapR is proud to be a Platinum Sponsor of the 9th Annual Hadoop Summit, a leading conference for the Apache Hadoop community. This 3-day event features many of the Apache Hadoop thought leaders who will showcase successful Hadoop use cases, share development and administration tips and tricks, and educate organizations about how best to leverage Apache Hadoop as a key component in their enterprise data architecture.

Talks

The computing world seems lately to be all a quiver about the novelty deep learning models and how they seem so mysterious.

In fact, the basic ideas behind these systems are very closely related to commonly known algorithms like k-means clustering.

In this session, Ted will present a simple example of an anomaly detector built using k-means clustering and show how it provides a insight into how much more advanced models such as neural networks and recurrent networks.

The growing adoption of Spark SQL and Apache Drill presents an interesting conundrum- who should use which tool for what and when? Can they potentially be used together? This session will compare and contrast these technologies across a variety of parameters – architectures, ease of use, data types and data sources supported. We will demo a rich set of queries on some very interesting datasets to bring the representative use cases to life for both Spark SQL and Apache Drill.

Real-time requirements are becoming increasingly common for new applications. Developers and architects need a reliable way to move data streams fast from the point of creation across different systems in order to respond to events as they happen. But a successful architecture requires more than just the use of an event-based system; a sustainable solution needs to incorporate best practices to ensure its viability over the long-term.

Application developers and architects need to have foresight; they must anticipate common pitfalls and roadblocks from the start. Getting a real-time system right requires planning and provisioning for:

The Internet of Things is rapidly evolving in several directions, for several use cases, and on several platforms and protocols. Like the Betamax-VHS war and many others, it's not clear what the best way to build your IoT architecture today while the dust hasn't yet settled. Regardless of the underlying framework or use case, there are a few common components and criteria that must be satisfied for any IoT solution. The primary purpose of this presentation is to identify what are the lowest common denominator must-have functionalities in your solution. Open source software is becoming more and more mainstream as a de facto technology choice in all levels of organization. Contributing to and consuming open source software is especially important in cutting-edge technologies such as IoT.

The Stream is the Database - Revolutionizing Healthcare Data Architecture

Building a robust, responsive, secure data service for healthcare is tricky. For starters, healthcare data lends itself to multiple models: Document representation for patient profile view or update Graph representation to query relationships between patients, providers, and medications Search representation for advanced lookups Keeping these different systems up to date requires an architecture that can synchronize them in real time as data is updated. Furthermore, meeting audit requirements in Healthcare requires the ability to apply granular cross-datacenter replication policies to data and be able to provide detailed lineage information for each record. In this talk Will Ochandarena will describe how stream-first architectures can solve these challenges, and Brad Anderson will talk about how this has been implemented at Liaison Technologies.

Speakers

Ted Dunning

Ted Dunning is Chief Application Architect at MapR Technologies and committer and PMC member of the Apache Mahout, Apache ZooKeeper, and Apache Drill projects​. Ted has been very active in mentoring new Apache projects and is currently serving as vice president of incubation for the Apache Software Foundation​.​ Ted was the chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems. He built fraud detection systems for ID Analytics (later purchased by LifeLock) and he has 24 patents issued to date and a dozen pending. Ted has a PhD in computing science from the University of Sheffield. When he’s not doing data science, he plays guitar and mandolin. He also bought the beer at the first Hadoop user group meeting..

Sridhar Reddy

Sridhar is a Director of Professional Services for MapR Technologies. Sridhar has over 20 years of experience working with Java and JaveEE in many roles of the software development life cycle, including design, development, management, training and technology evangelism. Prior to MapR, Sridhar managed the Java Platform development team at PayPal, where he led a team of Java developers to build the next generation of the Java platform. Prior to PayPal, Sridhar worked as a Technology Evangelist at Sun Microsystems for over 10 years, where he increased awareness and adoption of Java technology in the worldwide developer community. While at Sun, Sridhar also managed the JavaOne Hands-On Labs as well as Sun Tech Days, a worldwide developer conference. Sridhar holds a BS in Mechanical Engineering from Osmania University in India, and an MS in Computer Science from the Florida Institute of Technology.

Will Ochandarena

Will Ochandarena is Director of Product Management at MapR, responsible for user experience and cloud. Prior to MapR, Will spent some time in the SeaMicro group at AMD, responsible for networking and cloud strategy, and before that was a product manager for the Nexus family of data center switches at Cisco. Will has an engineering degree from Rensselaer Polytechnic Institute, and an MBA from Santa Clara University.