Apache Spark

On the data analytics front, profound change is in the air, and open source tools are leading many of the changes. Sure, you are probably familiar with some of the open source stars in this space, such as Hadoop and Apache Spark, but there is now a strong need for new tools that can holistically round out the data analytics ecosystem. Notably, many of these tools are customized to process streaming data...Streaming data analytics are needed for improved drug discovery...While Apache Spark grabs many of the headlines in the data analytics space, given billions of development dollars thrown at it by IBM and other companies, several unsung open source projects are also on the rise. Here are three emerging data analytics tools worth exploring:

The Broad Institute of MIT and Harvard will release version 4 of the industry-leading Genome Analysis Toolkit under an open source software license. The software package, designated GATK4, contains new tools and rebuilt architecture. It is available currently as an alpha preview on the Broad Institute's GATK website, with a beta release expected in mid-June. Broad engineers announced the upgrade, as well as the decision to release the tool as an open source product, at Bio-IT World today...

Apache Cassandra company DataStax is snuggling up with Databricks. The partnership is designed to deliver open source code back to the Apache Spark and Apache Cassandra communities to ensure that developers always have the best tools...

Technology is constantly moving forward—well, maybe not always forward, but always moving. Even for someone who keeps an eye on the trends and their effect on programmers, discerning exactly where things are headed can be a challenge. My clearest glimpse into open source programming trends always comes in the fall when I work with my fellow chairs, Kelsey Hightower and Scott Hanselman, and our fantastic programming committee to sculpt the coming year's OSCON (O'Reilly Open Source Convention). The proposals that we get and the number focused on specific topics turn out to be good indicators of hot trends in the open source world. What follows is an overview of the top programming trends we saw in 2016...

...as new computing architectures and approaches rapidly evolve for cloud computing, for big data, for the Internet of Things (IoT), it's also becoming evident that the open source development model is extremely powerful because of the manner in which it allows innovations from multiple sources to be recombined and remixed in powerful ways. Consider the following examples...

Healthcare technology is shaking things up faster than ever before. Whether it’s the quicker pace or technology-resistant providers, it’s crucial for leaders to stay educated and up-to-speed on the industry’s top developments. Here are 10 insurance technology trends that should be top of mind for 2016...Open-source will continue to make inroads: Microsoft's recent acceptance of open-source technologies such as Hadoop, Spark and D3.js in its DBMS and BI offerings is a clear indication that vendors are having a hard time keeping closed-source software competitive.

Machine learning and artificial intelligence have quickly gained traction with the public through applications such as Apple’s Siri and Microsoft’s Cortana. The true promise of these disciplines, though, extends far beyond simple speech recognition performed on our smartphones. New, open source tools are arriving that can run on affordable hardware and allow individuals and small organizations to perform prodigious data crunching and predictive tasks.

We're just two months out from the OpenShift Commons Gathering coming up on November 7, 2016 in Seattle, Washington, co-located with KubeCon and CloudNativeCon. OpenShift Origin is a distribution of Kubernetes optimized for continuous application development and multi-tenant deployment. Origin adds developer and operations-centric tools on top of Kubernetes to enable rapid application development, easy deployment and scaling, and long-term lifecycle maintenance for small and large teams. And we're excited to say, the 1.3 GA release of OpenShift Origin, which includes Kubernetes 1.3, is out the door! Hear more about the release from Lead Architect for OpenShift Origin, Clayton Coleman...

Let's start with an uncontroversial point: Software developers and system operators love Kubernetes as a way to deploy and manage applications in Linux containers. Linux containers provide the foundation for reproducible builds and deployments, but Kubernetes and its ecosystem provide essential features that make containers great for running real applications...What you may not know is that Kubernetes also provides an unbeatable combination of features for working data scientists. The same features that streamline the software development workflow also support a data science workflow! To see why, let's first see what a data scientist's job looks like...

In an increasingly connected world, enterprises are extending new business models, processes and services across their employees, customers and partners. In his keynote presentation at WSO2Con US 2014 today, Dr. Sanjiva Weerawarana, WSO2 founder and CEO, will discuss how competing in this connected world requires a new holistic approach to IT architecture that harnesses the combined power of the cloud, APIs mobile computing, Internet of Things (IoT), analytics, DevOps, and integration...