What Storm Does

Storm is a distributed real-time computation system for processing large volumes of high-velocity data. Storm is extremely fast, with the ability to process over a million records per second per node on a cluster of modest size. Enterprises harness this speed and combine it with other data access applications in Hadoop to prevent undesirable events or to optimize positive outcomes.

Some of specific new business opportunities include: real-time customer service management, data monetization, operational dashboards, or cyber security analytics and threat detection.

Forums

Storm Tutorials

Try these Tutorials

Introduction Hadoop has always been associated with BigData, yet the perception is it’s only suitable for high latency, high throughput queries. With the contribution of the community, you can use Hadoop interactively for data exploration and visualization. In this tutorial you’ll learn how to analyze large datasets using Apache Hive LLAP on Amazon Web Services […]

A very common request from many customers is to be able to index text in image files; for example, text in scanned PNG files. In this tutorial we are going to walkthrough how to do this with SOLR. Prerequisites Download the Hortonworks Sandbox Complete the Learning the Ropes of the HDP Sandbox tutorial. Step-by-step guide […]

Introduction In this tutorial, you will learn about the different features available in the HDF sandbox. HDF stands for Hortonworks DataFlow. HDF was built to make processing data-in-motion an easier task while also directing the data from source to the destination. You will learn about quick links to access these tools that way when you […]

Introduction JReport is a embedded BI reporting tool can easily extract and visualize data from the Hortonworks Data Platform 2.3 using the Apache Hive JDBC driver. You can then create reports, dashboards, and data analysis, which can be embedded into your own applications. In this tutorial we are going to walkthrough the folllowing steps to […]

Introduction R is a popular tool for statistics and data analysis. It has rich visualization capabilities and a large collection of libraries that have been developed and maintained by the R developer community. One drawback to R is that it’s designed to run on in-memory data, which makes it unsuitable for large datasets. Spark is […]

Apache Zeppelin on HDP 2.4.2 Author: Vinay Shukla In March 2016 we delivered the second technical preview of Apache Zeppelin, on HDP 2.4. Meanwhile we and the Zeppelin community have continued to add new features to Zeppelin. These features are now available in the final technical preview of Apache Zeppelin. This technical preview works with […]

Yet Hortonworks value proposition to Enterprises isn’t one of its software being free — it’s about being 100 percent open source, expanding the Hadoop platform, and being able to support its customers and partners like no one else can.

Hortonworks Inc., whose Hadoop software has been a big driver of the current interest in Big Data, raised $100 million at a valuation of more than $1 billion as it prepares to go public sometime in 2015.

In this case we’re talking about delivery on the Stinger initiative, which teamed engineers from Hadoop distro provider Hortonworks with more than 140 developers to advance interactive SQL querying ability on Apache Hive at scale in pure open source.

As organizations have begun collecting and producing massive amounts of data, they have started to recognize the advantages of data analysis, but they are also struggling to manage the massive amounts of information they have.

Hortonworks, whose Hadoop software has been a big driver of the current interest in Big Data, raised $100 million at a valuation of more than $1 billion, co-led by investment giant BlackRock and hedge fund Passport Capital.

WANdisco PLC, which ensures companies can access vital applications and data through crises like data centre failures, signed a partnership on Thursday with Hortonworks, which provides software on which many such applications are written.

The Hadoop specialist has announced the general availability of Hortonworks Data Platform 2.0 for Windows, which is designed to bring the power of Hadoop 2.0's YARN-based architecture to Windows data centers.

That release of Hadoop, along with its "YARN" component, allows the Big Data technology to be used on petabyte-scale datasets without having to use the batch-oriented and laborious MapReduce algorithm.

As I reported last week, Apache Hadoop 2.0 was released to general availability, and now top Hadoop vendor Hortonworks has responded in kind with the 2.0 version of its own Hortonworks Data Platform (HDP) distribution.

Hadoop is maturing. On October 15, the Apache Software Foundation released Version 2 of the open source Java-based framework. - See more at: http://data-informed.com/hadoop-version-2-means-business/#sthash.m4JaMEZc.dpuf

With the release of the Hortonworks Data Platform 2.1 version of its Hadoop distribution, Hortonworks is packing in new enterprise features, including data access, data governance, data management, security and operations.

However, John Furrier, founder of SiliconANGLE, posits that Hortonworks, with their similar DNA being applied in the data world, is, in fact, the Red Hat of Hadoop. “The discipline required,” he says, “really is a long game.”

As a software executive, entrepreneur and now CEO of data-platform company Hortonworks, Bearden appreciates independent spirits. He also understands business intricacies that derail best-laid plans. Read More At Investor's Business Daily: http://news.investors.com/management-leaders-in-success/041714-697558-cornell-university-graduate-school-entrepreneurship-program.htm#ixzz31DvssKb9 Follow us: @IBDinvestors on Twitter | InvestorsBusinessDaily on Facebook

Cloudera and Hortonworks, rivals in the fast-growing market for Hadoop-related software and services, are stepping up their channel games by expanding their channel ecosystems and enlisting a growing number of solution provider, ISV and OEM partners.

At Foltz-Smith’s urging, TrueCar jumped into Hadoop with both feet. The executive, who hates proof of concepts (“Pick a real problem. Do not do POCs.”) got the okay to invest in a 2 PB cluster, licensed the Hadoop distribution from Hortonworks, and they were off and running.

There are scores of promising big data companies, but Fortune sought to cut through the noise and reached out to a number of luminaries in the field to ask which big data companies they believe have the biggest potential.

Hortonworks Inc., which sells Big Data Apache Hadoop software, has raised $100 million in new financing, valuing the company at more than $1 billion as it look to go public next year, the company announced Tuesday.

Hortonworks will serve up its Hadoop distribution platform to Accenture customers, while Accenture will offer support to clients looking to marry the Hortonworks data platform with their existing IT infrastructure.

Accenture (NYSE: ACN) has entered into an alliance agreement with Hortonworks, a leading contributor to and provider of enterprise Apache™ Hadoop®, in a further strategic move to build its big data and digital capabilities and bring big outcomes from big data and analytics to its clients.

The elusive promise of the Big Data app economy has inched a little closer to reality on Monday after Hortonworks expanded its partnership with Concurrent to package the startup’s Cascading development framework into its flagship Hadoop distribution.

Hortonworks Inc. and Concurrent Inc. announced this week they are partnering to make Hadoop development easier and quicker by combining the former's data platform with the latter's Cascading application development framework.

In case you're unfamiliar, the Hadoop software is likened to a database and may be seen as competition for traditional relational databse systems from Oracle (ORCL) and IBM (IBM), and, to a certain extent, the analytical capabilities of Teradata (TDC (thought that is a matter of debate), but it really is a bird of a different feather.

What was significant about the conversation is that, despite being employed by fierce competitors, Cutting and Murthy showed genuine appreciation for each other and respect for each other’s contributions to the Apache Hadoop project.

As covered yesterday by Gigaom’s Derrick Harris, major Hadoop distribution provider Hortonworks announced this morning its acquisition of XA Secure, a provider of fine-grained security and policy management for Hadoop.

Hortonworks has added Apache Kafka to tis Hadoop software platform as a technical preview. Kafka isn’t the most popular tool in the world, but it’s widely used among large web companies, making it a useful add-on for luring customers of that ilk.

Consultant Wayne Eckerson says Hadoop 2, with its key YARN component, qualifies as a flexible big data operating system. And it could quickly take the open source framework into the IT mainstream, he predicts.

Today, the legacy tech giant announced that it has dumped $50 million into Hortonworks, a leading distributor of Hadoop open-source software for storing, processing, and analyzing lots of different kinds of data.

Hewlett-Packard Co. will extend its strategic partnership to integrate engineering strategies with Hortonworks Inc., the venture-backed Big Data company that is preparing to go public next year. The extension is supported by a $50 million equity investment by H-P, which, as part of the partnership, uses Hortonworks Data Platform as the Hadoop component of its own big data platform, HP HAVEn. The companies also said in a joint...

Hewlett-Packard is putting more of its muscle and money behind Hortonworks by expanding its partnership with the Hadoop distribution vendor and investing $50 million in the company. - See more at: http://www.eweek.com/database/hp-invests-50-million-in-hadoop-distributor-hortonworks.html#sthash.WWc7YnbN.dpuf

Prominent Hadoop software and service companies Hortonworks and Pivotal have partnered to further develop software called Ambari, which would make it easier for enterprises to manage Hadoop distributions.

While the introduction of YARN in Hadoop version 2 helped to unhook the framework from its MapReduce roots, the folks at Hortonworks say the next step of the Hadoop journey will ride atop the Apache Tez engine.

Big data growth has driven strong use cases for Hadoop, the open source grid storage technology, according to executives representing Home Depot, Rogers Communications, Schlumberger, Symantec and Verizon at Hadoop Summit.

He said that organizations can unlock a great deal of potentially useful information and cost savings by using Hadoop, but that the marketplace for the platform has become quite crowded. - See more at: http://sdtimes.com/hadoop-summit-predicts-big-growth-in-future/#sthash.7qLiObTV.dpuf

In a chat with The Platform, Hortonworks VP of Corporate Strategy, Shaun Connolly said that when it comes to the Fortune 100, Hortonworks has significant share with 71% of the retail companies, 75% of the telcos, and around half of all the top banks on the list.

So when Hortornworks invited me to the opening of their new office in London this week, where a number of high profile customers were speaking, I thought it would be a good opportunity to get some insight into the real-life examples of organisations that are starting to find value in analysing unstructured data.

Instead, ZirMed is using Hortonworks' Hadoop distribution along with Apache Hive, open source software that lets SQL-savvy developers and business users at the company query data stored in the Hadoop Distributed File System (HDFS).

Royal Mail may not have the biggest cluster, but we do a lot of experimentation. And we’ve got a lot to prove, because [CEO] Moya Greene and the rest of the executive board are very excited about what we’re doing and are directing our efforts.

"We have signed a formal agreement driven by both companies at the executive level to make HANA plus Hadoop a winning combination for our customers," said Irfan Khan, senior vice president and general manager, SAP Big Data, in an interview.

"Integration with Apache Hadoop is part of SAP’s overall strategy to provide valuable insights across a continuum of data from the efficient storage of massive amounts of cold data, to petabyte-level storage of warm data to real-time and streaming data analysis," SAP said in a statement.

CEO Rob Bearden describes Hortonworks’ increasingly expansive corporate strategy, how the company aims to keep up with new big data technologies and why being a public company provides an edge over competitors.

Hortonworks has signed a definitive agreement to acquire Budapest-based SequenceIQ, a specialist in deployment automation technology for launching on-demand Hadoop clusters in the cloud or any environment that supports Docker containers.

Meet Fortune’s first class of Big Data All-Stars: 20 extraordinary people who we think are the best at connecting the dots, digging deep, and discovering the information that will transform the way businesses operate.

The collective knowledge of Hortonworks’ support and engineering teams has captured nearly a decade’s worth of operational best practices in this release to help customers improve troubleshooting and speed time to resolution when issues occur.

View Past Webinars

With its low costs and easy scalability, enterprises are using Hadoop for more applications. Modern Business Intelligence is interactive and pervasive, resulting in a high concurrency and query load for Hadoop. The result? Business users experience slow response time to their BI applications that only get worse as increased users are given access to BI. […]

Hortonworks DataFlow (HDF) is the complete solution that addresses the most complex streaming architectures of today’s enterprises. More than 20 billion IoT devices are active on the planet today and thousands of use cases across IIOT, Healthcare and Manufacturing warrant capturing data-in-motion and delivering actionable intelligence right NOW. “Data decay” happens in a matter of […]

Demand for cloud is through the roof and cloud architecture dominates Enterprise IT spending! This comes with the challenges of running enterprise workloads in the cloud securely and with ease. Attend this webinar where experts will take you through a novel solution to simplify provisioning and managing enterprise workloads while providing an open and […]

Hortonworks DataFlow (HDF) provides the only end-to-end platform that collects, curates, analyzes and acts on data in real-time, on-premises or in the cloud, with a drag-and-drop visual interface. HDF is an integrated solution with Apache Nifi/MiNifi, Apache Kafka, Apache Storm and Druid. In this exclusive Premier Inside Out, you will hear from the creator of […]

With IoT exploding across all verticals and with terabytes of data flowing in through multiple streaming sources, enterprises are having a hard time trying to gain real-time insights and take corrective action on data as it flows in. Hortonworks DataFlow (HDF) addresses the most compelling use cases of today’s enterprises struggling to find predictive insights […]

Apache NiFi is an easy to use, powerful, and reliable system to process and distribute data. It provides an end-to-end platform that can collect, curate, analyze, and act on data in real-time, on-premises, or in the cloud with a drag-and-drop visual interface. It’s being used across industries on large amounts of data that had stored […]

As data is growing at an exponential rate, organizations are increasingly looking to leverage streaming data from mobile devices, wearable technology, and sensors for real-time processing and analytics. Gartner estimates that “By 2020, 70% of organizations will adopt data streaming to enable real-time analytics.” However, not all systems are designed to handle real-time data ingest […]

Data is growing in data lakes, so are security and compliance risks. These risks stem from storing and processing sensitive data. In this webinar, we will go through a 4 step process to proactively discover and manage sensitive data within big data environments. We will discuss: The current challenges and recommended steps around automated data […]

Joe Niemiec (Senior Technical Director – Platform Enablement & Strategy) understands the YARN Capacity Scheduler and has worked with it across all kinds of deployments. While Capacity Management has many facets – from sharing, chargeback, and forecasting – the focus of this interactive webinar will be on the primary features available for platform operators. In […]

This webinar will discuss the emergence of the Data Science Platform as an integrated and collaborative environment for empowering advanced analytics across the utility enterprise. A successful Data Science Platform enables the proliferation of data, opening up new domains for analytics. Data that was previously unavailable to users can be easily and efficiently accessed, enhancing […]

Sprint, being one of the largest telecom organizations in the US with over 60 million subscribers relies on analytics and insights derived from their vast data landscape to drive better customer experience, improved business operations and detecting fraud. Modernizing their data platforms turned into a key strategic undertaking with several complex parameters related to ingesting […]

Join us for a joint session focussing on the powerful combination of Batch and Real-time Analytics. Come and be part of the discussion and hear from both Hortonworks and Dell EMC on this topical subject. Join Ross Porter, Systems Engineering Director, EMEA, at Dell EMC and Venkatesh Sellappa, Solutions Engineer EMEA at Hortonworks to learn […]

How do you modernize your existing data warehouse solutions to easily offload data into Apache Hadoop and Apache Hive? Does your organization lack the skill set to migrate data from RDBMS to Hadoop and Hive? Does your IT team need to offload and port workloads from Oracle, Db2 and Netezza? Find out how Hortonworks and […]

Enterprises realize that data is the fuel that data science teams crave but data is growing at an explosive rate, with COLD data growing at a much faster rate than HOT. As these data volumes grow beyond 75 TB, continuing to scale HDFS clusters using compute with local storage (DAS) gets very expensive and complex […]

Today’s Big Data teams demand solutions designed for Big Data that are optimized, secure, and adaptable to changing workload requirements. Working together, Hortonworks, IBM, and Attunity have designed an integrated solution that transfers large volumes of data to a platform that can handle rapid ingest, processing and analysis of data of all types from all […]

Hadoop has matured to become a key part of the next-gen data management platforms for enterprises worldwide. The growing production use of Hadoop in the cloud, on-premises, and out to the edge demands seamless management, security, and governance of all data, regardless of its deployment or type. Join Noel Yuhanna, Principal Analyst with Forrester […]

Data Science is being hailed as the next battleground for competitive differentiation that has the potential to transform businesses across industries. Too often enterprises fail to realize the full potential of their data science initiatives due their inability to leverage all the internal and external data that they have at their disposal. The productivity of […]

Tomorrow’s energy provider will rely on data-driven analytics for more decisions than ever. From customer programs to asset management and expanded energy portfolios, decisions increasingly need data in order to select the best possible outcome. One key to being competitive in the future will be for utilities to proactively develop and move forward with a […]

Self-service is key for organizations who either want to manage their Dataflows in real-time, or capture perishable insights from Data-in-motion. Join this webinar and learn how easy you can manage your dataflows, your schema objects, and streaming applications, in a self-service manner. These game-changing features in HDF 3.0 will make streaming analytics faster and […]

The University of North Texas (UNT) is one of the nation’s largest universities, with 12 schools and colleges, 42,000 students, and 380,000 active alumni. As part of the university’s strategic initiative designed to improve enrollment, retention and overall student experience, UNT chose Attunity Replicate software to enable a data lake using Hortonworks Data Platform for […]

Tomorrow’s energy provider will rely on data-driven analytics for more decisions than ever. For those utilities that want to lead the charge, building an open and secure analytics infrastructure platform will be the key to their success. One key to being competitive in the future will be for utilities to proactively build an analytics infrastructure […]

Organizations today are looking to exploit modern DataArchitectures that combine the power and scale of Big Data Hadoop platforms with operational data from their Transactional Systems. In order to react to situations in an agile manner in real-time, low-latency access to data is essential. Hortonworks and Oracle can provide comprehensive solutions that allow organizations to […]

Learn how a leading healthcare company is yielding big dividends from Big Data. Advisory Board, a healthcare firm serving 90% of U.S. hospitals, has multiple different business units and data science teams within their organization. In this webinar, they’ll share how they use technologies like Hadoop and Spark to address the diverse use cases for […]

Hadoop’s data analytics capabilities offer tremendous potential for deriving new and differentiated business insights. But, many organizations get bogged down with the DIY infrastructure decisions and fail to keep up with the evolving needs of their business. Dell EMC and Hortonworks can help organizations get past this challenge with proven and certified architectures which allow […]

Only 23% of businesses can integrate customer insights in real-time; join us to hear how this global retail department store architected and implemented their successful solution. Retailers need to create a more personalized shopping experience to increase customer satisfaction and conversions. That means aligning merchandising and marketing engagements across all channels, and that means choosing […]

When it comes to the data lakes and data warehouses, there’s no shortage of controversy: Is one better than the other? The real answer is, there’s no need for heated debate—a data lake actually complements the data warehouse. Integrating a data lake with your EDW is really just an evolution of architecture that can provide […]

The key of Big Data is the ability to capture greater customer insights by pulling together and understanding the relation between multiple pieces of information that we were unable to combine and analyse without the automated data processing. While the benefits of automated data processing enable companies to gain competitive edge and build stronger customer […]

Apache Ambari 2.5 helps customers simplify the experience for provisioning, managing, monitoring, securing and troubleshooting Hadoop deployments. Find out how the combination of Ambari and SmartSense delivers a path to success to help IT get Hadoop up and running effectively. The end result – you get the full business impact management and benefits of Big […]

Today enterprises are moving their data lakes to the cloud to help them execute faster, increase productivity, and drive innovation while leveraging the scale and flexibility of the cloud. However, such gains come with robust Authentication, Authorization and Audit (“AAA”) requirements needed for these workloads. In this interactive webinar: Learn how to get consistent security […]

You have a legacy system that no longer meet the demands of your current data needs, and replacing it isn’t an option. But don’t panic: Modernizing your traditional enterprise data warehouse is easier than you may think. Join us on August 1st at 11am PDT to hear from David Loshin, President of Knowledge Integrity, […]

To realize the full potential of modern data applications, organizations need to be able to capture perishable insights from data in motion. While flow management tools are available to help gather, route, filter and transform data from any source, companies have lacked equivalent tools for building the analytics apps needed to extract insight from streaming […]

How do you optimize Apache Spark workloads in the cloud? How do you tune your resources for maximum performance and efficiency? Find out how the new Hortonworks Flex support subscriptions enables IT agility and success in the cloud. We will cover: Options for running Data Science, Analytics and ETL workloads in the cloud Hortonworks support […]

Verizon Global Technology Services (GTS) was challenged by a multi-tier, labor-intensive process when trying to migrate data from disparate sources into a data lake to create financial reports and business insights. Join experts from Verizon GTS, Attunity and Hortonworks on June 8th at 11:00 a.m. PT/2:00 p.m. ET to learn more about how Verizon: Easily […]

The combination of big data and cloud is enabling the enterprise to unlock insights into data more quickly and with greater flexibility than ever before. To make this combination achieve it’s full potential, enterprises need an experience that marries the agility of cloud infrastructure with the power of data analytics. Join experts from Hortonworks and […]

As Hadoop based workloads are becoming ever more popular in the enterprise, the need for enterprise grade capabilities like active directory based authentication, multi-user support, and role based access control has never been more important. In this session, we are going to explore how you can create an HDInsight cluster joined to an Active Directory […]

Join experts from Ovum and Hortonworks to learn how to get big data analytics workloads up and running in the cloud immediately, and how it will accelerate your time-to-benefit and maximize your agility in the cloud. As big data workloads are moving to the cloud, the challenge for enterprises is the overwhelming number of choices […]

Deploying, right sizing, and the ability to meet seasonal peaks without disrupting availability are often seen as difficult challenges in a big data and Hadoop deployment. In this webinar, we will discuss these operational complexities and how to overcome them without adversely impacting the business. Join experts from Hortonworks and Robin Systems on May 25 […]

Every insurance company regardless of line of business is focused on enabling the digital customer experience with the goal of enhanced profitability, lowering costs and creating stronger customer loyalty. However, many are struggling to achieve this goal challenged a history of policy and/or product based business models. Additionally, many continue to use a traditional MDM […]

Join experts from Hortonworks and our guest from Forrester to learn how a next-gen connected data architecture can help accelerate time to value for your big data initiatives. Data today is a foundation for most businesses to drive better customer experience, better products and improved operational efficiencies. Growing adoption of cloud applications and platforms, in […]

Detecting an impersonator on your enterprise network is a complex and time-consuming game. It’s even more difficult when an attacker is able morph and dynamically change tactics and behaviors. A big data security analytics platform such as recently announced Top Level Project Apache Metron can make it easier to detect, investigate, assess, and remediate threats […]

Businesses are striving to get the most value out of their data and turn it into actionable insights. The shift towards becoming a data-centric organization requires a modern data architecture with the ability to access all critical enterprise data at the right time. This is easier said than done. Most organizations find themselves challenged by […]

Every insurance company regardless of line of business is focused on being more data-centric. Risk assessments based data is at the heart of analysis. Understanding and paying valid claims quickly is key to customer retention and loyalty. Creating new insurance offerings to meet market and customer demands is imperative to remain relevant. Today insurance companies […]

Improve the efficiency and accelerate job execution by moving traditional SAS workloads into Hadoop to modernize and optimize SAS analytics. How can we run traditional SAS® jobs, including SAS® Workspace Servers, on Hadoop worker nodes? The answer is SAS® Grid Manager for Hadoop, which is integrated with the Hadoop ecosystem to provide resource management, high […]

Scotiabank is an international financial institution with presence in over 55 countries and assets over 900 Billion dollars. Their decision to build an Enterprise Data Lake solution to manage their Big Data turned into a massive initiative with many complex parameters related to data extraction, data compression, secure data transfer over the network, data privacy […]

Hortonworks and Microsoft are working together to democratize Big Data. Large enterprises and early adopters are already harnessing the power of analytics and machine learning to increase productivity, identify emerging opportunities, and build competitive advantage. In this session, we will discuss how you can use Microsoft Azure HDInsight to discover insight from new data sources […]

Combining IOT, Customer Experience, and Enterprise Data What if you could derive real-time insights using ALL of your data? Join us for this webinar and learn how companies are combining “new” real-time data sources (i.e. IOT, Social, Web Logs) with continuously updated enterprise data from SAP and other enterprise transactional systems. This provides deep and up-to-the-second analytical […]

Only 23% of businesses can integrate customer insights in real-time. Learn how to change that. Join us to hear from industry experts on how to transform your organization’s data into the best omnichannel customer experience. Through this webinar, participants will hear how one retailer, with over 5 million customers and 750 brands, developed precise customer lifetime […]

The connected world creates a rate and volume of streaming cybersecurity data that is unprecedented, and attacks are increasingly sophisticated and multifaceted. Yet it is unreasonably time-consuming for security personnel to piece together data from multiple systems to assess the true nature of a single threat across an enterprise. Learn how big data and data […]

Today's data-driven organizations are challenged by typical EDWs which include the added costs of proprietary technologies and the labor-intensive inflexibility of the EDW model.
To summarize, EDW is expensive, rigid and inefficient. Smarter organizations are now turning to modern solutions to renovate their EDW.
Join this webinar as share the top 3 ways to optimize your EDW with Hadoop. We will cover, archiving, onboarding and the enrichment of data enabling you to kick start your journey to move data and processing to Hadoop.

Hortonworks SmartSense provides proactive recommendations that improve cluster performance, security and operations. And since 30% of issues are configuration related, Hortonworks SmartSense makes an immediate impact on Hadoop system performance and availability, in some cases boosting hardware performance by two times. Learn how SmartSense can help you increase the efficiency of your Hadoop hardware, through customized […]

As enterprises around the world bring more of their sensitive data into Hadoop data lakes, balancing the need for democratization of access to data without sacrificing strong security principles becomes paramount. In this webinar, Srikanth Venkat, director of product management for security & governance will demonstrate two new data protection capabilities in Apache Ranger – […]

Watch now by clicking on the “play” button below. It’s an exciting time for retailers as technology is driving a major disruption in the market. Whether you are just beginning to build a retail data analytics program or you have been gaining advanced insights from your data for quite some time, join Eric and Shish […]

Innovative mobile operators need to mine the vast troves of unstructured data now available to them to help develop compelling customer experiences and uncover new revenue opportunities. In this webinar, you’ll learn how HDB’s in-database analytics enable advanced use cases in network operations, customer care, and marketing for better customer experience. Join us, and get […]

Hortonworks Data Cloud for Amazon Web Services is a new product offering from Hortonworks that is delivered and sold via the AWS Marketplace. It allows you to start analyzing and processing vast amounts of data quickly. Powered by the Hortonworks Data Platform, Hortonworks Data Cloud is an easy-to-use and cost-effective solution for handling big data […]

Is your University taking advantage of Big Data to improve student performance and raise professor effectiveness, while reducing administrative workloads? Student performance data is increasingly being captured as part of software-based and online classroom exercises and testing. This data can be augmented with behavioral data captured from sources such as social media, student-professor meeting notes, […]

Hortonworks Data Cloud for Amazon Web Services is a new product offering from Hortonworks that is delivered and sold via the AWS Marketplace. It allows you to start analyzing and processing vast amounts of data quickly. Powered by the Hortonworks Data Platform, Hortonworks Data Cloud is an easy-to-use and cost-effective solution for handling big data […]

Part five in a five-part series, this webcast will be a demonstration of the integration of Apache Zeppelin and Pivotal HDB. Apache Zeppelin is a web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more. This webinar will demonstrate the configuration of the psql […]

Chief Data Officers in financial services have unique challenges: they need to establish an effective data ecosystem under strict governance and regulatory requirements. They need to build the data-driven applications that enable risk and compliance initiatives to run efficiently. In this webinar, we will discuss the case of a global banking leader and the anti-money […]

Apache MiNiFi is designed to make it practical to enable data collection from the second it is born, ideal for IoT scenarios where there are a large number connected devices or a need for a smaller and more streamlined footprint than Apache NiFi. Join us as we share a use case and demo of Apache […]

Customers are preparing themselves to analyze and manage an increasing quantity of structured and unstructured data. Business leaders introduce new analytical workloads faster than what IT departments can handle. Legacy IT infrastructure needs to evolve to deliver operational improvements and cost containment, while increasing flexibility to meet future requirements. By providing HDP on IBM Power Systems, Hortonworks […]

Rapid data growth from a wide range of new data sources is significantly outpacing organizations’ abilities to manage data with existing systems. Today’s data architectures and IT budgets are straining under the pressure. In response, the center of gravity in the data architecture is shifting from structured transactional systems to cloud based modern data architectures […]

Many organisations are now looking to stream operational data in real-time from their transactional RDBMS systems into Hadoop big data platforms, in order to support new analytics use cases. Such data can then be combined with other data stored on the Hadoop cluster, and critical decisions made on up-to-date information, with more reliable results. Real-time […]

Today’s European financial markets hardly resemble the ones from 15 years ago. The high speed of electronic trading, explosion in trading volumes, the diverse range of instruments classes and a proliferation of trading venues pose massive challenges. With all this complexity, market abuse patterns have also become egregious. Banks are now shelling out millions of […]

The fourth Industrial revolution is here, and competing to succeed in the 4.0 ‘digital’ world entails making the right decisions based on data driven pointers, to successfully implement your strategy. As we work with the entire stack of Fortune 100 organizations, we often see companies—particularly those operating across business lines with complex lines of businesses […]

Part four in a five-part series, this webcast will be a demonstration of the installation of Apache MADlib (incubating), an open source library for scalable in-database analytics, into Hortonworks HDB. MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data. […]

You know that your data warehouse is necessary for analytics initiatives that help guide management decisions and serve your customers better. But do you know how offloading data from your data warehouse to Hadoop can help you save money, improve performance and rebalance workloads? Join subject matter experts from HPE, Hortonworks and Attunity for a […]

Featured Speaker: Ibrahim Itani, Leader of Big Data Architecture and Technology, Verizon. With increasing data volumes and data sources, enterprises are outgrowing their traditional BI solutions and struggling to make use of the data collected on their new data platforms. Frequently, data engineers will resort to old habits of shifting data sets between repositories so […]

Streaming Analytics are the new normal. Customers are exploring use cases that have quickly transitioned from batch to near real time. Hortonworks Data Flow / Apache NiFi and Isilon provide a robust scalable architecture to enable real time streaming architectures. Explore our use cases and demo on how Hortonworks Data Flow and Isilon can empower […]

Johnson Controls delivers best-in-class building technologies and energy storage. In their quest to continually improve operations, they implemented a modern data architecture based on Hadoop. They started with a small successful proof of concept and recognized the need to make more of their data accessible to more teams. Johnson Controls was able to successfully integrate Big […]

Apache NiFi, Storm and Kafka augment each other in modern enterprise architectures. NiFi provides a coding free solution to get many different formats and protocols in and out of Kafka and compliments Kafka with full audit trails and interactive command and control. Storm compliments NiFi with the capability to handle complex event processing. Join us to […]

In this webinar, we will focus our presentation on Hortonworks Professional Services. The Professional Services team consists of members of the Worldwide Education and Consulting organization which enable and successfully implement client solutions around the Hortonworks Connected Data Platform. This initial session will cover: Who we are – Hortonworks Professional Services, the organization and methodology […]

Apache MiNiFi is designed to make it practical to enable data collection from the second it is born, ideal for IoT scenarios where there are a large number connected devices or a need for a smaller and more streamlined footprint than Apache NiFi. Join us as we walk through how Apache MiNiFI works, and how […]

Hadoop and The Internet of Things has enabled data driven companies to leverage new data sources and apply new analytical techniques in creative ways that provide competitive advantage. Beyond clickstream data, companies are finding transformational insights stemming from machine data and telemetry that are radically improving operational efficiencies and yielding new actionable customer insights. During […]

With the advent of Big Data platforms, Banking & Financial Services companies are building applications that create massive business value. However, the datasets being used often contain significant amounts of confidential, proprietary and highly sensitive data and so the potential benefits are held back by privacy concerns. In this joint webinar, Hortonworks and Privitar will […]

Hortonworks launched SmartSense to help customers quickly collect cluster configuration, metrics, and logs to proactively detect issues, and expedite support case resolution. In this webinar, Paul Codding, Senior Product Manager for SmartSense will walk the audience through the new functionality that has been launched as part of SmartSense 1.3. Learn how SmartSense 1.3 changes the […]

Part three in a five-part series, this webcast will be a demonstration of the integration of Hortonworks HDB and Apache Hadoop YARN. YARN provides the global resource management for HDB for cluster-level hardware efficiency, while the in-database resource queues and operators provide the database and query-level resource management for workload prioritization and query optimization. This […]

Fueled by ever-changing customer behaviors and an increasing number of industry disruptions, the modern enterprise requires analytics to stay ahead of the game. Today’s data warehouse needs continuous enhancements to address new requirements for advanced analytics, real-time streaming data, Big Data, and unstructured data. The focus should be on developing a forward-looking, future-proof view and […]

Optimizing manufacturing processes ultimately revolves around increasing output at reduced cost and improved quality. Manufacturers try to minimize inventory levels by scheduling just-in-time delivery of raw materials, but even the smallest miscalculation can cause stock-outs that lead to production delays. Sensors andRFID tags can capture supply chain data, but this creates a large, ongoing flow […]

Big data projects are only as valuable as their results – and the path it takes to get there isn’t always easy. Join us to learn about enterprise readiness features of Hortonworks DataFlow 2.0 with Ambari and Ranger for integrated installation, deployment and operations of data in motion components for streaming analytics of Apache NiFi, […]

How Hortonworks DataFlow and the HDF Certification Program make it easier and faster to integrate different systems together, with highlights on the latest processors added to Apache NiFi for Kafka, IoT, Slack, and more, all designed to accelerate your big data project and free-up resources for innovation.

The Global Credit Card industry is rapidly changing and the participants are increasingly facing new challenges with exploding volumes, regulatory pressures and new entrants competing for the market share. The industry has responded to these challenges by looking at avenues to cut costs, increase efficiencies and provide better, safer products and services to attract new […]

Hortonworks recently released the Hortonworks HDP 2.5 Sandbox, a free, comprehensive, easy-to-use, hands-on learning environment that provides the fastest onramp for anyone interested in learning, evaluating or using Apache Hadoop™ and the extended ecosystem in an enterprise. Join us in this interactive webinar as we discuss and demo features of the Hortonworks Sandbox, including: How […]

Who’s winning the deep forensic analysis ‘arms race’ for compliance? Real-time trade surveillance in global financial markets has created a data tsunami. With greater volumes of data comes greater compliance risk. CNBC reports U.S. Banks have been fined over $200B since the financial crisis. How are compliance teams fighting back to make more of the […]

Learn about Hortonworks DataFlow (HDFTM) and how you can easily augment your existing data systems – Hadoop and otherwise. Learn what Dataflow is all about and how Apache NiFi, MiNiFi, Kafka and Storm work together for streaming analytics.

Part two in a five-part series, this webcast will be a demonstration of Pivotal Extension Framework (PXF), an extensible framework that allows Hortonworks HDB to query external system data. This is really useful for both data loading, and also avoiding data loading for data that doesn’t need to reside within the database instance. PXF includes […]

Rapid data growth from a wide range of new data sources is significantly outpacing organizations’ abilities to manage data with existing systems. Organizations now look to capture all data, keep it longer, and prepare to use the data in new ways as business conditions evolve. As a result, legacy data architectures and IT budgets are […]

The closer Apache Hadoop comes to being a real time platform for your enterprise data, the more business critical the data integration layer becomes. To enable the real time platform, it requires both minimal impact on production with CDC (change data capture) and real time data via Apache Kafka and HDF™ Join us to hear real […]

Hadoop didn’t disrupt the data center. The exploding amounts of data did. But, let’s face it, if you can’t move your data to Hadoop, then you can’t use it in Hadoop. Join the experts from Hortonworks, the #1 leader in Hadoop development, and Attunity, a leading data management software provider, for a webinar where you’ll […]

Gartner predicts there will be 250 million connected vehicles by 2020. While automotive manufacturers are on track to drive connected vehicles implementation, are they poised to leverage the trillion-dollar opportunity from the gold-mine that is “sensor data”? Research from Morgan Stanley suggests, automotive manufacturers can save $488 billion by using predictive maintenance. By assessing in […]

This webcast is the first in a five-part series on Hortonworks HDB, demonstrating the installation procedures for installing Hortonworks HDB on Hortonworks HDP. HDB’s integration with Apache Ambari allows you to install and manage your high-performance SQL database alongside other Hadoop services. Starting with an existing HDP cluster, it will cover any required prerequisites and […]

Today organizations produce more data than ever and are continuously looking for solutions that allow them to gain deep insight into their business and monetize the data collected from multiple sources. Apache Spark helps you improve your business insights by providing a highly-scalable and interactive environment for analyzing data. Microsoft has worked with Hortonworks to […]

Companies of all sizes are challenged to keep up with emerging technologies that deliver a competitive advantage. Big data holds the key to a greater customer insight and stronger customer relationships. But risk of sensitive data exposure — and compliance violations— keep many organisations from pursuing big data initiatives and reaping the rewards of business-driven […]

Like all consumer packaged goods (CPG) companies, PepsiCo relies on huge volumes of data to accurately replenish its retailers with the appropriate amount and type of product. Across the CPG industry, most analysts exclusively rely on Excel and Access for data wrangling, but as PepsiCo’s data surpassed the capabilities of those tools, they knew they […]

In Neil Winters own words, Markel International sells its customers “a promise, not a product” and as a result, a continuous view of data across a huge variety of sources is critical. During this webinar, Cindy Maike, general manager for insurance at Hortonworks, will explore the approach Markel is taking to respond to the priorities […]

Success in the insurance industry depends on your company’s ability to quickly interact with customers at every point in the insurance life cycle, and then to make timely use of the new data to guide business decisions. Many of the customers and companies agree that Customer 360 is an important initiative, but many don’t know […]

Already strategic partners, Pivotal Software and Hortonworks deepened their relationship in Spring 2016 with the goal of providing enterprises the most complete modern data platform for advanced analytics and machine learning. As part of the expanded relationship, Hortonworks has introduced Hortonworks HDB, the market’s leading Hadoop Native SQL database and big data SQL machine learning […]

Big Data and Apache™ Hadoop® are driving tectonic shifts in enterprise data management (EDM) within the financial services industry. Open Enterprise Hadoop and the vendor ecosystem growing up around it are consolidating and standardizing data architectures at the leading financial institutions around the world—transforming expensive, inflexible, and proprietary data landscapes into economic, agile, open source […]

As organizations strive to identify and realize the value in Big Data, many now seek more agile and capable analytic systems. While many have piloted Hadoop as a data repository for simple workloads, there is much more value that can be created from Hadoop by leveraging the data in the platform more, to interact with […]

The promise of big data is greater than ever before…due to an explosion in the number and variety of data sources. This has caused a shift from traditional structured, and batch or periodic data warehouse environments to today’s more complex combination of structured with semi & unstructured data, along with the requirement to apply analytics […]

Payment card fraud has mushroomed into a massive challenge for consumers, financial institutions, regulators, and law enforcement. As the accessibility and usage of credit and debit cards grows and transaction volumes increase, banks are losing tens of billions of dollars on an annual basis to fraudsters. The Nilson Report estimated that of every dollar of […]

The vision of a digitally connected world is fast becoming a reality with The World Economic Forum predicting that over 50 billion devices will be connected by 2020. It’s a disruptive force for the insurance industry as you aspire to support your digital customers as well as create new revenue streams. Coupled with that, […]

CISOs are often asked to justify a growing budget by showing ROI. But at the end of the day the security organization spends a lot of money and the answer to what is the ROI is that nothing happened. Because when the security leader is doing their job they are invisible. That is a very […]

The explosion of new types of data in recent years has put tremendous pressure on the financial services data center, both technically and financially, and an architectural shift is underway in which multiple Lines of Business (LOBs) can consolidate their data into a unified data lake. This approach helps financial institutions address risk management and […]

Big data is transforming the way that organisations use and manage data. They now have more data in motion and at rest than ever before in higher velocities and from more sources across the organisation. Businesses can’t afford to miss opportunities for deeper insight due to time spent “data wrangling”. They are also looking for […]

Marketing was arguably one of the first lines of business to understand the impact data could have on growth and profitability. Indeed, an article in Forbes acknowledged that of all the ways businesses are looking to big data to streamline operations, “marketing is perhaps one of the most important”. On this webinar, you’ll hear how […]

Rapid data growth of traditional and new data sources is putting a strain on existing Enterprise Data Warehouse (EDW) resources and related IT budgets. Learn how to reduce the cost of an EDW by augmenting it with an EMC Data Lake and Hortonworks Data Platform (HDP). Today, Enterprises simply can’t afford to keep all data […]

Join this session as Hortonworks and eCube demonstrates how to drive actionable intelligence in real time with Hortonworks DataFlow, (HDF) and an array of Hadoop ecosystem tools. This session is a must attend for organisations challenged with optimising IoT data collection, analysing perishable insights and ultimately enriching the data lake with new data. We will […]

Whether it’s trying to remain relevant by servicing customers digitally through multiple channels, identifying new predictive variables to assess risk or seeking profitable growth, the insurance industry is under pressure on numerous fronts. At Hortonworks, we are working with companies like Zurich Insurance, Markel and MunichRe to enable them to increase premiums, optimize their […]

Every industry, every organisation, every department is going through a huge change, whether realised or not, as the opportunity to harness data to impact their business is ripe. Many European businesses are successfully leveraging new platform technologies to transform their organization using data. Whether they are renovating their existing infrastructure for substantial cost savings or […]

Join us for an exploration of how Hadoop Native SQL unleashes the power of Apache™ Hadoop® for business insights and predictive analytics. We will demonstrate how Pivotal HDB, powered by Apache HAWQ (incubating) allows near-real-time execution of ad-hoc queries at scale. Complete analytics tasks faster – in seconds or minutes, not hours or days, using […]

We’re fully into the “age of data”. New developments in connected and non-connected devices are multiplying the rate at which data is created thus challenging organizations to think differently about their data architecture. Today’s enterprises may not be equipped with collecting, curating and analysing this data in real time and always carry the pressures of driving competitive advantage. With the […]

In this webinar you’ll understand how Hortonworks Connected Data Platforms enables a modern big data solution to run on the EMC Isilon infrastructure. We will also share how the combined solution delivers unmatched flexibility, lower costs and more robust security. Register now to initiate analytics projects quickly and get results in minutes.

Nowadays every business is a data business, the successful enterprises master the value of their data. So how can you start on the path to being a data-defined enterprise? Is your existing IT infrastructure designed to take you on that path and handle all your structured and unstructured data, and everything in between? Apache™ Hadoop® […]

In the latest Forrester Wave report for Big Data Hadoop Cloud Solutions, Microsoft Azure came on top beating some very esteemed vendors. Learn how to complete your big data solution and join Microsoft and Hortonworks as we showcase Hortonworks DataFlow and how it complements Azure HDInsight enabling users to easily move their data to the […]

Rapid data growth of traditional and new data sources is putting a strain on existing Enterprise Data Warehouse (EDW) resources and related IT budgets. Learn how to reduce the cost of an EDW by augmenting it with an EMC Data Lake and Hortonworks Data Platform (HDP). Today, Enterprises simply can’t afford to keep all data […]

Warranty claims have direct financial impact on manufacturers and add substantial indirect costs from degradation of a company’s brand image, reduced customer loyalty, and potential legal liability. When manufacturers analyze their “after the fact” warranty data, it’s too late to identify issues proactively and be able to respond to reduce risk. Enter the era of […]

The explosion of new types of data in recent years has put tremendous pressure on the financial services data center, both technically and financially, and an architectural shift is underway in which multiple Lines of Business (LOBs) can consolidate their data into a unified data lake. This approach helps financial institutions address risk management and […]

Microsoft HDInsight has been chosen by Forrester to be the leader in their Hadoop Cloud Wave report. Join Microsoft and Hortonworks on June 6, 2016 at 10:00 AM PST to discuss how Hortonworks powers the Microsoft HDInsight platform and examples of how world leading corporations choose Microsoft HDInsight to run a variety of mission critical work loads.

With data becoming the heart of your business, the last thing you want is proprietary software holding you back. Hortonworks can help free you from the grip of “hybrid open” approaches that shackle Apache™ Hadoop® with proprietary extensions. Join this webinar to learn how.

You may be up all night wondering how enterprise organizations deal with large data volumes and data varieties without significantly increasing costs. And perhaps your existing data architectures are not equipped to handle today’s data? Join this webinar to learn how to optimize your data architecture and gain significant insights into cost savings with Hadoop on […]

Credit & payment card fraud has mushroomed into massive challenges for consumers, financial institutions, regulators and law enforcement. As the accessibility and usage of credit cards increase, banks are losing billions in fraudulent transactions. A recent Nielsen report suggests 5 cents per one dollar is lost to fraud. Banks are increasingly turning to Hadoop & […]

Join Hortonworks and Cisco at the upcoming webinar and hear from the industry leading experts on the latest trends and drivers for a modern data architecture and how can you benefit from it. We will cover: – How to become and data-driven organisation with Hortonworks Data Platform – How to build a super-scaling Hadoop cluster […]

Enterprises have come to rely heavily on Apache Hadoop for business critical applications. Therefore it is critical for them to resolve cluster issues rapidly when time is of the essence. What if you can have access to a support service that proactively monitors your Hadoop infrastructure, and also recommends tailored solutions and actions? Learn how […]

You may be up all night wondering how enterprise organizations deal with large data volumes and data varieties without significantly increasing costs. And perhaps your existing data architectures are not equipped to handle today’s data challenges? Join this webinar to learn how to optimize your data architecture and gain significant cost savings with Hadoop. We […]

Optimizing manufacturing processes ultimately revolves around increasing output at reduced cost and improved quality. Manufacturers try to minimize inventory levels by scheduling just-in-time delivery of raw materials, but even the smallest miscalculation can cause stock-outs that lead to production delays. Sensors and RFID tags can capture supply chain data, but this creates a large, ongoing […]

The emergence of Big Data has driven the need for a new data platform within the enterprise. Apache Hadoop has emerged as the core of that platform and is driving transformative outcomes across every industry. Join this webinar for an overview of the technology, how it fits within the enterprise, and gain insight into some […]

How do you keep track of large number of diverse data objects in your data lake that continue to increase every day? Now that Apache Hadoop has become a critical component of your data architecture, how do you know with confidence which piece of data came from which source and how did it change over […]

Today’s criminals and terrorist organizations are outpacing the performance of anti-money laundering (AML) programs by using new and unconventional ways to hide illicit transactions. While financial services firms have taken measures to improve programs, such as fine-tuning alert systems to reduce false positives, and investing in human capital to manage the growing number of investigations, […]

Log analytics solutions like Splunk have powerful capabilities, however in today’s world where everything is connected, non-pertinent data could flood your sophisticated and costly Splunk architectures. Hortonworks Dataflow optimizes Splunk architectures by filtering and forwarding only the most relevant data into Splunk systems while redirecting the remaining data into highly cost-effective Hadoop storage. Watch this […]

Today, telecommunications providers need relevant data in reasonable time and format to transform their business by acting on the insights generated through the data. Sprint has turned to Hadoop for scalable data storage and analytics in order to harness the data flowing from all possible directions at high speed and in various formats. Join Sprint, […]

Now every business is a data business. The ability to master the value of data has become a key driver of competitive advantage in industries of all kinds. Come and find out what’s new in the latest release of Hortonworks Data Platform 2.4. and find out what’s in store for the roadmap ahead.

Dataflow management in the word of hyper connected people, systems and things is challenging and complex. Hortonworks DataFlow provides a real-time, visual and interactive system to simplify data delivery from the Internet of Things to data centers, from data centers to the cloud and for everything in between. Join this webinar and find out more […]

You may be up all night wondering how enterprise organizations deal with large data volumes and data varieties without significantly increasing costs. And perhaps your existing data architectures are not equipped to handle today’s data challenges? Join this webinar to learn how to optimize your data architecture and gain significant cost savings with Hadoop. We […]

Every business is a data business. To transform your organization and unlock the value of your data, you need a way to ingest, store and analyze every type of data in your organization. The open and connected data platforms enable you to make decisions faster. We will provide real world examples of companies that have […]

Retailers are laser-focused on understanding buyer sentiment and driving personalized engagement. It’s much easier to predict sales, revenue, and stock availability when you have a comprehensive understanding of customer buying behavior and path to purchase . Manthan’s Customer Analytics, running on Hortonworks Hadoop, leverages both structured data like sales history and unstructured data like social […]

You might be living under a bridge if you think Hadoop implementation comes without its challenges. How can such projects graduate from a POC into full production? Join Hortonworks and Cisco as we share the top five common Hadoop implementation fails and ways to avoid them. You will learn the industry best practices and hear […]

Every day, healthcare professionals must make critical decisions— often times without sufficiently accurate and transparent data. The healthcare industry is undergoing a revolution, driven by an irreversible surge in the quantity and availability of data. This data includes traditional clinical and transactional data such as claims, electronic medicalrecords (EMR), lab results, and radiological images. However, […]

Splunk architectures have powerful capabilities, however in today’s world where everything is connected, non-pertinent data could flood your sophisticated and costly Splunk architectures. Hortonworks Dataflow optimizes Splunk architectures by filtering and forwarding only the most relevant data into Splunk systems while redirecting the remaining data into highly cost-effective Hadoop storage. Join this webinar to learn […]

Big data centric businesses in financial services have begun realising the benefits of leveraging data science in areas not only in risk, fraud, compliance and 360 view of customer but also,digital transformation in retail banking, improved trading strategies in capital marketing. In this webinar we will discuss all of these and deep dive into popular […]

The banking sector continues to be a driving force of any economy and leading banks are adapting to consumer and technological advances that are presenting a multitude of business opportunities. Banks can now process huge amounts of data from both traditional and non-traditional sources in Hadoop giving them better insight into both their risks and […]

Insurance companies of all sizes are challenged to keep up with emerging technologies that deliver a competitive advantage. Big data holds the key to greater customer insight and stronger customer relationships. But risk of sensitive data exposure — and compliance violations — keeps many insurers from pursuing big data initiatives and reaping the rewards of […]

Rapid data growth from a wide range of new data sources is significantly outpacing organizations’ abilities to manage data with existing systems. Today’s data architectures and IT budgets are straining under the pressure. In response, the center of gravity in the data architecture is shifting from structured transactional systems to connected data platforms with Apache […]

Growth in mobile advertising has become explosive with the mass adoption of smart phones and an array of mobile apps shaping the businesses and lives of the modern consumers. With mobile advertising spend at its peak, ad-networks are now looking at new and smarter ways to handle data from the increase in audience engagement and […]

The global wealth management industry is one of the growth components for any financial institution worth its weight in gold – it is one of the most ripe for disruption. In this business and technology focused webinar, we’ll discuss the practice of wealth management & its lifecycle. We’ll also analyze the innovations being transferred into […]

When HP Lovecraft wrote of forbidden knowledge about non-human deities, knowledge which would reduce the reader to insanity, most people assumed that he was making up a fantasy world. In fact he was documenting Kerberos and its Hadoop integration. There are some things humanity was not meant to know. Most people are better off living […]

When HP Lovecraft wrote of forbidden knowledge about non-human deities, knowledge which would reduce the reader to insanity, most people assumed that he was making up a fantasy world. In fact he was documenting Kerberos and its Hadoop integration. There are some things humanity was not meant to know. Most people are better off living […]

Growth in mobile advertising has become explosive with the mass adoption of smart phones and an array of mobile apps shaping the businesses and lives of the modern consumers. With mobile advertising spend at its peak, ad-networks are now looking at new and smarter ways to handle data from the increase in audience engagement and […]

Financial Services is undergoing a major transformation and it is very evident that Banking as we know it will change dramatically over the next few years. Previous webinars has spent some time over the last year defining the Big Data landscape in Banking across Capital Markets, Retail Banking, Wealth & Asset Management, Hedge Funds etc. […]

The ROI of streaming analytics projects is often hindered by the difficulties of collecting and delivering data into the analytics platform. Hortonworks DataFlow provides a real-time, visual and interactive system to simply and effectively deliver data to the Kafka messaging bus for further processing by Spark Streaming, Storm, and even HBASE and HDFS. Join this […]

As digital consumption of rich media content explodes and with audience expectations at its peak, media providers have been challenged with not only delivering high-quality audience experiences but also the audience analytics in realtime to enable actionable insights for content publishers. Arkena, one of Europe’s leading media services organizations chose to power it’s analytical platform […]

A study of the top data breaches in 2015 reads like a “who’s who” of actors in society across governmental departments, banks and retail establishments. The financial services industry understands that a comprehensive & strategic approach to cybersecurity is now far from being an IT challenge a few years ago to a “must have”. As […]

Consumer Packaged Goods (CPG) companies such as PepsiCo rely on the seamless communication between a large interconnected network. In order to be successful, this network must include suppliers, production facilities, logistics partners and retailers. With a heavy reliance on coordination, each member of this network generates information in a wide variety of volumes and formats […]

Join us for a live 30 minute webinar to see how you can make data collection from the Internet of Anything fast, easy and secure. Designed to accelerate big data ROI from streaming analytics systems such as Spark and Storm, Hortonworks DataFlow delivers data from anywhere it originates to anywhere it needs to go. We […]

Hortonworks DataFlow (HDF), powered by Apache NiFi, is the first integrated platform that solves the complexity and challenges of collecting and transporting data from a multitude of sources be it big or small, fast or slow, always connected or intermittently available. Hortonworks DataFlow is a single combined platform for data acquisition, simple event processing, transport […]

The emergence of Big Data has driven the need for a new data platform within the enterprise. Apache Hadoop has emerged as the core of that platform and is driving transformative outcomes across every industry. Join this webinar for an overview of the technology, how it fits within the enterprise, and gain insight into some […]

Together, Hortonworks and WANdisco eliminate downtime and data loss to meet the most demanding Service Level Agreements (SLAs). This joint solution helps move customers into full production and expand their deployment footprint through an active-active replication architecture that achieves 100% continuous availability.

With the increasing number of organizations investing in big data projects, where do you start? Join, Dave Russell and Yael Widmann as they share their five steps towards a successful big data project. They will cover, business use cases, data sources, security and many more..

Hadoop’s cost effective scalability and flexibility to analyze all data types is driving organizations everywhere to embrace big data analytics. From proof of concept to deployment across the enterprise, Hortonworks and Datameer will share three common uses cases for Datameer on HDP and a demo of the Datameer Analytics Solution and we look forward to […]

Hadoop technology is becoming pervasive within the enterprise. But what are the real time use cases of Hadoop ? And what are the tangible benefits of Hadoop and Big Data technology? Register now to hear about how Hortonworks see the Hadoop market and how Hortonworks’ customers use Hadoop technology to transform their business.

In the era of consumer-centric “agile” supply chain strategies, companies are forced to act more like retailers in how they capture, analyze and use consumer data. This gives visibility to internal and external supply chain partners on how products are made, sold and used. But that visibility demands more data from more points across the supply chain. In […]

Hear from experts with deep experience in analytics, big data, and data integration: Forrester Research analyst educates you on the four areas of innovations—real-time Hadoop, machine learning accelerated solutions, simplification and automation, and security. Hadoop pioneer, Hortonworks, presents concrete steps to getting started with Hadoop for data discovery, single view, and predictive analytics. Data virtualization […]

Big data technologies are definitely disrupting traditional industries helping innovate, generate new insight and revenue streams. With Apache Hadoop’s ability to economically store large volumes of structured, unstructured or semi-structured data, organizations specifically in the retail-banking sector are now able to be more predictive and insightful towards consumers. Engagement across digital channels such as mobile, […]

In this webinar you will learn how the Hortonworks Data Platform offers an Open Enterprise Hadoop solution with EMC’s Elastic Cloud Storage (ECS) Platform. You will learn how the deep integration between EMC and Hortonworks delivers a Hadoop solution with unmatched scalability, storage efficiency & availability. Discover the benefits of robust data protection and geo-scale […]

Big Data platforms, powered by Open Source Hadoop, can economically store large volumes of structured, unstructured or semistructured data & help process it at scale thus enabling predictive and actionable intelligence. The Digital trend in Banking is now driving well established banking organizations to respond to the disruption being caused by emerging FinTechs. Given that […]

Credit & Payment Card fraud has mushroomed into a massive challenge for consumers, financial institutions,regulators and law enforcement. As the accessibility and usage of Credit Cards burgeons and transaction volumes increase, Banks are losing tens of billions of dollars on an annual basis to fraudsters. The Nilson Report (as 2013) estimated that of every dollar […]

Recent innovations in the Internet-enabled connected cars that we drive today have spawned a whole new set of opportunities and challenges for automakers. The opportunities come from the ability to capture detailed, current data on how drivers operate their vehicles and how those vehicles respond to that use. Join this webinar to learn how this […]

Banking is an increasingly complex as well as a global business. Leading Banks now generate a large amount of revenue in Global markets and this is generally true of all major worldwide banks. Financial crime is a huge concern for banking institutions given the complexity of the products they offer their millions of customers, large […]

Risk management is not just a defensive business imperative but the best managed banks can understand their holistic risks much better to deploy their capital to obtain the best possible business outcomes. Leading global banks are now leveraging Apache Hadoop and it’s ecosystem of projects to create holistic data management and governance architectures in support […]

To realize the potential from the massive and dynamically changing data stream generated by IoAT systems must be able to ingest and process this information in a timely fashion – before its value perishes. Data in motion from the IoAT (Internet of Anything) must be treated as dataflows—from source to destination—so that modern analytical applications […]

Pivotal HAWQ, one of the world’s most advanced enterprise SQL on Hadoop technology, coupled with the Hortonworks Data Platform, the only 100% open source Apache Hadoop data platform, can turbocharge your analytic efforts. Featuring a massively-parallel processing (MPP) SQL query engine that runs directly in the compute resources of a Hadoop cluster, Pivotal HAWQ and […]

As organizations pursue Hadoop initiatives in order to capture new opportunities for data-driven insight, data governance requirements can pose a key challenge. The management of information to identify its value and enable effective control, security and compliance for customer and enterprise data is a core requirement for both traditional and Modern Data Architectures. Apache Atlas […]

Big Data technology (led by Hadoop) is changing the landscape in areas as diverse as Risk Management, AML Compliance, Fraud Detection, Cyber Security and Customer Analytics. In this webinar, we will explore some of these global themes and discuss specific use cases & business areas that the largest Global Banks are leveraging Big Data across.

With Stinger and Stinger.next initiative, we have come a long way from making Hive a faster and more scalable tool for running SQL jobs on Hadoop. Come and hear our recent innovations as well as customer stories of how they use Hive to achieve success.

As the world generates even more volumes of data, from any device or any thing, companies are increasingly discovering the need to gain immediate insights and discern actions and predictions for their business. Massive data streams that originate from connected yet disparate sources including sensors, machines, geo-location devices, social feeds, web clicks, server logs and […]

SQL continues to be the most widely used language for big data analysis. It is no surprise that the SQL on Hadoop ecosystem is vibrant and robust, with many commercial and open source alternatives in the market. It is also an area of active innovation with different tools optimized for varying use cases. In this […]

This short clip from the Hortonworks DataFlow webinar will enable you to get started with HDF and design your first DataFlow. Speakers: Tim Hall – VP of Product Management, Hortonworks Joe Witt – Sr. Director of Engineering, Hortonworks

Your data is trying to tell you more about your business, are you listening? Join this webinar to hear how Quick Serve Restaurants and Retailers are using big data and open source Hadoop to listen and learn from their data. Our partner Blue Granite is uniquely qualified to drive quick time-to-value in Hadoop projects and […]

Predictive Analysis is a key use case of Big Data. Today, data driven organizations use advanced machine learning algorithms to understand and improve their business operations. In this session we will provide an overview of Spark’s machine learning capabilities. We will be running live interactive so that you can shape the conversation and the direction […]

The connected car drives a new data supply chain that is transforming the automotive industry. Combining the capabilities of Hortonworks and HARMAN services, automakers and suppliers will have access to a scalable platform form real-time insights, new innovative service creation, and predictive analytics-based solutions to minimize risk and reduce expenses. Join Dan Daogaru of Hortonworks, […]

The advent of connected manufacturing has ushered in an era where low-cost machine sensors take thousands of measurements per second at many points across the manufacturing process, enabling manufacturers to quickly detect anomalies and solve issues before they impact yield and quality. With Big Data insights, manufacturers can capitalize on this opportunity by following an […]

While in big data we can reap big rewards, it also poses significant risk, including misleading data and unexpected costs that could impact the business. It is critical for enterprises to put in place a data governance strategy to ensure that information remains accurate, consistent and trusted. With the ever evolving landscape of disparate data […]

It is increasingly evident that organizations can realize the full potential value of their data assets by combining the structured transactional data with semi-structured and unstructured data. Businesses also notice that to be agile and react to situations in real time, access to transactional data with low latency is essential. Low-latency transactional data brings additional […]

The world’s leading firms have recognized that data, along with human capital, is the most valuable asset they have today. The need for IT to digitize the business and provide actionable data to forecast market movements, improve customer experience, make flash offers, response to network errors, is paramount to sustainability. However, existing systems were not […]

In case you missed the webinar, you can access the slides here. The Personalized Medicine Initiative (PMI), part of the Life Sciences Institute of the University of BC, has deployed HDP and PHEMI Central Big Data Warehouse to collect, store and manage genomic and clinical data for Molecular You (MyCo). Molecular You is a ground breaking […]

YARN has fundamentally transformed the Hadoop landscape. It has transformed Hadoop from a single workload system to one that can now support a multitude of “fit for purpose” processing. Join this webinar where will be discussing the importance of YARN, it’s role in the Hadoop landscape and take part in votes that will shape the […]

Join this webinar as we discuss how the St. Louis metro system has reduced cost per mile by 30% by partnering with Hortonworks and LHP Telematics. They are now able to gather data from over 200 buses and analyze it to prevent part failures.

Join Grant Bodley for a webinar about the transformation of the automotive industry due to Big Data and the information highway it has created. Grant is the GM of Global Manufacturing Industry Solutions at Hortonworks and will explain: Forces transforming the automotive industry Disruptive innovation driven by Big Data and the Connected Car Open Enterprise […]

How do you get started with a big data project? The challenges of deploying and operating Hadoop clusters plague many organizations as much consideration is need on platforms, infrastructure, integration and the management of it all. Join Hortonworks and Canopy Cloud as we discuss the approaches are working for large enterprise organization, choosing the right […]

SQL continues to be the most widely used language for big data analysis. It is no surprise that the SQL on Hadoop ecosystem is vibrant and robust, with many commercial and open source alternatives in the market. It is also an area of active innovation with different tools optimized for varying use cases. In this […]

Hortonworks Data Platform (HDP) 2.3 represents the latest innovation from across the Hadoop ecosystem, especially in the area of security. With HDP 2.3, enterprises can secure their data using a gateway for perimeter security, provide fine grain authorization and auditing for all access patterns, and ensure data encryption over the wire as well as stored […]

This webinar will provide an overview of Hortonworks DataFlow (HDF), how it complements Hortonworks Data Platform, and the future roadmap. Join us to learn more about how to securely and easily collect, conduct, and curate dynamic Internet of Anything data into actionable insights for your business. We have responded to the remaining questions you […]

As more data is imported into Hadoop Data Lakes, how can we best secure sensitive data? What security options are available and what kind of best practices should be implemented? Join Vincent Lam of Protegrity and Syed Mahmood of Hortonworks as they jointly discuss securing HDP data lakes to leverage security in Hadoop without sacrificing usability. You’ll learn […]

You must be living under a rock if you haven’t noticed the momentum organisations are having across the enterprise with Big Data and Hadoop. Within the last few years there has been a fundamental shift in the way Hadoop is being used. From a specialist application to now serving multiple business units through a “data […]

As developers, we like to build and test applications in our favorite IDE. We prefer to debug applications using checkpoints and be able to trace through our code. This paradigm of development is disrupted in a distributed compute environment with execution spread across multiple hosts and JVMs. In this session we look into using Hadoop […]

Join us to hear how telecommunications service providers, retailers, and other consumer centric organizations can leverage Hadoop to gain access to predictive analytics, form a single view of their customer’s journey across channels and get a 360 degree voice of their customers. In this webinar, you will also learn how to dramatically enhance your customers’ […]

VHA (Voluntary Hospitals of America) is the largest member-owned health care company in the US delivering industry-leading supply chain management services and clinical improvement services to its members. At VHA, product, supplier, and member information is siloed across multiple sources. VHA sees value in consolidating the disparate data into a Data Lake, supported by the Hortonworks […]

In this webinar you’ll understand how the Hortonworks Data Platform delivers an Open Enterprise Hadoop solution to run on EMC Isilon infrastructure. You will learn how the EMC Isilon storage solutions combined with the Hortonworks Data Platform deliver unmatched flexibility, lower cost, and deliver robust data protection and security. You’ll learn how you can easily […]

The recently launched HDP 2.3 is a major advancement of Open Enterprise Hadoop. It represents the best of community led development with innovations spanning Apache Hadoop, Apache Ambari, Ranger, HBase, Spark, and Storm. In this session we will provide an in-depth overview of new functionalities and discuss the impact on new and ongoing big data initiatives.

The emergence of Big Data has driven the need for a new data platform within the enterprise. Apache Hadoop has emerged as the core of that platform and is driving transformative outcomes across every industry. Join this webinar for an overview of the technology, how it fits within the enterprise and gain insight into some […]

View the Recording Over the last few years, the insurance industry appears to have fared reasonably well; however, as of year-end 2014 Returns on Equity (ROEs) have begun to fall due to a combination of capital accumulation, competitive pricing, weak investment returns and rising loss expense (Source: 2015 EY US Property/Casualty insurance outlook). Whether the […]

Wow! When have you ever sat in on a Big Data analytics discussion by three of the most influential CTOs in the industry? What do they talk about among themselves? Join Teradata’s Stephen Brobst, Informatica’s Sanjay Krishnamurthi, and Hortonworks’ Scott Gnau as they provide a framework and best practices for maximizing value for data assets […]

Rich media is exploding all around us. From our personal usage to retailers monitoring store traffic for optimized associate placement, there is wide and growing application of rich media. Despite the pervasive usage, enterprises have had limited choice of generally available tools to analyze rich media. In this session we will look into leveraging deep […]

Traditionally support has had a retroactive orientation, with a focus on helping customers with problems that have already occurred and need to be addressed. What if enterprises have access to a support service that proactively monitors their Hadoop infrastructures and, not only identifies potential issues, but also recommends specific solutions and actions? In this webinar, […]

Enhancing a customer experience has become essential for communication service providers to effectively manage customer churn and build a strong, long lasting relationship with their customers. This has become increasingly challenging as customer interactions occur across multiple channels. Understanding customer behavior and how it applies across channels is the key to ensuring the best level […]

HDP 2.3 represents the latest innovation from across the Hadoop ecosystem. Its focus is on easing enterprise adoption by eliminating administration complexities, improving developer productivity, enhancing security and data governance, and delivering proactive cluster monitoring. Join our webinar to learn more about exciting innovations and to see the break through user experience packaged within Hortonworks […]

Scalding is a scala DSL for Cascading, running on Hadoop. It’s a concise, functional and very efficient way to build big data applications. One significant benefit of Scalding —because it runs on top of Cascading— is that it allows easy porting of Scalding apps from MapReduce to newer, faster execution fabrics. In this webinar, Cyrille […]

By leveraging Big Data, you already know your data now comes from a variety of sources including CRM systems, files, spreadsheets, video, social media, payment data and more. As data is being collected from these diverse sources, sensitive information must be protected. The question is: how do you comprehensively secure your Big Data environment amid […]

Effective data governance is imperative to the success of Data Lake initiatives. Without governance policies and processes, information discovery and analysis is severely impaired. In this session we will provide an in-depth look into the Data Governance Initiative launched collaboratively between Hortonworks and partners from across industries. We will cover the objectives of Data Governance […]

Join this webinar to explore Hadoop security challenges and trends, learn how to simply the connection of your Hortonworks Data Platform to your existing Active Directory infrastructure and hear about real world examples of organizations that are achieving the following benefits: Secured Hortonworks environments thanks to Active Directory infrastructure for identity and authentication. Increased productivity […]

Hadoop and The Internet of Things has enabled data driven companies to leverage new data sources and apply new analytical techniques in creative ways that provide competitive advantage. Beyond clickstream data, companies are finding transformational insights stemming from machine data and telemetry that are radically improving operational efficiencies and yielding new actionable customer insights. We […]

In order to make data-driven decisions about risks and threats to facilities, assets and employees, Oil and Gas companies need a solution that can acquire, manage, integrate, analyze and explore your data more efficiently across a diverse set of data sources. During this webinar, you will learn how Hortonworks HDP and Novetta Entity Analytics can […]

In this webinar you’ll learn how Pivotal HAWQ, one of the world’s most advanced enterprise SQL on Hadoop technology, coupled with the Hortonworks Data Platform, the only 100% open source Apache Hadoop data platform, can turbocharge your Data Science efforts. Pivotal HAWQ allows you to leverage advanced analytics for your data in Hadoop using massively-parallel […]

Founded in February 2014, SequenceIQ has developed innovative products such as Cloudbreak, an elastic and cloud agnostic deployment solution for HDP clusters, and Periscope, which provides policy-based autoscaling for multi-tenant HDP clusters. The company has deep experience contributing to existing Apache Software Foundation projects and innovating within the open source community, best exemplified by the […]

Predictive Analysis is a key use case of Big Data. Today, data driven organizations use advanced machine learning algorithms to understand and improve their business operations. In this session we will provide an overview of Spark’s machine learning capabilities and leverage Apache Zeppelin’s web based notebook for interactive data science analysis.

Hadoop provides a powerful platform for data science and analytics, where data engineers and data scientists can leverage myriad data from external and internal data sources to uncover new insight. Such power is also presenting a few new challenges. On the one hand, the business wants more and more self-service, and on the other hand […]

HBase adoption continues to explode amid rapid customer success and unbridled innovation. HBase with its limitless scalability, high reliability and deep integration with Hadoop ecosystem tools, offers enterprise developers a rich platform on which to build their next generation applications. In this workshop we will explore HBase SQL capabilities, deep Hadoop ecosystem integrations and deployment […]

Companies in every industry look for ways to explore new data types and large data sets that were previously too big to capture, store and process. They need to unlock insights from data such as clickstream, geo-location, sensor, server log, social, text and video data. However, becoming a data-first enterprise comes with many challenges. Join […]

Join this webinar with Hortonworks and Skytree and learn how Communications Service Providers can enhance their customers experience by: – Creating a Data Lake for a 360 degree customer view. – Building dynamic customer profiles. – Leveraging a next-best-action streaming engine. You will learn more about how Hortonworks Hadoop Distribution Platform and Skytree Machine Learning […]

Today’s enterprises are challenged with capturing large amounts of data from a number of sources in a variety of formats, and then storing it in a cost-effective, timely manner. With your current data warehouse, this may seem overwhelming. It doesn’t have to be. With a Hadoop-based modern data warehouse, you can overcome these challenges and […]

Your Big Data strategy is only as good as the quality of your data. Today, deriving business value from data depends on how well your company can capture, cleanse, integrate and manage data. During this webinar, we will discuss how to eliminate the challenges to Big Data management inside Hadoop.

Securing Hadoop data is a hot topic for good reason – no matter where you are in your Hadoop implementation plans, it’s best to define your data security approach now, not later. Hortonworks and Voltage Security are focused on deeply integrating Hadoop with your existing data center technologies and team capabilities. Attend this discussion to […]

Apache Amabri is the only 100% open source management and provisioning tool for Apache Hadoop. Recent innovations of Apache Amabri have focused on opening Apache Ambari into a pluggable management platform that can automate cluster provisioning, deploy 3rd party software and provide custom operational and developers views to the end user. In this session we […]

Whether you are an insurer, reinsurer, broker or insurance service provider; everything you do is based on analytics. From underwriting to claims to agency and marketing, the smartest and most streamlined business operations at insurance companies are driven by advanced and intelligent analytics. But is your data ready? Are you an “Analytics Ready” insurer? Great […]

Many organizations are leveraging social media to understand consumer sentiment and opinions about brands and products. Analytics in this area, however, is in its infancy and does not always provide a compelling result for effective business impact. Learn how consumer organizations can benefit by integrating social data with enterprise data to drive more profitable consumer […]

Learn how a successful Hadoop project moves from use case discovery to successful implementation of analytic insights to the ability to deliver predictive analysis. Hortonworks and CSC combine forces on this webinar to help answer the question “How can I see results quickly and reliably in my big data project?”. Whether your goal is to […]

This webinar will cover the key fundamentals of Apache Spark and operational best practices for executing Spark jobs along with the rest of Big Data workloads. We will also provide a working example to showcase micro-batch and machine learning processing using Apache Spark.

Join Cloudian, Hortonworks and 451 Research for a panel-style Q&A discussion about the latest trends and technology innovations in Big Data and Analytics. Matt Aslett, Data Platforms and Analytics Research Director at 451 Research, John Kreisa, Vice President of Strategic Marketing at Hortonworks, and Paul Turner, Chief Marketing Officer at Cloudian, will answer your toughest questions […]

Join this webinar and hear about key trends for Hadoop in 2015. You will learn: How Hadoop opens a new world of analytic applications. How organizations can avoid the need to hire high-priced Hadoop consultants. Hadoop’s killer app for 2015. Mike Gualtieri, Principal Analyst at Forrester, and John Kreisa, Vice President Strategic Marketing at Hortonworks, will […]

Big Data Analytics is transforming how banks and financial institutions unlock insights, make more meaningful decisions, and manage risk. Join this webinar to see how you can gain a clear understanding of the customer journey by leveraging Platfora to interactively analyze the mass of raw data that is stored in your Hortonworks Data Platform. Our […]

In 2010 the Clinical Informatics Team at the University of California in Irvine, led by Charles Boicey, looked outside of the conventional Healthcare data ecosystem for new data management solutions – their existing Electronic Health Record and Enterprise Data Warehouse environments no longer met the organization’s needs. They researched “Big Data” technologies in companies such as […]

YARN has fundamentally transformed the Hadoop landscape. It has opened hadoop from a single workload system to one that can now support a multitude of fit for purpose processing. In this workshop we will provide an overview of Apache Slider that enables custom applications to run natively in the cluster as a YARN Ready Application. […]

Many enterprises are turning to Apache Hadoop to enable Big Data Analytics and reduce the costs of traditional data warehousing. Yet, it is hard to succeed when 80% of the time is spent on moving data and only 20% on using it. It’s time to swap the 80/20! The Big Data experts at Attunity and Hortonworks have […]

Hadoop is no longer optional. Companies of all sizes are in various phases of their own Big Data journey. Whether you are just starting to explore the platform or have multiple clusters up and running, everyone is presented with a similar challenge – developing their internal skillset. Hadoop specialists are hard to find. Hand coding […]

The Enterprise Data Lake has become the defacto repository of both structured and unstructured data within an enterprise. Being able to discover information across both structured and unstructured data using search is a key capability of enterprise data lake. In this workshop, we will provide an in-depth overview of HDP Search with focus on configuration, […]

Developers increasingly are building dynamic, interactive real-time applications on fast streaming data to extract maximum value from data in the moment. To do so requires a data pipeline, the ability to make transactional decisions against state, and an export functionality that pushes data at high speeds to long-term Hadoop analytics stores like Hortonworks Data Platform […]

As the Big Data Analytics and the Apache Hadoop ecosystem has matured and gained increasing traction in established industries with faster adoption in the insurance market than originally anticipated, it is clear that the potential benefits for data management and business intelligence are staggering. At the same time, many big data programs have stalled or failed […]

The Smart Content Hub solution from HP and Hortonworks enables a shared content infrastructure that transparently synchronizes information with existing systems and offers an open standards-based platform for deep analysis and data monetization. Join this webinar and learn how you can: 1/ Leverage a 100% of your data, including text, images, audio, video, and many […]

Apache Ambari is a single framework for IT administrators to provision, manage and monitor a Hadoop cluster. Apache Ambari 1.7.0 is included with Hortonworks Data Platform 2.2. In this 30-minute webinar, learn from the Hortonworks Ambari product manager Jeff Sposetti and Apache Ambari committer Mahadev Konar about new capabilities including: Improvements to Ambari core – […]

Apache HBase provides low-latency storage for scenarios that require real-time analysis and tabular data for end user applications. Join this 30-minute webinar to learn from Devaraj Das, Hortonworks founder and Apache HBase committer and Hortonworks product manager Carter Shanklin. Devaraj and Carter will discuss the HBase innovations that are included in HDP 2.2, including: support […]

No matter if you are new to Hadoop or have a mature cluster in production, scale will be a critical factor of your success with Hadoop. Are you ready to take the next big step as you scale out your data architecture? Please join Talend and Hortonworks on this webinar where we will help you […]

How can you simplify the management and monitoring of your Hadoop environment? Ensure IT can focus on the right business priorities supported by Hadoop? Join Hortonworks and HP in this webinar and learn how you can simplify the management and monitoring of your Hadoop environment, and ensure IT can focus on the right business priorities […]

Almost every week, news of a proprietary or customer data breach hits the news wave. While attackers have increased the level of sophistication in their tactics, so too have organizations advanced in their ability to build a robust, data-driven defense. Join Hortonworks and Sqrrl to learn how a Modern Data Architecture with Hortonworks Data Platform […]

Many organizations have become aware of the importance of big data technologies, such as Apache Hadoop but are struggling to determine the right architecture to integrate it with their existing analytics and data processing infrastructure. As companies are implementing Hadoop, they need to learn new skills and languages, which can impact developer productivity. Often times […]

What if you could assemble all your data in one system and run your critical analytic applications in parallel, regardless of the format, age or location of the data? Today, thanks to the economics of Apache Hadoop-based data platforms, in particular YARN, this is possible. Listen to this relay and hear directly from our experts how […]

Earlier this year, the open source community delivered the Stinger Initiative to improve speed, scale and SQL semantics in Apache Hive. Now Stinger.next is underway, to build on those initial successes. In this 30-minute webinar, Hortonworks co-founder Alan Gates and Hortonworks Hive product manager Raj Baines discuss SQL queries in HDP 2.2: ACID transactions and […]

Big Data is moving to the next level of maturity and it’s all about the applications. Dhruv Kumar, one of the minds behind Cascading, the most widely used and deployed development framework for building Big Data applications, will discuss how Cascading can enable developers to accelerate the time to market for their data applications, from […]

In this 30-minute webinar Balaji Ganesan, Hortonworks senior director for enterprise security strategy and Vinay Shukla, director of product management, discuss HDP 2.2’s features for delivering comprehensive security in the platform. Balaji and Vinay will discuss Apache Ranger and Apache Knox and how they are integrated in HDP 2.2 to provide fine grain authorization, auditing […]

Financial services companies can reap tremendous benefits from ‘Big Data’ and they have moved quickly to deploy it. But these companies also place heavy demands on ‘Big Data’ infrastructure for flexibility, reliability and performance. In this webinar, Hortonworks joins WANDisco to look at three examples of using ‘Big Data’ to get a more comprehensive […]

As the ratio of memory to processing power rapidly evolves, many within the Hadoop community are gravitating towards Apache Spark for fast, in-memory data processing. And with YARN, they use Spark for machine learning and data science use cases along side other workloads simultaneously. This is a continuation of our YARN Ready Series, aimed at […]

What if your organization could study months and years worth of historical data from disparate sources, without sampling, to pinpoint risks for your business and compliance reporting? These risks come from uncertainty in financial markets, threats from project failures, legal liabilities, credit risk, accidents and natural disasters as well as deliberate attack from an adversary. […]

Massive new data volumes are forcing a transformation in the data center and driving a new modern data architecture that includes Apache Hadoop. As organizations are developing new analytic applications to drive their business forward, many of these new applications are being deployed with Hadoop and HP hardware to meet the growing demands of their data. Join […]

New types of data flow into and around today’s retail businesses with the speed and volume that many retailers are unable to process with their traditional data architectures. Apache Hadoop, integrated within a modern data architecture, delivers aggregated, detailed data. Retailers can use this data for new batch, interactive and real-time analytics aligned with their […]

At the center of many data-driven businesses is Scalding. Scalding is a Scala library based on the Cascading framework and is designed to simplify application development on Hadoop and YARN. Please join us as Jonathan Coveney, Sr. Software Engineer at Twitter, teaches us about Scalding, and how Twitter uses it to perform a variety of […]

Data is exponentially increasing in both types and volumes, creating opportunities for businesses. To fully realize the potential of this new data, analysts recommend the shift from a single platform to a data ecosystem. Multiple systems are needed to exploit the variety and volume of data sources. A flexible data repository such as a data […]

Data is exponentially increasing in both types and volumes, creating opportunities for businesses. Watch this video and learn from three Big Data experts: John Kreisa, VP Strategic Marketing at Hortonworks, Imad Birouty, Director of Technical Marketing at Teradata, and John Haddad, Senior Director of Product Marketing at Informatica.

Join Ofer Medelvitch, Director of Data Science of Hortonworks and Michael Zeller, Founder and CEO of Zementis as they present key learnings as to what drives successful implementations of big data analytics projects. Their knowledge comes from working with dozens of companies from small cloud-based start-ups to some of the largest companies in the world. […]

This is a continuation of our YARN Ready Series, aimed at helping developers learn the different ways to integrate to YARN and Hadoop. Apache Ambari is a completely open operational framework for provisioning, managing and monitoring Apache Hadoop clusters. In this webinar, learn how to use Ambari to also manage YARN. Register at the right […]

Now that you are moving your Hadoop POC into production, you know that sensitive customer and corporate data (credit card numbers, intellectual property, customer files, and more) need protection. Now the question becomes: How do you keep all this sensitive data secure, as it moves into Hadoop, as it is stored and as it moves […]

This is the third in the YARN Ready webinar series covering how to integrate to YARN; this event focuses on using Tez. Tools and applications that are YARN Ready have been verified to work within YARN, and there are a number of ways to integrate. Part 1 covered native YARN, part 2 covered Slider and […]

Join Hortonworks and Cisco as we discuss trends and drivers for a modern data architecture. Our experts will walk you through some key design considerations when deploying a Hadoop cluster in production. We’ll also share practical best practices around Cisco-based big data architectures and Hortonworks Data Platform to get you started on building your modern […]

This is the second in our series covering how to integrate to YARN using Slider. Tools and applications that are YARN Ready have been verified to work within YARN, and there are a few ways to integrate. Part 1 was integration natively with YARN, part 2 is Slider, and Tez is covered in an upcoming […]

Join Hortonworks and CSC, in this interactive webinar to: Understand the trends and drivers for Hadoop and Big Data for Manufacturing. Leverage Apache Hadoop for ingesting, storing, and discovery analytics to identify patterns that have actionable value to the business. Explore use cases and best practices that can guide you through your Big Data strategy. […]

This is the first in a series covering how to integrate using native YARN. Tools and applications that are YARN Ready have been verified to work within YARN, and there are a few ways to integrate, one of which is natively. Others include Slider and Tez, covered in the next webinars.

Join us in this interactive webinar as we walk through use cases on how you can use SAS In-Memory Statistic for Hadoop and SAS Visual Statistic with Hortonworks’ Data Platform (HDP) to reveal insights in your big data and redefine how your organization solves complex problems. Hortonworks and SAS together offers unprecedented speed and flexibility […]

Apache Ambari is a single framework for IT administrators to provision, manage and monitor a Hadoop cluster. Apache Ambari 1.6.1 includes support for Hortonworks Data Platform 2.1. In this 30-minute webinar, Jeff Sposeti, Hortonworks senior director of product management, and Mahadev Konar, Hortonworks co-founder and committer for Apache Ambari, will discuss new Ambari capabilities, including: […]

As more applications are created using Apache Hadoop that derive value from the new types of data from sensors/machines, server logs, click-streams, and other sources, the enterprise “Data Lake” forms with Hadoop acting as a shared service. While these Data Lakes are important, a broader life-cycle needs to be considered that spans development, test, production, […]

For the first time, Hortonworks Data Platform ships with Apache Storm for processing stream data in Hadoop. In this 30-minute webinar, Himanshu Bari, Hortonworks senior product manager, and Taylor Goetz, Hortonworks engineer and committer to Apache Storm, will discuss Storm and stream processing in HDP 2.1, including: Key requirements of a streaming solution and common […]

The recently launched YARN Ready Program will accelerate multi-workload Hadoop in the Enterprise. The webinar provides an overview of the program and how to get started with integrating new and existing applications with YARN-based Hadoop.

Apache Solr is the open source platform for searching data stored in Hadoop. Solr powers search on many of the world’s largest Internet sites, enabling powerful full-text search and near real-time indexing. Whether users search for tabular, text, geo-location or sensor data in Hadoop, they find it quickly with Apache Solr. Hortonworks Data Platform 2.1 […]

Join Revolution Analytics and Hortonworks in this interactive webinar to discuss how customers are using Hadoop and R in the real world for Data Mining and Predictive Analytics. We’ll show an end-to-end customer churn analytics demonstration (leveraging Revolution Analytics, Hortonworks and Tableau) serving three user personas: a website visitor, a data scientist and a business […]

Most successful Apache Hadoop implementations start small in scope and scale with a single analytic application but can quickly grow. Mature deployments can have many applications running off a single shared data lake. Improved application lifecycle will accelerate creation of new apps to meet new business needs. As the Hadoop environment grows and becomes increasingly […]

The YARN framework introduced as part of Hadoop 2 has prompted the emergence of data lakes, reservoirs and hubs as organizations wake to the capabilities of Enterprise Hadoop in a Modern Data Architecture. In this 30-minute webinar, Rohit Bakhshi, product manager at Hortonworks, and Vinod Vavilapalli, who leads YARN development at Hortonworks, present an overview […]

Presented by independent analyst & Big Data thought leader, Mike Ferguson Join Mike Ferguson, as he explores how the growing business demand to analyse new sources of data is impacting on traditional architectures and how these architectures need to change to accommodate big data analytical workloads. What’s Driving the big data agenda New data and […]

Are your business users able to quickly access and report on the massive amount of data flowing into Hadoop? Learn how leading companies are already accelerating the speed of innovation using the combination of Hadoop and the Actian Analytics Platform. In this webinar, Hortonworks and Actian will describe how you can: Deploy modern data architecture […]

Owen O’Malley and Carter Shanklin hosted the second of our seven Discover HDP 2.1 webinars. Owen and Carter discussed the Stinger Initiative and the improvements to Apache Hive that are included in HDP 2.1: Faster queries with Hive on Tez, vectorized query execution and a cost-based optimizer New SQL semantics and datatypes SQL-standard authorization The […]

Difficult challenges and choices face today’s healthcare and pharmaceutical industry. Listen to this replay and hear from industry leaders in pharmaceutical, healthcare, and Big Data technologies on how they’re unleashing Big Data to drive real business impact.

In this webinar, Charles Boyce will present how UC Irvine Health turned to Hadoop and Hortonworks Data Platform to improve clinical operations in the hospital and its scientific research at the medical school. Their team is building a quantified medical practice that reduces re-admissions, speeds new research projects, and tracks patient vital stats on a […]

Join Hortonworks and Concurrent to learn how to accelerate your big data application development with the popular Cascading framework and Hortonworks Data Platform. In this webinar, we will Describe how developers can create future proof, data-driven applications built on Apache Hadoop Take advantage of the latest Hadoop processing frameworks like YARN and Tez Learn more […]

Join experts from Forrester, Hortonworks & Skytree to discuss the impact of big data on fraud management for financial institutions. One of the key risk management activities of financial service organizations includes the detection and prevention of fraud. In this webinar, Forrester Research, Hortonworks and Skytree will focus on challenges for enterprise fraud management professionals […]

Retailers need the complete picture – a 360-degree view of the customer, the pulse on brand sentiment, personalized promotions, and an optimal shopping experience. When Hadoop is integrated with modern retail operations, it dramatically reduces the cost of capturing, ingesting and storing data. By implementing self-service analytics capabilities on top of Hadoop, you can gain […]

Hadoop is a great platform for storing and processing massive amounts of data. Elasticsearch is the ideal solution for Searching and Visualizing the same data. Join us to learn how you can leverage the full power of both platforms to maximize the value of your Big Data. Attend this webinar and we’ll walk you through: […]

What if your organization could obtain a 360 degree view of the customer across offline, online and social and mobile channels? Attend this webinar with Splunk and Hortonworks and see examples of how marketing, business and operations analysts can reach across disparate data sets in Hadoop to spot new opportunities for up-sell and cross-sell. We’ll […]

Join Hortonworks and Actian, as we address the challenges faced by companies trying to implement their Big Data Strategy. In this webinar, we will identify some of the top challenges around analytics with big data and highlight how existing skills can be used to solve these challenges. Additionally, we will also provide real-world use cases […]

Attend this webinar to see the power of combining the Hortonworks Data Platform with Microsoft’s ubiquitous Windows, Office, SQL Server, Parallel Data Warehouse, and Azure platform to build the Modern Data Architecture for Big Data. In less than an hour, we’re walk you through: Building a hybrid Modern Data Architecture using Hadoop within a Microsoft […]

Our last webinar on “Getting Started Writing YARN Applications” covered benefits and steps you can take to get prepared for developing your application or integrating your existing application on YARN. Visit our get-started on YARN page for more information. This webinar goes into a bit more detail, where we will walk through YARN code and […]

2013 was certainly another fast moving year for the Enterprise Hadoop market. We witnessed the emergence of the YARN-based architecture of Hadoop 2 and a strong ecosystem embracement that will fuel its next big wave of innovation. So what’s in store for 2014? Join Shaun Connolly, VP of Strategy, Hortonworks where he’ll be covering the Enterprise Hadoop State of the […]

There is alot of information available on the benefits of Apache YARN but not much on just how to get started building applications. Join us to learn what you need to do to take the first steps towards developing your application or integrating your existing application on YARN. Agenda: Benefits Overview: Application benefits from YARN, […]

Is Hadoop ready for high-concurrency complex BI and Advanced Analytics? Roaring performance and fast, low-latency execution is possible when an in-memory analytical platform is paired with the Apache Hadoop framework. Join Hortonworks and Kognitio for an informative Web Briefing on putting Hadoop at the center of your modern data architecture—with zero disruption to business users. In this […]

Join the conversation with experts from Hortonworks and WANdisco as they explain how to achieve maximum availability, performance and scalability for multi-data center deployments of Hadoop. In this webinar, we’ll: Examine the key drivers and use cases for High Availability, performance and scalability for Apache Hadoop. Walk through an overview of reference architecture for a […]

The explosion of data in the enterprise has created a new class of storage and processing requirements that were never envisioned. Enterprises are seeing exponential growth in machine generated data, sensor data, social data, web logs and other data types and are looking to find value in this data. Much of this data was once […]

Hortonworks announces the release of Hortonworks Data Platform (HDP) 2.0, the first commercial Hadoop distribution built on the stable Hadoop 2.2 GA release from the Apache Software Foundation. Join us for a webinar outlining the YARN based architecture of HDP 2.0 and how it enables new workloads in the modern data architecture. We’ll discuss: YARN, […]

Join Hortonworks and Microstrategy to: Discuss the modern architecture for Business Intelligence on top of Hadoop as a data source. Learn how our joint solution helps enterprises store, process and analyze vast amounts of structured and unstructured data to deliver business insights throughout an organization. Discover what new benefits Hadoop 2.0 offers and how the […]

Among the advantages of cloud computing are faster access to compute and storage resources and utility pricing. With the added power of Apache Hadoop, businesses can now easily provision and manage Hadoop-ready infrastructure to analyze vast amounts of data and develop new business applications in the open cloud. While these are great benefits there are […]

When you’re analyzing big data, are you also analyzing your data? Are you even collecting telemetry data to understand how customers are using your product and where they may be having issues? Join Platfora and Hortonworks to: Discuss how Apache Hadoop and Big Data Analytics are driving the Modern Data Architecture Learn how Enterprise is using Platfora […]

How can you understand what customers are thinking, and how can you respond to sentiment, either positive or negative, in real time? Can you gain competitive advantage from knowing what consumers are saying about you or your competition online? Join Hortonworks and Tableau Software to: Discuss how Apache Hadoop and data discovery and visualization tools […]

Join us in this interactive webinar to discuss trends and business drivers for Hadoop. Learn how Hortonworks and Revolution Analytics play a role in the modern data architecture. See how you can run R natively in Hortonworks to simply move your R-powered analytics to Hadoop.

How do you turn data from many different sources into actionable insights and manufacture those insights into innovative information-based products and services? Industry leaders are accomplishing this by adding Hadoop as a critical component in their modern data architecture to build a data lake. A data lake collects and stores data across a wide variety […]

There certainly is no shortage of hype when it comes to the term “Big Data”. One thing we can be sure of is that massive data volumes are driving a new modern data architecture that includes Hadoop in the mix. But what does that architecture look like with traditional infrastructure like the enterprise data warehouse? […]

Big Data is a trend that has engulfed today’s IT industry and one that organizations are struggling to manage. The size of the digital universe this year will be tenfold what it was just five years earlier. Therefore, organizations must find smarter data management approaches that enable them to effectively corral and optimize their data. […]

Join us for a deep dive into Apache Hadoop YARN and how you can start using it in your development efforts. Hortonworks is here to help, learn about Hortonworks Office Hours for YARN. We will also touch on the Hortonworks YARN Certification program.

With the momentum behind Big Data growing and use of the Apache Hadoop architecture increasing in enterprise-grade deployments, a comprehensive data protection strategy is needed to mitigate risk of breach and assure global regulatory compliance. As Hadoop platforms go deeper and wider into business critical applications, enterprise customers need security tools that fit and scale […]

Customer insight and marketplace predictions are a few of the profitable benefits found in big data technology. Leading companies are using the advanced analytics solution to find new revenue streams, increase customer satisfaction and optimize the supply chain. You’ve already got the data. It’s time to put it to work. Join Hortonworks and Pactera to […]

Hortonworks recently released Hortonworks Data Platform 2.0 (HDP2.0). This new release packages the most recent innovation form the Apache Hadoop community. This distribution includes the first delivery of next generation resource management Apache YARN which has been four years in the making. It allows for a wide range of data processing applications to run natively […]

In this webinar, we will discuss how Apache Hadoop works with your current infrastructure and how you can use data discovery and visualization tools to gain deeper insights from new data types stored in Hadoop and your existing data center investments. Join Hortonworks and Tableau Software and learn how to get started to visualize Hadoop […]

Apache Hive is the de facto standard for Hadoop SQL interaction today. However, it was originally built for “batch” workloads and as Hadoop gains in popularity, enterprise requirements for Hive to become more real time or interactive have evolved… the Hive community has responded. Please join us for this webinar to find out how the […]

While Hadoop has emerged as a key technology in Big Data, many business analysts and users are still trying to figure out how Hadoop fits into their analytics strategy. If you are wondering where Hadoop fits in with the rest of your data sources for Strategic Analytics, this webinar will help you to understand key […]

How can Hadoop take advantage of OpenStack and how can the OpenStack meet the needs of a demanding Hadoop cluster? In this session, we will briefly look at the Hadoop’s design decisions; come up with the best practices for deploying and running Hadoop on OpenStack and some of the challenges around it. We’ll also look […]

Join this webinar to discuss best practices for designing and building a solid, robust and flexible Hadoop platform on an enterprise virtual infrastructure. Attendees will learn the flexibility and operational advantages of Virtual Machines such as fast provisioning, cloning, high levels of standardization, hybrid storage, vMotioning, increased stabilization of the entire software stack, High Availability […]

For this webinar, two of the most trusted experts in their fields to examine how big data technologies are being used today by practical big data practitioners. Eric Baldeschwieler (aka E14, @eric14), CTO and Founder of Hortonworks, and Hadoop luminary will provide perspective on the role of Massively Parallel Processing (MPP) Relational Databases in the modern data platform architecture. Stephen Brobst, CTO […]

Hadoop is deployed for a variety of uses, including web analytics, fraud detection, security monitoring, healthcare, environmental analysis, social media monitoring, and other purposes. Deriving meaningful insights from all this data can be a challenge, and the architectural approach you choose will make a difference in what you can and cannot achieve with reporting and […]

According to IDC, Windows Servers run more than 50% of the servers in the Enterprise Data Center. Hortonworks has worked closely with Microsoft to port Apache Hadoop to Windows to enable organizations to take advantage of this emerging Big Data technology. Join us in this informative webinar to hear about the new Hortonworks Data Platform […]

In this session, attendees will learn how to use R in the distributed environment of Hadoop using the rmr package. Additionally, the R package googleVis will be used to show how application development teams can incorporate the power of R and the power of Google Chart Tools into their applications quickly and easily. The result […]

Hortonworks and Appnovation will help you get better understanding of what Big Data is, what all is involved for companies that are quickly accumulating exceedingly large amounts of complex data, what the options are to handle this information and most importantly, what this data can do for the company once translated into a usable format. […]

Entravision Communications Corporation (NYSE: EVC) is a diversified Spanish-language media company with a unique group of media assets including television stations, radio stations and digital platforms. In 2011, they made the strategic decision to build a data analytics, modeling and insights division to expand the value of its traditional advertisement services. Join us in this […]

Hortonworks recently unveiled the Hortonworks Sandbox, a free, comprehensive, easy-to-use, hands-on learning environment that provides the fastest onramp for anyone interested in learning, evaluating or using Apache Hadoop™ in an enterprise. Join us in this interactive webinar as we discuss and demo features of the Hortonworks Sandbox, including: How to download and use the Sandbox […]

Hortonworks continues to innovate throughout all Hadoop related projects, packaging the most enterprise-ready components, such as Ambari, into the Hortonworks Data Platform (HDP). Please join us in this interactive webinar as we present real-world use cases of Enterprise customers that are finding success with HDP and their Big Data initiatives. We will also introduce new […]

Hadoop’s cost effective scalability and flexibility to analyze all data types is driving organizations everywhere to embrace big data analytics. From proof of concept to deployment across the enterprise, join Datameer and Hortonworks as we answer the ‘now what?’ when rolling out your Hadoop big data analytics project. This webinar will address critical project components […]

In 2012, we released Hortonworks Data Platform powered by Apache Hadoop and established partnerships with major enterprise software vendors including Microsoft and Teradata that are making enterprise ready Hadoop easier and faster to consume. As we start 2013, we invite you to join us for this live webinar where Shaun Connolly, VP of Strategy at Hortonworks, […]

Big Data is everywhere. And at the center of the big data discussion is Apache Hadoop, a next-generation enterprise data platform that allows you to capture, process and share the enormous amounts of new, multi-structured data that doesn’t fit into transitional systems. With Microsoft HDInsight, powered by Hortonworks Data Platform, you can bridge this new […]

Join Rohit Bakshi, Product Management, as he guides you through the current work on HA and Hadoop. Rohit will have a live demo of High Availability options on HDP 1.1 as well as answer any questions during this session.

YARN: The Future of Data Processing with Apache Hadoop Speaker: Arun C. Murthy, Hortonworks co-founder, VP of Apache Hadoop at Apache Software Foundation. The lead for the MapReduce project and YARN. Apache Hadoop MapReduce has been overhauled to emerge as Apache Hadoop YARN, a generic distributed application framework to support MapReduce and other application paradigms. […]

Join us in this 4-part series with the core committers of the Apache Hadoop projects (Pig, Zookeeper and YARN) and Hadoop experts to gain insight into current advances in Apache Hadoop, obtain use-cases and best practices on how to get started with Hadoop and live Q&A with the people at the center of the Hadoop […]

Join us in this 4-part series with the core committers of the Apache Hadoop projects (Pig, Zookeeper and YARN) and Hadoop experts to gain insight into current advances in Apache Hadoop, obtain use-cases and best practices on how to get started with Hadoop and live Q&A with the people at the center of the Hadoop […]

Join us in this 4-part series with the core committers of the Apache Hadoop projects (Pig, Zookeeper and YARN) and Hadoop experts to gain insight into current advances in Apache Hadoop, obtain use-cases and best practices on how to get started with Hadoop and live Q&A with the people at the center of the Hadoop […]

Are you a Systems Integrator or consultant working on Hadoop implementations? Working with Systems Integrators is a foundational aspect of Hortonworks business model. We see a unique and massive opportunity to leverage Hortonworks unequalled Hadoop expertise with our SI partners complementary technology and domain expertise, to enable high-value and repeatable Big Data solutions. Join us […]

Hortonworks and Teradata Aster have partnered to deliver advanced, powerful analytics of big data using Hadoop. Many have embrace this combined architecture that uses Hadoop and Teradata Aster analytics solutions as key ingredients to maximize value from ALL data. In this webinar where we will leave you with key takeaways to accelerate your Big Analytics […]

Data Integration is a key step in a Hadoop solution architecture. It is the first obstacle encountered once your cluster is up and running. OK, I have a cluster…now what? Complex scripts? For wide scale adoption of Apache Hadoop, an intuitive set of tools that abstract away the complexity of integration is necessary. Enter Talend […]

Microsoft and Hortonworks announced a strategic relationship earlier this year to accelerate and extend the delivery of Apache Hadoop-based distributions for Windows Server and Windows Azure. Join us in this 60-minute webcast with Rohit Bakashi, Product Manager at Hortonworks and Mike Flasko, Sr. Program Manager at Microsoft to discuss the work that’s being done since […]

Apache Hive provides SQL-like access to your stored data in Apache Hadoop. Apache HBase stores tabular data in Hadoop and supports update operations. The combination of these two capabilities is often desired, however, the current integration show limitations such as performance issues. In this talk, Hortonworks co-founder, Owen O’Malley, will present an overview of Hive […]

Scalability of the NameNode has been a key issue for HDFS clusters. Because the entire file system metadata is stored in memory on a single NameNode, and all metadata operations are processed on this single system, the NameNode both limits the growth in size of the cluster and makes the NameService a bottleneck for the […]

Join us for this free informative webinar to learn how the power of open source technologies address these data integration challenges. Hear from Rohit Bakhshi, Solution Architect at Hortonworks and Jim Walker, Director of Product Marketing at Talend, on Apache Hadoop best practices that data enthusiast of any skill-levels can leverage. Gain insights to different approaches organizations can take to avoid the complexity of uploading or extracting data from Hadoop. Also, see a live demonstration on how to load HDFS in less than five minutes without writing a line of code and how to create and run a pig script.

Hortonworks has been developing the next generation of Apache Hadoop MapReduce that factors the framework into a generic resource management fabric to support MapReduce and other application paradigms such as Graph Processing, MPI etc. High-availability is built-in from the beginning; as are security and multi-tenancy to support multiple users and organizations on large, shared clusters. […]

HCatalog is a metadata and table management system for Hadoop. It allows users to share data and metadata across Hive, Pig, and MapReduce. It also allows users to write their applications without being concerned how or where the data is stored, and insulates users from schema and storage format changes. In this talk, Hortonworks founder […]

Join Hortonworks founder Eric Baldeschwieler as he guides you through Hortonworks’ planned releases for the upcoming year. Eric has led the evolution of Apache Hadoop from a 20-node prototype to a 42,000-node service behind every click at Yahoo! In this webcast, Eric will guide you through the planned enhancements to the major Hadoop components in […]

UBS has been an early adopter of Hadoop and continues to test a number of data processing & analytics use cases. In this webcast, Executive Director at UBS, Dave Casper will discuss how Hadoop fits the overall Data & Architecture strategy at UBS. Joining Dave in this discussion is Abhishek Mehta, (Founder, Tresata) and Arun […]

The HDFS NameNode is a robust and reliable service as seen in practice in production at Yahoo, Facebook and other enterprises. However, the NameNode does not have automatic failover. A hot failover solution called HA NameNode is under active development (HDFS-1623) and making excellent progress. Join Hortonworks founder Sanjay Radia, as he outlines the approach […]

Apache Hadoop is the de-facto Big Data platform for data storage and processing. The current stable, production release of Hadoop is hadoop-0.20. The Apache Hadoop community is preparing to release hadoop-0.23 with several major improvements including HDFS Federation and NextGen MapReduce. In this webcast, Arun Murthy, the Apache Hadoop Release Master for hadoop.next, will discuss […]

The analytics world has expanded from simple transaction analytics to the correlation of transactions and interactions.Join Hortonworks and Datameer as they discuss their partnership around Apache Hadoop and how big data analytics is changing the face of BI. This webinar will include a demonstration of customer focused use cases. You will learn: How Hadoop-based analytics […]